Enabling ML Based Research

Slides:



Advertisements
Similar presentations
Presentation at WebEx Meeting June 15,  Context  Challenge  Anticipated Outcomes  Framework  Timeline & Guidance  Comment and Questions.
Advertisements

Careers in CS & Engineering. CS & Engineering careers are not all this….
Topics in Computational Biology (COSI 230a) Pengyu Hong 09/02/2005.
An Overview of the NISE Network Presentation Overview NISE Network Network Community Educational Products Get More Involved.
Data Processing Capabilities (Quantum & Dimensions – TOM)
Welcome to HTCondor Week #14 (year #29 for our project)
On Education Gerrit C. van der Veer most work done by Anne Bowser Elizabeth Churchill Jennifer Preece.
Curriculum Transformation Moving towards the Reality.
© Heikki Topi Data Science and Computing Education ACM Education Council Portland, OR September 16-17, 2014 Heikki Topi, Bentley University.
Panel Discussion Part I Methodology Ideas from adult MR brain segmentation are used in neonatal MR brain segmentation. However, additional challenges.
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Sept. 5, 2012 Kevin T. Gallagher and Linda C. Gundersen September 5, 2012 CDI Science.
Discovery Informatics Workshop Social Computing Challenges DRAFT.
Three Critical Matters in Big Data Projects for e- Science Kerk F. Kee, Ph.D. Assistant Professor, Chapman University Orange, California
Plumbing the Computing Platforms of Big Data Dilma Da Silva Professor & Department Head Computer Science & Engineering Texas A&M University.
College of Engineering and Science Louisiana Tech University College of Engineering and Science Connecting Mathematics with Engineering and the Sciences.
OMIS 694, Big Data Analytics
MDL Information Systems, Inc. Powering the Process of Invention Donna del Rey Director, Business Planning
Pengenalan Ilmu Komputasi. Computational Science??
The Impact of CIRTL on Member Institutions Lucas Hill, Ann E. Austin Michigan State University Summary Methods What do institutions gain from Network Participation?
FACULTY EXTERNSHIP OPPORTUNITIES IN DATA SCIENCE AND DATA ANALYTICS Facilitated by: FilAm Software Technology, Clark Freeport Zone Ecuiti, San Francisco,
AN OPPORTUNITY TO ENTER INTO A JOINT VENTURE PARTNERSHIP WITH NTIS FOR DATA INNOVATION NTIS Information Session Department of Commerce Auditorium and Webcast.
András Benczúr Head, “Big Data – Momentum” Research Group Big Data Analytics Institute for Computer.
Software Engineering Experimentation
EGI towards H2020 Feedback (from survey)
Good afternoon. My name is Maura Donovan and I am Executive Director of the Office of Economic Development at the University and a representative of the.
Advanced Research Computing
EPSCoR means “Experimental Program to Stimulate Competitive Research
H2020 Future and Emerging Technologies (FET) FET-Open
Industry Advisory Board
Engineering (Richard D. Braatz and Umberto Ravaioli)
The Fort Worth ISD Learning Model
Big Data Panel Discussion
CSPA & Digital Transformation
CARER Proposal Writing Workshop November 2004
KnowEnG: A SCALABLE KNOWLEDGE ENGINE FOR LARGE SCALE GENOMIC DATA
Kostas Seferis, i2S Data science and e-infrastructures can help aquaculture to improve performance and sustainability!
Future Trends in Nuclear Physics Computing Workshop
Mike Gualtieri, Principal Analyst Rowan Curran, Researcher
Professional Services in Open Source
Sergei V. Gleyzer University of Florida
Applying for NSF CAREER Grants
Azure Hybrid Use Benefit Overview
Next Generation cargo ERP Enterprise Resource Prediction
Driving Digital Business with SAP Digital Business Services
Why it is good for startups to partner with Industry leaders
Paradigms 2.0: Supporting Collaborative Departmental Change
Exploiting the multidisciplinary environment in Oulu
Facilitation guide for Building Team EQ skills.
School of Information Management Nanjing University China
The Q Improvement Lab August 2017.
Thanks to all of you for attending
OMIS 665, Big Data Analytics
What are your Career Options?
 Deep Analytical Talent  Data Savvy Professionals  Technology and Data Enablers.
Exploiting the multidisciplinary environment in Oulu
The Curry School of Education October 3, 2018
Technical Capabilities
Emerging Information Technologies I
Reflections Scott Klemmer · 03 June 2008
The NASA/IPAC Teacher Archive Research Program
Learning loves company
Exploiting the multidisciplinary environment in Oulu
Digital Science Center
MAZARS’ CONSULTING PRACTICE Helping your Business Venture Further
Coventry University, UK
CS 239 – Big Data Systems Fall 2018
Thank you to our Sponsors
BCoN Data Integration Workshop, University of Kansas, Feb 13-14, 2018
Stanford AI in Medicine Imaging Center.
Presentation transcript:

Enabling ML Based Research Leslie Yeh Johnson, Google University Relations lesliey@google.com Thank you for inviting me to this meeting. I’m learning a lot and this is a wonderful opportunity for me to speak to and get feedback from the materials community.

Goal/Question How could a Google+NSF partnership enable more domain scientists to take advantage of Machine Learning based research approaches? Hard science data is exploding Linda says this is the direction that Materials Science is heading So is the technology to process it (Cloud capabilities, ML hardware optimization) -> Google is very invested in this space. Back in the spring I came here, not actually here, to talk about how Google could help Materials researchers take advantage ML/data based research approaches. We implemented a few ideas, and I want to go over those briefly and learnings. But largely, I’d like to have a discussion with this group about what could be done that really leverages a Google+NSF partnership. ML -> making sense of a lot of existing data to make predictions about new data. More data -> get smarter and make better predictions. Rather than computers processing rules, the program learns from examples.

I am not a Machine Learning researcher/expert. My team manages the interface between research at Google, and external researchers, but largely in the computer science fields. And what I’m hoping to have with you all is a conversation about how best to work with domain scientists, and to additionally leverage the NSF to do this in an effective and scalable way.

Possible Points of Leverage Education Data Implementation

Education Motivation Machine Learning ML Applicability

Data What to gather How much How to store/access More data is only useful with careful design

Implementation Experiment Design ML Implementation Tools

Broad Toolkit of Alphabet Technologies Machine learning Deep Learning Visual search Sequence prediction Logistic regression at scale Other systems implementing almost every ML algorithm Scale Cloud, DataFlow Borg for billions of core hours Imaging Tech from Geo, Search, Photos, Android Ongoing research Experiment design Parameter tuning via batched experimental design DOE modeling

So Far

Education @ NSF Meetings May 2017 Materials Workshop July 2017 Quantum Science Summer School Machine Learning Crash Course 15 hour course -> 3 hours + optional consulting

Google Accelerated Sciences Team Increase the rate of scientific discovery by leveraging Google’s extensive knowledge and technology Projects should have Science upside: large scientific/humanitarian benefit. World class partners: deep understanding and desire to collaborate. Unique leverage: of Google algorithmic expertise and large scale. Business upside: revolutionize an important industry. What success looks like Participate and ‘ship’ important breakthroughs Transform science via development and sharing of exemplary research methods Structure: 2-4 Googlers working with scientists to combine domain knowledge & specific Google expertise

NSF BIGDATA Critical Techniques, Technologies and Methodologies for Advancing Foundations and Applications of Big Data Sciences and Engineering SYNOPSIS The BIGDATA program seeks novel approaches in computer science, statistics, computational science, and mathematics, along with innovative applications in domain science, including social and behavioral sciences, education, biology, the physical sciences, and engineering that lead towards the further development of the interdisciplinary field of data science. Downside: have to be far along on the Education/data/Implementation curve to take advantage

Discussion What is most needed? What are the barriers? What can domain scientists take advantage of? What can uniquely be offered by NSF+Google?