Machine Science Distilling Free-Form Natural Laws from Experimental Data Hod Lipson, Cornell University.

Slides:



Advertisements
Similar presentations
 CHEM.B Apply the mole concept to representative particles (e.g., counting, determining mass of atoms, ions, molecules, and/or formula units). 
Advertisements

CSE 301 History of Computing Analog Computing. Analog Computers Instead of computing with numbers, one builds a physical model (an analog) of the system.
Biologically Inspired AI (mostly GAs). Some Examples of Biologically Inspired Computation Neural networks Evolutionary computation (e.g., genetic algorithms)
High Throughput Computing and Protein Structure Stephen E. Hamby.
DERIVATIVES Linear Approximations and Differentials In this section, we will learn about: Linear approximations and differentials and their applications.
Digression: Symbolic Regression Suppose you are a criminologist, and you have some data about recidivism. Suppose you are a criminologist, and you have.
As you come in, please: Get out a sheet of paper and put your name on it. Write a definition, from your own memory, for these terms: hypothesis, scientific.
LINEAR REGRESSION: On to Predictions! Or: “How to amaze your friends and baffle your enemies”
The Nature of Science and Scientific Inquiry
The simple pendulum Energy approach q T m mg PH421:Oscillations F09
5.5 Solving Trigonometric Equations Example 1 A) Is a solution to ? B) Is a solution to cos x = sin 2x ?
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
Feedback Control Systems (FCS)
Some Techniques in Deterministic Modeling for Mathematical Biology By:Ryan Borek, Dallas Hamann, Heather Parsons, Carrie Ruda, Carissa Staples, Erik Wolf.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Genetic Programming.
Resilient Machines Through Continuous Self-Modeling Pattern Recognition Seung-Hyun Lee Soft Computing Lab. Josh Bongard,Victor Zykov, and Hod.
Initial Value Problem: Find y=f(x) if y’ = f(x,y) at y(a)=b (1) Closed-form solution: explicit formula -y’ = y at y(0)=1 (Separable) Ans: y = e^x (2)
DERIVATIVES 3. We have seen that a curve lies very close to its tangent line near the point of tangency. DERIVATIVES.
Copyright © Cengage Learning. All rights reserved. 2 Derivatives.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Large-amplitude oscillations in a Townsend discharge in low- current limit Vladimir Khudik, Alex Shvydky (Plasma Dynamics Corp., MI) Abstract We have developed.
Improved Gene Expression Programming to Solve the Inverse Problem for Ordinary Differential Equations Kangshun Li Professor, Ph.D Professor, Ph.D College.
Variants of the 1D Wave Equation Jason Batchelder 6/28/07.
Printing Functional Systems
Aim: Differentials Course: Calculus Do Now: Aim: Differential? Isn’t that part of a car’s drive transmission? Find the equation of the tangent line for.
Jeff Howbert Introduction to Machine Learning Winter Regression Linear Regression.
Chapter 1 The Science of Biology. (What is science?) The Nature of Science.
Look-ahead Linear Regression Trees (LLRT)
Linear Regression Handbook Chapter. Experimental Testing Data are collected, in scientific experiments, to test the relationship between various measurable.
The Evolutionary Modeling and Short-range Climatic Prediction for Meteorological Element Time Series K. Q. Yu, Y. H. Zhou, J. A. Yang Institute of Heavy.
Solving Iterated Functions Using Genetic Programming Michael Schmidt Hod Lipson 2010 HUMIES Competition f(f(x))
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
Josh Bongard † & Hod Lipson Computational Synthesis Laboratory Cornell University † Current Address: Department of Computer Science.
G ENETIC P ROGRAMMING Ranga Rodrigo March 17,
Numerical Methods for Solving ODEs Euler Method. Ordinary Differential Equations  A differential equation is an equation in which includes derivatives.
Simple Linear Regression. The term linear regression implies that  Y|x is linearly related to x by the population regression equation  Y|x =  +  x.
Do Now : Evaluate when x = 6, y = -2 and z = The Quadratic Formula and the Discriminant Objectives Students will be able to: 1)Solve quadratic.
The Double Pendulum by Franziska von Herrath & Scott Mandell.
Ch 1.3: Classification of Differential Equations
Oscillatory motion (chapter twelve)
Statistics in WR: Lecture 1 Key Themes – Knowledge discovery in hydrology – Introduction to probability and statistics – Definition of random variables.
1 Simple Linear Regression and Correlation Least Squares Method The Model Estimating the Coefficients EXAMPLE 1: USED CAR SALES.
Differential Equations Linear Equations with Variable Coefficients.
A comparative approach for gene network inference using time-series gene expression data Guillaume Bourque* and David Sankoff *Centre de Recherches Mathématiques,
The Hydrogen Atom The only atom that can be solved exactly.
Modeling & Simulation of Dynamic Systems (MSDS)
ESTIMATION METHODS We know how to calculate confidence intervals for estimates of  and  2 Now, we need procedures to calculate  and  2, themselves.
Brendan Jones Christian Fadul. The equation of motion for a linear oscillator does not have a single analytical solution. As the components of the system.
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
CORRELATION-REGULATION ANALYSIS Томский политехнический университет.
PRIOR READING: Main 1.1, 2.1 Taylor 5.1, 5.2 SIMPLE HARMONIC MOTION: NEWTON’S LAW
Indefinite Integrals -1.  Primitive or Antiderivative  Indefinite Integral  Standard Elementary Integrals  Fundamental Rules of Integration  Methods.
Modelling & Simulation of Semiconductor Devices Lecture 1 & 2 Introduction to Modelling & Simulation.
Introduction to Differential Equations
Selected Topics in CI I Genetic Programming Dr. Widodo Budiharto 2014.
Automated Reverse Engineering of Nonlinear Dynamical Systems
12.1 – What is Stoichiometry?
S519: Evaluation of Information Systems
PHYSICS Introduction.
Devil physics The baddest class on campus IB Physics
Simple Harmonic Motion
The Science of Biology.
Copyright © Cengage Learning. All rights reserved.
Copyright © 2014 John Wiley & Sons, Inc. All rights reserved.
7.1 Draw Scatter Plots and Best Fitting Lines
Residuals (resids).
Regression and Correlation of Data
Bootstrapping and Bootstrapping Regression Models
Presentation transcript:

Machine Science Distilling Free-Form Natural Laws from Experimental Data Hod Lipson, Cornell University

Lipson & Pollack, Nature 406, 2000

Camera View

Adapting in simulation Simulator Evolve Controller In Simulation Try it in reality! Download Crossing The Reality Gap

Adapting in reality Evolve Controller In Reality Try it ! Too many Physical Trials

Simulation & Reality Evolve Controller “Simulator” Try it in reality! Build Collect Sensor DataEvolve Simulator Evolve Simulators Evolve Robots

Servo Actuators Tilt Sensors

Emergent Self-Model With Josh Bongard and Victor Zykov, Science 2006

Damage Recovery With Josh Bongard and Victor Zykov, Science 2006

Random Predicted Physical

System Identification ?

Perturbations

Photo: Floris van Breugel

Structural Damage Diagnosis With Wilkins Aquino

. com With Jeff Clune, Jason Yosinski

Symbolic Regression f(x)=e x sin(|x|) What function describes this data? John Koza, 1992

Encoding Equations * sin – x1x1 3 f(x)f(x) + x2x2 -7 Building Blocks: + - * / sin cos exp log … etc sin(x 2 ) x 1 *sin(x 2 ) (x 1 – 3)*sin(x 2 ) (x 1 – 3)*sin(-7 + x 2 ) John Koza, 1992

Michael D. Schmidt, Hod Lipson (2006) Models: Expression trees Subject to mutation and selection Experiments: Data-points Subject to mutation and selection {const,+,-,*,/,sin,cos,exp,log,abs}

Solution Accuracy Entire Dataset Coevolved Dataset

Solution Complexity Entire Dataset Coevolved Dataset

Semi-empirical mass formula Modeling the binding energy of an atomic nucleus R 2 = Weizsäcker’s Formula: R 2 = Inferred Formula:

Systems of Differential Equations Regress on derivative timex 1 x 2 … … … … … ………… State Variables dx 1 /dtx 2 /dt … … … … … ……… Derivatives

Original Equations Inferred Equations Inferring Biological Networks With Michael Schmidt, John Wikswo (Vanderbilt), Jerry Jenkins (CFDRC)

Charles Richter

Ingmar Zanger, John Amend

Wet Data, Unknown System With Michael Schmidt (Cornell) and Gurol Suel (UT Southwestern)

Wet Data, Unknown System With Michael Schmidt (Cornell) and Gurol Suel (UT Southwestern)

Cell #1 Cell #2 Cell #3-60 …

Blue Dots = data points, Green Line = regressed fit = =

Biologist’s Inferred Model: Gurol Suel, et. al., Science 2007 Symbolic Regression Inferred Time-Delay Model:

Withheld Test Set #1 Fit

Withheld Test Set #2 Fit

Withheld Test Set #3 Fit

Looking For Invariants

Data Mining

42 42+x-x 42+1/(1000+x 2 ) ?

From Data: x y… ……… δxδx δyδy δyδy δxδx,, … Calculate partial derivatives Numerically: δfδf δxδx δfδf δyδy From Equation: δx’δx’ δy’δy’ δy’δy’ δx’δx’,, … Calculate predicted partial derivatives Symbolically:

x y x 3 + x – y 2 – 1.5 = 0x 2 + y 2 – 16 = 0x 2 + y 2 + z 2 – 1 = 0 x y x z Homework CircleElliptic CurveSphere

Linear Oscillator Coefficients may have different scales and offsets each run

Pendulum H L

Double Linear Oscillator would be plus for Lagrangian

xyz z=f(x,y) ? dy/dt=f(x,y) ? f(x,y,z)=0 ?

Run…

xyz z=f(x,y) ? dy/dt=f(x,y) ? f(x,y,z)=0 ? I feel lucky

Von Thomas Hermanowski, Dr. Andreas Rick, Dr. Jochen Weber

Particle Accelerators angle Distortion Correction (GE X-Ray Detectors) 5x Resolution Improvement Jay Schuren

Concluding Remarks Correlation is enough. Faced with massive data, [the Scientific Method] is becoming obsolete. We can stop looking for models. “ ” Chris Anderson Wired The data deluge accelerates our ability to hypothesize, model, and test.

Theoretical physicists are not yet obsolete, but scientists have taken steps toward replacing themselves

I am worried that we have enjoyed a brief window in human history where we could actually understand things, but that period may be coming to an end. -- Steve Strogatz The end of insight