Think like an experimentalist 10/11/10. Melissa, you’re a modeler! And I do “systems biology”. So model this data for me!!

Slides:



Advertisements
Similar presentations
Yinyin Yuan and Chang-Tsun Li Computer Science Department
Advertisements

Design of Experiments Lecture I
1 Regression as Moment Structure. 2 Regression Equation Y =  X + v Observable Variables Y z = X Moment matrix  YY  YX  =  YX  XX Moment structure.
Lessons Learned from Measuring Cell Response by Quantitative Automated Microscopy FDA Workshop, Potency Measurements for Cellular and Gene Therapy Products,
Modelling and Identification of dynamical gene interactions Ronald Westra, Ralf Peeters Systems Theory Group Department of Mathematics Maastricht University.
Biological pathway and systems analysis An introduction.
Experimental Design, Response Surface Analysis, and Optimization
Beyond Null Hypothesis Testing Supplementary Statistical Techniques.
Aspects of Conditional Simulation and estimation of hydraulic conductivity in coastal aquifers" Luit Jan Slooten.
Dynamic Treatment Regimes, STAR*D & Voting D. Lizotte, E. Laber & S. Murphy LSU ---- Geaux Tigers! April 2009.
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.
Maximum likelihood Conditional distribution and likelihood Maximum likelihood estimations Information in the data and likelihood Observed and Fisher’s.
Maximum likelihood (ML) and likelihood ratio (LR) test
Geostatistical structural analysis of TransCom data for development of time-dependent inversion Erwan Gloaguen, Emanuel Gloor, Jorge Sarmiento and TransCom.
Dynamic Treatment Regimes, STAR*D & Voting D. Lizotte, E. Laber & S. Murphy ENAR March 2009.
An Overview of Today’s Class
Dynamic Treatment Regimes, STAR*D & Voting D. Lizotte, E. Laber & S. Murphy Psychiatric Biostatistics Symposium May 2009.
1 Validation and Verification of Simulation Models.
13-1 Designing Engineering Experiments Every experiment involves a sequence of activities: Conjecture – the original hypothesis that motivates the.
Data mining and statistical learning, lecture 3 Outline  Ordinary least squares regression  Ridge regression.
The Calibration Process
Maximum likelihood (ML)
IE 594 : Research Methodology – Discrete Event Simulation David S. Kim Spring 2009.
1 Linear Methods for Classification Lecture Notes for CMPUT 466/551 Nilanjan Ray.
Variation, Validity, & Variables Lesson 3. Research Methods & Statistics n Integral relationship l Must consider both during planning n Research Methods.
Chapter 1: Introduction to Statistics
Calibration Guidelines 1. Start simple, add complexity carefully 2. Use a broad range of information 3. Be well-posed & be comprehensive 4. Include diverse.
1 Validation & Verification Chapter VALIDATION & VERIFICATION Very Difficult Very Important Conceptually distinct, but performed simultaneously.
 1  Outline  stages and topics in simulation  generation of random variates.
Chapter 1: Introduction to Statistics. 2 Statistics A set of methods and rules for organizing, summarizing, and interpreting information.
© 1998, Geoff Kuenning General 2 k Factorial Designs Used to explain the effects of k factors, each with two alternatives or levels 2 2 factorial designs.
ENM 503 Lesson 1 – Methods and Models The why’s, how’s, and what’s of mathematical modeling A model is a representation in mathematical terms of some real.
Chapter 21 R(x) Algorithm a) Anomaly Detection b) Matched Filter.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Experimentation in Computer Science (Part 1). Outline  Empirical Strategies  Measurement  Experiment Process.
Using Bayesian Networks to Analyze Whole-Genome Expression Data Nir Friedman Iftach Nachman Dana Pe’er Institute of Computer Science, The Hebrew University.
CS 782 – Machine Learning Lecture 4 Linear Models for Classification  Probabilistic generative models  Probabilistic discriminative models.
Simulation is the process of studying the behavior of a real system by using a model that replicates the behavior of the system under different scenarios.
Center for Radiative Shock Hydrodynamics Fall 2011 Review Assessment of predictive capability Derek Bingham 1.
WHAT IS THE NATURE OF SCIENCE?. SCIENTIFIC WORLD VIEW 1.The Universe Is Understandable. 2.The Universe Is a Vast Single System In Which the Basic Rules.
Solution of a Partial Differential Equations using the Method of Lines
Lecture 4: Statistics Review II Date: 9/5/02  Hypothesis tests: power  Estimation: likelihood, moment estimation, least square  Statistical properties.
QUANTITATIVE RESEARCH Presented by SANIA IQBAL M.Ed Course Instructor SIR RASOOL BUKSH RAISANI.
NCAF Manchester July 2000 Graham Hesketh Information Engineering Group Rolls-Royce Strategic Research Centre.
- 1 - Overall procedure of validation Calibration Validation Figure 12.4 Validation, calibration, and prediction (Oberkampf and Barone, 2004 ). Model accuracy.
Module III Multivariate Analysis Techniques- Framework, Factor Analysis, Cluster Analysis and Conjoint Analysis Research Report.
Single-Subject and Correlational Research Bring Schraw et al.
9-1 ESTIMATION Session Factors Affecting Confidence Interval Estimates The factors that determine the width of a confidence interval are: 1.The.
+ EXPERIMENTAL INVESTIGATIONS An experimental investigation is one in which a control is identified. The variables are measured in an effort to gather.
1 Tom Edgar’s Contribution to Model Reduction as an introduction to Global Sensitivity Analysis Procedure Accounting for Effect of Available Experimental.
Statistical Methods. 2 Concepts and Notations Sample unit – the basic landscape unit at which we wish to establish the presence/absence of the species.
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
Computacion Inteligente Least-Square Methods for System Identification.
Bayesian Optimization. Problem Formulation Goal  Discover the X that maximizes Y  Global optimization Active experimentation  We can choose which values.
Research Design
High-throughput genomic profiling of tumor-infiltrating leukocytes
WHAT IS THE NATURE OF SCIENCE?
Journal club Jun , Zhen.
What is the scientific method?
Scientific Method.
Utilizing Spectrophotometry in Life Science
Chapter Eight: Quantitative Methods
The Road to Success in AP Physics
I. Statistical Tests: Why do we use them? What do they involve?
BA 275 Quantitative Business Methods
What is Science?.
Chapter 1 – Investigating scientifically
Design Of Experiment Eng. Ibrahim Kuhail.
Modeling and Simulation: Exploring Dynamic System Behaviour
Observation Information we get from our senses alone.
Presentation transcript:

Think like an experimentalist 10/11/10

Melissa, you’re a modeler! And I do “systems biology”. So model this data for me!!

Useful dataUnuseful data No indication of treatment times given in figure legend Multiple time points, multiple variables, independent loading control, Pixel density quantification

Aspects to consider for experimental design that complements modeling (from Kreutz & Timmer) In comparison to classical biochemical studies, establishment of mechanistic mathematical models requires a relative large amount of data Measurements obtained by experimental repetitions have to be comparable on a quantitative not only on a qualitative level A measure of confidence is required for each data point The number of measured conditions should clearly exceed the number of all unknown model parameters Validation of dynamic models requires measurements of the time dependency after external perturbations Perturbations of a single player (e.g. by knockout, over-expression and similar techniques) provide valuable information for the establishment of a mechanistic model

Aspects to consider for experimental design that complements modeling (cont.) The biochemical mechanisms between the observables should be reasonably known The predictive power of mathematical models increases with the level of available knowledge. It could therefore be preferable to concentrate experimental efforts on well understood subsystems If the modeled proteins could not be observed directly, measurements of other proteins that interact with the players of interest, can be informative. The amount of information from such additional observables depends on the required enlargement of the model The velocity of the underlying dynamics indicates meaningful sampling intervals  t. The measurements should seem relatively smooth. If the considered hypothesis are characterized by a different dynamics, this difference determines proper sampling times

Aspects to consider for experimental design that complements modeling (cont.) Steady-state concentrations provide useful information The number of molecules per cell or the total concentration is a very useful information. The order of magnitude of the number of molecules (i.e. tens or thousands) per cellular compartment has to be known Thresholds for a qualitative change of the system behavior, i.e. the switching conditions, are insightful information Calibration measurements with known protein concentrations are advantageous because the number of scaling parameters is reduced The specificity of the experimental technique is crucial for quantitative interpretation of the measurements For the applied measurement techniques, the relationship between the output (e.g. intensities) and the underlying truth (e.g. concentrations) has to be known. Usually, a linear dependency is preferable

Optimal Sampling Experiments cost time and $$ Often, a minimal set of measurements are made for time-dependent data Limited by reagents Space on plate # of cells/quantity of tissue Replicates often limited to n=3

Modeling was performed based upon existing data. If experiments were to be redone, would there be a better set of time points to select??

This is an optimization problem Several approaches, such as: – Compute the determinant of the Fisher information matrix (inverse of the covariance matrix of the estimated parameters) – Use an initial guess of [T 0, T 1 ] N and initial guess of parameter values – Re-estimate parameter values – Re-calculate det(F) – Iterate to maximize F for new sampling time vector Kutalik et al, Biosystems, 2004

x x x x x x

x x x x x x x x x

Question: is any experimentalist going to go through this exercise prior to collecting data? Reality: “heuristic” approach usually taken

Features: Measurements of the time dependency after external perturbation Perturbations of a single player to provide information for the establishment of a mechanistic model Measurements obtained by experimental repetitions have to be comparable on a quantitative not only on a qualitative level A measure of confidence is required for each data point For the applied measurement technique, the relationship between the intensity and the underlying truth has to be known. Usually, a linear dependency is preferable Note how sampling interval changes over timecourse

What data would be useful for determining unknown parameters? What are the units here?

P = pro-inflammatory cytokines Salva et al, Ped Pulmonol, 1996 McAllister et al, J Immunol, 2005.