Biostatistics for Dummies Biomedical Computing Cross-Training Seminar October 18 th, 2002.

Slides:



Advertisements
Similar presentations
STATISTICS Joint and Conditional Distributions
Advertisements

Sampling Design, Spatial Allocation, and Proposed Analyses Don Stevens Department of Statistics Oregon State University.
Autocorrelation Functions and ARIMA Modelling
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Forecasting Using the Simple Linear Regression Model and Correlation
Probability & Statistical Inference Lecture 9
Time Series Building 1. Model Identification
Prediction, Correlation, and Lack of Fit in Regression (§11. 4, 11
Objectives (BPS chapter 24)
STAT 497 APPLIED TIME SERIES ANALYSIS
Building and Testing a Theory Steps Decide on what it is you want to explain or predict. 2. Identify the variables that you believe are important.
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
Regression Analysis. Unscheduled Maintenance Issue: l 36 flight squadrons l Each experiences unscheduled maintenance actions (UMAs) l UMAs costs $1000.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Chapter Topics Types of Regression Models
Modern methods The classical approach: MethodProsCons Time series regression Easy to implement Fairly easy to interpret Covariates may be added (normalization)
Simple Linear Regression Analysis
Ordinary Kriging Process in ArcGIS
Business Statistics - QBM117 Statistical inference for regression.
Correlation and Regression Analysis
Simple Linear Regression Analysis
1 Simple Linear Regression 1. review of least squares procedure 2. inference for least squares lines.
Variance and covariance Sums of squares General linear models.
Regression and Correlation Methods Judy Zhong Ph.D.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition
Simple Linear Regression
TIME SERIES by H.V.S. DE SILVA DEPARTMENT OF MATHEMATICS
STAT 497 LECTURE NOTES 2.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Linear Stationary Processes. ARMA models. This lecture introduces the basic linear models for stationary processes. Considering only stationary processes.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 12-1 Correlation and Regression.
Introduction to Linear Regression
Geographic Information Science
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Biomedical Research. What is Biomedical Research Biomedical research is the area of science devoted to the study of the processes of life; prevention.
It’s About Time Mark Otto U. S. Fish and Wildlife Service.
Week 11 Introduction A time series is an ordered sequence of observations. The ordering of the observations is usually through time, but may also be taken.
K. Ensor, STAT Spring 2005 Model selection/diagnostics Akaike’s Information Criterion (AIC) –A measure of fit plus a penalty term for the number.
MARKETING RESEARCH CHAPTER 18 :Correlation and Regression.
Spatial Analysis & Geostatistics Methods of Interpolation Linear interpolation using an equation to compute z at any point on a triangle.
Week 21 Stochastic Process - Introduction Stochastic processes are processes that proceed randomly in time. Rather than consider fixed random variables.
ECE-7000: Nonlinear Dynamical Systems Overfitting and model costs Overfitting  The more free parameters a model has, the better it can be adapted.
Estimation Method of Moments (MM) Methods of Moment estimation is a general method where equations for estimating parameters are found by equating population.
STATISTICS Joint and Conditional Distributions Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
Chapter 8: Simple Linear Regression Yang Zhenlin.
Lab for Remote Sensing Hydrology and Spatial Modeling Dept of Bioenvironmental Systems Engineering National Taiwan University 1/45 GEOSTATISTICS INTRODUCTION.
The Box-Jenkins (ARIMA) Methodology
ECE-7000: Nonlinear Dynamical Systems 2. Linear tools and general considerations 2.1 Stationarity and sampling - In principle, the more a scientific measurement.
There is a hypothesis about dependent and independent variables The relation is supposed to be linear We have a hypothesis about the distribution of errors.
Stochastic Hydrology Random Field Simulation Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
Spatial Point Processes Eric Feigelson Institut d’Astrophysique April 2014.
1 Guess the Covered Word Goal 1 EOC Review 2 Scientific Method A process that guides the search for answers to a question.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
BUSINESS MATHEMATICS & STATISTICS. Module 6 Correlation ( Lecture 28-29) Line Fitting ( Lectures 30-31) Time Series and Exponential Smoothing ( Lectures.
EC 827 Module 2 Forecasting a Single Variable from its own History.
CPH Dr. Charnigo Chap. 11 Notes Figure 11.2 provides a diagram which shows, at a glance, what a neural network does. Inputs X 1, X 2,.., X P are.
Introduction Many problems in Engineering, Management, Health Sciences and other Sciences involve exploring the relationships between two or more variables.
Inference for Least Squares Lines
Linear Regression.
STATISTICS Joint and Conditional Distributions
Ch3: Model Building through Regression
Biomedical Research.
Regression Analysis Week 4.
Stochastic Hydrology Random Field Simulation
STOCHASTIC HYDROLOGY Random Processes
数据的矩阵描述.
CORRELATION AND MULTIPLE REGRESSION ANALYSIS
CH2 Time series.
Presentation transcript:

Biostatistics for Dummies Biomedical Computing Cross-Training Seminar October 18 th, 2002

What is “Biostatistics”? Techniques Mathematics Statistics Computing Data Medicine Biology

What is “Biostatistics”? Biological data Knowledge of biological process

Common Applications (Medical and otherwise) Clinical medicine Epidemiologic studies Biological laboratory research Biological field research Genetics Environmental health Health services Ecology Fisheries Wildlife biology Agriculture Forestry

Biostatisticians Work Develop study design Conduct analysis Oversee and regulate Determine policy Training researchers Development of new methods

Some Statistics on Biostatistics Internet search (Google) > 210,000 hits > 50 Graduate Programs in U.S. Too much to cover in one hour!

Center Focus MSU strengths Computational simulation in physical sciences Environmental health sciences Bioinformatics is crowded Computational simulation in environmental health sciences Build on appreciable MSU strength Establish ourselves Unique capability Particular appeal to NIEHS

Focus of Seminar Statistical methodologies Computational simulation in environmental health sciences Can be classified as “biostatistics” Stochastic modeling Time series Spatial statistics*

The Application Of interest Cancer incidence rate Pesticide exposure Of concern Age Gender Race Socioeconomic status Objectives Suitably adjust cancer incidence rate Determine if relationship exists Develop model Explain relationship Estimate cancer rate Predict cancer rate

The Data N.S.S. & U.S. Dept. of Commerce National T.I.S. ( , by county) Number of acres harvested Type of crop MS State Dept. Health Central Cancer Registry (1996 – 1998, by person) Tumor type Age Gender Race County of residence Cancer morbidity Crude incidence/100,000 Age adjusted incidence/100,000

Why (Bio)statistics? Statistics Science of uncertainty Model order from disorder Disorder exists Large scale rational explanation Smaller scale residual uncertainty Chaos Deterministic equation Randomness x0x0 Entropy

(Bio)statistical Data Independent identically distributed Inhomogeneous data Dependent data Time series Spatial statistics

Time Series Identically distributed Time dependent Equally spaced Randomness

Objectives in Time Series Graphical description Time plots Correlation plots Spectral plots Modeling Inference Prediction

Time Series Models Linear Models Covariance stationary Constant mean Constant variance Covariance function of distance in time  (t) ~ i.i.d Zero mean Finite variance  square summable

Nonlinear Time Series Amplitude-frequency dependence Jump phenomenon Harmonics Synchronization Limit cycles Biomedical applications Respiration Lupus-erythematosis Urinary introgen excretion Neural science Human pupillary system

Some Nonlinear Models Nonlinear AR Additive noise Threshold AR Smoothed TAR Markov chain driven Fractals Amplitude- dependent exponential AR Bilinear AR with conditional heteroscedasticity Functional coefficient AR

A Threshold Model

Describing Correlation Autocorrelation AR: exponential decay MA: 0 past q Partial autocorrelation AR: 0 past p MA: exponential decay Cross-correlation Relationship to spectral density

Spatial Statistics* Data components Spatial locations S = {s 1,s 2,…,s n } Observable variable {Z(s 1 ),Z(s 2 ),…,Z(s n )} s  D  R k Correlation Data structures Geostatistical Lattice Point patterns or marked spatial point processes Objects Assumptions on Z and D

Biological Applications Geostatistics Soil science Public health Lattice Remote sensing Medical imaging Point patterns Tumor growth rate In vitro cell growth

Spatial Temporal Models Combine time series with spatial data Application Time element time Pesticide exposure develop cancer Spatial element Proximity to pesticide use