Introduction Osborn.

Slides:



Advertisements
Similar presentations
Random Sampling and Data Description
Advertisements

Nemours Biomedical Research Statistics March 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
PSYC512: Research Methods PSYC512: Research Methods Lecture 8 Brian P. Dyre University of Idaho.
 Catalogue No: BS-338  Credit Hours: 3  Text Book: Advanced Engineering Mathematics by E.Kreyszig  Reference Books  Probability and Statistics by.
Visual Displays of Data and Basic Descriptive Statistics
● Midterm exam next Monday in class ● Bring your own blue books ● Closed book. One page cheat sheet and calculators allowed. ● Exam emphasizes understanding.
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Hands-on Introduction to R. Outline R : A powerful Platform for Statistical Analysis Why bother learning R ? Data, data, data, I cannot make bricks without.
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
Introduction Osborn. Daubert is a benchmark!!!: Daubert (1993)- Judges are the “gatekeepers” of scientific evidence. Must determine if the science is.
Chapter 21 Basic Statistics.
Hands-on Introduction to R. We live in oceans of data. Computers are essential to record and help analyse it. Competent scientists speak C/C++, Java,
Review of Chapters 1- 6 We review some important themes from the first 6 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
 Statistics The Baaaasics. “For most biologists, statistics is just a useful tool, like a microscope, and knowing the detailed mathematical basis of.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Fundamentals of Data Analysis Lecture 3 Basics of statistics.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
Determination of Sample Size: A Review of Statistical Theory
1 Results from Lab 0 Guessed values are biased towards the high side. Judgment sample means are biased toward the high side and are more variable.
Chapter 10: Introduction to Statistical Inference.
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
The field of statistics deals with the collection,
Reasoning in Psychology Using Statistics Psychology
POLS 7000X STATISTICS IN POLITICAL SCIENCE CLASS 5 BROOKLYN COLLEGE-CUNY SHANG E. HA Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for.
Hands-on Introduction to R. We live in oceans of data. Computers are essential to record and help analyse it. Competent scientists speak C/C++, Java,
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
CHAPTER 6: SAMPLING, SAMPLING DISTRIBUTIONS, AND ESTIMATION Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Statistics Descriptive Statistics. Statistics Introduction Descriptive Statistics Collections, organizations, summary and presentation of data Inferential.
Yandell - Econ 216 Chap 1-1 Chapter 1 Introduction and Data Collection.
SESSION 1 & 2 Last Update 15 th February 2011 Introduction to Statistics.
Analyze Phase Inferential Statistics
Statistical Inference
Chapter 8: Estimating with Confidence
Statistics in Management
Data Analysis.
Chapter 8: Estimating with Confidence
Overview of probability and statistics
Psychology 202a Advanced Psychological Statistics
Statistical Data Analysis
Sampling Distributions
Central Tendency and Variability
Statistical Reasoning in Everyday Life
IET 603 Quality Assurance in Science & Technology
Statistics in Applied Science and Technology
Introduction to Inferential Statistics
Inferential statistics,
The Nature of Probability and Statistics
Reasoning in Psychology Using Statistics
An Introduction to Statistics
Discrete Event Simulation - 4
(or why should we learn this stuff?)
Chapter 8: Estimating with Confidence
Statistical Data Analysis
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Constructing and Interpreting Visual Displays of Data
(-4)*(-7)= Agenda Bell Ringer Bell Ringer
CS639: Data Management for Data Science
Chapter 8: Estimating with Confidence
DESIGN OF EXPERIMENT (DOE)
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Sampling Distributions
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Statistics Review (It’s not so scary).
Presentation transcript:

Introduction Osborn

“Legal” Science Daubert is a benchmark!!!: Daubert (1993)- Judges are the “gatekeepers” of scientific evidence. Must determine if the science is reliable Has empirical testing been done? Falsifiability Has the science been subject to peer review? Are there known error rates? Is there general acceptance? Frye Standard (1928) essentially Federal Government and 26(-ish) States are “Daubert States”

Measurement and Randomness Any time an observation is made, one is making a “measurement” Experimental error is inherent in every measurement Refers to variation in observations between repetitions of the same experiment. It is unavoidable and many sources contribute Error in a statistical context is a technical termBHH

Measurement and Randomness Experimental error is a form of randomness Randomness: inherent unpredictability in a process The the outcomes of the process follow a probability distribution Statistical tools are used to both: Describe the randomness Make inferences taking into account the randomness Careful!: Bad data, assumptions and models lead to garbage (GIGO)

Probability Frequency: ratio of the number of observations of interest (ni) to the total number of observations (N) Probability (frequentist): frequency of observation i in the limit of a very large number of observations We will almost always use this definition It is EMPIRICAL!

Frequency Roll a “fair” die 20 times (N = 20). What is the frequency of obtaining 2 (n2 = ?)? Let’s do this with simulation (Monte Carlo): In R: Result: n2 = 2 freq2 = 2/20 = 0.1

What is Statistics?? Study of relationships in data Descriptive Statistics – techniques to summarize data E.g. mean, median, mode, range, standard deviation, stem and leaf plots, histograms, box and whiskers plots, etc. Inferential Statistics – techniques to draw conclusions from a given data set taking into account inherent randomness E.g. confidence intervals, hypothesis testing, Bayes’ theorem, forecasting, etc.

Data Random variables - All measurements have an associated “randomness” component Randomness –patternless, unstructured, typical, total ignoranceChaitin, Claude Any experiment/observation recorded is a random variate

T.I.C. of Gasoline

Observations from the T.I.C. GC-MS instrument output for a gasoline :

Population and Sample Almost all of statistics is based on a sample drawn from a population. Population: The totality of observations that might occur as a result of repeatedly performing an experiment Why not measure the whole population? Usually impossible Likely wasteful Population should be relevant. Part logic Part guess Part philosophy….

Data and Sampling Sample Representations: Representative Sample Population Biased Samples Population Population Sample Sample

Parameters and Statistics Parameter: any function of the population Statistic: any function of a sample from the population Statistics are used to estimate population parameters Statistics can be biased or unbiased Sample average is an unbiased estimator for population mean We may construct distributions for statistics Populations have distributions for observations Samples have distributions for observations and statistics

What is ? R : A powerful Platform for Statistical Analysis Why bother learning R ? Basic Graphing Basic Data Summary and Analysis Tools Basic Statistical Inference Tools We will learn R and Rstudio Getting Help Basic input/output and calculating Visualizing with Graphing

Finding our way around R/RStudio Script Window Command Line

Handy Commands: x <- 4 x <- “text goes in quotes” Basic Input and Output Numeric input x <- 4 variables: store information :Assignment operator x <- “text goes in quotes” Text (character) input

Get help on an R command: Handy Commands: Get help on an R command: If you know the name: ?command name ?plot brings up html on plot command If you don’t know the name: Use Google (my favorite) ??key word

R is driven by functions: Handy Commands: R is driven by functions: func(arguement1, argument2) input to function goes in parenthesis function name function returns something; gets dumped into x x <- func(arg1, arg2)

Handy Commands: Matrices: X User defined functions syntax: X[,1] returns column 1 of matrix X X[3,] returns row 3 of matrix X Handy functions for data frames and matrices: dim, nrow, ncol, rbind, cbind User defined functions syntax: func.name <- function(arguements) { do something return(output) } To use it: func.name(values)

R commands not to forget for today <- (assignment or “gets”) ? (to get help with a command) : (range operator) c (“collect”) sample seq (generate a sequence) plot library install.packages (to install libraries you don’t have) For matrices and vectors: x[,3] vs. x[3,] vs. x[,] vs. x[3,3] vs. x[] vs. x[1:3] etc…