Statistics: Examples and Exercises 20.109 Fall 2010 Module 1 Day 7.

Slides:



Advertisements
Similar presentations
Topics Today: Case I: t-test single mean: Does a particular sample belong to a hypothesized population? Thursday: Case II: t-test independent means: Are.
Advertisements

Sampling: Final and Initial Sample Size Determination
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
Business 205. Review Sampling Continuous Random Variables Central Limit Theorem Z-test.
T-tests Computing a t-test  the t statistic  the t distribution Measures of Effect Size  Confidence Intervals  Cohen’s d.
Single-Sample t-Test What is the Purpose of a Single-Sample t- Test? How is it Different from a z-Test?What Are the Assumptions?
10-1 Introduction 10-2 Inference for a Difference in Means of Two Normal Distributions, Variances Known Figure 10-1 Two independent populations.
Chap 11-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 11 Hypothesis Testing II Statistics for Business and Economics.
Parameter Estimation Chapter 8 Homework: 1-7, 9, 10.
Today Today: Chapter 10 Sections from Chapter 10: Recommended Questions: 10.1, 10.2, 10-8, 10-10, 10.17,
BHS Methods in Behavioral Sciences I
1/45 Chapter 11 Hypothesis Testing II EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008.
A Decision-Making Approach
Statistics 101 Class 9. Overview Last class Last class Our FAVORATE 3 distributions Our FAVORATE 3 distributions The one sample Z-test The one sample.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 10-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
8-4 Testing a Claim About a Mean
Chapter 11: Inference for Distributions
Today Concepts underlying inferential statistics
PSY 307 – Statistics for the Behavioral Sciences
Statistics Or Do our Data mean Diddly?. Why are stat important Sometimes two data sets look different, but aren’t Other times, two data sets don’t look.
PSY 307 – Statistics for the Behavioral Sciences
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 8 Introduction to the t Test.
Chapter 5 Sampling and Statistics Math 6203 Fall 2009 Instructor: Ayona Chatterjee.
Section #4 October 30 th Old: Review the Midterm & old concepts 1.New: Case II t-Tests (Chapter 11)
Analysis & Interpretation: Individual Variables Independently Chapter 12.
Education 793 Class Notes T-tests 29 October 2003.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 2 – Slide 1 of 25 Chapter 11 Section 2 Inference about Two Means: Independent.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
© 2002 Thomson / South-Western Slide 8-1 Chapter 8 Estimation with Single Samples.
Population: a data set representing the entire entity of interest - What is a population? Sample: a data set representing a portion of a population Population.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
One Sample Inf-1 If sample came from a normal distribution, t has a t-distribution with n-1 degrees of freedom. 1)Symmetric about 0. 2)Looks like a standard.
Mid-Term Review Final Review Statistical for Business (1)(2)
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
CHAPTER 11 DAY 1. Assumptions for Inference About a Mean  Our data are a simple random sample (SRS) of size n from the population.  Observations from.
From Theory to Practice: Inference about a Population Mean, Two Sample T Tests, Inference about a Population Proportion Chapters etc.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
CHEMISTRY ANALYTICAL CHEMISTRY Fall Lecture 6.
Chapter 4 Statistics Tools to accept or reject conclusion from experimental measurements Deal with random error only.
KNR 445 Statistics t-tests Slide 1 Introduction to Hypothesis Testing The z-test.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Descriptive Statistics Used to describe a data set –Mean, minimum, maximum Usually include information on data variability (error) –Standard deviation.
- We have samples for each of two conditions. We provide an answer for “Are the two sample means significantly different from each other, or could both.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Lecture Slides Elementary Statistics Twelfth Edition
1 Probability and Statistics Confidence Intervals.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Chapter Eleven Performing the One-Sample t-Test and Testing Correlation.
T tests comparing two means t tests comparing two means.
Essential Statistics Chapter 171 Two-Sample Problems.
Testing Differences in Means (t-tests) Dr. Richard Jackson © Mercer University 2005 All Rights Reserved.
Lecture 8 Estimation and Hypothesis Testing for Two Population Parameters.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Part 1: Chapters 7 to 9. 95% within 2 standard deviations 68% within 1 standard deviations 99.7% within 3 standard deviations.
Chapter 4. Inference about Process Quality
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
Math 4030 – 10a Tests for Population Mean(s)
Chapter 11 Hypothesis Testing II
Chapter 4 Statistics.
Sampling Distributions
Basic Practice of Statistics - 3rd Edition Two-Sample Problems
Essential Statistics Two-Sample Problems - Two-sample t procedures -
Statistical Inference for the Mean: t-test
Inferences from Matched Pairs
Presentation transcript:

Statistics: Examples and Exercises Fall 2010 Module 1 Day 7

Your Data and Statistics "Figures often beguile me," he wrote, "particularly when I have the arranging of them myself; in which case the remark attributed to Disraeli would often apply with justice and force: 'There are three kinds of lies: lies, damned lies, and statistics.'” Quote from Mark Twain, Chapters from My Autobiography, 1906

Why are stats important Sometimes two data sets look different, but aren’t Other times, two data sets don’t look that different, but are.

Why are stats important Informed experimental design is very powerful Save time, money, experimental subjects, patients, lab animals …….

Normal Distribution The data are centered around the mean The data are distributed symmetrically around the mean

Mean μ vs The entire population mean is μ Sample population mean is As your sample population gets larger, Data Set – 2, 3, 4, 4, 5, 5, 6, 6, 7, 7, 8, 9 Mean

Standard Deviation Describes how data are expected to vary from the mean σ is s.d. of population s is s.d. of sample μ = 50 σ = 20

Meaning of Standard Deviation Red, Green, Blue all same mean Different standard deviation

Meaning of Standard Deviation Data with a larger spread (blue and green) have a larger Standard Deviation

Standard Deviation 68% of values are within 1 standard deviation 95% of values are within 2 standard deviations of the mean

Statistical Significance How do we know that two data sets are truly different

Recap: Probability density function p(x) a x p(x) Normalized x is a random number Probability that is ab

95% confidence interval of an estimate A range such that 95% of replicate estimates would be within it 95% of area p(x)

95% Confidence interval for a normally distributed variable # data points t Note: Uncertainty decreases proportionally to So take more data! Increasingly accurate estimate of

Example 3 measurements of absorbance at 600 nm: 0.110, 0.115, % confidence limit? Soln:

Confidence Intervals Use t to find interval containing μ if is known Example: t 95 = < μ <8.6 I am 95% confident that the population mean lies between 6.4 and 8.6 HawksCyclones X 1 7.5X s 1 1.0s 2 1.4

T-tests Compare confidence intervals to see if data sets are significantly different Assumptions – Data are normally distributed – The mean is independent of the standard deviation μ ≠ f(σ) Various types – One sample t-test Are these data different than the entire population? – Two sample t-test Do these two data sets come from different populations? – Paired t-test Do individual changes show an overall change?

Use t-test to compare means We have and – Do they come from different populations? Are and different? Null Hypothesis H o : – = Alternative Hypothesis H a : – > t statistic tests H o. If t < 0.05, then reject H o and accept H a

T-test Illustration Two populations that are significantly different, with X 2 larger than X 1

T-test Illustration Two populations that are not significantly different, but X 2 is still larger than X 1

Exercise: Find 99% Confidence MITHarvard X X s s t calc = 1.79 t 99 = ? t calc ? T 99 Go to table in notes to find t 99 with 11 degrees of freedom t=?

Today and Thursday’s Experiments Transfections today Measure fluorescence via Bioanalyzer on Thursday

Thursday’s Experiments: Bioanalyzer

Bioanalyzer Output

Targeted cells showed green fluorescence via flow cytometry at expected frequency. Jonnalagadda, et al DNA Repair. (4) FACS Data

FACS vs. Bioanalyzer Ultimate readout will be fluorescence intensity in red and green channels for each cell FACS measures thousands of events, while the Bioanalyzer measures hundreds What can this mean for your statistics???

Example Bioanalyzer Data Live cells will be labeled red, HR cells will also be green Positive Control

Example Bioanalyzer Data Live cells will be labeled red, HR cells will also be green Possible Experimental Sample Output

Excel Example: Day 8 Results Fluorescence Intensity Cell 33  3 +  : EGFP

Conclusion Due to the nature of the data – Look at gating for individual cell data – Consider a Gaussian distribution for significance when comparing across conditions and groups Think about how much data you have within each population and use different distributions to think about certainty in your data

Extra Slides

Application HawksCyclones X 1 7.5X s 1 1.0s t calc = 4.6 t 95 = 2.2 t calc > t 95 Go to table in notes to find t 95 with 11 degrees of freedom (12-1) (The excel sheet does a different comparison)

Figure 2