Introduction to Basic Statistical Methodology

Slides:



Advertisements
Similar presentations
Introduction to Basic Statistical Methodology. CHAPTER 1 ~ Introduction ~
Advertisements

Week11 Parameter, Statistic and Random Samples A parameter is a number that describes the population. It is a fixed number, but in practice we do not know.
9.2a Tests about a Population Proportion Target Goal: I can check the conditions for carrying out a test about a population proportion. I can perform a.
Statistical Issues in Research Planning and Evaluation
Drawing Samples in “Observational Studies” Sample vs. the Population How to Draw a Random Sample What Determines the “Margin of Error” of a Poll?
Physics 270 – Experimental Physics. Let say we are given a functional relationship between several measured variables Q(x, y, …) What is the uncertainty.
Chapter 7 Sampling and Sampling Distributions
Correlation 2 Computations, and the best fitting line.
Click on image for full.pdf article Links in article to access datasets.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
Discrete Probability Distributions
Part III: Inference Topic 6 Sampling and Sampling Distributions
Chapter 9 Hypothesis Testing.
Presentation 12 Chi-Square test.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Business Statistics,
Testing Hypotheses about a Population Proportion Lecture 29 Sections 9.1 – 9.3 Tue, Oct 23, 2007.
Let’s flip a coin. Making Data-Based Decisions We’re going to flip a coin 10 times. What results do you think we will get?
Week 8 Fundamentals of Hypothesis Testing: One-Sample Tests
Evidence Based Medicine
Binomial distribution Nutan S. Mishra Department of Mathematics and Statistics University of South Alabama.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Chapter 20 Testing hypotheses about proportions
Bernoulli Trials Two Possible Outcomes –Success, with probability p –Failure, with probability q = 1  p Trials are independent.
Chapter-8 Chi-square test. Ⅰ The mathematical properties of chi-square distribution  Types of chi-square tests  Chi-square test  Chi-square distribution.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
Chapter 1 Overview and Descriptive Statistics 1111.1 - Populations, Samples and Processes 1111.2 - Pictorial and Tabular Methods in Descriptive.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests Business Statistics: A First Course Fifth Edition.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Chapter Outline Goodness of Fit test Test of Independence.
Logic and Vocabulary of Hypothesis Tests Chapter 13.
Hypothesis Testing Notes. Hypothesis Testing  Is used to determine whether the difference in two groups is likely to be caused by chance  If you flip.
Health and Disease in Populations 2002 Sources of variation (1) Paul Burton! Jane Hutton.
Essential Questions How do we use simulations and hypothesis testing to compare treatments from a randomized experiment?
CHAPTER Basic Definitions and Properties  P opulation Characteristics = “Parameters”  S ample Characteristics = “Statistics”  R andom Variables.
STATISTICAL INFERENCES
Testing Hypotheses about a Population Proportion Lecture 31 Sections 9.1 – 9.3 Wed, Mar 22, 2006.
Results: How to interpret and report statistical findings Today’s agenda: 1)A bit about statistical inference, as it is commonly described in scientific.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
Agenda n Probability n Sampling error n Hypothesis Testing n Significance level.
Lecture #8 Thursday, September 15, 2016 Textbook: Section 4.4
Statistics for Managers Using Microsoft® Excel 5th Edition
Presentation 12 Chi-Square test.
Introduction to Statistics for Engineers
Statistics Hypothesis Test PHANTOMS
STAT 311 Chapter 1 - Overview and Descriptive Statistics
One-Sample Inference for Proportions
Chapter 11 Chi-Square Tests.
FINAL EXAMINATION STUDY MATERIAL III
CHAPTER 6 Statistical Inference & Hypothesis Testing
Chapters 20, 21 Hypothesis Testing-- Determining if a Result is Different from Expected.
Lecture 13 Sections 5.4 – 5.6 Objectives:
What does it mean to say that the results of an experiment are (or are not) statistically significant? The significance level,  (conventionally set to.
Week 11 Chapter 17. Testing Hypotheses about Proportions
Two-sided p-values (1.4) and Theory-based approaches (1.5)
Making Data-Based Decisions
Chapter 11: Introduction to Hypothesis Testing Lecture 5b
Essential Statistics Introduction to Inference
Chapter 10 Analyzing the Association Between Categorical Variables
Statistical Analysis Chi-Square.
Chapter 11 Chi-Square Tests.
P-VALUE.
Hypothesis Testing A hypothesis is a claim or statement about the value of either a single population parameter or about the values of several population.
STATISTICS IN A NUTSHELL
Testing Hypotheses about a Population Proportion
CHAPTER 10 Comparing Two Populations or Groups
Chapter 1 Overview and Descriptive Statistics
Basic Practice of Statistics - 5th Edition Introduction to Inference
Presentation transcript:

Introduction to Basic Statistical Methodology

CHAPTER 1 ~ Introduction ~

What is “random variation” in the distribution of a population? Examples: Toasting time, Temperature settings, etc.… POPULATION 1: Little to no variation (e.g., product manufacturing) In engineering situations such as this, we try to maintain “quality control”… i.e., “tight tolerance levels,” high precision, low variability. O O O O O But what about a population of, say, people?

What is “random variation” in the distribution of a population? Examples: Body Temperature (F) POPULATION 1: Little to no variation (e.g., clones) Density Most individual values ≈ population mean value NOT REALISTIC!!! Very little variation about the mean! 98.6 F

What is “random variation” in the distribution of a population? Examples: Gender, Race, Age, Height, Drug Response (e.g., cholesterol level),… Examples: Body Temperature (F) POPULATION 2: Much variation (more realistic) POPULATION 2: Much variation (more realistic) Density Much more variation about the mean!

What are “statistics,” and how can they be applied to real issues? Example: Suppose a certain company insists that it complies with “gender equality” regulations among its employee population, i.e., approx. 50% male and 50% female. GLOBAL OPERATION DYNAMICS, INC. To test this claim, let us select a random sample of n = 100 employees, and count X = the number of males. (If the claim is true, then we expect X  50.)     X = 64 males (+ 36 females)      etc.      Questions: If the claim is true, how likely is this experimental result? (“p-value”) Could the difference (14 males) be due to random chance variation, or is it statistically significant?

The experiment in this problem can be modeled by a random sequence of n = 100 independent coin tosses (Heads = Male, Tails = Female). It can be mathematically proved that, if the coin is “fair” (“unbiased”), then in 100 tosses: probability of obtaining at least 0 Heads away from 50 is = 1.0000 “certainty” probability of obtaining at least 1 Head away from 50 is = 0.9204 probability of obtaining at least 2 Heads away from 50 is = 0.7644 probability of obtaining at least 3 Heads away from 50 is = 0.6173 probability of obtaining at least 4 Heads away from 50 is = 0.4841 probability of obtaining at least 5 Heads away from 50 is = 0.3682 probability of obtaining at least 6 Heads away from 50 is = 0.2713 probability of obtaining at least 7 Heads away from 50 is = 0.1933 probability of obtaining at least 8 Heads away from 50 is = 0.1332 probability of obtaining at least 9 Heads away from 50 is = 0.0886 probability of obtaining at least 10 Heads away from 50 is = 0.0569 probability of obtaining at least 11 Heads away from 50 is = 0.0352 probability of obtaining at least 12 Heads away from 50 is = 0.0210 probability of obtaining at least 13 Heads away from 50 is = 0.0120 probability of obtaining at least 14 Heads away from 50 is = 0.0066 etc.  0 ...... …..from 0 to 100 Heads….. The  = .05 cutoff is called the significance level. 0.0066 is called the p-value of the sample. Because our p-value (.0066) is less than the significance level (.05), our data suggest that the coin is indeed biased, in favor of Heads. Likewise, our evidence suggests that employee gender in this company is biased, in favor of Males.

What are “statistics,” and how can they be applied to real issues? Example: Suppose a certain company insists that it complies with “gender equality” regulations among its employee population, i.e., approx. 50% male and 50% female. GLOBAL OPERATION DYNAMICS, INC. HYPOTHESIS EXPERIMENT To test this claim, let us select a random sample of n = 100 employees, and count X = the number of males. (If the claim is true, then we expect X  50.)   OBSERVATIONS     X = 64 males (+ 36 females)    etc.       Questions: If the claim is true, how likely is this experimental result? (“p-value”) Could the difference (14 males) be due to random chance variation, or is it statistically significant?

PROBABILITY THEORY ...... ANALYSIS CONCLUSION The experiment in this problem can be modeled by a random sequence of n = 100 independent coin tosses (Heads = Male, Tails = Female). It can be mathematically proved that, if the coin is “fair” (“unbiased”), then in 100 tosses: probability of obtaining at least 0 Heads away from 50 is = 1.0000 “certainty” probability of obtaining at least 1 Head away from 50 is = 0.9204 probability of obtaining at least 2 Heads away from 50 is = 0.7644 probability of obtaining at least 3 Heads away from 50 is = 0.6173 probability of obtaining at least 4 Heads away from 50 is = 0.4841 probability of obtaining at least 5 Heads away from 50 is = 0.3682 probability of obtaining at least 6 Heads away from 50 is = 0.2713 probability of obtaining at least 7 Heads away from 50 is = 0.1933 probability of obtaining at least 8 Heads away from 50 is = 0.1332 probability of obtaining at least 9 Heads away from 50 is = 0.0886 probability of obtaining at least 10 Heads away from 50 is = 0.0569 probability of obtaining at least 11 Heads away from 50 is = 0.0352 probability of obtaining at least 12 Heads away from 50 is = 0.0210 probability of obtaining at least 13 Heads away from 50 is = 0.0120 probability of obtaining at least 14 Heads away from 50 is = 0.0066 etc.  0 ...... PROBABILITY The  = .05 cutoff is called the significance level. THEORY ANALYSIS 0.0066 is called the p-value of the sample. Because our p-value (.0066) is less than the significance level (.05), our data suggest that the coin is indeed biased, in favor of Heads. Likewise, our evidence suggests that employee gender in this company is biased, in favor of Males. CONCLUSION

“Classical Scientific Method” Hypothesis – Define the study population... What’s the question? Experiment – Designed to test hypothesis Observations – Collect sample measurements Analysis – Do the data formally tend to support or refute the hypothesis, and with what strength? (Lots of juicy formulas...) Conclusion – Reject or retain hypothesis; is the result statistically significant? Interpretation – Translate findings in context! Statistics is implemented in each step of the classical scientific method!