Chi-Squared Analysis Stickrath.

Slides:



Advertisements
Similar presentations
What is Chi Square? Chi square is a “goodness of fit” test.
Advertisements

Chi-Square Test Chi-square is a statistical test commonly used to compare observed data with data we would expect to obtain according to a specific hypothesis.
Chi-Squared Tutorial This is significantly important. Get your AP Equations and Formulas sheet.
What is a χ2 (Chi-square) test used for?
Laws of Probability and Chi Square
Quantitative Skills 4: The Chi-Square Test
Hypothesis Testing IV Chi Square.
Physics 270 – Experimental Physics. Let say we are given a functional relationship between several measured variables Q(x, y, …) What is the uncertainty.
What is a χ2 (Chi-square) test used for?
Please turn in your signed syllabus. We will be going to get textbooks shortly after class starts. Homework: Reading Guide – Chapter 2: The Chemical Context.
Evaluation (practice). 2 Predicting performance  Assume the estimated error rate is 25%. How close is this to the true error rate?  Depends on the amount.
Hypothesis Tests for Means The context “Statistical significance” Hypothesis tests and confidence intervals The steps Hypothesis Test statistic Distribution.
Naked mole rats are a burrowing rodent
Chi-square notes. What is a Chi-test used for? Pronounced like kite, not like cheese! This test is used to check if the difference between expected and.
Chi-square Goodness of Fit Test
Chi-Squared (  2 ) Analysis AP Biology Unit 4 What is Chi-Squared? In genetics, you can predict genotypes based on probability (expected results) Chi-squared.
8/15/2015Slide 1 The only legitimate mathematical operation that we can use with a variable that we treat as categorical is to count the number of cases.
Chi-Squared Test.
Hypothesis Testing:.
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
AP Biology. Chi-Square Purpose: To determine if a deviation from expected results is significant. (or was it just chance) Purpose: To determine if a deviation.
1 Psych 5500/6500 Chi-Square (Part Two) Test for Association Fall, 2008.
Chi Square.
Review of Statistics Group Results. Which type of statistics? When comparing two group scores-Use the t-test. When comparing more than two scores: Use.
Let’s flip a coin. Making Data-Based Decisions We’re going to flip a coin 10 times. What results do you think we will get?
Chi Square Analysis  Use to see if the observed value varies from the expected value.  Null Hypothesis – There is no difference between the observed.
Chi-Square Procedures Chi-Square Test for Goodness of Fit, Independence of Variables, and Homogeneity of Proportions.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Chi-Squared (  2 ) Analysis AP Biology Unit 4 What is Chi-Squared? In genetics, you can predict genotypes based on probability (expected results) Chi-squared.
Test for Significant Differences T- Tests. T- Test T-test – is a statistical test that compares two data sets, and determines if there is a significant.
The Statistical Analysis of Data. Outline I. Types of Data A. Qualitative B. Quantitative C. Independent vs Dependent variables II. Descriptive Statistics.
Fitting probability models to frequency data. Review - proportions Data: discrete nominal variable with two states (“success” and “failure”) You can do.
Chi Squared Test. Why Chi Squared? To test to see if, when we collect data, is the variation we see due to chance or due to something else?
Chi square analysis Just when you thought statistics was over!!
Physics 270 – Experimental Physics. Let say we are given a functional relationship between several measured variables Q(x, y, …) x ±  x and x ±  y What.
Sampling  When we want to study populations.  We don’t need to count the whole population.  We take a sample that will REPRESENT the whole population.
Test of Goodness of Fit Lecture 43 Section 14.1 – 14.3 Fri, Apr 8, 2005.
Welcome to MM570 Psychological Statistics
Chi-square Test also called chi- squared or χ2 distribution.
Chi-Square Test. Chi-Square (χ 2 ) Test Used to determine if there is a significant difference between the expected and observed data Null hypothesis:
Science Practice 2: The student can use mathematics appropriately. Science Practice 5: The student can perform data analysis and evaluation of evidence.
Chi square Test. Chi squared tests are used to determine whether the difference between an observed and expected frequency distribution is statistically.
Did Mendel fake is data? Do a quick internet search and can you find opinions that support or reject this point of view. Does it matter? Should it matter?
The Chi Square Equation Statistics in Biology. Background The chi square (χ 2 ) test is a statistical test to compare observed results with theoretical.
DRAWING INFERENCES FROM DATA THE CHI SQUARE TEST.
Chi Square Pg 302. Why Chi - Squared ▪Biologists and other scientists use relationships they have discovered in the lab to predict events that might happen.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
AP Biology Heredity PowerPoint presentation text copied directly from NJCTL with corrections made as needed. Graphics may have been substituted with a.
Chi-Square (χ 2 ) Analysis Statistical Analysis of Genetic Data.
Scientists typically collect data on a sample of a population and use this data to draw conclusions, or make inferences, about the entire population. (for.
Chi Square Chi square is employed to test the difference between an actual sample and another hypothetical or previously established distribution such.
Test of Goodness of Fit Lecture 41 Section 14.1 – 14.3 Wed, Nov 14, 2007.
Chi-Squared (2) Analysis
Chi-Squared Χ2 Analysis
Chi Squared Test.
Statistical Analysis Chi Square (X2).
Patterns of inheritance
Chi-Squared (2) Analysis
Inferential Statistics
Chi Square SBI3UP.
Chi Square.
Chi square.
Chi Squared Test.
Statistical Analysis Chi-Square.
Lecture 41 Section 14.1 – 14.3 Wed, Nov 14, 2007
Chi2 (A.K.A X2).
How do you know if the variation in data is the result of random chance or environmental factors? O is the observed value E is the expected value.
Chi square.
Lecture 43 Section 14.1 – 14.3 Mon, Nov 28, 2005
Presentation transcript:

Chi-Squared Analysis Stickrath

Chi-Squared Analysis Suppose I bet you $1,000 that I can predict whether heads or tails will turn up each time you flip a coin. The first time I say, “heads” you flip the coin and it is heads. I got lucky The second time I say, “heads” you flip the coin and it is heads

Chi-Squared Analysis The third time, fourth time, fifth time, sixth time, seven time, eighth time, and so on I predict heads. Each time you flip heads. At what point do you suspect that I am using a two-headed coin? When do you stop chalking it up to chance and accuse me of using a two-headed coin? You can use statistics to back up your accusations and save yourself $1,000

Chi-Squared Analysis Start with the assumption (null-hypothesis) that the results of the coin flip are due to chance It is easier to disprove something than to prove it You will attempt to disprove your null-hypothesis By showing that it is NOT due to chance you can accuse me of cheating

Chi-Square Test Heads 20 10 100 Tails -10 Comparison of observed results and expected results Null-hypothesis: It is purely due to chance X2 value = Sum of (Observed – Expected)2 Expected X2 value = 20 Categories Observed Expected (O-E) (O-E)2 (O-E)2/E Heads 20 10 100 Tails -10

What if we do a second experiment with a new coin and obtain the results below Null-hypothesis: It is purely due to chance X2 value = Sum of (Observed – Expected)2 Expected X2 value = 0.2 Categories Observed Expected (O-E) (O-E)2 (O-E)2/E Heads 11 10 1 0.1 Tails 9 -1

What conclusion would you make from the data for the two coins? Which data is legitimately due to chance, and which data is not due to chance? In the case of the first coin (two-headed) the chi-squared (X2) value is 20 In the case of the second coin (regular) the chi-squared (X2) value is 0.2 So, the higher the (X2) value…the _______ likely the results are due to chance The lower the (X2) value…the _______ likely the results are due to chance

How low is low enough? The null-hypothesis is that your results are due to chance You are attempting to disprove the null-hypothesis It is easier to disprove something than to prove it How can chi-squared (X2) analysis be used to disprove the null-hypothesis There’s an app for that (actually a chart) To follow the chart you must know two things Degrees of Freedom p-value

Degrees of Freedom The number of values in the final calculation of a statistic that are free to change Let’s say I give you 4 numbers and tell you that they must add up to 100. In addition, I tell you that one of the numbers is 50. The three remaining numbers could be a variety of values as long as the overall total is 100 Choice 1 Choice 2 Choice 3 Number 1 = 50 Number 1 = 50 Number 1 = 50 Number 2 = 30 Number 2 = 5 Number 2 = ? Number 3 = 10 Number 3 = 25 Number 3 = ? Number 4 = 10 Number 4 = 20 Number 4 = ?. There are many more choices that fulfill the conditions

Degrees of Freedom In the example above you have 4 options, one of which is a fixed value (50) 3 numbers are free to change 3 degrees of freedom What if I said you have 5 options, one of which is a fixed value (50) 4 numbers are free to change 4 degrees of freedom The more options you have, the more degrees of freedom you have Generally, in biology degrees of freedom = # categories -1

p-value The null-hypothesis is that your results are due to chance p-value: probability that the null-hypothesis is valid (true) High p-value means null-hypothesis is true Low p-value means that the null-hypothesis is untrue How low is low enough? The significant p-value is 0.05 (5%) A p-value less than 0.05 means that it is less than 5% likely that the results are due to chance A p-value greater than 0.05 means that it is more than 5% likely that the results are due to chance

Two-headed coin Big X2 = Small p-value = Not due to chance = Statistically Significant Data The X2 value for our two-headed coin was 20 The number of options were 2 (heads or tails) = 1 degree of freedom The significant p-value is always 0.05 or less Critical value for 1 degree of freedom is 3.84 20 is greater than 3.84 so p-value is less than 0.05 20 = 0.000001 = Not due to chance = Statistically significant

Regular coin Small X2 = Large p-value = Due to chance = Statistically Insignificant Data The X2 value for our two-headed coin was 0.2 The number of options were 2 (heads or tails) = 1 degree of freedom The significant p-value is always 0.05 or less Critical value for 1 degree of freedom is 3.84 0.2 is lower than 3.84 so p-value is more than 0.05 0.2 = 0.65 = Due to chance = Statistically insignificant

Simple vs. Complex In the case of the two-headed coin, you have simple expectations 50:50 heads to tails What about more complex problems?

Teaching Example 100 students took my exam X2 =11.93 Degrees of Freedom = # categories -1 = 5-1 = 4 Categories Observed Expected (O-E) (O-E)2 (O-E)2/E A 20 15 5 25 1.67 B 22 -3 9 0.36 C 35 40 -5 0.63 D 23 8 64 4.27 F

Did my student meet my expectations? Degrees of Freedom = # categories -1 = 5-1 = 4