Statistical Analysis Chi Square (X2).

Slides:



Advertisements
Similar presentations
The Chi-Square Test The chi-square test is a statistical test commonly used to compare the observed results of a genetic cross with the expected results.
Advertisements

AP Biology.  Segregation of the alleles into gametes is like a coin toss (heads or tails = equal probability)  Rule of Multiplication  Probability.
Chi-Square Test A fundamental problem is genetics is determining whether the experimentally determined data fits the results expected from theory (i.e.
Chi-Squared Tutorial This is significantly important. Get your AP Equations and Formulas sheet.
Laws of Probability and Chi Square
Quantitative Skills 4: The Chi-Square Test
Chi-square notes. What is a Chi-test used for? Pronounced like kite, not like cheese! This test is used to check if the difference between expected and.
Chapter 11 Section 2 Applying Mendel’s Principles
11.4 Hardy-Wineberg Equilibrium. Equation - used to predict genotype frequencies in a population Predicted genotype frequencies are compared with Actual.
Chi-Squared Test.
AP Biology. Chi-Square Purpose: To determine if a deviation from expected results is significant. (or was it just chance) Purpose: To determine if a deviation.
Chi Square.
Chi-Square Test A fundamental problem in genetics is determining whether the experimentally determined data fits the results expected from theory. How.
Chi Squared Test. Why Chi Squared? To test to see if, when we collect data, is the variation we see due to chance or due to something else?
Chi square analysis Just when you thought statistics was over!!
The Chi Square Test “The Goodness Fit” Test. Purpose Used in genetics to determine how close to the expected ratio your data is Determines if the experiment.
Chi Square Analysis The chi square analysis allows you to use statistics to determine if your data “good” or not. In our fruit fly labs we are using laws.
Probability and Punnett Squares. Tossing Coins If you toss a coin, what is the probability of getting heads? Tails? If you toss a coin 10 times, how many.
How do you know when your data aren’t “close enough”? …and hand grenades!
Corn Lab CORN LAB: CORN LAB: Looking at Monohybrid Crosses and Chi Square Tests.
Did Mendel fake is data? Do a quick internet search and can you find opinions that support or reject this point of view. Does it matter? Should it matter?
The Chi Square Equation Statistics in Biology. Background The chi square (χ 2 ) test is a statistical test to compare observed results with theoretical.
III. Statistics and chi-square How do you know if your data fits your hypothesis? (3:1, 9:3:3:1, etc.) For example, suppose you get the following data.
Hardy-Weinberg Equilibrium When mating is completely random, the zygotic frequencies expected in the next generation may be predicted from the knowledge.
AP Biology Heredity PowerPoint presentation text copied directly from NJCTL with corrections made as needed. Graphics may have been substituted with a.
Chi-Square (χ 2 ) Analysis Statistical Analysis of Genetic Data.
Genetics and Statistics A Tale of Two Hypotheses.
Chi Square Chi square is employed to test the difference between an actual sample and another hypothetical or previously established distribution such.
The Chi Square Test A statistical method used to determine goodness of fit Chi-square requires no assumptions about the shape of the population distribution.
Chi-Square Test A fundamental problem is genetics is determining whether the experimentally determined data fits the results expected from theory (i.e.
Statistical Analysis: Chi Square
Chi-Squared Χ2 Analysis
11.2 Applying Mendel’s Principles
Chi-Square Test A fundamental problem is genetics is determining whether the experimentally determined data fits the results expected from theory (i.e.
Chi Squared Test.
Cell Cycle and Chi Square
Patterns of inheritance
Chi-Square Analysis AP Biology.
Chi-Square Test.
The Chi Square Test A statistical method used to determine goodness of fit Goodness of fit refers to how close the observed data are to those predicted.
Chi Square SBI3UP.
Chi Square.
MENDELIAN GENETICS CHI SQUARE ANALYSIS
Probability & Genetics
The Chi-Square Test The chi-square test is a statistical test commonly used to compare the observed results of a genetic cross with the expected results.
The Chi Square Test A statistical method used to determine goodness of fit Goodness of fit refers to how close the observed data are to those predicted.
Chi square.
Chi Squared Test.
Chi-Square Analysis.
Chi-Square Test.
Probability.
Probability & Genetics
Statistical Analysis Chi-Square.
The Chi Square Test A statistical method used to determine goodness of fit Goodness of fit refers to how close the observed data are to those predicted.
Statistical Analysis: Chi Square
Chi-Square Test.
Lecture 41 Section 14.1 – 14.3 Wed, Nov 14, 2007
Lecture 38 Section 14.5 Mon, Dec 4, 2006
Chi2 (A.K.A X2).
The Chi-Square Test The chi-square test is a statistical test commonly used to compare the observed results of a genetic cross with the expected results.
What is a χ2 (Chi-square) test used for?
Chi-Square Analysis AP Biology.
How do you know if the variation in data is the result of random chance or environmental factors? O is the observed value E is the expected value.
Genetic Statistics.
20 May 2019 Chi2 Test For Genetics Help sheet.
11-2 Probability and Punnett Squares
Chi square.
Chi-Square Analysis AP Biology.
Corn Genetics and Chi-Square Analysis
Presentation transcript:

Statistical Analysis Chi Square (X2)

Chi Square - a test of statistical significance Chi Square measures whether any deviations from the predicted norm that occurs in experimental results exceeds the deviation that might occur by chance. If it is found to be consistent with chance then it can be said to be due to genetics. X2 = ∑( d2 / e ) Chi square deviation expected Sum squared

Flip a coin 100 times –. you expect 50. heads and 50 tails Flip a coin 100 times – you expect 50 heads and 50 tails X2 = ∑( d2 / e ) Heads Tails Observed values 45 55 Expected (e) 50 Deviation (d) 5 Deviation squared (d2) 25 d2 / e 25/50 = 0.5 ∑( d2 / e ) 0.5 + 0.5 = 1

Flip a coin 500 times –. you expect 250. heads and 250 tails Flip a coin 500 times – you expect 250 heads and 250 tails X2 = ∑( d2 / e ) Heads Tails Observed values 245 255 Expected (e) 250 Deviation (d) 5 Deviation squared (d2) 25 d2 / e 25/250 = 0.1 ∑( d2 / e ) 0.1 + 0.1 = 0.2

Sample Size Notice that the sample size is very important. The larger your sample size, the closer your data should be to the expected data if the trait you are examining is due to chance (as genetics should be).

Phenotypes The number of possible phenotypes is also very important. In the case of coin flipping, there are 2 “phenotypes” In statistics we refer to this as having one degree of freedom. The degrees of freedom is on less than the total number of possible phenotypes. 4 phenotypes means 3 degrees of freedom 6 phenotypes means 5 degrees of freedom

How we use this information How we use this information - refer to the table This table gives the probability that an amount of deviation would occur simply by chance. df 0.99 0.97 0.95 0.90 0.80 0.70 0.50 0.30 0.20 0.10 0.05 0.02 0.01 1 --- 0.001 0.004 0.016 0.060 0.15 0.46 1.07 1.64 2.706 3.841 5.024 6.635 2 0.051 0.103 0.211 0.45 0.71 1.39 2.41 3.22 4.605 5.991 7.378 9.210 3 0.11 0.216 0.352 0.584 1.01 1.42 2.37 3.66 4.64 6.251 7.815 9.348 11.34 4 0.29 0.484 0.711 1.064 1.65 2.20 3.36 4.88 5.99 7.779 9.488 11.14 13.27 5 0.55 0.831 1.145 1.610 2.34 3.00 4.35 6.06 7.29 9.236 11.07 12.83 15.08

From our first coin toss example, X2 = 1 degrees of freedom (df) = 1 Therefore there is about a 30% probability that this was due to chance. This is not considered statistically significant. df 0.99 0.97 0.95 0.90 0.80 0.70 0.50 0.30 0.20 0.10 0.05 0.02 0.01 1 --- 0.001 0.004 0.016 0.060 0.15 0.46 1.07 1.64 2.706 3.841 5.024 6.635 2 0.051 0.103 0.211 0.45 0.71 1.39 2.41 3.22 4.605 5.991 7.378 9.210 3 0.11 0.216 0.352 0.584 1.01 1.42 2.37 3.66 4.64 6.251 7.815 9.348 11.34 4 0.29 0.484 0.711 1.064 1.65 2.20 3.36 4.88 5.99 7.779 9.488 11.14 13.27 5 0.55 0.831 1.145 1.610 2.34 3.00 4.35 6.06 7.29 9.236 11.07 12.83 15.08

From our second coin toss example, X2 = 0 From our second coin toss example, X2 = 0.2 degrees of freedom (df) = 1 Therefore there is less than 70% probability that this was due to chance. This is not considered statistically significant. But it is better! df 0.99 0.97 0.95 0.90 0.80 0.70 0.50 0.30 0.20 0.10 0.05 0.02 0.01 1 --- 0.001 0.004 0.016 0.060 0.15 0.46 1.07 1.64 2.706 3.841 5.024 6.635 2 0.051 0.103 0.211 0.45 0.71 1.39 2.41 3.22 4.605 5.991 7.378 9.210 3 0.11 0.216 0.352 0.584 1.01 1.42 2.37 3.66 4.64 6.251 7.815 9.348 11.34 4 0.29 0.484 0.711 1.064 1.65 2.20 3.36 4.88 5.99 7.779 9.488 11.14 13.27 5 0.55 0.831 1.145 1.610 2.34 3.00 4.35 6.06 7.29 9.236 11.07 12.83 15.08

Coins are not Genetics In most scientific research we do not want the results to be due to chance so for data to be considered statistically significant, the X2 result must be 5% or less. In genetics we want to find consistent data that shows high probability due to chance so the higher the probability due to chance, the better. To be considered statistically significant, the X2 result must be 95% or more.

Looking at our corn cobs One group had the following data Round, Purple Round, Yellow Wrinkled, Purple Wrinkled, Yellow Total 169 61 53 17 300 RP Rp rP rp Observed values Expected (e) Deviation (d) Deviation squared (d2) d2 / e ∑( d2 / e )

Looking at our corn cobs One group had the following data Round, Purple Round, Yellow Wrinkled, Purple Wrinkled, Yellow Total 169 61 53 17 300 RP Rp rP rp Observed values 169 61 53 17 Expected (e) Deviation (d) Deviation squared (d2) d2 / e ∑( d2 / e )

Looking at our corn cobs One group had the following data Round, Purple Round, Yellow Wrinkled, Purple Wrinkled, Yellow Total 169 61 53 17 300 RP Rp rP rp Observed values 169 61 53 17 Expected (e) 56 19 Deviation (d) Deviation squared (d2) d2 / e ∑( d2 / e )

Looking at our corn cobs One group had the following data Round, Purple Round, Yellow Wrinkled, Purple Wrinkled, Yellow Total 169 61 53 17 300 RP Rp rP rp Observed values 169 61 53 17 Expected (e) 56 19 Deviation (d) 5 3 2 Deviation squared (d2) d2 / e ∑( d2 / e )

Looking at our corn cobs One group had the following data Round, Purple Round, Yellow Wrinkled, Purple Wrinkled, Yellow Total 169 61 53 17 300 RP Rp rP rp Observed values 169 61 53 17 Expected (e) 56 19 Deviation (d) 5 3 2 Deviation squared (d2) 25 9 4 d2 / e ∑( d2 / e )

Looking at our corn cobs One group had the following data Round, Purple Round, Yellow Wrinkled, Purple Wrinkled, Yellow Total 169 61 53 17 300 RP Rp rP rp Observed values 169 61 53 17 Expected (e) 56 19 Deviation (d) 5 3 2 Deviation squared (d2) 25 9 4 d2 / e 0.45 0.16 0.21 ∑( d2 / e )

Looking at our corn cobs One group had the following data Round, Purple Round, Yellow Wrinkled, Purple Wrinkled, Yellow Total 169 61 53 17 300 RP Rp rP rp Observed values 169 61 53 17 Expected (e) 56 19 Deviation (d) 5 3 2 Deviation squared (d2) 25 9 4 d2 / e 0.45 0.16 0.21 ∑( d2 / e ) 0 + 0.45 + 0.16 + 0.21 = 0.82

X2 = 0.82 degrees of freedom (df) = 3 0.99 0.97 0.95 0.90 0.80 0.70 0.50 0.30 0.20 0.10 0.05 0.02 0.01 1 --- 0.001 0.004 0.016 0.060 0.15 0.46 1.07 1.64 2.706 3.841 5.024 6.635 2 0.051 0.103 0.211 0.45 0.71 1.39 2.41 3.22 4.605 5.991 7.378 9.210 3 0.11 0.216 0.352 0.584 1.01 1.42 2.37 3.66 4.64 6.251 7.815 9.348 11.34 4 0.29 0.484 0.711 1.064 1.65 2.20 3.36 4.88 5.99 7.779 9.488 11.14 13.27 5 0.55 0.831 1.145 1.610 2.34 3.00 4.35 6.06 7.29 9.236 11.07 12.83 15.08

X2 = 0.82 degrees of freedom (df) = 3 Therefore there is about 85% probability that this was due to chance. This is not considered statistically significant. df 0.99 0.97 0.95 0.90 0.80 0.70 0.50 0.30 0.20 0.10 0.05 0.02 0.01 1 --- 0.001 0.004 0.016 0.060 0.15 0.46 1.07 1.64 2.706 3.841 5.024 6.635 2 0.051 0.103 0.211 0.45 0.71 1.39 2.41 3.22 4.605 5.991 7.378 9.210 3 0.11 0.216 0.352 0.584 1.01 1.42 2.37 3.66 4.64 6.251 7.815 9.348 11.34 4 0.29 0.484 0.711 1.064 1.65 2.20 3.36 4.88 5.99 7.779 9.488 11.14 13.27 5 0.55 0.831 1.145 1.610 2.34 3.00 4.35 6.06 7.29 9.236 11.07 12.83 15.08

Looking at the corn cob class data Round, Purple Round, Yellow Wrinkled, Purple Wrinkled, Yellow Total 2378 660 661 258 3957 RP Rp rP rp Observed values Expected (e) Deviation (d) Deviation squared (d2) d2 / e ∑( d2 / e )

Looking at the corn cob class data Round, Purple Round, Yellow Wrinkled, Purple Wrinkled, Yellow Total 2378 660 661 258 3957 RP Rp rP rp Observed values 2378 660 661 258 Expected (e) Deviation (d) Deviation squared (d2) d2 / e ∑( d2 / e )

Looking at the corn cob class data Round, Purple Round, Yellow Wrinkled, Purple Wrinkled, Yellow Total 2378 660 661 258 3957 RP Rp rP rp Observed values 2378 660 661 258 Expected (e) 2226 742 247 Deviation (d) Deviation squared (d2) d2 / e ∑( d2 / e )

Looking at the corn cob class data Round, Purple Round, Yellow Wrinkled, Purple Wrinkled, Yellow Total 2378 660 661 258 3957 RP Rp rP rp Observed values 2378 660 661 258 Expected (e) 2226 742 247 Deviation (d) 152 82 81 11 Deviation squared (d2) d2 / e ∑( d2 / e )

Looking at the corn cob class data Round, Purple Round, Yellow Wrinkled, Purple Wrinkled, Yellow Total 2378 660 661 258 3957 RP Rp rP rp Observed values 2378 660 661 258 Expected (e) 2226 742 247 Deviation (d) 152 82 81 11 Deviation squared (d2) 23104 6724 6561 121 d2 / e ∑( d2 / e )

Looking at the corn cob class data Round, Purple Round, Yellow Wrinkled, Purple Wrinkled, Yellow Total 2378 660 661 258 3957 RP Rp rP rp Observed values 2378 660 661 258 Expected (e) 2226 742 247 Deviation (d) 152 82 81 11 Deviation squared (d2) 23104 6724 6561 121 d2 / e 10.38 9.06 8.8 0.49 ∑( d2 / e )

Looking at the corn cob class data Round, Purple Round, Yellow Wrinkled, Purple Wrinkled, Yellow Total 2378 660 661 258 3957 RP Rp rP rp Observed values 2378 660 661 258 Expected (e) 2226 742 247 Deviation (d) 152 82 81 11 Deviation squared (d2) 23104 6724 6561 121 d2 / e 10.38 9.06 8.8 0.49 ∑( d2 / e ) 10.38 + 9.06 + 8.8 + 0.49 = 28.73

X2 = 28.73 degrees of freedom (df) = 3 0.99 0.97 0.95 0.90 0.80 0.70 0.50 0.30 0.20 0.10 0.05 0.02 0.01 1 --- 0.001 0.004 0.016 0.060 0.15 0.46 1.07 1.64 2.706 3.841 5.024 6.635 2 0.051 0.103 0.211 0.45 0.71 1.39 2.41 3.22 4.605 5.991 7.378 9.210 3 0.11 0.216 0.352 0.584 1.01 1.42 2.37 3.66 4.64 6.251 7.815 9.348 11.34 4 0.29 0.484 0.711 1.064 1.65 2.20 3.36 4.88 5.99 7.779 9.488 11.14 13.27 5 0.55 0.831 1.145 1.610 2.34 3.00 4.35 6.06 7.29 9.236 11.07 12.83 15.08

X2 = 28.73 degrees of freedom (df) = 3 Therefore there is less than 1% probability that this was due to chance. This is really bad data!!! df 0.99 0.97 0.95 0.90 0.80 0.70 0.50 0.30 0.20 0.10 0.05 0.02 0.01 1 --- 0.001 0.004 0.016 0.060 0.15 0.46 1.07 1.64 2.706 3.841 5.024 6.635 2 0.051 0.103 0.211 0.45 0.71 1.39 2.41 3.22 4.605 5.991 7.378 9.210 3 0.11 0.216 0.352 0.584 1.01 1.42 2.37 3.66 4.64 6.251 7.815 9.348 11.34 4 0.29 0.484 0.711 1.064 1.65 2.20 3.36 4.88 5.99 7.779 9.488 11.14 13.27 5 0.55 0.831 1.145 1.610 2.34 3.00 4.35 6.06 7.29 9.236 11.07 12.83 15.08

Chi-Square Test Problems question b) Green Yellow Blue Observed values 19 18 37 Expected (e) 18.5 Deviation (d) 0.5 Deviation squared (d2) 0.25 d2 / e 0.0135 ∑( d2 / e ) 0.0135 + 0.0135 + 0 = 0.027 This indicates just under 99% probability due to chance. Therefore we can accept this as statistically significant.

Chi-Square Test Problems question c) RP Rp rP rp Observed values 216 65 79 21 Expected (e) 214 71 24 Deviation (d) 2 6 8 3 Deviation squared (d2) 4 36 64 9 d2 / e 0.019 0.51 0.90 0.375 ∑( d2 / e ) 0.019 + 0.51 + 0.90 + 0.375 = 1.804 This indicates about 65% probability due to chance. Therefore we can not accept this as statistically significant.

Chi-Square Test Problems question d) Ty tY TY ty Observed values 193 184 556 61 Expected (e) 186 559 62 Deviation (d) 7 2 3 1 Deviation squared (d2) 49 4 9 d2 / e 0.26 0.02 ∑( d2 / e ) 0.26 + 0.02 + 0.02 + 0.02 = 0.32 This indicates about 96% probability due to chance. Therefore we can accept this as statistically significant.