Chi-Square Analysis Goodness of Fit "Linkage Studies of the Tomato" (Trans. Royal Canad. Inst. (1931))

Slides:



Advertisements
Similar presentations
Statistics for A2 biology Statistics is..... The branch of mathematics dealing with... Uncertainty, or Probability.
Advertisements

AP Biology.  Segregation of the alleles into gametes is like a coin toss (heads or tails = equal probability)  Rule of Multiplication  Probability.
Exploring Mendelian Genetics
 R = Red eyesRr X Rr  r = White eyes. Used to confirm whether a set of data follows a specific probability distribution. IE…how likely is it that deviations.
Chi-Square Test Chi-square is a statistical test commonly used to compare observed data with data we would expect to obtain according to a specific hypothesis.
Mendelian Genetics. Genes- genetic material on a chromosome that codes for a specific trait Genotype- the genetic makeup of the organism Phenotype- the.
Chi-Square Test A fundamental problem is genetics is determining whether the experimentally determined data fits the results expected from theory (i.e.
Laws of Probability and Chi Square
Chi-Square Analysis Mendel’s Peas and the Goodness of Fit Test.
Chapter 11 Inference for Distributions of Categorical Data
Gregor Johann Mendel “Father of Genetics”. Mendel’s success Came up with an elegant model of experimental design –chose a good “model” organism: Pisum.
Chi Square Analyses: Comparing Frequency Distributions.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 25, Slide 1 Chapter 25 Comparing Counts.
Chapter 26: Comparing Counts
Chi-square Goodness of Fit Test
Chi-Square Analysis Test of Independence. We will now apply the principles of Chi-Square analysis to determine if two variables are independent of one.
Chi-Square Test A fundamental problem in genetics is determining whether the experimentally determined data fits the results expected from theory (i.e.
Chapter 13: Inference for Tables – Chi-Square Procedures
11.4 Hardy-Wineberg Equilibrium. Equation - used to predict genotype frequencies in a population Predicted genotype frequencies are compared with Actual.
Chi-Squared Test.
Chi Square AP Biology.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 26 Comparing Counts.
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables.
11.2 Probability and Punnett Squares
Chapter 11: Inference for Distributions of Categorical Data.
Chapter 11: Inference for Distributions of Categorical Data
Chapter 3 – Basic Principles of Heredity. Johann Gregor Mendel (1822 – 1884) Pisum sativum Rapid growth; lots of offspring Self fertilize with a single.
Chi-Square Test A fundamental problem in genetics is determining whether the experimentally determined data fits the results expected from theory. How.
Chi-Squared (  2 ) Analysis AP Biology Unit 4 What is Chi-Squared? In genetics, you can predict genotypes based on probability (expected results) Chi-squared.
Dihybrid crosses.
Chi-Square Analysis Test of Homogeneity. Sometimes we compare samples of different populations for certain characteristics. This data is often presented.
Chi square analysis Just when you thought statistics was over!!
Fruit Fly Basics Drosophila melanogaster. Wild Type Phenotype Red eyes Tan Body Black Rings on abdomen Normal Wings.
6.3 Genetics and Predictions. You will be able to: 1.Explain how probability is used in genetic predictions 2.To construct a Punnett Square for monohybrid.
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
+ Chapter 11 Inference for Distributions of Categorical Data 11.1Chi-Square Goodness-of-Fit Tests 11.2Inference for Relationships.
State the ‘null hypothesis’ State the ‘alternative hypothesis’ State either one-tailed or two-tailed test State the chosen statistical test with reasons.
Chi-Square Analysis AP Biology.
Mendel’s Laws of Heredity
Lecture 11. The chi-square test for goodness of fit.
Chapter 3 Lecture Concepts of Genetics Tenth Edition Mendelian Genetics.
Comparing Counts Chapter 26. Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted.
Did Mendel fake is data? Do a quick internet search and can you find opinions that support or reject this point of view. Does it matter? Should it matter?
III. Statistics and chi-square How do you know if your data fits your hypothesis? (3:1, 9:3:3:1, etc.) For example, suppose you get the following data.
Chi-Square Analysis Test of Homogeneity. Sometimes we compare samples of different populations for certain characteristics. When we do so we are performing.
Comparing Observed Distributions A test comparing the distribution of counts for two or more groups on the same categorical variable is called a chi-square.
AP Biology Heredity PowerPoint presentation text copied directly from NJCTL with corrections made as needed. Graphics may have been substituted with a.
Chi-Square Analysis AP Biology.
The Chi Square Test A statistical method used to determine goodness of fit Chi-square requires no assumptions about the shape of the population distribution.
21. The chi-square test for goodness of fit
GENETICS AND HEREDITY.
Chapter 25 Comparing Counts.
Patterns of inheritance
Chi-Square Analysis AP Biology.
The Chi Square Test A statistical method used to determine goodness of fit Goodness of fit refers to how close the observed data are to those predicted.
MENDELIAN GENETICS CHI SQUARE ANALYSIS
The Chi Square Test A statistical method used to determine goodness of fit Goodness of fit refers to how close the observed data are to those predicted.
Chi-Square Analysis.
Chi-Square Analysis AP Biology.
The Chi Square Test A statistical method used to determine goodness of fit Goodness of fit refers to how close the observed data are to those predicted.
Chi-Square Analysis AP Biology.
Chapter 26 Comparing Counts.
Chi-Square Analysis AP Biology.
UNIT V CHISQUARE DISTRIBUTION
Chapter 26 Comparing Counts.
Chi-Square Analysis AP Biology.
Inference for Distributions of Categorical Data
Presentation transcript:

Chi-Square Analysis Goodness of Fit "Linkage Studies of the Tomato" (Trans. Royal Canad. Inst. (1931))

We will develop the use of the distribution through an example from biology. Consider two different characteristics of tomatoes, leaf shape and plant size. The leaf shape may be potato-leafed or cut- leafed, and the plant may be tall or dwarf. If we cross a tall cut-leaf tomato with a dwarf potato-leaf tomato and examine the progeny we will discover a uniform F 1 generation. The traits tall and cut-leaf are each dominant, while dwarf and potato-leaf are recessive. We use the letter T for height, and C for leaf shape, so the alleles are T, t, C, and c.

Tall cut-leaf tomato Dwarf potato-leaf tomato We will examine a Punnett square to illustrate this dihybrid cross. Notice the uniformity among the offspring, all are TtCc.

Now we cross the F 1 among themselves to produce the F 2 : gametes TCTctCtc TCTTCCTTCcTtCCTtCc TcTTCcTTccTtCcTtcc tCTtCCTtCcttCCttCc tcTtCcTtccttCcttcc

Now we identify the tall cut-leaf tomatoes: gametes TCTctCtc TCTTCCTTCcTtCCTtCc TcTTCcTTccTtCcTtcc tCTtCCTtCcttCCttCc tcTtCcTtccttCcttcc ttCc

Now we identify the tall potato-leaf tomatoes: gametes TCTctCtc TCTTCCTTCcTtCCTtCc TcTTCcTTccTtCcTtcc tCTtCCTtCcttCCttCc tcTtCcTtccttCcttcc

Next we identify the dwarf cut-leaf tomatoes: gametes TCTctCtc TCTTCCTTCcTtCCTtCc TcTTCcTTccTtCcTtcc tCTtCCTtCcttCCttCc tcTtCcTtccttCcttcc

gametes TCTctCtc TCTTCCTTCcTtCCTtCc TcTTCcTTccTtCcTtcc tCTtCCTtCcttCCttCc tcTtCcTtccttCcttcc Finally, the last type of tomato is dwarf potato-leaf:

So now we have four phenotypes (different physical forms) of tomatoes originating from the single phenotype of the F 1 generation. They are, along with their genotypes and expected frequencies: Tall cut-leaf TTCC, TTCc, TtCC, TtCc Tall potato- leaf TTcc, Ttcc Dwarf cut-leafttCC, ttCc Dwarf potato- leaf ttcc

If our understanding of genetics is correct and we have constructed the crosses we believe we have, we expect the proportions of the four phenotypes to fit our calculations. With the distribution, we are able to test to see if groups of individuals are present in the same proportions as we expect. This is rather like conducting multiple Z-tests for proportions, all at once. In this example we carry out the dihybrid cross to produce an F 1 generation, and, as expected, the F 1 are all of the same phenotype, tall and cut-leaf. Further, the F 1 are crossed among themselves to produce the F 2 generation. We record the numbers of individuals in each category.

The following table gives the observed numbers of each category. Phenotype Observed Expected frequency Tall cut-leaf926 Tall potato- leaf 288 Dwarf cut- leaf 293 Dwarf potato-leaf 104

To make a test for “goodness of fit” we start as with all other tests of significance, with a null hypothesis. Step 1: H 0 : The F 2 generation is comprised of four phenotypes in the proportions predicted by our calculations (based on Mendelian genetics). H a : The F 2 generation is not comprised of four phenotypes in the proportions predicted by our calculations. Another way of saying this is that for the null hypothesis the population fits our expected pattern, and for the alternate hypothesis, it does not fit our pattern.

Step 2: Assumptions: Our first assumption is that our data are counts. (We cannot use proportions or means.) With, we do not always have a sample of a population, and sometimes examine an entire population, as with this example. We must ensure that we have a representative sample, when we work from a sample. 1. All expected counts must be one or more. In order to check assumptions for this goodness of fit test we must calculate the expected counts for each category. Then we must meet two criteria: 2. No more than 20% of the counts may be less than 5.

We calculate the expected counts by finding the total number of observations and multiplying that by each expected frequency. Phenotype Observed counts Expected frequency Expected counts Tall cut-leaf926 Tall potato-leaf288 Dwarf cut-leaf293 Dwarf potato- leaf 104

As you can see, all expected counts are greater than 5, so all assumptions are met. Step 3: The formula for the test statistic is: where o = observed counts, and e = expected counts This calculation needs to be made in the graphing calculator. Enter the observed counts in L 1. Enter the expected frequencies in L 2, as exact numbers. (Enter numbers like 1/3, directly, as fractions, never round to just.3 or.33.)

In L 3 multiply L 2 by This will give the expected counts. The sum of L 1 can be found using 1-Var Stats. Now in L 4, enter, this will give you the contribution for each category. Finally, is the sum of L 4. For this problem, the statistic is In, we always need to know and report the degrees of freedom. The degrees of freedom are the number of categories minus one. Here we have 3 degrees of freedom.

Step 4:

Step 5:Step 6: Fail to reject H 0, a test statistic this large may occur by chance alone almost 70% of the time. Step 7: We lack strong evidence that the pattern of tomato phenotypes is different from the expected. That is, the F 2 generation are present in the expected proportions. The area can also be found with cdf(1.4687,10^99,3).

THE END