Download presentation
Presentation is loading. Please wait.
Published byCori Marsh Modified over 9 years ago
1
Populations: defining and identifying
3
Two major paradigms for defining populations Ecological paradigm A group of individuals of the same species that co-occur in space and time and have an opportunity to interact with each other. Evolutionary paradigm A group of individuals of the same species living in close enough proximity that any member of the group can potentially mate with any other member.
5
Cocoa from 32 abandoned estates in Trinidad 88 Imperial College Selection (ICS) clones conserved in the International Cocoa Genebank, Trinidad, assayed for 35 microsatellite loci Unweighted pair group method used to construct dendrogram of relatedness between individuals The different colored groups can be identified by eye, or identified with the computer program “STRUCTURE” (as was done here).
6
Yellow perch The yellow perch plays a significant role in the survival and success of the double-crested cormorant and other birds, predatory fish, commercial fisherman, and sport fisherman in the Great Lakes region. This fish must be properly managed in order to prevent the trophic structure and economy of the Great Lakes region from collapsing. The yellow perch (Perca flavescens) is found in the United States and Canada, and looks similar to the European perch but are paler. It is in the same family as the walleye, but in a different family from white perch.
7
mt DNA Control region haplotype frequency patterning for Yellow Perch spawning site groups across North America
8
Relationships among mtDNA haplotypes of Yellow Perch
9
Allele distribution for six representative Yellow Perch microsatellite loci among selected regions. Rings represent loci, colors within a ring represent alleles.
10
Bayesian assignment of Yellow Perch genetic structure, using STRUCTURE. Vertical bars represent individuals, colors within a bar represent probability of assignment to a cluster. 8 microsatellite loci, 25 collection sites, N= 495 fish, K=10
11
Inference of population structure using multi-locus genotype data Pritchard, Stephens, and Donnelly (2000) Falush, Stephens, and Pritchard (2003) STRUCTURE V2.1 Pritchard, J.K., and Wen, W. (2004)
12
Main objective of “structure” Assign individuals to populations on the bases of their genotypes, while simultaneously estimating population allele frequencies Infer number of populations “K” in the process
13
Other objectives Begin with a set of predefined populations and to classify individuals of unknown origin Identify the extent of admixture of individuals Infer the origin of particular loci in the sampled individuals
14
Structure is a Bayesian Model Based method of clustering many assumptions about parameters and distributions
15
Four basic models 1. Model without admixture each individual is assumed to originate in one (only one) of K populations 2. Model with admixture each individual is assumed to have inherited some proportion of its ancestry from each of K populations
16
Four basic models 3. Linkage model “Chunks” of chromosomes as derived as intact units from one or another K population and all allele copies on the same “chunk” derive from the same population.
17
Four basic models 4. F model The populations all diverged from a common ancestral population at the same time, but allows that the populations may have experienced different amounts of drift since the divergence event
18
Assumptions The main modeling assumptions are Hardy- Weinberg equilibrium (HW) within populations and complete linkage equilibrium (LD) between loci within populations The model accounts for the presence of HW or LD by introducing population structure and attempts to find populations groupings that (as far as possible) are not in disequilibrium
19
Hardy-Weinberg Gives relationship between gene frequencies and genotypic frequencies, assuming random mating F(AA)=p 2 F(Aa)=2pq F(aa)=q 2 The extent of a randomly mating population is predicted from STUCTURE using HW predictions
20
Two locus structure: linkage disequilibrium A1A1 A2A2 B1B1 B2B2
21
Relationship between allele frequency and haplotype frequency Haplotype frequencies x 11 = frequency of A 1 B 1 x 12 = frequency of A 1 B 2 x 21 = frequency of A 2 B 1 x 22 = frequency of A 2 B 2 Allele frequencies are the "marginal" totals p 1 = x 11 + x 12 q 1 = x 21 + x 22 p 2 = x 11 + x 21 q 2 = x 12 + x 22
22
Non-random associations of alleles between loci Expected value of a product between two random variables does not equal the product of two expectations: x 11 p 1 p 2 x 12 p 1 q 2 x 21 q 1 p 2 x 22 q 1 q 2
23
D=covariance of alleles between loci x 11 = p 1 p 2 + D x 12 = p 1 q 2 - D x 21 = q 1 p 2 - D x 22 = q 1 q 2 + D D=x 11 - p 1 p 2 or D=x 11 x 22 - x 12 x 21
24
Evolution of LD Establishment: LD=1 with new mutations Declines with time Increases with “hitchhiking” or SNPs associated with selected variants Increases by chance in small populations Doesn’t decline that fast in areas of low recombination Decreases with physical distance between SNPs (recombination)
26
Pairwise comparison of LD along chromosomes, high LD is red, low LD is green
27
Bayesian procedure employed by STRUCTURE Step 1: estimate the allele frequencies for each population assuming that the population of origin of each individual is known. Step 2: estimate the population of origin of each individual, assuming that the population allele frequencies are known. Iterate several times using “Markov-Chain Monte-Carlo” procedure
28
Good and bad things about “structure” When populations are real, most efficient way to estimate number of populations K and the membership of individuals to populations When populations are more continuous (for example a continuous cline), can impose incorrect structure on data, and create an arbitrary number of artificial groups.
29
Human variation and differentiation Hundreds of microsatellites now available ALU markers Can evolutionary history be reconstructed Are there distinct “races” Are certain populations less diverse
31
K is set to 3 We place individuals in three groups, without prior knowledge of group membership More loci, the better identification of groups
34
Human Genome Diversity Panel 55 Indigenous Populations from 5 Continents: Africa, Americas, Asia, Europe, Oceania, total of 1,056 people 377 microsatellite markers assayed Noah Rosenberg et al, Science, 2002
36
Structure within structure
37
Jun Li et al, Science, 2008 Human Genome Diversity Panel, 938 individuals from 51 populations, 5 continents 650,000 SNP Markers
41
Bayesian prior for population assignment
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.