Mapping a disease locus Fig. 11.A A1D A2d A1d d A2d A1 A2.

Slides:



Advertisements
Similar presentations
Linkage and Genetic Mapping
Advertisements

Qualitative and Quantitative traits
Genetic research designs in the real world Vishwajit L Nimgaonkar MD, PhD University of Pittsburgh
Multiple Comparisons Measures of LD Jess Paulus, ScD January 29, 2013.
Genetics & The Work of Mendel
Basics of Linkage Analysis
Linkage Analysis: An Introduction Pak Sham Twin Workshop 2001.
QTL Mapping R. M. Sundaram.
MALD Mapping by Admixture Linkage Disequilibrium.
1 15 The Genetic Basis of Complex Inheritance. 2 Multifactorial Traits Multifactorial traits are determined by multiple genetic and environmental factors.
Extensions to Mendel’s Observation Types of Dominance Relationships Between Alleles of Same Locus: Complete Dominance Incomplete Dominance Codominance.
Plant of the day! Pebble plants, Lithops, dwarf xerophytes Aizoaceae
The Inheritance of Complex Traits
Office hours Wednesday 3-4pm 304A Stanley Hall. Fig Association mapping (qualitative)
Finding “the gene” for cystic fibrosis. Why is this in quotes? A.CF is not caused by a gene, it’s caused by multiple genes. B.CF is not caused by genetic.
Variation in submergence tolerance
Office hours 3-4pm Wednesdays 304A Stanley Hall. Simulation/theory Expect 0.09 of a locus to reach LOD=3 by chance.
Office hours 3-4pm Wednesday, Dec A Stanley Hall Review session 5pm Thursday, Dec. 11 GPB100 Final exam 12:30pm Saturday, Dec Cory Hall.
How to find genetic determinants of naturally varying traits?
Simulation/theory With modest marker spacing in a human study, LOD of 3 is 9% likely to be a false positive.
Introduction to Linkage Analysis March Stages of Genetic Mapping Are there genes influencing this trait? Epidemiological studies Where are those.
Office hours 304A Stanley Hall next week 3-4pm Monday Nov 24.
2050 VLSB. Dad phase unknown A1 A2 0.5 (total # meioses) Odds = 1/2[(1-r) n r k ]+ 1/2[(1-r) n r k ]odds ratio What single r value best explains the data?
Office hours 3-4pm Wednesdays 304A Stanley Hall More realistic situation: in dad, phase of alleles unknown A1d d D A2d A1 A2 or A1d A2D.
Office hours Wednesday 3-4pm 304A Stanley Hall Review session 5pm Thursday, Dec. 11 GPB100.
Office hours Wednesday 3-4pm 304A Stanley Hall Review session 5pm Thursday, Dec. 11 GPB100.
Observing Patterns in Inherited Traits
Two loci AA/AA x BB/BB AB/AB (F1) AA AA long chrom short chrom locus 1 locus 2 Parent A BB BB long chrom short chrom locus 1 locus 2 Parent B Locus1/Locus.
Genetic Analysis in Human Disease. Learning Objectives Describe the differences between a linkage analysis and an association analysis Identify potentially.
Linkage and LOD score Egmond, 2006 Manuel AR Ferreira Massachusetts General Hospital Harvard Medical School Boston.
Natural Variation in Arabidopsis ecotypes. Using natural variation to understand diversity Correlation of phenotype with environment (selective pressure?)
Broad-Sense Heritability Index
Multifactorial Traits
Genetic Mapping Oregon Wolfe Barley Map (Szucs et al., The Plant Genome 2, )
A gene is composed of strings of bases (A,G, C, T) held together by a sugar phosphate backbone. Reminder - nucleotides are the building blocks.
Non-Mendelian Genetics
Dihybrid (or greater) Crosses: Review For either genotype or phenotype, the expected outcomes of a particular cross can be calculated by multiplying the.
Introduction to Linkage Analysis Pak Sham Twin Workshop 2003.
Announcements: Proposal resubmission deadline 4/23 (Thursday).
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Quantitative Genetics. Continuous phenotypic variation within populations- not discrete characters Phenotypic variation due to both genetic and environmental.
Complex Traits Most neurobehavioral traits are complex Multifactorial
Quantitative Genetics
QTL Mapping in Heterogeneous Stocks Talbot et al, Nature Genetics (1999) 21: Mott et at, PNAS (2000) 97:
INTRODUCTION TO ASSOCIATION MAPPING
Tutorial #10 by Ma’ayan Fishelson. Classical Method of Linkage Analysis The classical method was parametric linkage analysis  the Lod-score method. This.
Lecture 3: Statistics Review I Date: 9/3/02  Distributions  Likelihood  Hypothesis tests.
Who was Mendel? Mendel – first to gather evidence of patterns by which parents transmit genes to offspring.
Mapping and cloning Human Genes. Finding a gene based on phenotype ’s of DNA markers mapped onto each chromosome – high density linkage map. 2.
Association analysis Genetics for Computer Scientists Biomedicum & Department of Computer Science, Helsinki Päivi Onkamo.
1 Balanced Translocation detected by FISH. 2 Red- Chrom. 5 probe Green- Chrom. 8 probe.
Genetic Theory Pak Sham SGDP, IoP, London, UK. Theory Model Data Inference Experiment Formulation Interpretation.
Epistasis / Multi-locus Modelling Shaun Purcell, Pak Sham SGDP, IoP, London, UK.
Lecture 22: Quantitative Traits II
Linkage. Announcements Problem set 1 is available for download. Due April 14. class videos are available from a link on the schedule web page, and at.
1 Genetic Mapping Establishing relative positions of genes along chromosomes using recombination frequencies Enables location of important disease genes.
Types of genome maps Physical – based on bp Genetic/ linkage – based on recombination from Thomas Hunt Morgan's 1916 ''A Critique of the Theory of Evolution'',
Genetics of Gene Expression BIOS Statistics for Systems Biology Spring 2008.
STT2073 Plant Breeding and Improvement. Quality vs Quantity Quality: Appearance of fruit/plant/seed – size, colour – flavour, taste, texture – shelflife.
Association Mapping in Families Gonçalo Abecasis University of Oxford.
Active Lecture PowerPoint ® Presentation for Essentials of Genetics Seventh Edition Klug, Cummings, Spencer, Palladino Copyright © 2010 Pearson Education,
Lecture 17: Model-Free Linkage Analysis Date: 10/17/02  IBD and IBS  IBD and linkage  Fully Informative Sib Pair Analysis  Sib Pair Analysis with Missing.
GENOME ORGANIZATION AS REVEALED BY GENOME MAPPING WHY MAP GENOMES? HOW TO MAP GENOMES?
Identifying QTLs in experimental crosses
upstream vs. ORF binding and gene expression?
Migrant Studies Migrant Studies: vary environment, keep genetics constant: Evaluate incidence of disorder among ethnically-similar individuals living.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS)
Linkage, Recombination, and Eukaryotic Gene Mapping
Chapter 7 Multifactorial Traits
Polygenic Inheritance
Presentation transcript:

Mapping a disease locus Fig. 11.A A1D A2d A1d d A2d A1 A2

Mapping a disease locus Fig. 11.A A1D A2d A1d d D A2

Mapping a disease locus Fig. 11.A A1d d A2D A1D A2d (sperm) A1 A2

LOD scores Odds = P(pedigree | r) P(pedigree | r = 0.5) r = genetic distance between marker and disease locus Odds = (1-r) n r k 0.5 (total # meioses) Odds = = Data >6 times more likely under LINKED hypothesis than under UNLINKED hypothesis. k = 1 recomb, n = 7 non-recomb. A1 A2

Just a point estimate observed recombination fraction = 1/8 = 12.5 cM Disease- causing mutation Restriction fragment length polymorphism True distance 30 cM this is our observation

LOD scores rodds ?? Odds = P(pedigree | r) P(pedigree | r = 0.5) Odds = (1-r) n r k 0.5 (total # meioses) k = 1 recomb, n = 7 non-recomb.

How to get an overall estimate of probability of linkage? A.Multiply odds together B.Add odds together C.Take the largest odds D.Take the average odds Given r Odds 1 Given r Odds 2 Given r Odds 3 1,22,3 1,21,3 2,31,3 1,22,3 1,2 1,3 2,31,3 2,3 1,31,22,3 2,2 Combining families

More realistic situation: in dad, phase of alleles unknown A1d d D A2d A1 A2 or A1d A2D

More realistic situation: in dad, phase of alleles unknown Odds = 1/2[(1-r) n r k ]P(pedigree|r) A1 A2 A1D A2d + 1/2[(1-r) n r k ] assume one phase for dad 7 non-recomb, 1 recomb (k = # recomb, n = # non-recomb) A1d A2D assume the other phase for dad 1 non-recomb, 7 recomb

In real life this correction does matter… best r = best r = Accounting for both phases Using only one phase family 1: 10 meioses, 1 (or 9) apparent recombinants family 2: 10 meioses, 4 (or 6) apparent recombinants family 3: 10 meioses, 3 (or 7) apparent recombinants family 4: 10 meioses, 3 (or 7) apparent recombinants total LOD = LOD(family 1) + LOD(family 2) + LOD(family 3) + LOD(family 4)

Modern genetic scans (single family)

Age of onset in breast cancer age of onset

Coins Odds = P(your flips | r) P(your flips | r = 0.5) r = intrinsic probability of coming up heads (bias) Odds = (1-r) n r k 0.5 (total # flips)

Coins 3 heads2 heads1 heads rodds heads rodds heads rodds rodds rodds r = intrinsic probability of coming up heads (bias)

Coins By chance, can get good LOD score for just about anything. The more students you have flipping coins, the more likely you are to see this “unlikely” combination. The multiple testing problem

Coins Probability of one student observing 0 heads and 4 tails: 1/16 Estimated number of students out of 70 observing 4 tails: 70*(1/16) = 4 Probability of one student observing 1 head and 3 tails: 4/16 Estimated number of students out of 70 observing 1 heads and 3 tails: 70*(4/16)= 17.5 Probability of one student observing 2 heads and 2 tails: 6/16 Estimated number of students out of 70 observing 2 heads and 2 tails: 70*(6/16) = 26.3 … We would need to see >4 students get 0 heads and 4 tails before we believe any coins are biased.

Simulation/theory Expect 0.09 of a locus to reach LOD=3 by chance.

Simulation/theory But this would change in a different organism, with different number of markers, etc. So in practice, everyone does their own simulation specific to their own study.

Candidate gene approach Hypothesize that causal variant will be in known pigment gene or regulator. NOT randomly chosen markers genome-wide.

Candidate gene approach Red progeny have RFLP pattern like red parent

Affected sib pair method 2,22,3 2,2 4,41,3 1,4 … Sib pairsObserved Expected under null Same allele 2(1/2)*2 Different allele 0(1/2)*2  2 =  (O - E) 2 E Test for significant allele sharing. Total # families

Qualitative but polygenic Fig Two loci. Need one dominant allele at each locus to get phenotype.

“A weak locus”: need lots of data AAbb aaBB AaBb Flower color inter-mate Two loci. Need one dominant allele at each locus to get phenotype. AABb AaBb aaBbAaBB aaBB Aabb Genotype at marker close to A locus purplewhite Top allele 31 Bottom allele 23

More generally (one locus): AA x BB AB (F1) AB x AB AA AB BA BB (F2)

25% 50%25% “Effect of having a B” AA AB BA BB AA AB BA BB Effect of a B allele is the same regardless of genotype: additive 1 locus, incomplete dominance

1 locus, complete dominance 75%25% AB BA AA BB Dominance is a kind of epistasis: nonadditive

A real example (F2’s) CC x SS CS CS x CS CC SS CS SC (F2’s)

Quantitative trait linkage test (F2’s) Not counting recombinants. Statistical test for goodness of fit.

Locus effect vs. parents C3H parent F2’s, C/C at marker F2’s, C/S at marker F2’s, S/S at marker SWR parent Homozygotes do not look like parent. What do you infer? A single varying locus does not explain the data

>1 locus controlling trait (One mouse family)

A weak locus C3H parent F2’s, C/C at marker F2’s, C/S at marker F2’s, S/S at marker SWR parent Most loci underlying human disease look like this. “Effect of having an S allele”

Heritability in exptal organisms Genetic variance = total var - “environmental var” Heritability H 2 = ee tt  g  =  t  -  e  g/tg/t

Heritability in humans: MZ twins 23B8-2B1E53E C5_1.jpg hrpics/sarahandsandra.jpg Each individual = z ij Total mean sq =  (z ij - z) 2 T Mean each pair = z i Within pairs mean sq =  (z ij - z i ) 2 N Between pairs mean sq =  (z i - z) 2 N-1 =  b 2 =  w 2 =  t 2 h 2 =  b 2  w 2 t2t2

Linkage mapping (quantitative) intoleranttolerant

Transgenic test

Fine-mapping: new markers Best marker Position of true causal variant Because you have to hunt through by hand to find the causal gene, and test experimentally. The smaller the region, the better.

Fine-mapping: new markers Position of true causal variant Increased marker density

Two loci, incomplete dominance

2-locus interaction Effect of J at locus 2 Locus 2 is epistatic to locus 1: effects of locus 1 are masked in individuals with JJ or JL,LJ at locus 2 Locus 2 follows a dominance model: JJ and JL,LJ have the same phenotype, LL differs “The dominant allele of locus 2 does the masking”

NO progeny as extreme as diploid hybrid

Three mutant genes From pathogenic strain Alleles from the same strain at different genes/loci can have different effects.

Linked mutations of opposite effect Path Lab Very unlikely

Fine-mapping Inject into golden larvae Golden uninjected WT uninjected

No truncation in humans, but… No other species have the Thr allele: what does this mean? Could be deleterious, just an accidental mutation. Could be advantageous for some humans, no other species.

Correlates with human differences AA AG GG Allele is rare Perhaps explains phenotypic variation among people of African ancestry Thr Thr, Ala Ala

Association mapping (qualitative) Fig Blue alleles at markers are on the same haplotype as the M allele of the disease locus

Association scan, qualitative osteoarthritiscontrols C’s G’s47433  2 test

Fine-mapping -log(  2 p-value) rs377472

Beginnings of molecular confirmation coding polymorphisms

Association scan, quantitative

Association vs. linkage Strong, easy to detect, but rare in population; may not be reflective of common disease. Also, hard to collect family data. Common but weak effects; need 1000’s of samples to detect. If no common cause, can fail. Unrelated individuals Related individuals

diabetescontrol Gm23270 no Gm Association mapping causal loci “Gm is protective against diabetes?”

Association and admixture these are all the Caucasians…

Association and admixture Cases Controls = = Don’t believe any one locus is causative!

Genotyping by array Fig. 11.8

Coding sequence array Fig. 1.13

Marker is linked to polymorphism in expression regulation cascade ORF TF G kinase TF G G

Marker is linked to polymorphism in expression regulation cascade ORF TF G kinase TF mRNA level shows linkage to locus of polymorphic regulator(s).

Clinical applications Colored curves = fat mass at different body locations

Association of human transcripts linkage (families) assoc (unrelated)

Protein inheritance

PSI+ phenotypes 50°C Genetically distinct S. cerevisiae strains