Experimental Design and Data Structure Supplement to Lecture 8 Fall 2015 1.

Slides:



Advertisements
Similar presentations
15 The Genetic Basis of Complex Inheritance
Advertisements

The genetic dissection of complex traits
Planning breeding programs for impact
Frary et al. Advanced Backcross QTL analysis of a Lycopersicon esculentum x L. pennellii cross and identification of possible orthologs in the Solanaceae.
GENETICS AND VARIABILITY IN CROP PLANTS. Genetics and variability of traits are grouped by:  Qualitative traits Traits that show variability that can.
Association Mapping as a Breeding Strategy
Identification of markers linked to Selenium tolerance genes
Qualitative and Quantitative traits
Tutorial #1 by Ma’ayan Fishelson
ASSOCIATION MAPPING WITH TASSEL Presenter: VG SHOBHANA PhD Student CPMB.
Discovery of a rare arboreal forest-dwelling flying reptile (Pterosauria, Pterodactyloidea) from China Wang et al. PNAS Feb. 11, 2008.
Chapter 9: Genetic linkage and maps in breeding applications
QTL Mapping R. M. Sundaram.
1 QTL mapping in mice Lecture 10, Statistics 246 February 24, 2004.
Quantitative Genetics Theoretical justification Estimation of heritability –Family studies –Response to selection –Inbred strain comparisons Quantitative.
Quantitative Genetics
Introduction to Linkage Analysis March Stages of Genetic Mapping Are there genes influencing this trait? Epidemiological studies Where are those.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS)
What is a QTL? What are QTL?. Current methods for QTL  Single Marker Methods ( Student, 17?? )  t-tests  Interval Mapping Method (Lander and Botstein,
Mendelian Genetics Simple Probabilities & a Little Luck.
Haplotype Discovery and Modeling. Identification of genes Identify the Phenotype MapClone.
Lecture 5: Segregation Analysis I Date: 9/10/02  Counting number of genotypes, mating types  Segregation analysis: dominant, codominant, estimating segregation.
Standardization of Pedigree Collection. Genetics of Alzheimer’s Disease Alzheimer’s Disease Gene 1 Gene 2 Environmental Factor 1 Environmental Factor.
Modes of selection on quantitative traits. Directional selection The population responds to selection when the mean value changes in one direction Here,
Module 7: Estimating Genetic Variances – Why estimate genetic variances? – Single factor mating designs PBG 650 Advanced Plant Breeding.
Methods of Genome Mapping linkage maps, physical maps, QTL analysis The focus of the course should be on analytical (bioinformatic) tools for genome mapping,
Introduction to BST775: Statistical Methods for Genetic Analysis I Course master: Degui Zhi, Ph.D. Assistant professor Section on Statistical Genetics.
Broad-Sense Heritability Index
Genetic Mapping Oregon Wolfe Barley Map (Szucs et al., The Plant Genome 2, )
Mapping populations Controlled crosses between two parents –two alleles/locus, gene frequencies = 0.5 –gametic phase disequilibrium is due to linkage,
PBG 650 Advanced Plant Breeding
Quantitative Genetics. Continuous phenotypic variation within populations- not discrete characters Phenotypic variation due to both genetic and environmental.
Complex Traits Most neurobehavioral traits are complex Multifactorial
Quantitative Genetics
INTRODUCTION TO ASSOCIATION MAPPING
Who was Mendel? Mendel – first to gather evidence of patterns by which parents transmit genes to offspring.
1 How many genes? Mapping mouse traits Lecture 1, Statistics 246 January 20, 2004.
Lecture 24: Quantitative Traits IV Date: 11/14/02  Sources of genetic variation additive dominance epistatic.
An quick overview of human genetic linkage analysis Terry Speed Genetics & Bioinformatics, WEHI Statistics, UCB NWO/IOP Genomics Winterschool Mathematics.
Simple-Sequence Length Polymorphisms SSLPs Short tandemly repeated DNA sequences that are present in variable copy numbers at a given locus. Scattered.
Lecture 22: Quantitative Traits II
Lecture 23: Quantitative Traits III Date: 11/12/02  Single locus backcross regression  Single locus backcross likelihood  F2 – regression, likelihood,
Chapter 22 - Quantitative genetics: Traits with a continuous distribution of phenotypes are called continuous traits (e.g., height, weight, growth rate,
Why you should know about experimental crosses. To save you from embarrassment.
What is a QTL? Quantitative trait locus (loci) Region of chromosome that contributes to variation in a quantitative trait Generally used to study “complex.
Types of genome maps Physical – based on bp Genetic/ linkage – based on recombination from Thomas Hunt Morgan's 1916 ''A Critique of the Theory of Evolution'',
Genetics of Gene Expression BIOS Statistics for Systems Biology Spring 2008.
Plant Breeding and Improvement STT Variation Environmental variation Heritable variation.
8 and 11 April, 2005 Chapter 17 Population Genetics Genes in natural populations.
Exam Critical Concepts Genetics Chapters
Overview What is Plant Breeding? Basic Genetics Mendelian Genetics
Lecture 2: Genetic mapping and linkage analysis in farm animals
Restriction Fragment Length Polymorphism. Definition The variation in the length of DNA fragments produced by a restriction endonuclease that cuts at.
GENOME ORGANIZATION AS REVEALED BY GENOME MAPPING WHY MAP GENOMES? HOW TO MAP GENOMES?
Simple-Sequence Length Polymorphisms
Ø Novel approaches for linkage mapping in dairy cattle
upstream vs. ORF binding and gene expression?
Chapter Seven: Extending Mendelian Genetics
Quantitative traits Lecture 13 By Ms. Shumaila Azam
DNA Marker Lecture 10 BY Ms. Shumaila Azam
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
Gene Linkage and Genetic Mapping
Mapping Quantitative Trait Loci
Applied Molecular Genetics Molecular Marker and Technique
Linkage, Recombination, and Eukaryotic Gene Mapping
Genome-wide Associations
Genome-wide Association Studies
Sequential Steps in Genome Mapping
Lecture 9: QTL Mapping II: Outbred Populations
Cancer as a Complex Genetic Trait
Presentation transcript:

Experimental Design and Data Structure Supplement to Lecture 8 Fall

QTL Mapping Data Marker Data:  Molecular markers: specific patterns of DNA sequences; polymorphic, abundant, neutral, co-dominant or dominant.  examples: RFLP, SSR, RAPD, AFLP, VNTR  Markers data are categorical (i.e., different classifications):  presence or absence of a band of molecular segment.  the number of categories depends on mapping population and marker type  examples:  two marker types (homozygote or heterozygote) for backcross population  three marker types for F 2 population with co-dominant markers.  Markers contain information about segregation at various positions of a genome in a population. 2

Quantitative trait data:  Measurement of a phenotype.  examples:  12 week body weight of mouse  grain yield of maize  little size of pigs  blood pressure  disease resistant score,  expression (traits) from microarrays…  Continuous or discrete data.  Quantitative trait data contain information about segregation and effects of QTL in a population. 3

Quantitative Trait Loci Quantitative Trait Loci (QTL): the regions or genes whose variation has an effect on a trait in a population. The statistical task of mapping QTL is to detect and estimate the association between the variation at the phenotypic level (trait data) and the variation at genetic level (marker data) in terms of number, positions, effects and interaction of QTL. 4

Experimental Designs Traditional experimental designs for locating QTL start with two parental inbred lines, P 1 and P 2, differing both in trait values and in the marker (M, N, …) variants or alleles (M 1, M 2, N 1, N 2,…) they carry.  in practice, markers are sought that have different variants (alleles) in the parents.  Advantage:  F 1 is heterozygote for all loci  which differ in P 1 and P 2  maximum linkage disequilibrium.  for mapping QTL  this type of experimental design has the maximum power. marker and trait information 5

or Backcross and F 2 Designs 6 Notation: “r” denotes recombination

 Backcross (BC):  two genotypes at a locus  simple to analyze  F 2 :  three genotypes at a locus,  can estimate both additive and dominance effects  more complex for data analysis, particularly for multiple QTL with epistasis (i.e., interaction)  more opportunity and information to examine genetic structure or architecture of QTL  more power than BC for QTL analysis 7

Other commonly used inbred line crosses  Note: QTL-Cartographer deals with the below, and there are additional details in Lynch and Walsh (1998)  Advanced intercross  selfing (SF t ): F 2 = F 1 × F 1 ; F 3 = F 2 × F 2 ; F 4 = F 3 × F 3 ; · · · through selfing):  continual selfing (6+ generations) leads to recombinant inbred lines (RI lines)  Doubled haploid (RI 0 ): similar to BC and RI in analysis  Repeated backcross:  B 1t, B 2t ; B 12 =P 1 × B 1 ; B 13 =P 1 × B 12 ; B 14 =P 1 × B 13 ; … 8

Recombinant inbred lines (RI lines):  selfing (RI 1 = SF t, t > 6)  brother-sister mating (RI 2 )  more mapping resolution as more recombination occurs when constructing RI lines.  May improve the measurement of mean phenotype of a line with multiple individuals, i.e., increase heritability.  potentially a very big advantage for QTL analysis,  a big factor for power calculation and sample size requirement. 9

Recombinant Inbred Lines By selfing (RI 1) By brother-sister mating (RI 2 ) 10

Outcross populations  Cross segregating populations (no inbred available):  similar model and analysis procedure used as inbred cross, but more complex in analysis.  need to estimate the probability of allelic origin for each genomic point from observed markers.  less powerful for QTL analysis  QTL alleles may not be preferentially fixed in the parental populations  more difficult for power calculation (more unknowns) 11

Half-sib families  analyze the segregation of one parent  similar to backcross in model and analysis.  less powerful for QTL detection  more uncontrollable variability in the other parents.  analyze allelic effect difference in one parent, not the allelic effect difference between widely differentiated inbred lines, populations and species.  generally the relevant heritability is low for QTL analysis. 12

 four genotypes at a locus  possible to estimate allelic substitution effects for male and female parents and their interaction (dominance). ♂ α = [ac + ad] – [bc + bd] ♀ α = [ac + bc] – [ad + bd] β = [ac + bd] – [ad + bc]  doubled information for QTL analysis compared to half-sibs  should be more powerful.  Note: If we use the double pseudo-backcross approach for mapping analysis, we do NOT utilize full genetic information,  it actually uses less than half the information available.  not powerful for QTL identification.  Power calculation depends on how the data are analyzed. ♂ ab ac, ad, bc, bd cd ♀ Full-sib families ♀♂♀♂ ab cacbc dadbd 13

Human populations Pedigree :  limited by sample size  linkage analysis  association between markers and disease locus is due to recent genetic linkage  suited for Mendelian diseases, not for complex diseases Case-Control:  large population available for study  association analysis  association between markers and disease locus is due to historical genetic linkage, and restricted to short regions  Can be used for complex diseases 14

Designs to reduce sample size and increase statistical power  Selective genotyping  Bulked segregant analysis  Progeny testing  Replicated progeny  Granddaughter design 15

Examples: Mapping Data  Two data sets are used as examples for various analyses:  Mouse (Table 1)  Maize (Table 2) 16

 Mouse data (Dragani et al Mammalian Genome 6: ) :  backcross population (B 1 )  103 individuals (sample size n =103)  181 microsatellite markers (SSR: simple sequence repeats) distributed across 20 chromosomes  including 14 markers on chromosome X  chromosome X is used here as an example  quantitative trait is 12 week body weight (BW)  Throughout, we use the trait data and marker data on chromosome X to illustrate the analyses of segregation, linkage, single marker analysis, interval mapping and composite interval mapping (which also uses some markers on other chromosomes). 17

1 = AA homozygote genotype; 0 = Aa heterozygote genotype 18

 Maize data:  F 2 population.  171 lines (sample size n =171)  132 markers distributed on 10 chromosomes,  including 12 markers on a chromosome used as an example.  quantitative trait is disease resistant score.  Eventually, we will use the trait data and the marker data to illustrate segregation, linkage, single marker analysis, interval mapping and composite interval mapping (which also uses some markers on other chromosomes).  A partial data are shown in Table 2. 19

2 = AA homozygote genotype of P1; 1 = Aa heterozygote genotype; 0 = aa homozygote genotype of P2 20