Statistical Issues in Human Genetics Jonathan L. Haines Ph.D. Center for Human Genetics Research Vanderbilt University Medical Center.

Slides:



Advertisements
Similar presentations
Linkage and Genetic Mapping
Advertisements

Lecture 45 Prof Duncan Shaw. Applications - finding genes Currently much interest in medical research, in finding the genes causing disease Sometimes.
The next generation Chapters 9, 10, 17 in the course textbook, especially pages , ,
Lecture 41 Prof Duncan Shaw. Genetic Variation Already know that genes have different alleles - how do these arise? Process of mutation - an alteration/change.
Lecture 39 Prof Duncan Shaw. Meiosis and Recombination Chromosomes pair upDNA replication Chiasmata form Recombination 1st cell division 2nd cell divisionGametes.
Single Nucleotide Polymorphism And Association Studies Stat 115 Dec 12, 2006.
Note that the genetic map is different for men and women Recombination frequency is higher in meiosis in women.
Identification of markers linked to Selenium tolerance genes
Qualitative and Quantitative traits
Genetic research designs in the real world Vishwajit L Nimgaonkar MD, PhD University of Pittsburgh
Tutorial #1 by Ma’ayan Fishelson
ASSOCIATION MAPPING WITH TASSEL Presenter: VG SHOBHANA PhD Student CPMB.
Genetic Analysis in Human Disease
Basic Concepts in Genetics References:
Mapping Genes for SLE: A Paradigm for Human Disease? Stephen S. Rich, Ph.D. Department of Public Health Sciences Wake Forest University School of Medicine.
Genetic linkage analysis Dotan Schreiber According to a series of presentations by M. Fishelson.
Basics of Linkage Analysis
Linkage Analysis: An Introduction Pak Sham Twin Workshop 2001.
Eric Jorgenson Epidemiology 217 2/21/12
GGAW - Oct, 2001M-W LIN Study Design for Linkage, Association and TDT Studies 林明薇 Ming-Wei Lin, PhD 陽明大學醫學系家庭醫學科 台北榮民總醫院教學研究部.
Human Gene Mapping & Disease Gene Identification Cont.
Genetic Analysis.
Human Genetics Genetic Epidemiology.
QTL Mapping R. M. Sundaram.
Quantitative Genetics
Introduction to Linkage Analysis March Stages of Genetic Mapping Are there genes influencing this trait? Epidemiological studies Where are those.
2050 VLSB. Dad phase unknown A1 A2 0.5 (total # meioses) Odds = 1/2[(1-r) n r k ]+ 1/2[(1-r) n r k ]odds ratio What single r value best explains the data?
Office hours Wednesday 3-4pm 304A Stanley Hall Review session 5pm Thursday, Dec. 11 GPB100.
Chapter 12: Inheritance Patterns and Human Genetics
Chapter 14 – The Human Genome
Genetic Analysis in Human Disease. Learning Objectives Describe the differences between a linkage analysis and an association analysis Identify potentially.
Linkage and LOD score Egmond, 2006 Manuel AR Ferreira Massachusetts General Hospital Harvard Medical School Boston.
Standardization of Pedigree Collection. Genetics of Alzheimer’s Disease Alzheimer’s Disease Gene 1 Gene 2 Environmental Factor 1 Environmental Factor.
Introduction to BST775: Statistical Methods for Genetic Analysis I Course master: Degui Zhi, Ph.D. Assistant professor Section on Statistical Genetics.
Analyzing DNA Differences PHAR 308 March 2009 Dr. Tim Bloom.
Multifactorial Traits
Introduction to Genetics
Family-Based Association Tests
The Complexities of Data Analysis in Human Genetics Marylyn DeRiggi Ritchie, Ph.D. Center for Human Genetics Research Vanderbilt University Nashville,
Non-Mendelian Genetics
CS177 Lecture 10 SNPs and Human Genetic Variation
Introduction to Linkage Analysis Pak Sham Twin Workshop 2003.
Genome-Wide Association Study (GWAS)
Lecture 19: Association Studies II Date: 10/29/02  Finish case-control  TDT  Relative Risk.
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Quantitative Genetics. Continuous phenotypic variation within populations- not discrete characters Phenotypic variation due to both genetic and environmental.
Quantitative Genetics
Tutorial #10 by Ma’ayan Fishelson. Classical Method of Linkage Analysis The classical method was parametric linkage analysis  the Lod-score method. This.
What’s the Difference? Genetic and Common Diseases.
Genetics of Complex Disease Bert Gold, Ph.D., F.A.C.M.G.
An quick overview of human genetic linkage analysis
Errors in Genetic Data Gonçalo Abecasis. Errors in Genetic Data Pedigree Errors Genotyping Errors Phenotyping Errors.
Pedigrees.
Practical With Merlin Gonçalo Abecasis. MERLIN Website Reference FAQ Source.
An quick overview of human genetic linkage analysis Terry Speed Genetics & Bioinformatics, WEHI Statistics, UCB NWO/IOP Genomics Winterschool Mathematics.
Using Merlin in Rheumatoid Arthritis Analyses Wei V. Chen 05/05/2004.
Types of genome maps Physical – based on bp Genetic/ linkage – based on recombination from Thomas Hunt Morgan's 1916 ''A Critique of the Theory of Evolution'',
Pedigree Charts The family tree of genetics. Learning Objective I can use the correct symbols to put together a pedigree chart I can predict a genotype.
Chromosomal Inheritance and Human Heredity. Human Chromosomes Karyotype – a picture of an organism’s chromosomes We take pictures during mitosis when.
Genetics in Clinical Research Jonathan L. Haines, Ph.D. Center for Human Genetics Research 7/16/04.
Single Nucleotide Polymorphisms (SNPs
KA 4: Ante- and postnatal screening
Migrant Studies Migrant Studies: vary environment, keep genetics constant: Evaluate incidence of disorder among ethnically-similar individuals living.
Recombination (Crossing Over)
Epidemiology 101 Epidemiology is the study of the distribution and determinants of health-related states in populations Study design is a key component.
Different mode and types of inheritance
Power to detect QTL Association
Linkage analysis and genetic mapping
Introduction to Genetics
Presentation transcript:

Statistical Issues in Human Genetics Jonathan L. Haines Ph.D. Center for Human Genetics Research Vanderbilt University Medical Center

COMMON COMPLEX DISEASE Complex Disease Environmen t Genes

COMMON COMPLEX DISEASE Complex Disease Environmen t Genes

What Can The Genes Tell Us? Give us a better understanding of the underlying biology of the trait in question Serve as direct targets for better treatments –Pharmacogenetics –Interventions Give us better predictions of who might develop disease Give us better predictions of the course of the disease Lead to knowledge that can help find a cure or prevention

Watson and Crick started it all in 1953 with the description of DNA 53 Year Anniversary of the paper will be in April. Both Won Nobel Prize

The DNA Between Individuals is Identical. All differences are in the 0.1% of DNA that varies. ACCGTCCAGGACCGTCCAGG ACCGTGCAGGACCGTGCAGG It’s hard to believe sometimes!

HUMAN CHROMOSOMES

Single-Nucleotide Polymorphisms (SNPs) One of the most common types of variation GATCCTGTAGCT GATCCTCTAGCT Extremely frequent across the genome (~1/400 bp) -> high resolution Easy to genotype -> high-throughput techniques G/C 1 st Chromosome 2 nd Chromosome GATCCTGTAGCT GATCCTCTAGCT Normal Affected < Normal < Disease

What are We Looking For? Human Genome Chromosome Gene (DNA) EarthCityStreetAddress Band

640 cubic yards 1/100 cubic inch 3,000 MB 1 x MB It really is like finding a needle in a haystack! (and a very BIG haystack, at that)

The Genome Sequence is not THE answer!

1. Define Phenotype a. Consistency b. Accuracy 2. Define the Genetic Component a. Twin Studies b. Adoption Studies c. Family Studies d. Heritability e. Segregation Analysis 3. Define Experimental Design 4. Ascertain Families a. Case-Control b. Singleton c. Sib Pairs d. Affected Relative Pairs 5. Collect Data a. Family Histories b. Clinical Results c. Risk Factors d. DNA Samples 6. Perform Genotype Generation a. Genomic Screen b. Candidate Gene 8. Identify, Test, and Localize Regions of Interest 9. Bioinformatics and Gene Identification 10. Identify Susceptibility Variation(s) 11. Define Interactions a. Gene-Gene b. Gene-Environment 7. Analyze data a. Model-dependent Lod score b. Model-independent sib-pair, relative pair c. Association studies case-control, family-based Disease Gene Discovery In Complex Disease

CLASSES OF HUMAN GENETIC DISEASE Diseases of Simple Genetic Architecture –Can tell how trait is passed in a family: follows a recognizable pattern –One gene per family –Often called Mendelian disease –Usually quite rare in population –“Causative” gene Diseases of Complex Genetic Architecture –No clear pattern of inheritance –Moderate to strong evidence of being inherited –Common in population: cancer, heart disease, dementia etc. –Involves many genes or genes and environment –“Susceptibility” genes

CLASSES OF HUMAN GENETIC DISEASE

Modes of Inheritance Autosomal Dominant –Huntington disease Autosomal Recessive –Cystic fibrosis X-linked –Duchenne muscular dystrophy Mitochondrial –Leber Optic atrophy Additive –HLA-DR in multiple sclerosis Combinations of the above –RP (39 loci), Nonsyndromic deafness

Linkage Analysis Traces the segregation of the trait through a family Traces the segregation of the chromosomes through a family Statistically measures the correlation of the segregation of the trait with the segregation of the chromosome

A SAMPLE PEDIGREE The RED chromosome is key

Measures of Linkage Parametric Vs Non-Parametric Two major approaches toward linkage analysis Parametric: Defines a genetic model of the action of the trait locus (loci). This allows more complete use of the available data (inheritance patterns and phenotype information). –The historical approach towards linkage analysis. Development driven by need to map simple Mendelian diseases –Quite powerful when model is correctly defined Non-Parametric: Uses either a partial genetic model or no genetic model. Relies on estimates of allele/ haplotype/region sharing across relatives. Makes far fewer assumptions about the action of the underlying trait locus(loci).

Linkage Analysis Families –Affected sibpairs –Affected relative pairs –Extended families Traits –Qualitative (affected or not) –Quantitative (ordinal, continuous) There are numerous different methods that can be applied These methods differ dramatically depending on the types of families and traits

Recombination: Nature’s way of making new combinations of genetic variants A. B. C. D. A. A diploid cell. B. DNA replication and pairing of homologous chromosomes to form bivalent. C. Chiasma are formed between the chromatids of homologous chromosomes D. Recombination is complete by the end of prophase I.

Linkage Analysis in Humans Measure the rate of recombination between two or more loci on a chromosome Can be done with any loci, but primary application is to find the location of a trait variant by measuring linkage to known marker variants.

LOD Score Analysis The likelihood ratio as defined by Morton (1955): L(pedigree|  = x) L(pedigree |  = 0.50) where  represents the recombination fraction and where 0  x  When all meioses are “scorable”, the LR is constructed as: L.R. = The LOD score (z) is the log 10 (L.R.) : z(  ) is the lod score at a particular value of the recombination fraction : z(  ) is the maximum lod score, which occurs at the MLE of the recombination fraction 

CLASSES OF HUMAN GENETIC DISEASE

Large FamiliesSmall Families Linkage Analysis Association Studies Family-BasedCase-Control Study Designs

Linkage vs. Association Linkage Association Shared within Families Shared across Families

TESTING CANDIDATE GENES DiseaseNormal 5/20 Gene is not important

TESTING CANDIDATE GENES DiseaseNormal 10/205/20 Gene may be important

Two Basic Study Designs for Association Analysis Case-Control Advantages –Power –Ascertainment Disadvantages –Sensitivity to assumptions –Matching Family-Based –Parent-child Trio –Discordant sibpairs Advantages –Use existing samples –Robustness to assumptions Disadvantages –Ascertainment –Power

METHODS FOR FAMILY- BASED ASSOCIATION STUDIES –Parent-Child AFBAC TDT HHRR QTDT –Sibpair S-TDT DAT Sibship –SDT –WSDT –FBAT Pedigree –Transmit –PDT –FBAT

TRANSMISSION DISEQUILIBRIUM TEST (TDT) Examines transmission of alleles to affected individuals Requires: –Linkage (transmission through meioses); and –Association (specific alleles) Test of linkage if association assumed Test of association if linkage assumed Test of linkage AND association if neither assumed Uses the non-transmitted alleles, effectively, as the control group. Can make “pseudocontrol” by creating genotype of the two non-transmitted alleles Requires phenotype only for the child

TDT calculation AB CD Transmitted Non-Transmitted (B-C) 2 TDT= (B+C) With > 5 per cell, this follows a  2 distribution with 1 df

12 11 TDT Transmitted 1 2 Not transmitted

2212 TDT Transmitted 1 2 Not transmitted

TDT Transmitted 1 2 Not transmitted

TDT Example AB CD Transmitted Non-Transmitted (B-C) 2 TDT= (B+C) Transmitted Non-Transmitted (42-25) 2 TDT= (42+25) = 4.31

Two Basic Study Designs for Association Analysis Case-Control Advantages –Power –Ascertainment Disadvantages –Sensitivity to assumptions –Matching

Analysis of Case-Control Data Standard epidemiological approaches can be used Qualitative trait –Logistic regression Quantitative trait –Linear regression The usual concerns about matching but must also worry about false-positives from population substructure

Incorporating Genetics into Your Studies Obtain appropriate IRB approval –DNA studies are quite common –Template language exists for IRB approval and consent forms –Genetic Studies Ascertainment Core (GSAC) can help –Kelly Taylor: Collect family history information Obtain DNA sample –Venipuncture –Buccal wash/swab –Finger stick Extract/Store DNA –DNA Resources Core can help –Cara Sutcliffe:

What Can The Genes Tell Us? Give us a better understanding of the underlying biology of the trait in question Serve as direct targets for better treatments –Pharmacogenetics –Interventions Give us better predictions of who might develop disease Give us better predictions of the course of the disease Lead to knowledge that can help find a cure or prevention