Aspects of Genetics and Genomics in Cancer Research Li Hsu Biostatistics and Biomathematics Program Fred Hutchinson Cancer Research Center.

Slides:



Advertisements
Similar presentations
Linkage and Genetic Mapping
Advertisements

Review of main points from last week Medical costs escalating largely due to new technology This is an ethical/social problem with major conseq. Many new.
CZ5225 Methods in Computational Biology Lecture 9: Pharmacogenetics and individual variation of drug response CZ5225 Methods in Computational Biology.
Association Tests for Rare Variants Using Sequence Data
Note that the genetic map is different for men and women Recombination frequency is higher in meiosis in women.
Cancer: a genetic disease of inherited and somatic mutations n Gene mutations and/or genetic instability are involved in many cancers. n Viruses and environmental.
Genetic Analysis in Human Disease
Basics of Linkage Analysis
Linkage Analysis: An Introduction Pak Sham Twin Workshop 2001.
Ferdinand van ’t Hooft Cardiovascular Genetics and Genomics Group Karolinska Institutet, Stockholm, Sweden Genome-Wide Association Study GWAS
Retinoblastoma by Michele Chasteen What is Retinoblastoma? n n Tumor of the eye that can occur at a high frequency in children and sporadically at an.
Estimating the penetrances of breast and ovarian cancer in the carriers of BRCA1/2 mutations Silvano Presciuttini University of Pisa, Italy.
CS177 Lecture 9 SNPs and Human Genetic Variation Tom Madej
How to find genetic determinants of naturally varying traits?
PERFORMANCE OF DIFFERENT MODELS PREDICTING THE PRE-TEST PROBABILITY OF CARRYING MUTATIONS IN BRCA1 OR BRCA2 IN 330 ITALIAN FAMILIES Silvano Presciuttini,
. Learning – EM in The ABO locus Tutorial #9 © Ilan Gronau.
Positional Cloning LOD Sib pairs Chromosome Region Association Study Genetics Genomics Physical Mapping/ Sequencing Candidate Gene Selection/ Polymorphism.
Mapping Basics MUPGRET Workshop June 18, Randomly Intermated P1 x P2  F1  SELF F …… One seed from each used for next generation.
BRCA2 Blue: Rad51; Green: BRCA2
BIO341 Meiotic mapping of whole genomes (methods for simultaneously evaluating linkage relationships among large numbers of loci)
KinSNP Software for homozygosity mapping of disease genes using SNP microarrays El-Ad David Amir 1, Ofer Bartal 1, Yoni Sheinin 2, Ruti Parvari 2 and Vered.
BRCA1 and BRCA2 Mutations and Breast Cancer: An Integrated Approach Using Four Epidemiologic Parameters Monica McClain, PhD. Assistant Director, Biometry.
BRCA Genes Dallas Henson.
BRCA Mutations and Breast Cancer Ruth Phillips and Patty Ashby.
BRCA1/2 Mutation Testing and Breast/Ovarian Cancer in the Ashkenazi Jewish Population Glenn E Palomaki, B.S. Foundation for Blood Research Scarborough,
The Cancer Pedigree BRCA What?. Outline Introduction: Understanding the weight of genetics in Ovarian Breast Cancer BRCA 1 and BRCA 2 Genes – Function.
Robust and powerful sibpair test for rare variant association
Genetic Analysis in Human Disease. Learning Objectives Describe the differences between a linkage analysis and an association analysis Identify potentially.
Linkage and LOD score Egmond, 2006 Manuel AR Ferreira Massachusetts General Hospital Harvard Medical School Boston.
Lecture 5: Segregation Analysis I Date: 9/10/02  Counting number of genotypes, mating types  Segregation analysis: dominant, codominant, estimating segregation.
Introduction to BST775: Statistical Methods for Genetic Analysis I Course master: Degui Zhi, Ph.D. Assistant professor Section on Statistical Genetics.
Angelina Jolie The White Coat Wonder. Rational  The purpose of our research is to enrich the Premed-A community with the knowledge of other cancers caused.
Genetic Mapping Oregon Wolfe Barley Map (Szucs et al., The Plant Genome 2, )
Unknown genetic predisposition in familial breast cancer can lie deep in family tree San Ming Wang University of Nebraska Medical Center.
Case(Control)-Free Multi-SNP Combinations in Case-Control Studies Dumitru Brinza and Alexander Zelikovsky Combinatorial Search (CS) for Disease-Association:
Next-Generation Sequencing
Non-Mendelian Genetics
©Edited by Mingrui Zhang, CS Department, Winona State University, 2008 Identifying Lung Cancer Risks.
CS177 Lecture 10 SNPs and Human Genetic Variation
Introduction to Linkage Analysis Pak Sham Twin Workshop 2003.
Gene Hunting: Linkage and Association
Quantitative Genetics
Large-scale recombination rate patterns are conserved among human populations David Serre McGill University and Genome Quebec Innovation Center UQAM January.
QTL Mapping in Heterogeneous Stocks Talbot et al, Nature Genetics (1999) 21: Mott et at, PNAS (2000) 97:
Finnish Genome Center Monday, 16 November Genotyping & Haplotyping.
Copy Number Variation Eleanor Feingold University of Pittsburgh March 2012.
Lecture 13: Linkage Analysis VI Date: 10/08/02  Complex models  Pedigrees  Elston-Stewart Algorithm  Lander-Green Algorithm.
1 B-b B-B B-b b-b Lecture 2 - Segregation Analysis 1/15/04 Biomath 207B / Biostat 237 / HG 207B.
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
Lecture 3: Statistics Review I Date: 9/3/02  Distributions  Likelihood  Hypothesis tests.
FINE SCALE MAPPING ANDREW MORRIS Wellcome Trust Centre for Human Genetics March 7, 2003.
Sir Archibald E Garrod – alcaptonuria – black urine - (Madness of King George)
California Pacific Medical Center
Association analysis Genetics for Computer Scientists Biomedicum & Department of Computer Science, Helsinki Päivi Onkamo.
1 Balanced Translocation detected by FISH. 2 Red- Chrom. 5 probe Green- Chrom. 8 probe.
Association mapping for mendelian, and complex disorders January 16Bafna, BfB.
Javad Jamshidi Fasa University of Medical Sciences, December 2015 Cancer Genetics Session 4 Medical Genetics.
The International Consortium. The International HapMap Project.
Lectures 7 – Oct 19, 2011 CSE 527 Computational Biology, Fall 2011 Instructor: Su-In Lee TA: Christopher Miles Monday & Wednesday 12:00-1:20 Johnson Hall.
Chapter 22 - Quantitative genetics: Traits with a continuous distribution of phenotypes are called continuous traits (e.g., height, weight, growth rate,
1 Genetic Mapping Establishing relative positions of genes along chromosomes using recombination frequencies Enables location of important disease genes.
An atlas of genetic influences on human blood metabolites Nature Genetics 2014 Jun;46(6)
Linkage and Mapping Bonus #2 due now. The relationship between genes and traits is often complex Complexities include: Complex relationships between alleles.
Recombination (Crossing Over)
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS)
Exercise: Effect of the IL6R gene on IL-6R concentration
Heredity, Gene Regulation, and Development
Balanced Translocation detected by FISH
Heredity, Gene Regulation, and Development
Specific Tumor Suppressor Genes
Presentation transcript:

Aspects of Genetics and Genomics in Cancer Research Li Hsu Biostatistics and Biomathematics Program Fred Hutchinson Cancer Research Center

Outline Cancer facts Linkage analysis of family studies Genome-wide association studies

Etiology of Cancer The etiology of cancer is multifactorial, with genetic, environmental, medical, and lifestyle factors interacting to produce a given malignancy. The breakthroughs in high throughput genotyping technologies have made it possible for systematically identifying genes that are responsible for disease occurrence.

BRCA1 and Breast Cancer BRCA1 (breast cancer 1) is a human gene that belongs to a class of genes known as tumor suppressors, which maintains genomic integrity to prevent uncontrolled proliferation. Variations in the gene have been implicated in a number of hereditary cancers, namely breast, ovarian and prostate. The BRCA1 gene is located on the long (q) arm of chromosome 17 at 38Mb.

Probability of developing breast cancer by age (Chen et al. 2009) carriers Non-carriers

Probability of Developing Breast Cancer for BRCA1 carriers Average PersonBRCA1 Carrier Age %(1.7%-2.7%)18.8%(8.2%-2.3%) Age %( %)31.3%(14.3%-61.2%) Age %(6.0%-9.0%)45.4%(22.7%-74.3%) Age %(8.4%-12.5%)54.9%(30.4%-81.4%)

How was BRCA1 found?

Linkage Analysis 1/2 3/4 1/3 2/4 3/4 3/2 1/4 1/23/2

Assume disease gene (D) is rare with full penetrance 1/2 3/4 1/3 2/4 3/4 3/2 1/4 1/23/2 d/d D/d d/D d/d D/d d/dD/dd/dD/d

Linkage Analysis (continued) Disease allele (D) originally in chromosome with allele 3 How often does D co-segregate with allele 3 (non-recombinant)?

Assume disease gene (D) is rare with full penetrance 1/2 3/4 1/3 2/4 3/4 3/2 1/4 1/23/2 d/d D/d d/D d/d D/d d/dD/dd/dD/d

Linkage Analysis (continued) Disease allele (D) originally in chromosome with allele 3 How often does D co-segregate with allele 3 (non-recombinant)? –5 meiosises How often is D separated from allele 3 (recombinant)?

Assume disease gene (D) is rare with full penetrance 1/2 3/4 1/3 2/4 3/4 3/2 1/4 1/23/2 d/d D/d d/D d/d D/d d/dD/dd/dD/d

Linkage Analysis (continued) Disease allele (D) originally in chromosome with allele 3 How often does D co-segregate with allele 3 (non-recombinant)? –5 meiosises How often is D separated from allele 3 (recombinant)? –1 meiosis

Likelihood function Set a parameter θ which measures the distance between allele 3 and D by how frequently they recombine. The likelihood function L(θ) = (1- θ) 5 θ The maximum likelihood estimate is 1/6 LOD = log 10 L(1/6)/L(1/2) = 0.63 LOD for 7 families = 7x0.63 = 4.41

Issues Linkage analysis has narrowed down to a region about 1Mb. However it took another four years before the BRCA1 gene was mapped. Reduced penetrance, phenocopy, and genetic heterogeneity are among the factors that limit the success of the linkage analysis. Relevance of the findings to the population at large.

Genome-Wide Association Studies(GWAS) The Human Genome Project began in 1990 and completed in 2003.

Part of sequence from Chromosome 7 AGACGGAGTTTCACTCTTGTTGCCAACCTGGAGTGCAGTGGCGTGATCTCAGCTCACTGCACACTCCGCTTTCC/TGG TTTCAAGCGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGACTACAGTCACACACCACCACGCCCGGCTAATTTTTG TATTTTTAGTAGAGTTGGGGTTTCACCATGTTGGCCAGACTGGTCTCGAACTCCTGACCTTGTGATCCGCCAGCCTCT GCCTCCCAAAGAGCTGGGATTACAGGCGTGAGCCACCGCGCTCGGCCCTTTGCATCAATTTCTACAGCTTGTTTTCTT TGCCTGGACTTTACAAGTCTTACCTTGTTCTGCCTTCAGATATTTGTGTGGTCTCATTCTGGTGTGCCAGTAGCTAAAA ATCCATGATTTGCTCTCATCCCACTCCTGTTGTTCATCTCCTCTTATCTGGGGTCACA/CTATCTCTTCGTGATTGCATTC TGATCCCCAGTACTTAGCATGTGCGTAACAACTCTGCCTCTGCTTTCCCAGGCTGTTGATGGGGTGCTGTTCATGCCT CAGAAAAATGCATTGTAAGTTAAATTATTAAAGATTTTAAATATAGGAAAAAAGTAAGCAAACATAAGGAACAAAAAG GAAAGAACATGTATTCTAATCCATTATTTATTATACAATTAAGAAATTTGGAAACTTTAGATTACACTGCTTTTAGAGAT GGAGATGTAGTAAGTCTTTTACTCTTTACAAAATACATGTGTTAGCAATTTTGGGAAGAATAGTAACTCACCCGAACA GTGTAATGTGAATATGTCACTTACTAGAGGAAAGAAGGCACTTGAAAAACATCTCTAAACCGTATAAAAACAATTACA TCATAATGATGAAAACCCAAGGAATTTTTTTAGAAAACATTACCAGGGCTAATAACAAAGTAGAGCCACATGTCATTT ATCTTCCCTTTGTGTCTGTGTGAGAATTCTAGAGTTATATTTGTACATAGCATGGAAAAATGAGAGGCTAGTTTATCAA CTAGTTCATTTTTAAAAGTCTAACACATCCTAGGTATAGGTGAACTGTCCTCCTGCCAATGTATTGCACATTTGTGCCC AGATCCAGCATAGGGTATGTTTGCCATTTACAAACGTTTATGTCTTAAGAGAGGAAATATGAAGAGCAAAACAGTGCA TGCTGGAGAGAGAAAGCTGATACAAATATAAATGAAACAATAATTGGAAAAATTGAGAAACTACTCATTTTCTAAATT ACTCATGTATTTTCCTAGAATTTAAGTCTTTTAATTTTTGATAAATCCCAATGTGAGACAAGATAAGTATTAGTGATGGT ATGAGTAATTAATATCTGTTATATAATATTCATTTTCATAGTGGAAGAAATAAAATAAAGGTTGTGATGATTGTTGATTA TTTTTTCTAGAGGGGTTGTCAGGGAAAGAAATTGCTTTTTTTCATTCTCTCTTTCCACTAAGAAAGTTCAACTATTAATT TAGGCACATACAATAATTACTCCATTCTAAAATGCCAAAAAGGTAATTTAAGAGACTTAAAACTGAAAAGTTTAAGATA GTCACACTGAACTATATTAAAAAATCCACAGGGTGGTTGGAACTAGGCCTTATATTAAAGAGGCTAAAAATTGCAATA AGACCACAGGCTTTAAATATGGCTTTAAACTGTGAAAGGTGAAACTAGAATGAATAAAATCCTATAAATTTAAATCAA AAGAAAGAAACAAACTA/GAAATTAAAGTTAATATACAAGAATATGGTGGCCTGGATCTAGTGAACATATAGTAAAGA TAAAACAGAATATTTCTGAAAAATCCTGGAAAATCTTTTGGGCTAACCTGAAAACAGTATATTTGAAACTATTTTTAAA

Genome-Wide Association Study 550,000 SNPs on an array 2000 diseased individuals (colon cancer cases) and 2000 normal individuals Genotype all DNAs for 550,000 SNPs That is 2 billion genotyping!

GWAS on Type 2 Diabetes (Steinthorsdottir et al., 2007, Nature Genetics) CasesControls AA Aa aa Expected count for cases if AA is not associated with the disease. First, calculate the frequency of AA genotype in both cases and controls combined: freq = 3858/6669 = 57.85% For 1398 cases, we expect to see 1398*57.85%=809 individuals having genotype AA. CasesControls AA Aa aa

GWAS on Type 2 Diabetes The chi-square statistic is calculated by finding the difference between each observed and expected for each cell, squaring them, dividing each by the expected, and taking the sum of the results. ( )^2/809+( )^2/3049+… Compare the value to a standard chi-square distribution with degrees of freedom (# rows-1)*(# col -1) = 2. The p-value for this SNP is 6.772e-5.

Issues Too many SNPs! Identifying gene-gene and gene- environmental interactions are now possible.

Germline mutations account for only a small portion of cancer cases.

Summary The amount of the data that have been generated increases exponentially in the last few years. This creates a great demand on efficient and valid computational and statistical methods and tools for picking the needles from a haystack.