Linking Genetic Variation to Important Phenotypes

Slides:



Advertisements
Similar presentations
A quantitative trait locus not associated with cognitive ability in children: a failure to replicate Hill, L. et al.
Advertisements

CZ5225 Methods in Computational Biology Lecture 9: Pharmacogenetics and individual variation of drug response CZ5225 Methods in Computational Biology.
Single Nucleotide Polymorphism And Association Studies Stat 115 Dec 12, 2006.
Introduction to genomes & genome browsers
Association Mapping David Evans. Outline Definitions / Terminology What is (genetic) association? How do we test for association? When to use association.
CS177 Lecture 9 SNPs and Human Genetic Variation Tom Madej
Computational Tools for Finding and Interpreting Genetic Variations Gabor T. Marth Department of Biology, Boston College
Introduction to Linkage Analysis March Stages of Genetic Mapping Are there genes influencing this trait? Epidemiological studies Where are those.
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
CS 374: Relating the Genetic Code to Gene Expression Sandeep Chinchali.
Office hours Wednesday 3-4pm 304A Stanley Hall Review session 5pm Thursday, Dec. 11 GPB100.
Polymorphisms – SNP, InDel, Transposon BMI/IBGP 730 Victor Jin, Ph.D. (Slides from Dr. Kun Huang) Department of Biomedical Informatics Ohio State University.
Haplotype Discovery and Modeling. Identification of genes Identify the Phenotype MapClone.
Department of Biomedical Informatics Bioinformatics and Genetics Kun Huang Department of Biomedical Informatics OSUCCC Biomedical Informatics Shared Resource.
Introduction Basic Genetic Mechanisms Eukaryotic Gene Regulation The Human Genome Project Test 1 Genome I - Genes Genome II – Repetitive DNA Genome III.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College
Doug Brutlag 2011 Genomics & Medicine Doug Brutlag Professor Emeritus of Biochemistry &
National Taiwan University Department of Computer Science and Information Engineering Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
CS177 Lecture 10 SNPs and Human Genetic Variation
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
A basic review of genetics Dr. Danny Chan Associate Professor Assistant Dean (Faculty of Medicine) Department of Biochemistry Department of Biochemistry.
Announcements: Proposal resubmission deadline 4/23 (Thursday).
National Taiwan University Department of Computer Science and Information Engineering Pattern Identification in a Haplotype Block * Kun-Mao Chao Department.
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
An quick overview of human genetic linkage analysis
February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles.
An quick overview of human genetic linkage analysis Terry Speed Genetics & Bioinformatics, WEHI Statistics, UCB NWO/IOP Genomics Winterschool Mathematics.
Linkage Disequilibrium and Recent Studies of Haplotypes and SNPs
Chapter 2 Genetic Variations. Introduction The human genome contains variations in base sequence from one individual to another. Some sequence variants.
Slide 1 of 24 Copyright Pearson Prentice Hall 12-4 Mutations 12–4 Mutations.
Analyzing DNA using Microarray and Next Generation Sequencing (1) Background SNP Array Basic design Applications: CNV, LOH, GWAS Deep sequencing Alignment.
Slide 1 of 24 VIII MUTATIONS Mutations Types of Mutations:
Analysis of Next Generation Sequence Data BIOST /06/2015.
End Show Slide 1 of 24 Copyright Pearson Prentice Hall 12-4 Mutations Outline 12–4: Mutations.
Slide 1 of 24 Copyright Pearson Prentice Hall Biology.
Genetics of Gene Expression BIOS Statistics for Systems Biology Spring 2008.
KEY CONCEPT 8.5 Translation converts an mRNA message into a polypeptide, or protein.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Single Nucleotide Polymorphisms (SNPs
Common variation, GWAS & PLINK
Nucleotide variation in the human genome
Of Sea Urchins, Birds and Men
Genetic Testing for the Clinician
upstream vs. ORF binding and gene expression?
Global Variation in Copy Number in the Human Genome
School of Pharmacy, University of Nizwa
Gene Hunting: Design and statistics
BMI/CS 776 Spring 2018 Anthony Gitter
Mutations TSW identify and describe the various types of mutations and their effects.
Eukaryotic Gene Finding
Interpreting noncoding variants
Eliza Congdon, Russell A. Poldrack, Nelson B. Freimer  Neuron 
Mutations Learning Goal: To learn about what the causes, types and effects of mutations. Success Criteria: I know I am succeeding when I can… explain that.
Genomic alterations in breast cancer cell line MDA-MB-231.
Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
In these studies, expression levels are viewed as quantitative traits, and gene expression phenotypes are mapped to particular genomic loci by combining.
Exercise: Effect of the IL6R gene on IL-6R concentration
Mutations Section 6.2.
Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
School of Pharmacy, University of Nizwa
Medical genomics BI420 Department of Biology, Boston College
Perspectives from Human Studies and Low Density Chip
BF528 - Genomic Variation and SNP Analysis
Medical genomics BI420 Department of Biology, Boston College
Copyright Pearson Prentice Hall
SNPs and CNPs By: David Wendel.
Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
Copyright Pearson Prentice Hall
Presentation transcript:

Linking Genetic Variation to Important Phenotypes BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 2018 Anthony Gitter gitter@biostat.wisc.edu These slides, excluding third-party material, are licensed under CC BY-NC 4.0 by Mark Craven, Colin Dewey, and Anthony Gitter

Outline How does the genome vary between individuals? How do we identify associations between genetic variations and simple phenotypes/diseases? How do we identify associations between genetic variations and complex phenotypes/diseases?

Understanding Human Genetic Variation The “human genome” was determined by sequencing DNA from a small number of individuals (2001) The HapMap project (initiated in 2002) looked at polymorphisms in 270 individuals (Affymetrix GeneChip) The 1000 Genomes project (initiated in 2008) sequenced the genomes of 2500 individuals from diverse populations 23andMe genotyped its 1 millionth customer in 2015 Genomics England plans to sequence 100k whole genomes and link with medical records, 49k so far

Classes of Variants Single Nucleotide Polymorphisms (SNPs) Indels (insertions/deletions) Structural variants Formal definitions: https://www.snpedia.com/index.php/Glossary

Single Nucleotide Polymorphisms (SNPs) One nucleotide changes Variation occurs with some minimal frequency in a population Pronounced “snip” www.mdpi.com

Insertions and Deletions Black box: DNA template strand White box: newly replicated DNA Insertion: slippage inserts extra nucleotides Deletion: slippage excludes template nucleotides Forster et al. Proc. R. Soc. B 2015

Structural Variants Copy number variants (CNVs) Inversions Gain or loss or large genomic regions, even entire chromosomes Inversions DNA subsequence is reversed Translocations DNA subsequence is moved to a different chromosome

Genetic Recombination

Recombination Errors Lead to Copy Number Variants (CNVs)

1000 Genomes Project Project goal: produce a catalog of human variation down to variants that occur at >= 1% frequency over the genome

Understanding Associations Between Genetic Variation and Disease Genome-wide association study (GWAS) Gather some population of individuals Genotype each individual at polymorphic markers (usually SNPs) Test association between state at marker and some variable of interest (say disease) Adjust for multiple comparisons Phenotypes: observable traits

p = E-5 p = E-3 12

Wellcome Trust GWAS

Morning Person GWAS P = 5.0 × 10−8 Hu et al. Nature Communications 2016

Understanding Associations Between Genetic Variation and Disease International Cancer Genome Consortium Includes NIH’s The Cancer Genome Atlas Sequencing DNA from 500 tumor samples for each of 50 different cancers Goal is to distinguish drivers (mutations that cause and accelerate cancers) from passengers (mutations that are byproducts of cancer’s growth)

A Circos Plot

Some Cancer Genomes

Understanding Associations Between Genetic Variation and Complex Phenotypes Quantitative trait loci (QTL) mapping Gather some population of individuals Genotype each individual at polymorphic markers Map quantitative trait(s) of interest to chromosomal locations that seem to explain variation in trait

QTL Mapping Example

QTL Mapping Example QTL mapping of mouse blood pressure, heart rate [Sugiyama et al., Broman et al.] quantitative trait position in the genome Logarithm of Odds

QTL Example: Genotype-Tissue Expression Project (GTEx) Expression QTL (eQTL): traits are expression levels of various genes Map genotype to gene expression in different human tissues

QTL Example: GTEx https://www.genome.gov/27543767/

GWAS Versus QTL Both associate genotype with phenotype GWAS pertains to discrete phenotypes For example, disease status is binary QTL pertains to quantitative (continuous) phenotypes Height Gene expression Splicing events Metabolite abundance

Determining Association is Not Enough A simple case: CFTR (Cystic Fibrosis Transmembrane Conductance Regulator)

Many Measured SNPs Not in Coding Regions Genes encoding CD40 and CD40L with relative positions of the SNPs studied Chadha et al. Eur J Hum Genet 2005

Computational Problems Assembly and alignment of thousands of genomes Data structures to capture extensive variation Identifying functional roles of markers of interest (which genes/pathways does a mutation affect and how?) Identifying interactions in multi-allelic diseases (which combinations of mutations lead to a disease state?) Identifying genetic/environmental interactions that lead to disease Inferring network models that exploit all sources of evidence: genotype, expression, metabolic, etc. Detecting large structural variants