The role of variation in finding functional genetic elements Andy Clark – Cornell Dave Begun – UC Davis.

Slides:



Advertisements
Similar presentations
Combinatorial Algorithms for Haplotype Inference Pure Parsimony Dan Gusfield.
Advertisements

Single Nucleotide Polymorphism And Association Studies Stat 115 Dec 12, 2006.
Note that the genetic map is different for men and women Recombination frequency is higher in meiosis in women.
SNP Applications statwww.epfl.ch/davison/teaching/Microarrays/snp.ppt.
Discovery of a rare arboreal forest-dwelling flying reptile (Pterosauria, Pterodactyloidea) from China Wang et al. PNAS Feb. 11, 2008.
Efficient Algorithms for Genome-wide TagSNP Selection across Populations via the Linkage Disequilibrium Criterion Authors: Lan Liu, Yonghui Wu, Stefano.
Understanding GWAS Chip Design – Linkage Disequilibrium and HapMap Peter Castaldi January 29, 2013.
Association Mapping David Evans. Outline Definitions / Terminology What is (genetic) association? How do we test for association? When to use association.
MALD Mapping by Admixture Linkage Disequilibrium.
Dr. Almut Nebel Dept. of Human Genetics University of the Witwatersrand Johannesburg South Africa Significance of SNPs for human disease.
Predicting the Function of Single Nucleotide Polymorphisms Corey Harada Advisor: Eleazar Eskin.
Applying haplotype models to association study design Natalie Castellana June 7, 2005.
Biology and Bioinformatics Gabor T. Marth Department of Biology, Boston College BI820 – Seminar in Quantitative and Computational Problems.
A coalescent computational platform for tagging marker selection for clinical studies Gabor T. Marth Department of Biology, Boston College
A coalescent computational platform to predict strength of association for clinical samples Gabor T. Marth Department of Biology, Boston College
CSE 291: Advanced Topics in Computational Biology Vineet Bafna/Pavel Pevzner
Inference of Genealogies for Recombinant SNP Sequences in Populations Yufeng Wu Computer Science and Engineering Department University of Connecticut
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
Human Genome Project Seminal achievement. Scientific milestone. Scientific implications. Social implications.
Haplotype Discovery and Modeling. Identification of genes Identify the Phenotype MapClone.
Computational Molecular Biology Biochem 218 – BioMedical Informatics Simple Nucleotide.
Introduction Basic Genetic Mechanisms Eukaryotic Gene Regulation The Human Genome Project Test 1 Genome I - Genes Genome II – Repetitive DNA Genome III.
Modes of selection on quantitative traits. Directional selection The population responds to selection when the mean value changes in one direction Here,
Introduction to BST775: Statistical Methods for Genetic Analysis I Course master: Degui Zhi, Ph.D. Assistant professor Section on Statistical Genetics.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
Medical variations Gabor T. Marth Boston College Biology Department BI543 Fall 2013 February 5, 2013.
Doug Brutlag 2011 Genomics & Medicine Doug Brutlag Professor Emeritus of Biochemistry &
SNPs Daniel Fernandez Alejandro Quiroz Zárate. A SNP is defined as a single base change in a DNA sequence that occurs in a significant proportion (more.
The Complexities of Data Analysis in Human Genetics Marylyn DeRiggi Ritchie, Ph.D. Center for Human Genetics Research Vanderbilt University Nashville,
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen,
Conservation of genomic segments (haplotypes): The “HapMap” n In populations, it appears the the linear order of alleles (“haplotype”) is conserved in.
Biology 101 DNA: elegant simplicity A molecule consisting of two strands that wrap around each other to form a “twisted ladder” shape, with the.
CS177 Lecture 10 SNPs and Human Genetic Variation
SNPs and the Human Genome Prof. Sorin Istrail. A SNP is a position in a genome at which two or more different bases occur in the population, each with.
Finnish Genome Center Monday, 16 November Genotyping & Haplotyping.
Polymorphism Haixu Tang School of Informatics. Genome variations underlie phenotypic differences cause inherited diseases.
Genes in human populations n Population genetics: focus on allele frequencies (the “gene pool” = all the gametes in a big pot!) n Hardy-Weinberg calculations.
Encode variation analysis. Analysis goals Quantify genetic variation in ENCODE regions Detect selective constraint in ENCODE features Develop rules for.
Discovery of a rare arboreal forest-dwelling flying reptile (Pterosauria, Pterodactyloidea) from China Wang et al. PNAS Feb. 11, 2008.
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
Detection of positive selection in humane genome.
Recombination breakpoints Family Inheritance Me vs. my brother My dad (my Y)Mom’s dad (uncle’s Y) Human ancestry Disease risk Genomics: Regions  mechanisms.
Linear Reduction Method for Tag SNPs Selection Jingwu He Alex Zelikovsky.
February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles.
1 Balanced Translocation detected by FISH. 2 Red- Chrom. 5 probe Green- Chrom. 8 probe.
The HapMap Project and Haploview
The International Consortium. The International HapMap Project.
Linkage Disequilibrium and Recent Studies of Haplotypes and SNPs
A genetic polymorphism in the Drosophila insulin receptor suggests adaptation to climate variation across continents Annalise Paaby a, Mark Blacket b,
Signals of natural selection in the HapMap project data The International HapMap Consortium Gil McVean Department of Statistics, Oxford University.
Genetics of Gene Expression BIOS Statistics for Systems Biology Spring 2008.
NCSU Summer Institute of Statistical Genetics, Raleigh 2004: Genome Science Session 3: Genomic Variation.
The Haplotype Blocks Problems Wu Ling-Yun
Inferences on human demographic history using computational Population Genetic models Gabor T. Marth Department of Biology Boston College Chestnut Hill,
Common variation, GWAS & PLINK
MULTIPLE GENES AND QUANTITATIVE TRAITS
Complex disease and long-range regulation: Interpreting the GWAS using a Dual Colour Transgenesis Strategy in Zebrafish.
Population Genetics As we all have an interest in genomic epidemiology we are likely all either in the process of sampling and ananlysising genetic data.
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
Recombination (Crossing Over)
Epidemiology 101 Epidemiology is the study of the distribution and determinants of health-related states in populations Study design is a key component.
MULTIPLE GENES AND QUANTITATIVE TRAITS
Genome-wide Association Studies
Linking Genetic Variation to Important Phenotypes
BI820 – Seminar in Quantitative and Computational Problems in Genomics
Medical genomics BI420 Department of Biology, Boston College
Medical genomics BI420 Department of Biology, Boston College
Haplotypes When the presence of two or more polymorphisms on a single chromosome is statistically correlated in a population, this is a haplotype Example.
Presentation transcript:

The role of variation in finding functional genetic elements Andy Clark – Cornell Dave Begun – UC Davis

The Human Encode project Sequencing the regions in multiple mammals. Identifying genes by conservation (twinscan), by promoter signatures, etc.. Identifying motifs by ChIP/Chip, and interspecific conservation Identifying motifs by phylogenetic shadowing Seeks to identify all functional genetic elements in 1% of the human genome by:

What ENCODE won’t do Identify the genes that are important in determining variation in risk of common complex diseases.

Low resolution of linkage methods Typical confidence in gene location is Mbp.

Equilibrium relation between LD and recombination rate E(r 2 )

Linkage disequilibrium is rare beyond 100 kb or so 533 regions, The SNP Consortium data

Beyond 500 kb, there is almost zero Linkage disequilibrium

…so observing LD means the sites are likely to be close together

Empirical observation of linkage disequilibrium Reich et al. (2001 Nature 411: ) |D’| kb

LD in the human genome has a “block” structure

The HapMap project Genotype 2.5 million SNPs across the genome in 90 individuals from each Europe, Asia and Africa. Identify regions of high and low linkage disequilibrium. Identify SNPs that are most informative for the neighboring regions in the genome. All of this is in the hope that HapMap SNPs will help to map genes for complex traits by association tests.

HapMap woes HapMap is a big gamble – no model system. Assumes variation in risk of complex diseases is driven by a few simple, common SNPs of large effect.

The intersection between human ENCODE and HapMap Resequence the ENCODE regions in 48 humans from the HapMap project. Genotype all SNPs discovered in all 270 HapMap individuals. Use the ENCODE regions to ask whether higher density HapMap would be more informative, and whether low density HapMap distorts the picture.

ENCODE needs to have measurements of functional variation Not feasible in human ENCODE study. But it would be feasible in flies. The Drosophila community has a long history of innovation in studies of the genetics of complex traits.

The role of variation in a Drosophila ENCODE project Nearly all functional sites will exhibit variation in sequence. Do they exhibit variation in function? Natural populations provide an important source of mutations for understanding function.

Advantages of a Drosophila ENCODE project Validation – we can do efficient experiments to test function. Flies have more complex phenotypes. Experimental studies are needed for the challenge of assembling gene regulatory networks. The Drosophila community is best prepared to deal with VARIATION at multiple levels.