BI820 – Seminar in Quantitative and Computational Problems in Genomics

Slides:



Advertisements
Similar presentations
Plant of the day! Pebble plants, Lithops, dwarf xerophytes Aizoaceae
Advertisements

CS177 Lecture 9 SNPs and Human Genetic Variation Tom Madej
Bioinformatics for high-throughput DNA sequencing Gabor Marth Boston College Biology New grad student orientation Boston College September 8, 2009.
Biology and Bioinformatics Gabor T. Marth Department of Biology, Boston College BI820 – Seminar in Quantitative and Computational Problems.
A coalescent computational platform for tagging marker selection for clinical studies Gabor T. Marth Department of Biology, Boston College
General methods of SNP discovery: PolyBayes Gabor T. Marth Department of Biology Boston College Chestnut Hill, MA
Computational Tools for Finding and Interpreting Genetic Variations Gabor T. Marth Department of Biology, Boston College
Mining SNPs from EST Databases Picoult-Newberg et al. (1999)
Polymorphism Structure of the Human Genome Gabor T. Marth Department of Biology Boston College Chestnut Hill, MA
A coalescent computational platform to predict strength of association for clinical samples Gabor T. Marth Department of Biology, Boston College
Positional Cloning LOD Sib pairs Chromosome Region Association Study Genetics Genomics Physical Mapping/ Sequencing Candidate Gene Selection/ Polymorphism.
The informatics of SNPs and haplotypes Gabor T. Marth Department of Biology, Boston College Cold Spring Harbor Laboratory Advanced Bioinformatics.
Evolutionary Genome Biology Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen, Hungary, May 2006.
Human Migrations Saeed Hassanpour Spring Introduction Population Genetics Co-evolution of genes with language and cultural. Human evolution: genetics,
BI420 – Course information Web site: Instructor: Gabor Marth Teaching.
The informatics of SNPs and haplotypes Gabor T. Marth Department of Biology, Boston College Cold Spring Harbor Laboratory Advanced Bioinformatics.
Inference of Genealogies for Recombinant SNP Sequences in Populations Yufeng Wu Computer Science and Engineering Department University of Connecticut
Lecture X.X1. 2 The informatics of SNPs and Haplotypes Gabor T. Marth Department of Biology, Boston College
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
Polymorphisms – SNP, InDel, Transposon BMI/IBGP 730 Victor Jin, Ph.D. (Slides from Dr. Kun Huang) Department of Biomedical Informatics Ohio State University.
Polymorphism discovery informatics Gabor T. Marth Department of Biology Boston College Chestnut Hill, MA
Sequence Variation Informatics Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics.
Haplotype Discovery and Modeling. Identification of genes Identify the Phenotype MapClone.
Introduction Basic Genetic Mechanisms Eukaryotic Gene Regulation The Human Genome Project Test 1 Genome I - Genes Genome II – Repetitive DNA Genome III.
Population Genetics 101 CSE280Vineet Bafna. Personalized genomics April’08Bafna.
Haplotype Blocks An Overview A. Polanski Department of Statistics Rice University.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
Medical variations Gabor T. Marth Boston College Biology Department BI543 Fall 2013 February 5, 2013.
Fig Chapter 12: Genomics. Genomics: the study of whole-genome structure, organization, and function Structural genomics: the physical genome; whole.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen,
CS177 Lecture 10 SNPs and Human Genetic Variation
SNP Haplotypes as Diagnostic Markers Shrish Tiwari CCMB, Hyderabad.
Announcements: Proposal resubmission deadline 4/23 (Thursday).
A coalescent computational platform to predict strength of association for clinical samples Gabor T. Marth Department of Biology, Boston College
Finnish Genome Center Monday, 16 November Genotyping & Haplotyping.
What is a SNP?. Lecture topics What is a SNP? What use are they? SNP discovery SNP genotyping Introduction to Linkage Disequilibrium.
Polymorphism Haixu Tang School of Informatics. Genome variations underlie phenotypic differences cause inherited diseases.
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
Utility of Genotyping in Pharmaceutical Target (gene) Discovery and Drug Response Anne Westcott EST-Informatics.
Lecture 7.01 The informatics of SNPs and haplotypes Gabor T. Marth Department of Biology, Boston College CGDN Bioinformatics Workshop June.
The International Consortium. The International HapMap Project.
Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College
Evolutionary Genome Biology Gabor T. Marth, D.Sc. Department of Biology, Boston College
A coalescent computational platform to predict strength of association for clinical samples Gabor T. Marth Department of Biology, Boston College
Inferences on human demographic history using computational Population Genetic models Gabor T. Marth Department of Biology Boston College Chestnut Hill,
Single Nucleotide Polymorphisms (SNPs
SNP Detection Congtam Pham 2/24/04 Dr. Marth’s Class.
Common variation, GWAS & PLINK
Gil McVean Department of Statistics
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
Of Sea Urchins, Birds and Men
Pharmacogenetics: Implications of race and ethnicity on defining genetic profiles for personalized medicine  Victor E. Ortega, MD, Deborah A. Meyers,
Human Cells Human genomics
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
High-resolution haplotype structure in the human genome
Genome sequencing informatics
Patterns of Linkage Disequilibrium in the Human Genome
Power to detect QTL Association
Statistical Modeling of Ancestral Processes
Discovery tools for human genetic variations
Databases BI420 – Introduction to Bioinformatics Gabor T. Marth
Genome organization and Bioinformatics
Incorporating changing population size into the coalescent
Caroline Durrant, Krina T. Zondervan, Lon R
Medical genomics BI420 Department of Biology, Boston College
Databases BI420 – Introduction to Bioinformatics Gabor T. Marth
Medical genomics BI420 Department of Biology, Boston College
Research for medical discovery at the Computational Genomics Laboratory at Boston College Biology Gabor T. Marth Department of Biology, Boston College.
Presentation transcript:

BI820 – Seminar in Quantitative and Computational Problems in Genomics Gabor T. Marth Department of Biology Boston College Chestnut Hill, MA 02467

Sequence variations Human Genome Project produced a reference genome sequence that is 99.9% common to each human being sequence variations make our genetic makeup unique SNP Single-nucleotide polymorphisms (SNPs) are most abundant, but other types of variations exist and are important

Why do we care about variations? phenotypic differences inherited diseases demographic history

How do we find polymorphisms? look at multiple sequences from the same genome region diverse sequence resources can be used EST WGS BAC diversion: sequencing informatics

SNP discovery -- Methods Sequence clustering Cluster refinement Multiple alignment SNP detection

SNP discovery – Computer tools

SNP discovery – Mining Projects ~ 30,000 clones >CloneX ACGTTGCAACGT GTCAATGCTGCA >CloneY ACGTTGCAACGT GTCAATGCTGCA 25,901 clones (7,122 finished, 18,779 draft with basequality values) 21,020 clone overlaps (124,356 fragment overlaps) ACCTAGGAGACTGAACTTACTG 507,152 high-quality candidate SNPs (validation rate 83-96%) Marth et al., Nature Genetics 2001 ACCTAGGAGACCGAACTTACTG

SNP databases and characteristics access to variation data SNP properties reliability of information characterizing known polymorphic sites in sample collections – genotyping

Where do variations come from? sequence variations are the result of mutation events TAAAAAT TAACAAT TAAAAAT TAACAAT MRCA mutations are propagated down through generations TAAAAAT TAACAAT

Mutation rate higher mutation rate (µ) gives rise to more SNPS MRCA accgttatgtaga accgctatgtaga MRCA actgttatgtaga accgctatataga MRCA

Recombination accgttatgtaga accgttatgtaga accgttatgtaga accgttatgtaga

Demographic history large (effective) population size N small (effective) population size N different world populations have varying long-term effective population sizes (e.g. African N is larger than European)

Modeling history stationary collapse expansion bottleneck past present MD (simulation) AFS (direct form)

Ancestral inference modest but uninterrupted expansion bottleneck

The effects and signatures of selection selective mutations influence the genealogy itself; in the case of neutral mutations the processes of mutation and genealogy are decoupled

Allelic association and haplotype structure “linkage disequilibrium” “haplotype blocks”

Computer simulations: the Coalescent

? Medical utility? clinical phenotype molecular markers functional understanding

Mapping disease-causing loci genetic linkage association between allele and phenotype

Forensic applications