Outline to SNP bioinformatics lecture

Slides:



Advertisements
Similar presentations
Lecture 2 Strachan and Read Chapter 13
Advertisements

SNP Applications statwww.epfl.ch/davison/teaching/Microarrays/snp.ppt.
A Lite Introduction to (Bioinformatics and) Comparative Genomics Chris Mueller August 10, 2004.
1 of 25 Sequence Variation in Ensembl. 2 of 25 Outline SNPs SNPs in Ensembl Linkage disequilibrium SNPs in BioMart DAS sources.
Fatchiyah, PhD Dept Biology UB Fatchiyah.lecture.ub.ac.id
Plant of the day! Pebble plants, Lithops, dwarf xerophytes Aizoaceae
Single Nucleotide Polymorphisms Jennifer Lyon Eskind Biomedical Library May 1, 2009 CRC Workshop Series.
Predicting the Function of Single Nucleotide Polymorphisms Corey Harada Advisor: Eleazar Eskin.
Biology and Bioinformatics Gabor T. Marth Department of Biology, Boston College BI820 – Seminar in Quantitative and Computational Problems.
Computational Tools for Finding and Interpreting Genetic Variations Gabor T. Marth Department of Biology, Boston College
FINAL EXAM: TAKE-HOME Assessment of Significance in Cancer Gene SNPs.
Genome Browsers Ensembl (EBI, UK) and UCSC (Santa Cruz, California)
Evolutionary Genome Biology Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen, Hungary, May 2006.
SNP Resources: Finding SNPs, Databases and Data Extraction Debbie Nickerson NIEHS SNPs Workshop.
SNP Resources: Finding SNPs Databases and Data Extraction Mark J. Rieder, PhD Robert J. Livingston, PhD NIEHS Variation Workshop January 30-31, 2005.
Whole Genome Polymorphism Analysis of Regulatory Elements in Breast Cancer AAGTCGGTGATGATTGGGACTGCTCT[C/T]AACACAAGCGAGATGAAGAAACTGA Jacob Biesinger Dr.
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
Sequence Analysis. Today How to retrieve a DNA sequence? How to search for other related DNA sequences? How to search for its protein sequence? How to.
PolyPhen and SIFT: Tools for predicting functional effects of SNPs Epi 244 Spring 2009 Sam S. Oh.
Polymorphism discovery informatics Gabor T. Marth Department of Biology Boston College Chestnut Hill, MA
Sequence Variation Informatics Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics.
SNP Resources: Finding SNPs Databases and Data Extraction Mark J. Rieder, PhD SeattleSNPs Variation Workshop March 20-21, 2006.
Genetic Variations Resources May 15, 2013 Ansuman Chattopadhyay, PhD, Head Molecular Biology Information Service Health Sciences Library System University.
Selecting TagSNPs in Candidate Genes for Genetic Association Studies Shehnaz K. Hussain, PhD, ScM Assistant Professor Department of Epidemiology, UCLA.
Genome Variations & GWAS
DbSNP: the NCBI database of genetic variation S. T. Sherry, M.H. Ward, M. Kholodov, J. Baker, L. Phan, E. M. Smigielski and K. Sirotkin, Nucleic Acids.
Computational Molecular Biology Biochem 218 – BioMedical Informatics Simple Nucleotide.
MES Genome Informatics I - Lecture VIII. Interpreting variants Sangwoo Kim, Ph.D. Assistant Professor, Severance Biomedical Research Institute,
Doug Brutlag 2011 Genomics & Medicine Doug Brutlag Professor Emeritus of Biochemistry &
A Lite Introduction to (Bioinformatics and) Comparative Genomics Chris Mueller November 18, 2004 Based on the Genomics in Biomedical Research course at.
SNPs Daniel Fernandez Alejandro Quiroz Zárate. A SNP is defined as a single base change in a DNA sequence that occurs in a significant proportion (more.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen,
Biology 101 DNA: elegant simplicity A molecule consisting of two strands that wrap around each other to form a “twisted ladder” shape, with the.
SNP Haplotypes as Diagnostic Markers Shrish Tiwari CCMB, Hyderabad.
Development and Application of SNP markers in Genome of shrimp (Fenneropenaeus chinensis) Jianyong Zhang Marine Biology.
Korea BioInformation Center Byoung-Chul Kim
SeattleSNPs Variation Discovery Resource Materials prepared by: Mary E. Mangan, PhD Updated: Q Version 1.
National Taiwan University Department of Computer Science and Information Engineering Pattern Identification in a Haplotype Block * Kun-Mao Chao Department.
1 of 32 Sequence Variation in Ensembl. 2 of 32 Outline SNPs SNPs in Ensembl Haplotypes & Linkage Disequilibrium SNPs in BioMart HapMap project Strain-specific.
Pattern Matching Rhys Price Jones Anne R. Haake. What is pattern matching? Pattern matching is the procedure of scanning a nucleic acid or protein sequence.
What is a SNP?. Lecture topics What is a SNP? What use are they? SNP discovery SNP genotyping Introduction to Linkage Disequilibrium.
Overview of Bioinformatics 1 Module Denis Manley..
Julia N. Chapman, Alia Kamal, Archith Ramkumar, Owen L. Astrachan Duke University, Genome Revolution Focus, Department of Computer Science Sources
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
Epidemiology 217 Molecular and Genetic Epidemiology Bioinformatics & Proteomics John Witte.
February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles.
The International Consortium. The International HapMap Project.
Single nucleotide polymorphisms and Large scale variation
Visualization of genomic data Genome browsers. UCSC browser Ensembl browser Others ? Survey.
Research proposal 2009 信息技术会议 Bioinformatics Analysis & Identification of non-Synonymous SNPs in Candidate Genes for Ascites College of Animal Husbandry.
Notes: Human Genome (Right side page)
Human Genomics Higher Human Biology. Learning Intentions Explain what is meant by human genomics State that bioinformatics can be used to identify DNA.
Using public resources to understand associations Dr Luke Jostins Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015.
The Haplotype Blocks Problems Wu Ling-Yun
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Next Generation Sequencing
Consideration for Planning a Candidate Gene Association Study With TagSNPs Shehnaz K. Hussain, PhD, ScM Epidemiology 243: Molecular.
Sequence Alignments—part 2
School of Pharmacy, University of Nizwa
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
Gene Hunting: Design and statistics
Linking Genetic Variation to Important Phenotypes
By Michael Fraczek and Caden Boyer
Genome organization and Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
“TaqMan genotyping Assay’’
MUTATIONS.
School of Pharmacy, University of Nizwa
Ivan P. Gorlov, Olga Y. Gorlova, Shamil R. Sunyaev, Margaret R
Gene Safari (Biological Databases)
Presentation transcript:

Outline to SNP bioinformatics lecture Brief introduction SNPs in cell biology SNP discovery SNP assessment SNP databases SNPs in genome browsers

Single Nucleotide Polymorphisms Must be present in at least 1% of the population Most (90%) of the sequence variation between two genomes Two humans differ 0.1% 1/300 bp in the human genome Lower in coding regions 10 million in the human genome

Categories of SNPs Missense/Non-synonymous Nonsense Changes an amino acid About half of the SNPs in coding sequence Can alter function and or structure of the protein Cause of most monogenetic diseases Hemochromatosis (HFE) Cystic fibrosis (CFTR) Hemophilia (F8) Nonsense Introduces a stop codon Same consequences as non-synonymous

Categories of SNPs Synonymous Non-coding Does not alter the coding sequence May alter splicing Non-coding Can be located in promoter or regulatory regions Can impact the expression of the gene All SNPs can be used as markers

Use to cell biologist Association studies Causative SNPs Use SNPs as markers to find regions associated with phenotype Causative SNPs Altered protein Altered expression Regions of altered conservation between strains/species/individuals Evolutionary analyses Etc…

SNP discovery Discovery of SNPs usually from sequencing Discovery is based on separating sequencing errors from ’real’ differences and assessing the frequency in the sequenced population Separation of parologous sequences Validation, genotyping

SNP discovery resources Polybayes SNP discovery in redundant sequences Polyphred SNP discovery based on phred/phrap/consed NovoSNP Graphical identification of SNPs

Example: PolyPhred Detects heterozygotes from chromatograms Runs together with phred/phrap/consed Command line

SNP assessment Assess SNPs for functional effects Non-synonymous SNPs Conservation across species Amino acid properties Protein structure Transmembrane regions, signal peptides etc.

SNP assessment resources SIFT PolyPhen Pmut SNPs3D PANTHER PSEC TopoSNP MAPP Etc

Example: SIFT Sorting Intolerant From Tolerant Builds an alignment of similar sequences Calculates a score based on the aa in the alignment Takes the environment into account Takes the properties of the aa into account Does not use structure

SNP databases Maps of SNPs in human, mouse, etc Haplotype maps Functional SNPs Disease databases

SNP databases dbSNP F-SNP HGVBase PolyDoms OMIN Etc…

Example: dbSNP 50 million submissions 18 million clusters 7 million in genes 44 organisms 91 million SNPs submitted

dbSNP Search for SNPs, location, etc Information submitted on method, flanking sequence, alleles, population, sample size, validation etc Information computed on SNPs at same location including functional analysis, population diversity etc

SNPs in genome browsers Ensembl UCSC

Example: UCSC

HapMap Aim: a haplotype map of the human genome describing common patterns of sequence variation A haplotype map is based on alleles of SNPs close together are inherited together HapMap will identify which SNPs are informative in mapping, reducing the number of SNPs to genotype by a magnitude Populations from Asia, Europe and Africa 2nd generation map with over 3.1 million SNPs

Ng PC, Henikoff S. Predicting the effects of amino acid substitutions on protein function. Annu Rev Genomics Hum Genet. 2006;7:61-80. Review. Bhatti P, Church DM, Rutter JL, Struewing JP, Sigurdson AJ. Candidate single nucleotide polymorphism selection using publicly available tools: a guide for epidemiologists. Am J Epidemiol. 2006 Oct 15;164(8):794-804. Epub 2006 Aug 21. Clifford RJ, Edmonson MN, Nguyen C, Scherpbier T, Hu Y, Buetow KH. Bioinformatics tools for single nucleotide polymorphism discovery and analysis. Ann N Y Acad Sci. 2004 May;1020:101-9. Review. The International HapMap Consortium. A second generation human haplotype map of over 3.1 million SNPs. Nature 449, 851-861. 2007.