SeattleSNPs Variation Discovery Resource Materials prepared by: Mary E. Mangan, PhD www.openhelix.com Updated: Q1 2011 Version 1.

Slides:



Advertisements
Similar presentations
Julia Krushkal 4/11/2017 The International HapMap Project: A Rich Resource of Genetic Information Julia Krushkal Lecture in Bioinformatics 04/15/2010.
Advertisements

Efficient Algorithms for Genome-wide TagSNP Selection across Populations via the Linkage Disequilibrium Criterion Authors: Lan Liu, Yonghui Wu, Stefano.
Understanding GWAS Chip Design – Linkage Disequilibrium and HapMap Peter Castaldi January 29, 2013.
Fatchiyah, PhD Dept Biology UB Fatchiyah.lecture.ub.ac.id
Outline to SNP bioinformatics lecture
Variation Workshop University of Washington March 20-21, 2006 Sponsored by the NHLBI.
SNP Resources: Finding SNPs Discovery and Databases Mark J. Rieder, PhD SeattleSNPs Workshop March 20-21, 2006.
Medical Resequencing Debbie Nickerson Department of Genome Sciences University of Washington.
Copyright OpenHelix. No use or reproduction without express written consent1 Organization of genomic data… Genome backbone: base position number sequence.
Copyright OpenHelix. No use or reproduction without express written consent1.
SNP Resources: Finding SNPs, Databases and Data Extraction Debbie Nickerson
Computational Tools for Finding and Interpreting Genetic Variations Gabor T. Marth Department of Biology, Boston College
Mining SNPs from EST Databases Picoult-Newberg et al. (1999)
Picking SNPs Application to Association Studies Dana Crawford, PhD SeattleSNPs PGA University of Washington March 20, 2006.
SNP Resources: Finding SNPs, Databases and Data Extraction Debbie Nickerson NIEHS SNPs Workshop.
SNP Resources: Finding SNPs Databases and Data Extraction Mark J. Rieder, PhD Robert J. Livingston, PhD NIEHS Variation Workshop January 30-31, 2005.
SNPs DNA differs between humans by 0.1%, (1 in 1300 bases) This means that you can map DNA variation to around 10,000,000 sites in the genome Almost all.
SNP Selection University of Louisville Center for Genetics and Molecular Medicine January 10, 2008 Dana Crawford, PhD Vanderbilt University Center for.
SNP Resources: Finding SNPs Databases and Data Extraction Mark J. Rieder, PhD SeattleSNPs Variation Workshop March 20-21, 2006.
Selecting TagSNPs in Candidate Genes for Genetic Association Studies Shehnaz K. Hussain, PhD, ScM Assistant Professor Department of Epidemiology, UCLA.
Reading the Blueprint of Life
Copyright OpenHelix. No use or reproduction without express written consent1.
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent 2 Overview of Genome Browsers Materials prepared by Warren C. Lathe, Ph.D.
Copyright OpenHelix. No use or reproduction without express written consent1.
UCSC Genome Browser 1. The Progress 2 Database and Tool Explosion : 230 databases and tools 1996 : first annual compilation of databases and tools.
Copyright OpenHelix. No use or reproduction without express written consent1.
National Taiwan University Department of Computer Science and Information Engineering Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Biology 101 DNA: elegant simplicity A molecule consisting of two strands that wrap around each other to form a “twisted ladder” shape, with the.
Copyright OpenHelix. No use or reproduction without express written consent1.
CS177 Lecture 10 SNPs and Human Genetic Variation
SNP Haplotypes as Diagnostic Markers Shrish Tiwari CCMB, Hyderabad.
Copyright OpenHelix. No use or reproduction without express written consent1.
Introduction to the Gramene Genetic Diversity module 5/2010 Build #31.
1 of 32 Sequence Variation in Ensembl. 2 of 32 Outline SNPs SNPs in Ensembl Haplotypes & Linkage Disequilibrium SNPs in BioMart HapMap project Strain-specific.
Polymorphism Haixu Tang School of Informatics. Genome variations underlie phenotypic differences cause inherited diseases.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
The UCSC Table Browser & Custom Tracks Advanced searching and discovery using the UCSC Table Browser and Custom Tracks Osvaldo Graña CNIO Bioinformatics.
Copyright OpenHelix. No use or reproduction without express written consent1.
Epidemiology 217 Molecular and Genetic Epidemiology Bioinformatics & Proteomics John Witte.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
GVS: Genome Variation Server Materials prepared by: Warren C. Lathe, PhD Updated: Q Version 2.
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles.
The International Consortium. The International HapMap Project.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Motivations to study human genetic variation
Copyright OpenHelix. No use or reproduction without express written consent1.
Linkage Disequilibrium and Recent Studies of Haplotypes and SNPs
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1 1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Notes: Human Genome (Right side page)
Using public resources to understand associations Dr Luke Jostins Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015.
Copyright OpenHelix. No use or reproduction without express written consent1.
Next Generation Sequencing
Of Sea Urchins, Birds and Men
Consideration for Planning a Candidate Gene Association Study With TagSNPs Shehnaz K. Hussain, PhD, ScM Epidemiology 243: Molecular.
A modest but significant effect of CGB5 gene promoter polymorphisms in modulating the risk of recurrent miscarriage  Kristiina Rull, M.D., Ph.D., Ole.
Haplotypes When the presence of two or more polymorphisms on a single chromosome is statistically correlated in a population, this is a haplotype Example.
Selecting a Maximally Informative Set of Single-Nucleotide Polymorphisms for Association Analyses Using Linkage Disequilibrium  Christopher S. Carlson,
Presentation transcript:

SeattleSNPs Variation Discovery Resource Materials prepared by: Mary E. Mangan, PhD Updated: Q Version 1

Copyright OpenHelix. No use or reproduction without express written consent2 SeattleSNPs Agenda Introduction SeattleSNPs Process Basic Searches Understanding the Displays Downloads Education Summary Exercises

Copyright OpenHelix. No use or reproduction without express written consent3 Introduction Human Genome Project: the “reference” sequence Variation among humans is informative Projects to identify variations have been launched From From GenBank: MapViewer: Ensembl: UCSC Genome Browser:

Copyright OpenHelix. No use or reproduction without express written consent4 SNPs: Single Nucleotide Polymorphisms SNP: Single Nucleotide Polymorphism SNPs may be: A single nucleotide change (A/G, as shown above) A small insertion or deletion (indels) SNPs may have no impact, or may cause disease SNPs can tell us about inheritance patterns Human ApoE gene segment, rs SNP GTACCGCGGCGC GTACCACGGCGC Reference sequence: Variant found: HIS ARG

Copyright OpenHelix. No use or reproduction without express written consent5 NHLBI Program for Genomic Applications NHLBI has special sub-programs, like PGA PGA mission: resources, reagents, educate, disseminate

Copyright OpenHelix. No use or reproduction without express written consent6 SeattleSNPs SeattleSNPs mission: identify, genotype, model SNPs Focus: inflammatory responses in humans Provides data and workshops, available to all Genotyping services also available SNP Discovery Candidate Gene Reqsequencing and Analysis SNP Genotyping Collaborative Genotyping Large-scale Association Studies SeattleSNPs Education Workshops Scientific Presenations

Copyright OpenHelix. No use or reproduction without express written consent7 SeattleSNPs Agenda Introduction SeattleSNPs Process Basic Searches Understanding the Displays Downloads Education Summary Exercises

Copyright OpenHelix. No use or reproduction without express written consent8 SeattleSNPs Team Team is lead by Drs. Deborah Nickerson and Mark Rieder Many people contribute to providing the data, software and support; see publications ve&dopt=AbstractPlus&list_uids= &query_hl=3&itool=pubmed _docsum

Copyright OpenHelix. No use or reproduction without express written consent9 Candidate Gene Selection Select gene of interest Obtain longest sequenceRe-sequence genomic samples Heart, lung, blood research pathways of interest Search for longest gene model sequence Resequencing is performed

Copyright OpenHelix. No use or reproduction without express written consent10 Genomic Samples Obtained Genomic samples Find polymorphisms Provide data visualization, analysis and downloads Protocols: Obtain genomic samples (now using HapMap) Sequence samples, identify polymorphismsAssemble data for viewing and downloading

Copyright OpenHelix. No use or reproduction without express written consent11 Sequence each end of the fragment. Sequence Amplify DNA 5’3’ Customized software tools Primer design algorithm Custom LIMS and database to track all aspects of data production and quality Robotics used to automate sample handling Base-calling Quality determination Contig assembly Final quality determination Sequence viewing Polymorphism tagging Polymorphism reporting Individual genotyping Polymorphism detection PolyPhred Consed Analysis Phred Phrap Data publication to WWW Sequencing Production & Data Analysis Pipeline

Copyright OpenHelix. No use or reproduction without express written consent12 Re-sequencing Pipeline Gene design-automated primer picking software All approaches 2 kb upstream of first exon, 2 kb downstream of last exon Gene is < 25 kb - Full: complete re-sequencing Gene is > 25 kb i.e. exons, conserved non-coding sequences, and sampling across intron sequences Prior to amplification and re-sequencing, problematic GC-rich regions, alu repeats, polynucleotide tracts, and pseudogenes identified Sequence in base-pairs Mapping of PCR primers Mapping of Exons Mapping of PCRs

Copyright OpenHelix. No use or reproduction without express written consent13 Re-sequencing Pipeline Universal primer sequences standardize sequencing reaction conditions Standard dye terminator sequencing chemistry Optimized for reaction volume and dilution Automated sequencing capillary electrophoresis

Copyright OpenHelix. No use or reproduction without express written consent14 Data Analysis Sequence each end of the fragment. Sequence Amplify DNA 5’3’ Customized software tools Primer design algorithm Custom LIMS and database to track all aspects of data production and quality Robotics used to automate sample handling Base-calling Quality determination Contig assembly Final quality determination Sequence viewing Polymorphism tagging Polymorphism reporting Individual genotyping Polymorphism detection PolyPhred Consed Analysis Phred Phrap Data publication to WWW

Copyright OpenHelix. No use or reproduction without express written consent15 Polymorphism Identification and Analysis individuals in rows SNPSNP

Copyright OpenHelix. No use or reproduction without express written consent16 Homozygous C/C Heterozygous C/T Homozygous T/T Polymorphisms

Copyright OpenHelix. No use or reproduction without express written consent17 Program for Early Career Investigators Apply for free genotyping and analysis

Copyright OpenHelix. No use or reproduction without express written consent18 In This Tutorial We will examine how to find genes of interest We will explore and understand the data and displays

Copyright OpenHelix. No use or reproduction without express written consent19 SeattleSNPs Agenda Introduction SeattleSNPs Process Basic Searches Understanding the Displays Downloads Education Summary Exercises

Copyright OpenHelix. No use or reproduction without express written consent20 Finding Data from SeattleSNPs 2 main strategies Whole site search Gene lists site search for anything Browsers: Firefox Safari Explorer

Copyright OpenHelix. No use or reproduction without express written consent21 Sequencing Resources for Data Access Access lists of genes Access summaries

Copyright OpenHelix. No use or reproduction without express written consent22 Finding Genes From the Sequenced Genes list Find genes and panel info

Copyright OpenHelix. No use or reproduction without express written consent23 Gene List Information Gene list options

Copyright OpenHelix. No use or reproduction without express written consent24 Coriell DNA Panels p1: Coriell CEPH/AA panel p2: Coriell HapMap European/African panel from HapMap Yoruba in Ibadan Nigeria CEPH European Ancestry Utah Centre d’Etude du Polymorphisme Humain African American DNA

Copyright OpenHelix. No use or reproduction without express written consent25 Panels Integrate with Other Data p1 panels integrate with Perlegen data Hinds et al. Science Whole-genome patterns of common DNA variation in three human populations. p2 panels integrate with HapMap data The International HapMap Consortium. A haplotype map of the human genome. Nature p1 DNA panel - Perlegen Integration (1.58 million SNPs) = SeattleSNPs (1/200 bp)= Perlegen SNPs (~1/3000 bp) p2 DNA panel - HapMap Integration (~3.5 million SNPs) = SeattleSNPs (1/200 bp)= HapMap SNPs (~1/1000 bp)

Copyright OpenHelix. No use or reproduction without express written consent26 Panels Integrate with Other Data SeattleSNPs provides much more density of SNPs p1 panels integrate with Perlegen data p2 panels integrate with HapMap data SeattleSNPs DataHapMap

Copyright OpenHelix. No use or reproduction without express written consent27 Gene List Information Gene list options

Copyright OpenHelix. No use or reproduction without express written consent28 SeattleSNPs Agenda Introduction SeattleSNPs Process Basic Searches Understanding the Displays Downloads Education Summary Exercises

Copyright OpenHelix. No use or reproduction without express written consent29 SeattleSNPs Displays Gene-specific page Image of gene structure and SNPs Links to other resources and download Access gene-specific details and data gene, location

Copyright OpenHelix. No use or reproduction without express written consent30 Understanding SeattleSNPs Images Gene structure: exons, introns, UTRs coordinates based on their GenBank submission of this gene SNPs Controls for changing the view SeattleSNPs coordinates Gene structureSNPs change view SNP choices

Copyright OpenHelix. No use or reproduction without express written consent31 Gene-Specific Links Links to other resources UCSC Custom track shows the SeattleSNPs  custom tracks 

Copyright OpenHelix. No use or reproduction without express written consent32 SeattleSNPs Data Types Alox12: download all data Populations genotyped for this gene

Copyright OpenHelix. No use or reproduction without express written consent33 SeattleSNPs Data Types Documentation for all data types on the left Links to all the data on the right DocumentationData

Copyright OpenHelix. No use or reproduction without express written consent34 Mapping Data Mapping data types GenBank record for SeattleSNP coordinate system

Copyright OpenHelix. No use or reproduction without express written consent35 Mapping Data Mapping data types GenBank record for SeattleSNP coordinate system

Copyright OpenHelix. No use or reproduction without express written consent36 Genotyping Data Genotyping data for individuals individuals site of variation, 5’  3’

Copyright OpenHelix. No use or reproduction without express written consent37 Genotyping Data Genotyping data for individuals

Copyright OpenHelix. No use or reproduction without express written consent38 Linkage Data Linkage data “Tag” SNPs that can be used for genotyping See Carlson et al., Am. J. Hum. Genet., 74: , 2004

Copyright OpenHelix. No use or reproduction without express written consent39 LDSelect: Using LD to Pick tagSNPs LDSelect Uses SNP discovery data (not haplotypes) Finds all correlated SNPs to minimize the total number Maintains genetic diversity of locus Carlson et al. AJHG (2004)

Copyright OpenHelix. No use or reproduction without express written consent40 “…a unique combination of genetic markers present in a chromosome.” pg 57 in Hartl & Clark, 1997 Multi-SNP Correlations (aka Haplotypes)

Copyright OpenHelix. No use or reproduction without express written consent41 Haplotyping Data PHASE algorithm: infer haplotype statistically Stephens, et al. Am J Hum Genet PHASE: stephens/software.html

Copyright OpenHelix. No use or reproduction without express written consent42 Haplotyping Data Haplotyping data Visual Haplotype: upload your own data OR select gene from list upload data

Copyright OpenHelix. No use or reproduction without express written consent43 Haplotyping Data Visual Haplotype data output Alox12 individuals site of variation

Copyright OpenHelix. No use or reproduction without express written consent44 Predictive Analysis Predictive analysis on non-synonymous SNPs SIFT: Ng and Henikoff, Gen. Research, 12: , 2002 PolyPhen: Ramensky, et al., NAR 30:17: , 2002 Sorting Intolerant From Tolerant ( PolyPhen (

Copyright OpenHelix. No use or reproduction without express written consent45 SeattleSNPs Agenda Introduction SeattleSNPs Process Basic Searches Understanding the Displays Downloads Education Summary Exercises

Copyright OpenHelix. No use or reproduction without express written consent46 Downloading SeattleSNPs Data Download all the data Can also download just 1 gene from the gene page Usage/Citation policy: all data subsets gene page

Copyright OpenHelix. No use or reproduction without express written consent47 SeattleSNPs Agenda Introduction SeattleSNPs Process Basic Searches Understanding the Displays Downloads Education Summary Exercises

Copyright OpenHelix. No use or reproduction without express written consent48 Workshop Information Workshops offered in Seattle Slides, materials available

Copyright OpenHelix. No use or reproduction without express written consent49 Traveling Workshops Bring SeattleSNPs to your site

Copyright OpenHelix. No use or reproduction without express written consent50 Recorded Tutorial and Quick Reference Cards Recorded tutorial Download materials Order Quick Reference Cards

Copyright OpenHelix. No use or reproduction without express written consent51 SeattleSNPs Agenda Introduction SeattleSNPs Process Basic Searches Understanding the Displays Downloads Education Summary Exercises

Copyright OpenHelix. No use or reproduction without express written consent52 Summary SeattleSNPs PGA program; focus on heart, lung, blood genes Genotypes, haplotypes, web access and downloads Educational resources

Copyright OpenHelix. No use or reproduction without express written consent53 SNP Data SeattleSNPs data available in several ways Other projects exist to identify SNP variations Project scope, population, and methods may vary dbSNP database +

Copyright OpenHelix. No use or reproduction without express written consent54 SeattleSNPs Agenda Introduction SeattleSNPs Process Basic Searches Understanding the Displays Downloads Education Summary Exercises

Copyright OpenHelix. No use or reproduction without express written consent55