SeattleSNPs Variation Discovery Resource Materials prepared by: Mary E. Mangan, PhD Updated: Q Version 1
Copyright OpenHelix. No use or reproduction without express written consent2 SeattleSNPs Agenda Introduction SeattleSNPs Process Basic Searches Understanding the Displays Downloads Education Summary Exercises
Copyright OpenHelix. No use or reproduction without express written consent3 Introduction Human Genome Project: the “reference” sequence Variation among humans is informative Projects to identify variations have been launched From From GenBank: MapViewer: Ensembl: UCSC Genome Browser:
Copyright OpenHelix. No use or reproduction without express written consent4 SNPs: Single Nucleotide Polymorphisms SNP: Single Nucleotide Polymorphism SNPs may be: A single nucleotide change (A/G, as shown above) A small insertion or deletion (indels) SNPs may have no impact, or may cause disease SNPs can tell us about inheritance patterns Human ApoE gene segment, rs SNP GTACCGCGGCGC GTACCACGGCGC Reference sequence: Variant found: HIS ARG
Copyright OpenHelix. No use or reproduction without express written consent5 NHLBI Program for Genomic Applications NHLBI has special sub-programs, like PGA PGA mission: resources, reagents, educate, disseminate
Copyright OpenHelix. No use or reproduction without express written consent6 SeattleSNPs SeattleSNPs mission: identify, genotype, model SNPs Focus: inflammatory responses in humans Provides data and workshops, available to all Genotyping services also available SNP Discovery Candidate Gene Reqsequencing and Analysis SNP Genotyping Collaborative Genotyping Large-scale Association Studies SeattleSNPs Education Workshops Scientific Presenations
Copyright OpenHelix. No use or reproduction without express written consent7 SeattleSNPs Agenda Introduction SeattleSNPs Process Basic Searches Understanding the Displays Downloads Education Summary Exercises
Copyright OpenHelix. No use or reproduction without express written consent8 SeattleSNPs Team Team is lead by Drs. Deborah Nickerson and Mark Rieder Many people contribute to providing the data, software and support; see publications ve&dopt=AbstractPlus&list_uids= &query_hl=3&itool=pubmed _docsum
Copyright OpenHelix. No use or reproduction without express written consent9 Candidate Gene Selection Select gene of interest Obtain longest sequenceRe-sequence genomic samples Heart, lung, blood research pathways of interest Search for longest gene model sequence Resequencing is performed
Copyright OpenHelix. No use or reproduction without express written consent10 Genomic Samples Obtained Genomic samples Find polymorphisms Provide data visualization, analysis and downloads Protocols: Obtain genomic samples (now using HapMap) Sequence samples, identify polymorphismsAssemble data for viewing and downloading
Copyright OpenHelix. No use or reproduction without express written consent11 Sequence each end of the fragment. Sequence Amplify DNA 5’3’ Customized software tools Primer design algorithm Custom LIMS and database to track all aspects of data production and quality Robotics used to automate sample handling Base-calling Quality determination Contig assembly Final quality determination Sequence viewing Polymorphism tagging Polymorphism reporting Individual genotyping Polymorphism detection PolyPhred Consed Analysis Phred Phrap Data publication to WWW Sequencing Production & Data Analysis Pipeline
Copyright OpenHelix. No use or reproduction without express written consent12 Re-sequencing Pipeline Gene design-automated primer picking software All approaches 2 kb upstream of first exon, 2 kb downstream of last exon Gene is < 25 kb - Full: complete re-sequencing Gene is > 25 kb i.e. exons, conserved non-coding sequences, and sampling across intron sequences Prior to amplification and re-sequencing, problematic GC-rich regions, alu repeats, polynucleotide tracts, and pseudogenes identified Sequence in base-pairs Mapping of PCR primers Mapping of Exons Mapping of PCRs
Copyright OpenHelix. No use or reproduction without express written consent13 Re-sequencing Pipeline Universal primer sequences standardize sequencing reaction conditions Standard dye terminator sequencing chemistry Optimized for reaction volume and dilution Automated sequencing capillary electrophoresis
Copyright OpenHelix. No use or reproduction without express written consent14 Data Analysis Sequence each end of the fragment. Sequence Amplify DNA 5’3’ Customized software tools Primer design algorithm Custom LIMS and database to track all aspects of data production and quality Robotics used to automate sample handling Base-calling Quality determination Contig assembly Final quality determination Sequence viewing Polymorphism tagging Polymorphism reporting Individual genotyping Polymorphism detection PolyPhred Consed Analysis Phred Phrap Data publication to WWW
Copyright OpenHelix. No use or reproduction without express written consent15 Polymorphism Identification and Analysis individuals in rows SNPSNP
Copyright OpenHelix. No use or reproduction without express written consent16 Homozygous C/C Heterozygous C/T Homozygous T/T Polymorphisms
Copyright OpenHelix. No use or reproduction without express written consent17 Program for Early Career Investigators Apply for free genotyping and analysis
Copyright OpenHelix. No use or reproduction without express written consent18 In This Tutorial We will examine how to find genes of interest We will explore and understand the data and displays
Copyright OpenHelix. No use or reproduction without express written consent19 SeattleSNPs Agenda Introduction SeattleSNPs Process Basic Searches Understanding the Displays Downloads Education Summary Exercises
Copyright OpenHelix. No use or reproduction without express written consent20 Finding Data from SeattleSNPs 2 main strategies Whole site search Gene lists site search for anything Browsers: Firefox Safari Explorer
Copyright OpenHelix. No use or reproduction without express written consent21 Sequencing Resources for Data Access Access lists of genes Access summaries
Copyright OpenHelix. No use or reproduction without express written consent22 Finding Genes From the Sequenced Genes list Find genes and panel info
Copyright OpenHelix. No use or reproduction without express written consent23 Gene List Information Gene list options
Copyright OpenHelix. No use or reproduction without express written consent24 Coriell DNA Panels p1: Coriell CEPH/AA panel p2: Coriell HapMap European/African panel from HapMap Yoruba in Ibadan Nigeria CEPH European Ancestry Utah Centre d’Etude du Polymorphisme Humain African American DNA
Copyright OpenHelix. No use or reproduction without express written consent25 Panels Integrate with Other Data p1 panels integrate with Perlegen data Hinds et al. Science Whole-genome patterns of common DNA variation in three human populations. p2 panels integrate with HapMap data The International HapMap Consortium. A haplotype map of the human genome. Nature p1 DNA panel - Perlegen Integration (1.58 million SNPs) = SeattleSNPs (1/200 bp)= Perlegen SNPs (~1/3000 bp) p2 DNA panel - HapMap Integration (~3.5 million SNPs) = SeattleSNPs (1/200 bp)= HapMap SNPs (~1/1000 bp)
Copyright OpenHelix. No use or reproduction without express written consent26 Panels Integrate with Other Data SeattleSNPs provides much more density of SNPs p1 panels integrate with Perlegen data p2 panels integrate with HapMap data SeattleSNPs DataHapMap
Copyright OpenHelix. No use or reproduction without express written consent27 Gene List Information Gene list options
Copyright OpenHelix. No use or reproduction without express written consent28 SeattleSNPs Agenda Introduction SeattleSNPs Process Basic Searches Understanding the Displays Downloads Education Summary Exercises
Copyright OpenHelix. No use or reproduction without express written consent29 SeattleSNPs Displays Gene-specific page Image of gene structure and SNPs Links to other resources and download Access gene-specific details and data gene, location
Copyright OpenHelix. No use or reproduction without express written consent30 Understanding SeattleSNPs Images Gene structure: exons, introns, UTRs coordinates based on their GenBank submission of this gene SNPs Controls for changing the view SeattleSNPs coordinates Gene structureSNPs change view SNP choices
Copyright OpenHelix. No use or reproduction without express written consent31 Gene-Specific Links Links to other resources UCSC Custom track shows the SeattleSNPs custom tracks
Copyright OpenHelix. No use or reproduction without express written consent32 SeattleSNPs Data Types Alox12: download all data Populations genotyped for this gene
Copyright OpenHelix. No use or reproduction without express written consent33 SeattleSNPs Data Types Documentation for all data types on the left Links to all the data on the right DocumentationData
Copyright OpenHelix. No use or reproduction without express written consent34 Mapping Data Mapping data types GenBank record for SeattleSNP coordinate system
Copyright OpenHelix. No use or reproduction without express written consent35 Mapping Data Mapping data types GenBank record for SeattleSNP coordinate system
Copyright OpenHelix. No use or reproduction without express written consent36 Genotyping Data Genotyping data for individuals individuals site of variation, 5’ 3’
Copyright OpenHelix. No use or reproduction without express written consent37 Genotyping Data Genotyping data for individuals
Copyright OpenHelix. No use or reproduction without express written consent38 Linkage Data Linkage data “Tag” SNPs that can be used for genotyping See Carlson et al., Am. J. Hum. Genet., 74: , 2004
Copyright OpenHelix. No use or reproduction without express written consent39 LDSelect: Using LD to Pick tagSNPs LDSelect Uses SNP discovery data (not haplotypes) Finds all correlated SNPs to minimize the total number Maintains genetic diversity of locus Carlson et al. AJHG (2004)
Copyright OpenHelix. No use or reproduction without express written consent40 “…a unique combination of genetic markers present in a chromosome.” pg 57 in Hartl & Clark, 1997 Multi-SNP Correlations (aka Haplotypes)
Copyright OpenHelix. No use or reproduction without express written consent41 Haplotyping Data PHASE algorithm: infer haplotype statistically Stephens, et al. Am J Hum Genet PHASE: stephens/software.html
Copyright OpenHelix. No use or reproduction without express written consent42 Haplotyping Data Haplotyping data Visual Haplotype: upload your own data OR select gene from list upload data
Copyright OpenHelix. No use or reproduction without express written consent43 Haplotyping Data Visual Haplotype data output Alox12 individuals site of variation
Copyright OpenHelix. No use or reproduction without express written consent44 Predictive Analysis Predictive analysis on non-synonymous SNPs SIFT: Ng and Henikoff, Gen. Research, 12: , 2002 PolyPhen: Ramensky, et al., NAR 30:17: , 2002 Sorting Intolerant From Tolerant ( PolyPhen (
Copyright OpenHelix. No use or reproduction without express written consent45 SeattleSNPs Agenda Introduction SeattleSNPs Process Basic Searches Understanding the Displays Downloads Education Summary Exercises
Copyright OpenHelix. No use or reproduction without express written consent46 Downloading SeattleSNPs Data Download all the data Can also download just 1 gene from the gene page Usage/Citation policy: all data subsets gene page
Copyright OpenHelix. No use or reproduction without express written consent47 SeattleSNPs Agenda Introduction SeattleSNPs Process Basic Searches Understanding the Displays Downloads Education Summary Exercises
Copyright OpenHelix. No use or reproduction without express written consent48 Workshop Information Workshops offered in Seattle Slides, materials available
Copyright OpenHelix. No use or reproduction without express written consent49 Traveling Workshops Bring SeattleSNPs to your site
Copyright OpenHelix. No use or reproduction without express written consent50 Recorded Tutorial and Quick Reference Cards Recorded tutorial Download materials Order Quick Reference Cards
Copyright OpenHelix. No use or reproduction without express written consent51 SeattleSNPs Agenda Introduction SeattleSNPs Process Basic Searches Understanding the Displays Downloads Education Summary Exercises
Copyright OpenHelix. No use or reproduction without express written consent52 Summary SeattleSNPs PGA program; focus on heart, lung, blood genes Genotypes, haplotypes, web access and downloads Educational resources
Copyright OpenHelix. No use or reproduction without express written consent53 SNP Data SeattleSNPs data available in several ways Other projects exist to identify SNP variations Project scope, population, and methods may vary dbSNP database +
Copyright OpenHelix. No use or reproduction without express written consent54 SeattleSNPs Agenda Introduction SeattleSNPs Process Basic Searches Understanding the Displays Downloads Education Summary Exercises
Copyright OpenHelix. No use or reproduction without express written consent55