Draft sequencing of 1,000 genomes to study the genetics of quantitative traits: data production Fabio Busonero1, Brendan J. Tarrier2, Elizabeth A. Ketterer2,

Slides:



Advertisements
Similar presentations
Genetic Analysis of Genome-wide Variation in Human Gene Expression Morley M. et al. Nature 2004,430: Yen-Yi Ho.
Advertisements

Determination of Extra-Pair Fertilization and Inbreeding Using Microsatellite Genotyping in a Captive Population of Zebra Finches Lindsay Miller, Julia.
Bassem M. Shoucri Bio Sci D145 January 8, Why do we want to map genomes? Clinical applications! – Identify genes causing diseases – Treatment? Compare.
Cloning lab results Cloning the human genome Physical map of the chromosomes Genome sequencing Integrating physical and recombination maps Polymorphic.
Physical Mapping I CIS 667 February 26, Physical Mapping A physical map of a piece of DNA tells us the location of certain markers  A marker is.
Mining SNPs from EST Databases Picoult-Newberg et al. (1999)
University of Oklahoma Genome Center4/14/12.
11 © 2009 PerkinElmer © 2010 PerkinElmer November 20, 2012 DNA Services Overview.
Diabetes and Endocrinology Research Center The BCM Microarray Core Facility: Closing the Next Generation Gap Alina Raza 1, Mylinh Hoang 1, Gayan De Silva.
Reading the Blueprint of Life
Chromosome 16: PV92 PCR. What is PCR? DNA replication gone crazy in a tube!DNA replication gone crazy in a tube! Makes many copies of target sequence.
Introduction to BST775: Statistical Methods for Genetic Analysis I Course master: Degui Zhi, Ph.D. Assistant professor Section on Statistical Genetics.
Donor / Embryo Genomics Patrick Blondin L’Alliance Boviteq AABP Embryo Transfer Seminar Montréal, 2012 Sept 19 th.
DNA Technology Chapter 20.
DNA Methylation mapping
Module 1 Section 1.3 DNA Technology
Manipulation of DNA. Restriction enzymes are used to cut DNA into smaller fragments. Different restriction enzymes recognize and cut different DNA sequences.
Advancing Science with DNA Sequence Metagenome definitions: a refresher course Natalia Ivanova MGM Workshop September 12, 2012.
Review from last week. The Making of a Plasmid Plasmid: - a small circular piece of extra-chromosomal bacterial DNA, able to replicate - bacteria exchange.
1. 2 VARIANTS OF PCR APPLICATIONS OF PCR MECHANICS OF PCR WHAT IS PCR? PRIMER DESIGN.
Addressing cryptic relatedness in candidate samples for 1KG James Nemesh Steve McCarroll 02/13/2012.
Sequencing Kristian Stevens Mark Crepeau Charis Cardeno Charles H. Langley University of California, Davis Evolution.
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
HW2: exome sequencing and complex disease Jacquemin Jonathan de Bournonville Sébastien.
Molecular Markers CRITFC Genetics Workshop December 8, 2015.
GSVCaller – R-based computational framework for detection and annotation of short sequence variations in the human genome Vasily V. Grinev Associate Professor.
HRM REAL TIME PCR Presented by: Dadkhah Fahimeh SNP genotyping by HRM REAL TIME PCR.
Gene Technologies and Human ApplicationsSection 3 Section 3: Gene Technologies in Detail Preview Bellringer Key Ideas Basic Tools for Genetic Manipulation.
All rights Reserved Cengage/NGL/South-Western © 2016.
Polymerase Chain Reaction (PCR). What’s the point of PCR? PCR, or the polymerase chain reaction, makes copies of a specific piece of DNA PCR allows you.
Title: Studying whole genomes Homework: learning package 14 for Thursday 21 June 2016.
Library QA & QC Day 1, Video 3
Arun Kumar. B M.Sc 1st Year Biotechnology SSBS
From Reads to Results Exome-seq analysis at CCBR
Assay I HLA-DQ Alpha (A1) Haplotype. Purpose To determine which one of several known alleles is present at the HLA DQ α locus on each of an individual’s.
Molecular Markers Using DNA polymorphisms in plant breeding
DNA Sequencing Second generation techniques
Lesson: Sequence processing
Dewey et al. Presented By: Natasha Granneman & Christina Tran
SNP Detection Congtam Pham 2/24/04 Dr. Marth’s Class.
Nucleotide variation in the human genome
All rights Reserved Cengage/NGL/South-Western © 2016.
Comparison of WGA methods for genotyping fetal nucleated red blood cells for the application of non-invasive prenatal diagnosis Zhouwei Huang­­1, Angela.
Figure 1. Library preparation methods for highly degraded DNA
Introduction to RAD Acropora millepora.
All rights Reserved Cengage/NGL/South-Western © 2016.
Section 3: Gene Technologies in Detail
Introduction to NGS.
How are areas of DNA that don’t code for proteins (genes) used by our cells? How can we make use of these areas?
Lab 8: PTC Polymerase Chain Reaction Lab
Small RNA Sample Preparation
Epidemiology 101 Epidemiology is the study of the distribution and determinants of health-related states in populations Study design is a key component.
Power to detect QTL Association
mRNA Sequencing Sample Preparation
With Genetic Analyzers from Applied Biosystems
Discovery tools for human genetic variations
Recombinant DNA Unit 12 Lesson 2.
Hybrid Capture and Next-Generation Sequencing Identify Viral Integration Sites from Formalin-Fixed, Paraffin-Embedded Tissue  Eric J. Duncavage, Vincent.
PRINCIPLES OF CROP PRODUCTION (3 CREDIT HOURS)
ChIP DNA Sample Preparation
Molecular Diagnosis of Autosomal Dominant Polycystic Kidney Disease Using Next- Generation Sequencing  Adrian Y. Tan, Alber Michaeel, Genyan Liu, Olivier.
Simulating Genetic Screening
Genetics of Human Cardiovascular Disease
By: Mandy Butler, Ying-Tsu Loh and Cheryl Ann Peterson
Restriction Fragment Length Polymorphism (RFLP)
High-throughput ts allele sequencing.
Genomic DNA Sample Preparation
Giorgio Sirugo, Kenneth K. Kidd  The American Journal of Human Genetics 
The MLPA assay and application to diagnosis of DGS
Genomic & RNA Profiling Core Facility
Presentation transcript:

Draft sequencing of 1,000 genomes to study the genetics of quantitative traits: data production Fabio Busonero1, Brendan J. Tarrier2, Elizabeth A. Ketterer2, Goncalo R. Abecasis1, and Christine A. Brennan2. 1Center for Statistical Genetics, Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA; 2DNA Sequencing Core, University of Michigan, Ann Arbor, Michigan, USA. Introduction In the past several years, genome-wide association studies have furthered our understanding of the genetics basis of blood lipid levels, which are key risk factors for the development of cardiovascular disease. The success of these studies resulted, in large part, from their ability to explore the genome in a comprehensive manner, systematically assessing the impact of common variation on the trait of interest. Here we propose to deploy high-throughput sequencing technologies to extend these systematic whole genome assessments and, to maximize our chances of success, we’ll focus our study on an isolated founder population in Sardinia (precisely volunteers enrolled in ProgeNIA study), which is ideal for the study of rare genetic variants. The proposed plan will allow us to evaluate the contribution of common (frequency>5.0%) and rare (frequency 0.5 - 5.0%) single nucleotide polymorphisms, short insertions and deletions, large copy number polymorphisms and other structural variants to blood levels of low density lipoprotein cholesterol (LDL-c), high density lipoprotein cholesterol (HDL-c) and triglycerides (TG), all of which are key risk factors for cardiovascular disease. Results – Milestones in sequences production Over the last 10 months, changes in reagent chemistry, efficiency of base-callers and fine regulation of template loaded onto the flowcell by titration, allowed to increase amount of data for run, as shown in the plots below. RTA-1.5.35 Jan 2010 RTA-1.6.32 Mar 2010 RTA-1.6.47 May 2010 OLB-1.8.0 Jul 2010 Sept 2010 Data Description – Samples ProgeNIA’s phenotyped individuals include: 6,400 avuncular pairs 4,935 sibling pairs 4,263 parent-offspring pairs 4,014 cousin pairs 686 grand-parent grand-child pairs Strategy to choose the 1,000 individuals to be sequenced (EXPLAIN): To sequence a minimal number of individuals and use their sequences together with Identity by descent (IBD) information to deduce the sequence of other individuals in the pedigree. We are currently using ExonPicks scoring, to pick individuals to be sequenced from our large pedigree collections, since we have genotypes chip information to determine IBD sharing in the pedigree. We have been picking: Trios (2 parents + 1 offspring) One Parent+two offspring families RTA-1.5.35 Jan 2010 RTA-1.6.32 Mar 2010 RTA-1.6.47 May 2010 OLB-1.8.0 Jul 2010 Sept 2010 1 ½ RTA-1.5.35 Jan 2010 RTA-1.6.32 Mar 2010 RTA-1.6.47 May 2010 OLB-1.8.0 Jul 2010 Sept 2010 1 ½ RTA-1.5.35 Jan 2010 RTA-1.6.32 Mar 2010 RTA-1.6.47 May 2010 OLB-1.8.0 Jul 2010 Sept 2010 aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa Methods – Whole-genome sequencing Genomic DNA libraries were generated as previously described (Quail et al., 2008) with minor modifications. Briefly, DNA (5 μg) from whole blood salting-out extraction was randomly fragmented by sonication and subsequently treated with a mixture of T4 DNA Polymerase, E. coli DNA polymerase I Klenow fragment, and T4 polynucleotide kinase to end-repair, blunt and phosphorylate ends; the repaired fragments were then 3’-adenylated using Klenow exo-DNA Polymerase. Adenylated fragments were ligated to paired-end adaptors, and fragments ranging from 400 to 500 bp were selected on agarose gel and extracted using the QIAquick gel extraction kit (Qiagen). Size-selected adaptor-modified DNA fragments were PCR-enriched using adapter-specific primers. Following amplification, samples were quantified and quality checked by Agilent Bioanalyzer or qPCR, and denatured to load onto the flowcell for cluster amplification. Sequencing of samples was performed on Genome Analyzer Iix (Illumina) by paired-end 2x120 bp run. Sequence basecalls were made with Illumina software. Sequenon fingerprinting Conclusions Actual sequencing capacity, 3 Illumina Genome Analyzers IIx, allows for about 400 GB of DNA sequence data to be generated each month, with each samples sequenced at an average of 3X depth, and more than 10 GB of high-quality mapped de-duplicated bases per lane in paired-end sequencing runs with 120 bp reads. …how many peoples sequenced so far …and lipid levels? …info about SNPs, dbSNPs,… References ………………………………………….