Candidate Gene Resource Steering Committee Meeting July 25, 2006.

Slides:



Advertisements
Similar presentations
Lecture 2 Strachan and Read Chapter 13
Advertisements

Copyright © 2008, SAS Institute Inc. All rights reserved. Discovering Meaningful Patterns in Genomics Data with JMP Genomics Jordan Hiller JMP Genomics.
Genetic contributions to complex traits in a post genomewide era Nic Timpson ALSPAC – The first 21 years conference 2012.
The Diagnostic Laboratory ……the ideal system……. Molecular Genetics Diagnostic Laboratory Exciting area of medical pathology Need to continually up-date.
Genome Wide Association Study (GWAS) and Personalized Medicine
BST 775 Lecture PLINK – A Popular Toolset for GWAS
Which Phenotypes Can be Predicted from a Genome Wide Scan of Single Nucleotide Polymorphisms (SNPs): Ethnicity vs. Breast Cancer Mohsen Hajiloo, Russell.
Admixture in Horse Breeds Illustrated from Single Nucleotide Polymorphism Data César Torres, Yaniv Brandvain University of Minnesota, Department of Plant.
CaRE Center Informatics NHLBI CaRE Center Meeting Bethesda, MD July 25, 2006 Marcia Nizzari.
We processed six samples in triplicate using 11 different array platforms at one or two laboratories. we obtained measures of array signal variability.
Summary points with interesting prospects IRB – handling samples in different areas/countries Opportunities for young researchers Who reviews grants and.
Genetic Basis of Agronomic Traits Connecting Phenotype to Genotype Yu and Buckler (2006); Zhu et al. (2008) Traditional F2 QTL MappingAssociation Mapping.
CARe: The NHLBI’s Candidate Gene Association Resource.
Influencing Change in Research, Treatment Protocols, and New Drug Development.
INTRODUCTION Genome-wide association studies are now feasible. Measuring allele frequencies of pools of cases and controls, instead of between individuals,
Ingredients for a successful genome-wide association studies: A statistical view Scott Weiss and Christoph Lange Channing Laboratory Pulmonary and Critical.
Overview of SNP Genotyping Debbie Nickerson Department of Genome Sciences University of Washington
SNP chips Advanced Microarray Analysis Mark Reimers, Dept Biostatistics, VCU, Fall 2008.
Genomewide Association Studies.  1. History –Linkage vs. Association –Power/Sample Size  2. Human Genetic Variation: SNPs  3. Direct vs. Indirect Association.
SNPs DNA differs between humans by 0.1%, (1 in 1300 bases) This means that you can map DNA variation to around 10,000,000 sites in the genome Almost all.
Study Design Discussion The Ghost of Candidate Gene Past and the Ghost of Genome-wide Association Yet to Come Stephen S. Rich, Ph.D. Wake Forest University.
Missouri S&T Provost’s eFellows Program Angie Hammons, Manager S&T Educational Technology Cohort 2016.
Understanding Genetics of Schizophrenia
Moving forward our shared data agenda: a view from the publishing industry ICSTI, March 2012.
Georgia Wiesner, MD CREC June 20, GATACAATGCATCATATG TATCAGATGCAATATATC ATTGTATCATGTATCATG TATCATGTATCATGTATC ATGTATCATGTCTCCAGA TGCTATGGATCTTATGTA.
© 2008 LabKey Software Simplifying Scientific Data Management with LabKey Server January 29, 2009 Presenter: Peter Hussey,
Polymorphism and Variant Analysis Lab
ARC Biotechnology Platform: Sequencing for Game Genomics Dr Jasper Rees
Donor / Embryo Genomics Patrick Blondin L’Alliance Boviteq AABP Embryo Transfer Seminar Montréal, 2012 Sept 19 th.
Factors to Consider in Selecting a Genotyping Platform Elizabeth Pugh June 22, 2007.
Systematic reviews of genetic association studies Robert Walton Fiona Fong 15 March 2013.
Single Nucleotide Polymorphism
Future Use of Stored Samples & Data and the NIH Policy on GWAS and dbGaP NIAID/DAIDS Dione Washington, M.S. -- ProPEP Sudha Srinivasan, Ph.D.-- TRP Tanisha.
Data Type 1: Microarrays
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
Collecting and Storing Sequences In the laboratory Heather Helm UPR Sequencing Facilities Manager.
The Center for Medical Genomics facilitates cutting-edge research with state-of-the-art genomic technologies for studying gene expression and genetics,
Agenda Introduction to microarrays
Early Genetics and Lifecourse Epidemiology (EAGLE) consortium CM van Duijn.
Polymorphism & Variant Analysis Lab Saurabh Sinha Polymorphism and Variant Analysis Lab v1 | Saurabh Sinha 1 Powerpoint by Casey Hanson.
An Efficient Method of Generating Whole Genome Sequence for Thousands of Bulls Chuanyu Sun 1 and Paul M. VanRaden 2 1 National Association of Animal Breeders,
Biology 101 DNA: elegant simplicity A molecule consisting of two strands that wrap around each other to form a “twisted ladder” shape, with the.
Whole genome association studies Introduction and practical Boulder, March 2009.
10cM - Linkage Mapping Set v2 ABI Median intermarker distance: 4.7 Mb Mean intermarker distance: 5.6 Mb Mean genetic gap distance: 8.9 cM Average Heterozygosity.
Jianfeng Xu, M.D., Dr.PH Professor of Public Health and Cancer Biology Director, Program for Genetic and Molecular Epidemiology of Cancer Associate Director,
Methods in genome wide association studies. Norú Moreno
Copy Number Variation Eleanor Feingold University of Pittsburgh March 2012.
Genotype Calling Jackson Pang Digvijay Singh Electrical Engineering, UCLA.
Polysubstance Abuse–Vulnerability Genes: Genome Scans for Association, Using 1,004 Subjects and 1,494 Single-Nucleotide Polymorphisms Uhl, Liu, Walther,
Developing an Electronic Dossier System for UCLA Faculty: Planning Proposal Presentation to Committee on IT Infrastructure November 30,
SHARe. What is WHI-SHARe? SHARe – stands for SNP Health Association Resource. It is an NHLBI program for genome-wide association studies (GWAS). GWAS.
Statistical Issues in Genetic Association Studies
George R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD Select Sires’
The International Consortium. The International HapMap Project.
GenABEL: an R package for Genome Wide Association Analysis
One view on integrating Genomics and Informatics into the Undergraduate Nursing Curriculum Prepared by Patti Brennan and Stephanie Gilbertson-White Presented.
Lectures 7 – Oct 19, 2011 CSE 527 Computational Biology, Fall 2011 Instructor: Su-In Lee TA: Christopher Miles Monday & Wednesday 12:00-1:20 Johnson Hall.
Analysis of the Positively Selected and Non-Positively Selected Non-Protein Coding Sequences of Chromosome 16 by Kyle Tretina, Dr. Pattle P. Pun INTRODUCTION.
Affymetrix User’s Group Meeting Boston, MA May 2005 Keynote Topics: 1. Human genome annotations: emergence of non-coding transcripts -tiling arrays: study.
A brief guide to sequencing Dr Gavin Band Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015 Africa Centre for Health.
CCLE Cancer Cell Line Encyclopedia Alexey Erohskin.
Million Veteran Program: Industry Day Genomic Data Processing and Storage Saiju Pyarajan, PhD and Philip Tsao, PhD Million Veteran Program: Industry Day.
Chapter 8 Environments, Alternatives, and Decisions.
A web portal for management of biological data and applications
Oleksiy Karpenko, Neil J. Bahroos Galaxy Community Conference Chicago
Content and Labeling of Tests Marketed as Clinical “Whole-Exome Sequencing” Perspectives from a cancer genetics clinician and clinical lab director Allen.
A Short Tutorial on Causal Network Modeling and Discovery
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
TOPMed Analysis Workshop Genetic Analysis Center Biostatistics Department University of Washington TOPMed Data Coordinating Center August 7-9, 2017 Introduction.
Presentation transcript:

Candidate Gene Resource Steering Committee Meeting July 25, 2006

Goals for Today Strengthen relationships among CARE investigators Define pilot project (phenotypes & SNPs) Establish principles of data release Discuss genotyping study design Select phenotypes to be analyzed

CARE Governance Steering committee –Representative of each CARE organization –Subcommittees : Data Release, Phenotypes, Study Design, Informatics, SNP Selection, DNA/Genotyping NHLBI staff NHLBI appointed oversight committee

CARE : timeline RFP released March 2005 Response submitted July 15, 2005 Awarded April 1, 2006 Four year award –Y1: Create DNA and phenotype database –Y2: Genotyping –Y3 / 4: Joint analysis and data distribution

Resources Provided by NHLBI $18.3M over 4 years to create a resource to relate genotype-phenotype across cohorts: –Create a consortium among CARE cohorts –Database DNA and phenotypes –Genotype a common set of SNPs across cohorts –Create software tools to enable joint analysis –Data distribution as per CARE data release policy –Project management and coordination -PM hired : Deb Farlow

Areas for Discussion Today Data Release Study Design Phenotypes NHLBI Current state of genotyping technology Presentation of informatics tools

Data release Data release policy to be established by CARE steering committee with NHLBI and local IRB’s Broad proposed secure, HIPAA compliant web architecture to implement this policy and to enable access-controlled environment for data sharing and analysis

Areas for Discussion Today Data Release Study Design Phenotypes NHLBI Current state of genotyping technology Presentation of informatics tools

Original CARE Study Design Candidate Gene Study –50,000 samples –average 10 SNPs/gene x 1700 genes = 17,000 SNPs –Requirement: $0.01 /genotype (fully loaded) Whole Genome Association Study –500 cases / 1,000 controls –At least 300,000 SNPs genome wide

Candidate gene study Targeted genotyping technology has remained stable : same price and throughput as in approved proposal Key issue: criteria for selecting 17,000 candidate gene-based SNPs – biological hypotheses

Developments since RFP Whole genome scans promise new hypotheses for candidate genes Evaluation of coverage / performance of whole genome arrays Price for whole genome genotyping technology has improved

Whole genome scanning SHARE will genotype 15,000 people from NHLBI cohorts (FHS and TBA) RFA for 4-5 whole genome scans GAIN, WTCCC, etc, etc Implication: hypotheses that could be confirmed and extended by CARE Challenge: timing doesn’t synch up well with original CARE timeline

Developments since RFP Whole genome scans promise new hypotheses for candidate genes Evaluation of coverage / performance of whole genome arrays Price for whole genome genotyping technology has improved

Coverage

Do they work? * from

Do They Work at High Scale? Recent Call Rate Data (at Broad) ProductChipsCall Rate Affy 500K12, % ILMN 317K % In-Process QC test HapMap sample vs Hap Map Avg=99.62% 7,947,748 comparisons

QC statistics: MS andT2D Scans

DM vs. BRLMM 2500 chips <5% of chips fail

MIP (20K)

WGAS: Then and Now Original Plan Product: Affymetrix 500K Total cost per sample: $1600 (chip+reagents+equipment+labor+IDC) Study Design: 500 cases / 1,000 controls Budget=$2,400,000

WGAS: Then and Now Now possible Product: Affymetrix 500K Total cost per sample: $530 (chip+reagents+equipment+labor+IDC) Study Design: 4,500 samples Budget=$2,400,000

WGAS: Then and Now January 2007 Product: Affymetrix 500K Total cost per sample: $410 (chip+reagents+equipment+labor+IDC) Study Design: 5,800 samples Budget=$2,400,000

In Summary SNPsSamplesCost 7/15/05500,000 1,500$2.4M 17,00050,000$8.5M 7/25/06500,000 4,500$2.4M 17,00050,000$8.5M 1/07500,000 5,800$2.4M `17,00050,000$8.5M

Conclusions: genotyping Targeted genotyping (custom set of candidate genes) $0.01 / gt Timing of candidate gene selection Improved cost and performance of whole genome $0.001 / gt

Areas for Discussion Today Data Release Study Design Phenotypes NHLBI Current state of genotyping technology Presentation of informatics tools

High Level Workflow – for CaRE Upload Samples, Peds, Individuals, Phenotypes Create Experiments (Samples x Features) Summarize/Filter PLINK Data Vault QC/Curate Results Design and Execute Experiments Project DB LIMS DBs BSP DB Association & Statistics Viewers Cohort’s Custom Algorithms, Viewers Web Services Data Compile Feature DB Analysis: Gene Pattern + CaRE analysis tools Production: BSP/GAP + CaRE enhancements

Designing a Pilot A trial run for DNA quality, genotyping, phenotype and joint analysis, and publication Scale and content of pilot to be refined, topic for today’s discussion sessions

A R E Our shared aspiration: the greatest genetic epidemiology experiment to date C C CSSCD

Technological Advance Current 500K assayNew 500K assay DNA

How? Smaller format BRLMM Sequence Variability (DNA Analysis) A/A B/B A/B Mismatch probes not needed Fewer probes needed Single format

No drop in Het Calls

Mendel Errors Per Plate Accuracy 99.4% Sty/Nsp : one family 25,000 errors

Coverage of Common Variants by Whole-genome Products Tag SNPs Affymetrix Mapping 500K GeneChip Illumina HumanHap300 BeadChip

Coverage Mostly Provided by Pairwise Correlations A A A T T T G G G T T T G G T G G G A A C A A C T T C T T C T T G T T G G G C C C C G G T T G G G G T T G G C C C C T T C C C C G G A A A A C C A A A A T T G G C C C C G G C C C C G G T T G G

Specified Multimarker Tests Improve Effective Coverage A A A T T T G G T G G G A A C A A C G G C C C C G G T T G G G G T T G G C C C C T T G G T T G G C C

Coverage of the genome

Other recent developments Whole genome scan planned in 9,000 FHS participants (SHARE) Other whole genome scans will be funded (recent NHLBI RFA)