J. B. Cole Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD 20705-2350, USA Use of NGS.

Slides:



Advertisements
Similar presentations
Perspectives from Human Studies and Low Density Chip Jeffrey R. O’Connell University of Maryland School of Medicine October 28, 2008.
Advertisements

Genomic imputation and evaluation using 1074 high density Holstein genotypes P. M. VanRaden 1, D. J. Null 1 *, G.R. Wiggans 1, T.S. Sonstegard 2, E.E.
John B. Cole* and Paul M. VanRaden Animal Genomics and Improvement Laboratory Agricultural Research Service, USDA Beltsville, MD
John B. Cole Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD , USA New applications.
Use of Quantitative Trait Loci (QTL) in Dairy Sire Selection Fabio Monteiro de Rezende Universidade Federal Rural de Pernambuco (UFRPE) - Brazil.
Wiggans, 2013RL meeting, Aug. 15 (1) Dr. George R. Wiggans, Acting Research Leader Bldg. 005, Room 306, BARC-West (main office);
But who will be the next GREAT one?. USA Bull Proofs * Bulls are ranked based upon their DAUGHTER’S (progeny) production and physical characteristics.
2002 Curt Van Tassell Gene Evaluation and Mapping Laboratory and Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville,
2007 Paul VanRaden 1, Curt Van Tassell 2, George Wiggans 1, Tad Sonstegard 2, Jeff O’Connell 1, Bob Schnabel 3, Jerry Taylor 3, and Flavio Schenkel 4,
Mating Programs Including Genomic Relationships and Dominance Effects Chuanyu Sun 1, Paul M. VanRaden 2, Jeff R. O'Connell 3 1 National Association of.
2007 Paul VanRaden 1, George Wiggans 1, Curt Van Tassell 2, Tad Sonstegard 2, Jeff O’Connell 1, Bob Schnabel 3, Jerry Taylor 3, and Flavio Schenkel 4,
WiggansARS Big Data Workshop – July 16, 2015 (1) George R. Wiggans Animal Genomics and Improvement Laboratory Agricultural Research Service, USDA Beltsville,
BickhartADSA Meeting(1) 2013 Tools to Exploit Sequence data to find new markers and Disease Loci in Cattle D. M. Bickhart, H. A. Lewin and G. E. Liu.
K.L. Parker Gaddis 1 and J.B. Cole 2,* 1 Department of Animal Sciences 2 Animal Genomics and Improvement University of Florida Laboratory Gainesville,
2007 J. B. Cole 1,*, P. M. VanRaden 1, J. R. O'Connell 3, C. P. Van Tassell 1,2, T. S. Sonstegard 2, R. D. Schnabel 4, J. F. Taylor 4, and G. R. Wiggans.
G.R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD G.R. WiggansAg Discovery.
2007 Paul VanRaden, Curt Van Tassell, George Wiggans, Tad Sonstegard, and Jeff O’Connell Animal Improvement Programs Laboratory and Bovine Functional Genomics.
Norway (1) 2005 Status of Dairy Cattle Breeding in the United States Dr. H. Duane Norman Animal Improvement Programs Laboratory Agricultural Research Service,
John B. Cole* and Paul M. VanRaden Animal Genomics and Improvement Laboratory Agricultural Research Service, USDA Beltsville, MD
2007 Paul VanRaden and Mel Tooker Animal Improvement Programs Laboratory, USDA Agricultural Research Service, Beltsville, MD, USA
John B. Cole 1, Daniel J. Null *1, Chuanyu Sun 2, and Paul M. VanRaden 1 1 Animal Genomics and Improvement 2 Sexing Technologies Laboratory Navasota, TX.
John B. Cole Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD , USA The use and.
2007 Paul VanRaden, George Wiggans, Jeff O’Connell, John Cole, Animal Improvement Programs Laboratory Tad Sonstegard, and Curt Van Tassell Bovine Functional.
John B. Cole Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD New Tools for.
John B. Cole Animal Genomics and Improvement Laboratory Agricultural Research Service, USDA Beltsville, MD Using.
2014 J.B. Cole, 1,* J.L. Hutchison, 1 D.J. Null, 1 P.M. VanRaden, 1 G.E. Liu, 1 S.G. Schroeder, 1 T.P. Smith, 2 T.S. Sonstegard, 1 C.P. Van Tassell, 1.
John B. Cole, Ph.D. Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD, USA The U.S. genetic.
2007 J.B. Cole Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD Overview.
Genetic correlations between first and later parity calving ease in a sire-maternal grandsire model G. R. Wiggans*, C. P. Van Tassell, J. B. Cole, and.
2005 Paul VanRaden Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD An Example from Dairy.
Adjustment of selection index coefficients and polygenic variance to improve regressions and reliability of genomic evaluations P. M. VanRaden, J. R. Wright*,
2007 J.B. Cole Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD Genetic Evaluation.
Paul VanRaden, 1 Katie Olson, 2 Dan Null, 1 Mehdi Sargolzaei, 3 Marco Winters, 4 and Jan-Thijs van Kaam 5 1 Animal Improvement Programs Laboratory, ARS,
P1097: Candidate causative mutation on BTA18 associated with calving and conformation traits in Holstein bulls J.B. Cole, 1 J.L. Hutchison, 1 D.J. Null,
J. B. Cole * and P. M. VanRaden Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD
Paul VanRaden and Melvin Tooker* Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD 2006.
Effects of dam’s dry period length on calf M. T. Kuhn,* J. L. Hutchison, and H. D. Norman Animal Improvement Programs Laboratory, Agricultural Research.
2007 Melvin Tooker Animal Improvement Programs Laboratory USDA Agricultural Research Service, Beltsville, MD, USA
WiggansCDCB industry meeting – Sept. 29, 2015 (1) George R. Wiggans Animal Genomics and Improvement Laboratory Agricultural Research Service, USDA Beltsville,
G.R. Wiggans* and P.M. VanRaden Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD
John B. Cole Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD AIPL Report.
John B. Cole Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD New tools for.
2003 P.M. VanRaden Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD Genetic Evaluations.
Wiggans, 2014ASAS-ADSA-CSAS Joint Annual Meeting (1) G.R. Wiggans* 1, T.A. Cooper 1, P.M. VanRaden 1, D.J. Null 1, J.L. Hutchison 1, O.M. Meland 2, M.E.
J. B. Cole *, G. R. Wiggans, P. M. VanRaden, and R. H. Miller Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville,
2007 Paul VanRaden Animal Improvement Programs Laboratory USDA Agricultural Research Service, Beltsville, MD, USA
WiggansARS Big Data Computing Workshop (1) 2013 George R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville,
George R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD Select Sires’
2007 Paul VanRaden and Melvin Tooker* Animal Improvement Programs Laboratory 2010 Gains in reliability from combining subsets.
2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland.
2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland.
Fine Mapping and Discovery of Recessive Mutations that Cause Abortions in Dairy Cattle P. M. VanRaden 1, D. J. Null 1 *, T.S. Sonstegard 2, H.A. Adams.
G.R. Wiggans* 1, P.M. VanRaden 1, L.R. Bacheller 1, F.A. Ross, Jr. 1, M.E. Tooker 1, J.L. Hutchison 1, T.S. Sonstegard 2, and C.P. Van Tassell 1,2 1 Animal.
2006 Paul VanRaden Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD Predicting Genetic.
2007 John Cole, Paul VanRaden, George Wiggans, and Melvin Kuhn Animal Improvement Programs Laboratory USDA Agricultural Research Service, Beltsville, MD,
2006 Paul VanRaden Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD AIPL Contributions.
2007 Paul VanRaden Animal Improvement Programs Laboratory, USDA Agricultural Research Service, Beltsville, MD, USA 2008 New.
G.R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD 2011 National Breeders.
CRI – Spanish update (1) 2010 Status of Dairy Cattle Breeding in the United States Dr. H. Duane Norman Animal Improvement Programs Laboratory Agricultural.
John B. Cole Animal Genomics and Improvement Laboratory Agricultural Research Service, USDA Beltsville, MD What direction should.
2007 Paul VanRaden 1, Curt Van Tassell 2, George Wiggans 1, Tad Sonstegard 2, Bob Schnabel 3, Jerry Taylor 3, and Flavio Schenkel 4, Paul VanRaden 1, Curt.
John B. Cole Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD What can we do.
Application of Genetic Markers to Dairy Cattle. Overview Traditional selection Genetic markers Granddaughter design Resource populations QTL identification.
2017 AGIL-AIP Update John B. Cole
A dairy calf DNA biobank for the discovery of new recessive genetic disorders John B. Cole Animal Genomics & Improvement Laboratory, Agricultural Research.
Gene editing algorithm
Distribution and Location of Genetic Effects for Dairy Traits
Where AIPL Fits In Agricultural Research Service (ARS) is the main research arm of USDA (8,000 employees with 2,000 scientists at >100 locations) Beltsville.
Perspectives from Human Studies and Low Density Chip
Using Haplotypes in Breeding Programs
Presentation transcript:

J. B. Cole Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD , USA Use of NGS to identify the causal variant associated with a complex phenotype

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (2) Cole Overview l Why are we sequencing? l How did we select the animals to sequence? l What are the steps involved in the process? l What do you do with the reads once you have them? l Where are we now?

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (3) Cole Introduction l Several studies (Kuhn et al., 2003; Cole et al., 2007; Seidenspinner et al., 2009) have reported QTL on BTA 18 associated with dystocia l Bioinformatic analysis using SNP data has not identified the causal variant l Next generation sequencing (NGS) has recently been used to find causal variants for novel recessive disorders

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (4) Cole Chromosome 18 is different l Markers on chromosome 18 have large effects on several traits: w Dystocia and stillbirth: Sire and daughter calving ease and sire stillbirth w Conformation: rump width, stature, strength, and body depth w Efficiency: longevity and net merit l Large calves contribute to reduced lifetimes and decreased profitability

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (5) Cole Marker effects for dystocia complex AR-BFGL-NGS Cole et al., 2009 (J. Dairy Sci. 92:2931–2946) ARS-BFGL-NGS Cole et al., 2009 (J. Dairy Sci. 92:2931–2946)

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (6) Cole Correlations in dystocia complex

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (7) Cole The QTL also affects gestation length Maltecca et al Animal Genetics, 42:6,

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (8) Cole Overview of the dystocia complex l The key marker is ARS-BFGL-NGS at (rs ) 57,585,121 Mb on BTA18 l Intronic to SIGLEC12 (sialic acid binding Ig-like lectin 12) l Recent results indicate effects on gestation length (Maltecca et al., 2011) and calf birth weight (Cole et al., unpublished data)

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (9) Cole This is a gene-rich region

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (10) Cole Copy number variants are present l ARS-BFGL-NGS is flanked by CNV w There’s a loss and a gain to the left (8 SNP region) w There’s a gain to the right (10 SNP region) l This can result in assembly problems Hou et al Genomic characteristics of cattle copy number variations. BMC Genomics. 12:127.

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (11) Cole Where did this problem come from? 40,803 daughters

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (12) Cole What if we look at a different trait? l Cole et al. (2007) proposed the following mechanism: w SIGLEC12 may sequester circulating leptin w This increases gestation length w Calf birth weight (BW) is higher because of increased gestation length w Higher BW is associated with dystocia

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (13) Cole We don’t have birth weight data l Birth weights are not routinely recorded in the US l Collaborated with Hermann Swalve’s group to develop a selection index prediction of BW PTA l Performed GWAS and gene set enrichment analysis to search for interesting associations

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (14) Cole GWAS for birth weight PTA h Cole et al.(2013), unpublished data

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (15) Cole Are we measuring anything new? l Identified a SNP intronic to LHX4, which is associated with cow body weight and length (Ren et al., 2010, Mol. Bio. Reprod., 37: ). l 4 SNP in the QTL region on BTA 18 had large effects l Several other SNP with large effects intronic or adjacent to genes with unknown functions

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (16) Cole KEGG pathways for birth weight What does regulation of the actin cytoskeleton have to do with birth weight in cattle? That is, do these results make sense? Maybe…these pathways may be involved in establishment & maintenance of pregnancy, as well as coordination of growth and development. Cole et al.(2013), unpublished data

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (17) Cole Sequencing is becoming very affordable

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (18) Cole Sequencing successes at AIPL/BFGL l Simple loss-of-function mutations w APAF1 – Spontaneous abortions in Holstein cattle (Adams et al., 2012) w CWC15 – Early embryonic death in Jersey cattle (Sonstegard et al., 2013) w Weaver syndrome – Neurological degeneration and death in Brown Swiss cattle (McClure et al., 2013)

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (19) Cole Original pedigree-based design Bull A (1968) AA, SCE: 8 Bull B (1962) AA, SCE: 7 MGS Bull H (1989) Aa, SCE: 14 Bull I (1994) Aa, SCE: 18 Bull E (1982) Aa, SCE: 8 Bull F (1987) Aa, SCE: 15 Bull C (1975) AA, SCE: 8 δ = 10 Bull D (1968) ??, SCE: 7 MGS Bull E (1974) Aa, SCE: 10 MGS

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (20) Cole Modified pedigree & haplotype design Bull A (1968) AA, SCE: 8 Bull B (1962) AA, SCE: 7 MGS Bull H (1989) Aa, SCE: 14 Bull I (1994) Aa, SCE: 18 Bull E (1982) Aa, SCE: 8 Bull F (1987) Aa, SCE: 15 Bull C (1975) AA, SCE: 8 δ = 10 Bull E (1974) Aa, SCE: 10 MGS Bull J (2002) Aa, SCE: 6 Bull K (2002) Aa, SCE: 15 Bull J (2002) aa, SCE: 15 These bulls carry the haplotype with the largest, negative effect on SCE: Bull D (1968) ??, SCE: 7 Couldn’t obtain DNA:

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (21) Cole DNA Quality Control Molecular prep Sample Collection DNA Extraction Library Construction Library Quality Control

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (22) Cole Sample preparation time is substantial l DNA Extraction: ~12 hours (30 mins) l DNA QC: ~1-2 hours (1-2 hours) l Library Construction: 48 hours (12 hours) l Library QC: ~2-4 hours (1 hour) l Total: 3-4 days (15.5 hours)

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (23) Cole DNA quality

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (24) Cole Library quality

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (25) Cole Sequencing stage Illumina cBot: Preps DNA for sequencing Takes 4-5 hours Must be done 48 hours before Illumina HiSeq 2000: Does the sequencing Takes ~10-14 days for 100 x 100 Minimal hands-on time

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (26) Cole Anatomy of a flow cell l 8 lanes per flow cell w 3 columns per lane − 96 tiles per column l Each tile imaged 8 times w 1 from upper surface, 1 from lower l Approximately 300Gb of sequence per flow cell

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (27) Cole Sequencing by synthesis

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (28) Cole How many scientists does it take…

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (29) Cole Flowcell 1: Cluster densities Cluster densities from current HiSeq run finished 30 April 2013 (unpublished data):

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (30) Cole Flowcell 2: Cluster densities Cluster densities from current HiSeq run started 22 May 2013 (unpublished data):

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (31) Cole The Aftermath l Total Time (sample to sequence): w 3 weeks w That’s assuming nothing went wrong! w More realistic: months l Resulting Data w Large text files w ~300 gigabytes compressed l Analysis w Often underestimated w Can take months as well

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (32) Cole Variant detection Alignment against a reference genome Analysis is very disk I/O-intensive. Variant Detection Raw Sequencer Output Alignment to the Genome

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (33) Cole Computational Logistics l Desktop computers w Viable for single lanes w Long computation time l Servers are better w >100GB RAM and >16 processor cores l Cloud w Amazon Web Services w iAnimal/iPlant

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (34) Cole Storage considerations l What to save? w Raw data? w Processed results? l How much workspace? l Suggestions: w Workspace 10x compressed files w Save alignments w Backup REGULARLY!!!

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (35) Cole Why should you use a pipeline? Automates analysis Maximizes resource consumption Because post-docs aren’t cheap

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (36) Cole l Galaxy server l NextGene l Custom pipeline w Scripting languages w Open-source tools Many options for analysis pipelines

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (37) Cole Challenges l Annotation w This is a mess in the cow w The reference assembly may not be representative of all taurine cows l Validation w Doing functional genomics with large mammals is expensive – who pays? w When have we proven something?

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (38) Cole Conclusions l Sequencing is powerful, but presents many challenges l Computational requirements are substantial l We’re learning how much we don’t know about functional genomics in the cow l Validation remains a problem

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (39) Cole Acknowledgments l AIPL: Derek Bickhart, Dan Null, Paul VanRaden l BFGL: Reuben Anderson, Steve Schroeder, Tad Sonstegard, Curt Van Tassell

Animal Sciences Group, Wageningen UR Livestock Research, The Netherlands, 29 May 2013 (40) Cole Questions?