Jeff O’ConnellInterbull annual meeting, Orlando, FL, July 2015 (1) J. R. O’Connell 1 and P. M. VanRaden 2 1 University of Maryland School of Medicine,

Slides:



Advertisements
Similar presentations
Analysis of imputed rare variants
Advertisements

2007 Paul VanRaden, Mel Tooker, and Nicolas Gengler Animal Improvement Programs Lab, Beltsville, MD, USA, and Gembloux Agricultural U., Belgium
2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland.
Meta-analysis for GWAS BST775 Fall DEMO Replication Criteria for a successful GWAS P
Added value of whole-genome sequence data to genomic predictions in dairy cattle Rianne van Binsbergen 1,2, Mario Calus 1, Chris Schrooten 3, Fred van.
Perspectives from Human Studies and Low Density Chip Jeffrey R. O’Connell University of Maryland School of Medicine October 28, 2008.
Genomic imputation and evaluation using 1074 high density Holstein genotypes P. M. VanRaden 1, D. J. Null 1 *, G.R. Wiggans 1, T.S. Sonstegard 2, E.E.
Ingredients for a successful genome-wide association studies: A statistical view Scott Weiss and Christoph Lange Channing Laboratory Pulmonary and Critical.
2007 Paul VanRaden and Jeff O’Connell Animal Improvement Programs Lab, Beltsville, MD U MD College of Medicine, Baltimore, MD
Use of Quantitative Trait Loci (QTL) in Dairy Sire Selection Fabio Monteiro de Rezende Universidade Federal Rural de Pernambuco (UFRPE) - Brazil.
MSc GBE Course: Genes: from sequence to function Genome-wide Association Studies Sven Bergmann Department of Medical Genetics University of Lausanne Rue.
Computational Complexity The complexity of the MG model for a single SNP is determined by the complexity of the matrix operations in formulas used to iteratively.
Give me your DNA and I tell you where you come from - and maybe more! Lausanne, Genopode 21 April 2010 Sven Bergmann University of Lausanne & Swiss Institute.
2007 Paul VanRaden 1, Curt Van Tassell 2, George Wiggans 1, Tad Sonstegard 2, Jeff O’Connell 1, Bob Schnabel 3, Jerry Taylor 3, and Flavio Schenkel 4,
Mating Programs Including Genomic Relationships and Dominance Effects
Mating Programs Including Genomic Relationships and Dominance Effects Chuanyu Sun 1, Paul M. VanRaden 2, Jeff R. O'Connell 3 1 National Association of.
Chuanyu Sun Paul VanRaden National Association of Animal breeders, USA Animal Improvement Programs Laboratory, USA Increasing long term response by selecting.
WiggansARS Big Data Workshop – July 16, 2015 (1) George R. Wiggans Animal Genomics and Improvement Laboratory Agricultural Research Service, USDA Beltsville,
K.L. Parker Gaddis 1 and J.B. Cole 2,* 1 Department of Animal Sciences 2 Animal Genomics and Improvement University of Florida Laboratory Gainesville,
2007 J. B. Cole 1,*, P. M. VanRaden 1, J. R. O'Connell 3, C. P. Van Tassell 1,2, T. S. Sonstegard 2, R. D. Schnabel 4, J. F. Taylor 4, and G. R. Wiggans.
Fine mapping QTLs using Recombinant-Inbred HS and In-Vitro HS William Valdar Jonathan Flint, Richard Mott Wellcome Trust Centre for Human Genetics.
2007 Paul VanRaden, Curt Van Tassell, George Wiggans, Tad Sonstegard, and Jeff O’Connell Animal Improvement Programs Laboratory and Bovine Functional Genomics.
Wiggans, th WCGALP (1) G.R. Wiggans*, T.A. Cooper, D.J. Null, and P.M. VanRaden Animal Genomics and Improvement Laboratory Agricultural Research.
An Efficient Method of Generating Whole Genome Sequence for Thousands of Bulls Chuanyu Sun 1 and Paul M. VanRaden 2 1 National Association of Animal Breeders,
2007 Paul VanRaden and Mel Tooker Animal Improvement Programs Laboratory, USDA Agricultural Research Service, Beltsville, MD, USA
John B. Cole 1, Daniel J. Null *1, Chuanyu Sun 2, and Paul M. VanRaden 1 1 Animal Genomics and Improvement 2 Sexing Technologies Laboratory Navasota, TX.
John B. Cole Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD , USA The use and.
2007 Paul VanRaden Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2009 Mixing Different SNP Densities Mixing Different.
2007 Paul VanRaden, George Wiggans, Jeff O’Connell, John Cole, Animal Improvement Programs Laboratory Tad Sonstegard, and Curt Van Tassell Bovine Functional.
T. A. Cooper and G.R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD Council.
Adjustment of selection index coefficients and polygenic variance to improve regressions and reliability of genomic evaluations P. M. VanRaden, J. R. Wright*,
E XOME SEQUENCING AND COMPLEX DISEASE : practical aspects of rare variant association studies Alice Bouchoms Amaury Vanvinckenroye Maxime Legrand 1.
2007 Paul VanRaden Animal Improvement Programs Lab, USDA, Beltsville, MD, USA Pete Sullivan Canadian Dairy Network, Guelph, ON, Canada
Paul VanRaden, 1 Katie Olson, 2 Dan Null, 1 Mehdi Sargolzaei, 3 Marco Winters, 4 and Jan-Thijs van Kaam 5 1 Animal Improvement Programs Laboratory, ARS,
J. B. Cole * and P. M. VanRaden Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD
2007 Melvin Tooker Animal Improvement Programs Laboratory USDA Agricultural Research Service, Beltsville, MD, USA
WiggansCDCB industry meeting – Sept. 29, 2015 (1) George R. Wiggans Animal Genomics and Improvement Laboratory Agricultural Research Service, USDA Beltsville,
G.R. Wiggans* and P.M. VanRaden Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD
John B. Cole Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD AIPL Report.
2007 Paul VanRaden Animal Improvement Programs Laboratory USDA Agricultural Research Service, Beltsville, MD, USA
P. M. VanRaden and T. A. Cooper * Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD, USA
April 2010 (1) Prediction of Breed Composition & Multibreed Genomic Evaluations K. M. Olson and P. M. VanRaden.
2007 Paul VanRaden and Melvin Tooker* Animal Improvement Programs Laboratory 2010 Gains in reliability from combining subsets.
2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland.
Paul VanRaden and Chuanyu Sun Animal Genomics and Improvement Lab USDA-ARS, Beltsville, MD, USA National Association of Animal Breeders Columbia, MO, USA.
2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland.
P.M. VanRaden and D.M. Bickhart Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD, USA
What have we learned from sequencing efforts to date? Jared Decker Assistant Professor Beef Genetics Specialist Computational Genomics.
Multibreed Genomic Evaluation Using Purebred Dairy Cattle K. M. Olson* 1 and P. M. VanRaden 2 1 Department of Dairy Science Virginia Polytechnic and State.
2006 Paul VanRaden Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD AIPL Contributions.
G.R. Wiggans,* 1 P.M. VanRaden, 1 T.A. Cooper, 1 C.P. Van Tassell, 2 T. Sonstegard, 2 and B. Simpson 3 1 Animal Improvement Programs and 2 Bovine Functional.
G.R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD G.R. WiggansADSA 18.
G.R. Wiggans 1, T. A. Cooper 1 *, K.M. Olson 2 and P.M. VanRaden 1 1 Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville,
2007 Paul VanRaden Animal Improvement Programs Laboratory, USDA Agricultural Research Service, Beltsville, MD, USA 2008 New.
Multibreed Genomic Evaluations in Purebred Dairy Cattle K. M. Olson 1 and P. M. VanRaden 2 1 National Association of Animal Breeders 2 AIPL, ARS, USDA.
G.R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD Select Sires‘ Holstein.
G.R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD 2011 National Breeders.
G.R. Wiggans, T. A. Cooper* and P.M. VanRaden Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD
2007 Paul VanRaden 1, Curt Van Tassell 2, George Wiggans 1, Tad Sonstegard 2, Bob Schnabel 3, Jerry Taylor 3, and Flavio Schenkel 4, Paul VanRaden 1, Curt.
Strategies to Incorporate Genomic Prediction Into Population-Wide Genetic Evaluations Nicolas Gengler 1,2 & Paul VanRaden 3 1 Animal Science.
My vision for dairy genomics
Epidemiology 101 Epidemiology is the study of the distribution and determinants of health-related states in populations Study design is a key component.
Genome-wide Association Studies
The effect of using sequence data instead of a lower density SNP chip on a GWAS EAAP 2017; Tallinn, Estonia Sanne van den Berg, Roel Veerkamp, Fred van.
Beyond GWAS Erik Fransen.
Methods to compute reliabilities for genomic predictions of feed intake Paul VanRaden, Jana Hutchison, Bingjie Li, Erin Connor, and John Cole USDA, Agricultural.
Perspectives from Human Studies and Low Density Chip
Increased reliability of genetic evaluations for dairy cattle in the United States from use of genomic information Abstr.
Using Haplotypes in Breeding Programs
Development of Genomic GMACE
Presentation transcript:

Jeff O’ConnellInterbull annual meeting, Orlando, FL, July 2015 (1) J. R. O’Connell 1 and P. M. VanRaden 2 1 University of Maryland School of Medicine, Baltimore, MD, USA 2 Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD, USA Strategies to choose from millions of imputed sequence variants

Jeff O’ConnellInterbull annual meeting, Orlando, FL, July 2015 (2) Introduction l Whole-genome sequencing costs are rapidly declining l Large numbers of animals will be sequenced in the next few years l How will WGS be used in genomic evaluations?

Jeff O’ConnellInterbull annual meeting, Orlando, FL, July 2015 (3) Applications of WGS data l Develop reference panels for imputation using SNP chip l Discover QTLs l Discover rare and de novo mutations with deleterious effects for surveillance l Build better SNP chips

Jeff O’ConnellInterbull annual meeting, Orlando, FL, July 2015 (4) WGS implementation considerations l Cost of higher density SNP chips to industry l Imputation cost for weekly/monthly genomic evaluations l Redesigning chip incurs cost of generating new data for imputation w Build off the 60K chip

Jeff O’ConnellInterbull annual meeting, Orlando, FL, July 2015 (5) Project goal: Prepare for WGS l Develop programs to simulate genotypes and QTLs l Develop programs to process sequence data l Compare strategies for variant selection l Predict gains in reliability l Determine computational footprint for large populations

Jeff O’ConnellInterbull annual meeting, Orlando, FL, July 2015 (6) Data l 26,984 HOL bulls in U.S. reference population l 112,905 animals in the pedigree l 30 million simulated variants, 10,000 QTLs l 30 equal-length chromosomes (100 Mbases) l 3 different chip densities (HD, MD, LD) l 5 independent traits (same QTL locations)

Jeff O’ConnellInterbull annual meeting, Orlando, FL, July 2015 (7) Genotype simulation and pruning l Simulate 30 million genotypes plus QTLs in 1000 bulls using genosim.f90 l Remove loci with MAF 0.95 l Keep 500K loci near QTLs (genes) + 600K chip + 60K chip l 8,403,858 loci remain

Jeff O’ConnellInterbull annual meeting, Orlando, FL, July 2015 (8) QTL effect distribution (% variance) Top QTLs Trait Avg , ,

Jeff O’ConnellInterbull annual meeting, Orlando, FL, July 2015 (9) QTL correlation to 8M SNP set

Jeff O’ConnellInterbull annual meeting, Orlando, FL, July 2015 (10) Bull genotypes: Imputation findhap.f90 BullsGenotype densityVariants 1,000Sequenced pruned to8,403, High density600,000 24,863Medium density60, Low density12,000 26,984Total bulls, all imputed to8,403,858 17,896Phenotyped (old)DYD 9,088Validation (young)True BV

Jeff O’ConnellInterbull annual meeting, Orlando, FL, July 2015 (11) Computation required Step Proc- essors Time (hours) Memory (Gbyte) Disk (Gbyte) Simulate 30M Prune linkage Impute 8M

Jeff O’ConnellInterbull annual meeting, Orlando, FL, July 2015 (12) Study1: Select 25K from 600K chip l Compare 60K and 600K REL using same data l Choose best markers from 600K, add to 60K w Select largest 5,000 for each of 5 traits w Multiple regression: By effect size, effect variance w GWA: By p-value significance w Merge and remove duplicates, adding  23K

Jeff O’ConnellInterbull annual meeting, Orlando, FL, July 2015 (13) Results: REL from PA, 60K, 85K, 600K 60K + 25K TraitPA60KGWASizeVar600K Avg

Jeff O’ConnellInterbull annual meeting, Orlando, FL, July 2015 (14) Genome-wide association l Single SNP regression within mixed model that accounts for the polygenic effect using pedigree kinship w Y= Xb + SNP + a + e l Multiple testing adjustment leads to stringent cutoffs for significance l Focus on estimation rather than prediction

Jeff O’ConnellInterbull annual meeting, Orlando, FL, July 2015 (15) GWA trait 2 imputed QTLs <5% QTL < 10e-5

Jeff O’ConnellInterbull annual meeting, Orlando, FL, July 2015 (16) Study 2: Select sequence variants l 8M – search everywhere l 1M – search by bioinformatics using 2.5kb window on QTL l QTLs – perfect functional knowledge l Include 60K chip for imputation, but assign all or most prior variance to the selected variants w Selection of variants may affect optimal shape/ choice of prior distribution

Jeff O’ConnellInterbull annual meeting, Orlando, FL, July 2015 (17) REL from 1M, 60K+1M subset, QTLs 1 million near QTLsInclude QTLs Trait600K60K+25KAll 1M60K+10K10K Avg

Jeff O’ConnellInterbull annual meeting, Orlando, FL, July 2015 (18) Computation required Step Proc- essors Time (hours) Memory (Gbyte) Disk (Gbyte) Simulate 30M Prune linkage Impute 8M Predict 1M52220<1 Select 25K from 8M 300.5<1

Jeff O’ConnellInterbull annual meeting, Orlando, FL, July 2015 (19) Conclusions l Selection of variants requires several steps l Potential gains from sequence are large l Still requires very large reference population l Methods scalable to more sequences l Bioinformatics can narrow the search l GWA is scalable to larger data but multiple regression give higher reliability

Jeff O’ConnellInterbull annual meeting, Orlando, FL, July 2015 (20) Current directions l Evaluate several additional SNP selection criteria including priors for GWA l Apply methods to real data from 1000 Bulls Project l Evaluate bovine bioinformatics resources for functional annotation

Jeff O’ConnellInterbull annual meeting, Orlando, FL, July 2015 (21) Acknowledgments l JRO supported by USDA SCA