WiggansARS Big Data Workshop – July 16, 2015 (1) George R. Wiggans Animal Genomics and Improvement Laboratory Agricultural Research Service, USDA Beltsville,

Slides:



Advertisements
Similar presentations
2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland.
Advertisements

2006 George R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD USDA Genetic.
Genomic imputation and evaluation using 1074 high density Holstein genotypes P. M. VanRaden 1, D. J. Null 1 *, G.R. Wiggans 1, T.S. Sonstegard 2, E.E.
John B. Cole* and Paul M. VanRaden Animal Genomics and Improvement Laboratory Agricultural Research Service, USDA Beltsville, MD
Wiggans, 2014CDCB meeting – August 5 (1) G.R. Wiggans Animal Genomics and Improvement Laboratory Agricultural Research Service, USDA Beltsville, MD
George R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD 2008 Genetic trends.
George R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD Select Sires’
2007 Paul VanRaden and Jeff O’Connell Animal Improvement Programs Lab, Beltsville, MD U MD College of Medicine, Baltimore, MD
J. B. Cole 1, P. D. Miller 2, and H. D. Norman 1 1 Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD 2 Department.
Wiggans, 2013RL meeting, Aug. 15 (1) Dr. George R. Wiggans, Acting Research Leader Bldg. 005, Room 306, BARC-West (main office);
But who will be the next GREAT one?. USA Bull Proofs * Bulls are ranked based upon their DAUGHTER’S (progeny) production and physical characteristics.
2007 Jana L. Hutchison Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD , USA
India Emerging Markets Conference, May 2009 (1) Leigh Walton Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville,
December 2014 Proof Changes
2001 ADSA annual meeting, July 2001 (1) Timeliness of progeny-testing through AI and percentage of bulls returned to service (abstract 1020) H.D. NORMAN,*
2007 J. B. Cole 1,*, P. M. VanRaden 1, J. R. O'Connell 3, C. P. Van Tassell 1,2, T. S. Sonstegard 2, R. D. Schnabel 4, J. F. Taylor 4, and G. R. Wiggans.
Wiggans, 2013Japanese Genomics Tour (1) Dr. George R. WiggansDr. H. Duane Norman Acting Research LeaderInterim Administrator Animal Improvement Programs.
WiggansANSC UMD(1) 2013 George R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD , USA
George R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD National Association.
Wiggans, 2013SRUC Imputation (1) Dr. George R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD ,
 PTA mobility was highly correlated with udder composite.  PTA mobility showed a moderate, positive correlation with production, productive life, and.
Wiggans, th WCGALP (1) G.R. Wiggans*, T.A. Cooper, D.J. Null, and P.M. VanRaden Animal Genomics and Improvement Laboratory Agricultural Research.
2002 ADSA 2002 (HDN-1) H.D. NORMAN* ( ), R.H. MILLER, P.M. V AN RADEN, and J.R. WRIGHT Animal Improvement Programs.
Norway (1) 2005 Status of Dairy Cattle Breeding in the United States Dr. H. Duane Norman Animal Improvement Programs Laboratory Agricultural Research Service,
Bovine Genomics The Technology and its Applications Gerrit Kistemaker Chief Geneticist, Canadian Dairy Network (CDN) Many slides were created by.
AFGC Convention 2004 (1) 2004 Possibilities for Improving Dairy Cattle Performance Dr. H. Duane Norman Animal Improvement Programs Laboratory Agricultural.
2007 Paul VanRaden, George Wiggans, Jeff O’Connell, John Cole, Animal Improvement Programs Laboratory Tad Sonstegard, and Curt Van Tassell Bovine Functional.
G. R. Wiggans and P. M. VanRaden Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD
2002 George R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD USDA Dairy Goat.
John B. Cole, Ph.D. Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD, USA The U.S. genetic.
2007 J.B. Cole Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD Overview.
2005 Paul VanRaden Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD An Example from Dairy.
2005 Paul VanRaden Animal Improvement Programs Laboratory, USDA Agricultural Research Service, Beltsville, MD, USA Selection for.
Genetic Evaluation of Lactation Persistency Estimated by Best Prediction for Ayrshire, Brown Swiss, Guernsey, and Milking Shorthorn Dairy Cattle J. B.
T. A. Cooper and G.R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD Council.
Adjustment of selection index coefficients and polygenic variance to improve regressions and reliability of genomic evaluations P. M. VanRaden, J. R. Wright*,
2003 Melvin Tooker, Paul VanRaden, Ashley Sanders, and George Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville,
2007 Melvin Tooker Animal Improvement Programs Laboratory USDA Agricultural Research Service, Beltsville, MD, USA
G.R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD 2009 G.R. WiggansCouncil.
WiggansCDCB industry meeting – Sept. 29, 2015 (1) George R. Wiggans Animal Genomics and Improvement Laboratory Agricultural Research Service, USDA Beltsville,
G.R. Wiggans* and P.M. VanRaden Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD
John B. Cole Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD AIPL Report.
2006 Mid-Atlantic Dairy Grazing Conference, 2006 (1) Is There a Need for Different Genetics in Dairy Grazing Systems? H. D. Norman, J. R. Wright, R. L.
XX International Grassland Conference 2005 (1) 2005 Genetic Alternatives for Dairy Producers who Practise Grazing H. D. Norman, J. R. Wright, R. L. Powell.
2007 Paul VanRaden Animal Improvement Programs Laboratory USDA Agricultural Research Service, Beltsville, MD, USA
Norman, 2014ICAR / Interbull annual meeting, Berlin, Germany, May 20, 2014 (1) Dr. H. Duane Norman Interim Administrator Council on Dairy Cattle Breeding.
H.D. Norman, J.R. Wright, and R.H. Miller Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD, USA
WiggansARS Big Data Computing Workshop (1) 2013 George R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville,
George R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD Select Sires’
WiggansARPAS-DC meeting, Beltsville, MD – Dec. 9, 2015 (1) George R. Wiggans Animal Genomics and Improvement Laboratory Agricultural Research Service,
P.M. VanRaden and D.M. Bickhart Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD, USA
2003 Paul VanRaden Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD Genetic Evaluation.
Multi-trait, multi-breed conception rate evaluations P. M. VanRaden 1, J. R. Wright 1 *, C. Sun 2, J. L. Hutchison 1 and M. E. Tooker 1 1 Animal Genomics.
2003 P.M. VanRaden* and M.E. Tooker Animal Improvement Programs Laboratory Agricultural Research Service, USDA, Beltsville, MD Definition.
H.D. Norman* J.R. Wright, P.M. VanRaden, and M.T. Kuhn Animal Improvement Programs Laboratory Agricultural.
2007 John Cole, Paul VanRaden, George Wiggans, and Melvin Kuhn Animal Improvement Programs Laboratory USDA Agricultural Research Service, Beltsville, MD,
2006 GEORGE R. WIGGANS Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, Maryland ,
2001 ADSA Indianapolis 2001 (1) Heterosis and Breed Differences for Yield and Somatic Cell Scores of US Dairy Cattle in the 1990’s. PAUL VANRADEN Animal.
2007 Paul VanRaden Animal Improvement Programs Laboratory, USDA Agricultural Research Service, Beltsville, MD, USA 2008 New.
G.R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD Select Sires‘ Holstein.
G.R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD 2011 National Breeders.
CRI – Spanish update (1) 2010 Status of Dairy Cattle Breeding in the United States Dr. H. Duane Norman Animal Improvement Programs Laboratory Agricultural.
John B. Cole Animal Genomics and Improvement Laboratory Agricultural Research Service, USDA Beltsville, MD 2016 AGIL Report.
2006 8th World Congress on Genetics Applied to Livestock Production (1) Trait Selection When Culling U.S. Holsteins H.D. Norman, J.L. Hutchison, J.R. Wright,
H.D. NORMAN,* R.L. POWELL, J.R. WRIGHT
My vision for dairy genomics
Correlations Among Measures of Dairy Cattle Fertility and Longevity
Where AIPL Fits In Agricultural Research Service (ARS) is the main research arm of USDA (8,000 employees with 2,000 scientists at >100 locations) Beltsville.
Percent of total breedings
Increased reliability of genetic evaluations for dairy cattle in the United States from use of genomic information Abstr.
Presentation transcript:

WiggansARS Big Data Workshop – July 16, 2015 (1) George R. Wiggans Animal Genomics and Improvement Laboratory Agricultural Research Service, USDA Beltsville, MD , USA Big data in support of genetic improvement of dairy cattle `

WiggansARS Big Data Workshop – July 16, 2015 (2) Mission l Genetic improvement of dairy cattle for economically important traits w Yield (milk, fat, and protein) w Conformation (overall and individual traits) w Longevity (productive life) w Fertility (conception and pregnancy rates) w Calving (dystocia and stillbirth) w Disease resistance (mastitis)

WiggansARS Big Data Workshop – July 16, 2015 (3) Data types l Identification information for animal, sire, and dam: w Name w ID number w Birth date l Animal genotypes from marker panels that that range from 2,900 to 777,962 markers w Breed w Herd w Country Courtesy of Illumina, Inc.

WiggansARS Big Data Workshop – July 16, 2015 (4) Data types (continued) l Records for milk yield, fat percentage, protein percentage, and somatic cell count (1/month) l Appraiser-assigned scores for  16 body and udder characteristics related to conformation (e.g., stature) l Breeding records that include indicator for conception success l Calving difficulty scores and stillbirth occurrences

WiggansARS Big Data Workshop – July 16, 2015 (5) Data amounts l Pedigree records:71,974,045 l Animal genotypes:1,035,590 l Lactation records (since 1960):132,629,200 l Daily yield records (since 1990):641,864,015 l Reproduction event records:176,559,035 l Calving difficulty scores:29,528,607 l Stillbirth scores:19,567,198

WiggansARS Big Data Workshop – July 16, 2015 (6) Computing environment l Computation server w 2.27 GHz CPU (32 cores, 64 threads) w 660 GB RAM w 2.7 TB local storage l Database server w 3.4 GHz CPU (12 cores, 24 threads) w 264 GB RAM w 1.3 TB local storage l Shared storage w 38 TB

WiggansARS Big Data Workshop – July 16, 2015 (7) Data management l Variable length segments for database rows to minimize space and overhead in identifying data l All marker genotypes for an animal stored each as a single byte in a character large object (CLOB) l All breedings and monthly milk yield and component information for a cow’s lactation stored in variable character data types

WiggansARS Big Data Workshop – July 16, 2015 (8) Programming languages lClC w Database interface including data editing l FORTRAN w Calculation of genetic merit estimates l SAS w Data preparation, checking, and delivery

WiggansARS Big Data Workshop – July 16, 2015 (9) Calculation schedule l Triannual genetic merit estimates from processed phenotypic data l Monthly genomic evaluations based on estimates of marker effects using genotypic data and triannual phenotype-based evaluations l Weekly evaluations using marker effect estimates from monthly evaluations

WiggansARS Big Data Workshop – July 16, 2015 (10) Transition to industry l Council on Dairy Cattle Breeding w Database maintenance w Calculation and distribution of genetic merit estimates w Interface with evaluation users and data suppliers l ARS w Research and development using data made available by Council

WiggansARS Big Data Workshop – July 16, 2015 (11) Research resource l Massive amount of genomic data  Location of causal genetic variants l Investigation of haplotypes never found in a homozygous state ÜDiscovery of chromosomal abnormalities resulting in early embryonic death l Investigation of sons of heterozygous sires ÜSpecific markers associated with differences between sons by haplotype

WiggansARS Big Data Workshop – July 16, 2015 (12) Genetic merit of marketed Holstein bulls Average gain: $19.42/year Average gain: $47.95/year Average gain: $87.49/year

WiggansARS Big Data Workshop – July 16, 2015 (13) Working with sequence data l Sequence available from 1000 Bull Genomes Project hosted in Australia l Project funded by industry to sequence over 200 bulls to create a haplotype library l A posteriori granddaughter design to locate chromosomal segments of interest from 71 bulls each with over 100 genotyped and progeny- tested sons

WiggansARS Big Data Workshop – July 16, 2015 (14) Imputing sequence data l Haplotype library supports imputation l Genotypes from genotyping chips can be imputed to full sequence l Lower accuracy of sequence data compared with chip genotypes accommodated by dealing in dosages to represent allele content l Findhap v4 (VanRaden) fast and more accurate than Beagle at low × coverage

WiggansARS Big Data Workshop – July 16, 2015 (15) Alignment of sequence data l Alignment – determining location of chromosomal segments provided by sequencer l Findmap – matches segment against library of haplotypes l Preserves low-frequency variants l Does not identify new variants l Uses a hash table to find variant enabling rapid processing

WiggansARS Big Data Workshop – July 16, 2015 (16) Accuracy of Findhap vs. Beagle* Sequence + HDImputed from HD ProgramDepthCorrectCorrelationCorrectCorrelation Findhap8× × × Beagle8× × × *250 bulls had sequence + HD; 250 others were imputed from HD

WiggansARS Big Data Workshop – July 16, 2015 (17) Data storage and backup l Disk storage being added w Compression option being investigated l Back up to tape with weekly submission to off- site storage l Expect to have internet 2 connection w Facilitate sharing of sequence data

WiggansARS Big Data Workshop – July 16, 2015 (18) Summary l Highly successful program leading to annual increases in genetic merit for production efficiency l Large database of phenotypic and genomic data provided by industry l Research projects to determine mechanism of genetic control of economically important traits l Data processing techniques developed so that rapid turnaround could be realized