What have we learned from sequencing efforts to date? Jared Decker Assistant Professor Beef Genetics Specialist Computational Genomics.

Slides:



Advertisements
Similar presentations
Reliable genomic evaluations across breeds and borders Sander de Roos CRV, the Netherlands.
Advertisements

Introduction to genomes & genome browsers
Major insights from the HGP on Nature (2001) 15 th Feb Vol 409 special issue; pgs 814 & )Gene content 2)Proteome content 3)SNP identification.
Perspectives from Human Studies and Low Density Chip Jeffrey R. O’Connell University of Maryland School of Medicine October 28, 2008.
Bob Weaber, Ph.D. Cow-Calf Extension Specialist Assistant Professor Dept. of Animal Sciences and Industry
Speaker: HU Xue-Jia Supervisor: WU Yun-Dong Date: 19/12/2013.
Outline to SNP bioinformatics lecture
2007 Paul VanRaden and Jeff O’Connell Animal Improvement Programs Lab, Beltsville, MD U MD College of Medicine, Baltimore, MD
SNP database 張學偉 助理教授 高雄醫學大學 生物醫學暨環境生物學系. SNP = Single Nucleotide Polymorphism (read in SNiP)
FINAL EXAM: TAKE-HOME Assessment of Significance in Cancer Gene SNPs.
Genome Browsers Ensembl (EBI, UK) and UCSC (Santa Cruz, California)
Genome Browsers UCSC (Santa Cruz, California) and Ensembl (EBI, UK)
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
PolyPhen and SIFT: Tools for predicting functional effects of SNPs Epi 244 Spring 2009 Sam S. Oh.
CS 374: Relating the Genetic Code to Gene Expression Sandeep Chinchali.
Whole Genome Sequencing: Background and View of Current Efforts Larry Kuehn, Ph.D. Research Geneticist USDA, ARS, U.S. Meat Animal Research Center The.
Training, Validation, and Target Populations Training, Validation, and Target Populations Mark Thallman, Kristina Weber, Larry Kuehn, Warren Snelling,
Bob Weaber, Ph.D. Cow-Calf Extension Specialist Associate Professor Dept. of Animal Sciences and Industry
How Genomics is changing Business and Services of Associations Dr. Josef Pott, Weser-Ems-Union eG, Germany.
Introduction to Animal Breeding & Genomics Sinead McParland Teagasc, Moorepark, Ireland.
Genome Variations & GWAS
Modes of selection on quantitative traits. Directional selection The population responds to selection when the mean value changes in one direction Here,
WiggansARS Big Data Workshop – July 16, 2015 (1) George R. Wiggans Animal Genomics and Improvement Laboratory Agricultural Research Service, USDA Beltsville,
ARC Biotechnology Platform: Sequencing for Game Genomics Dr Jasper Rees
Doug Brutlag 2011 Genomics & Medicine Doug Brutlag Professor Emeritus of Biochemistry &
1 Application of Molecular Technologies in Beef Production Dan W. Moser, Ph.D Department of Animal Sciences and Industry Kansas State University, Manhattan.
Identification and evaluation of causative genetic variants corresponding to a certain phenotype Xidan Li.
Experimental validation. Integration of transcriptome and genome sequencing uncovers functional variation in human populations Tuuli Lappalainen et al.
Bovine Genomics The Technology and its Applications Gerrit Kistemaker Chief Geneticist, Canadian Dairy Network (CDN) Many slides were created by.
Biology 101 DNA: elegant simplicity A molecule consisting of two strands that wrap around each other to form a “twisted ladder” shape, with the.
Genomics. Finding True Genetic Merit 2 Dam EPD Sire EPD Pedigree Estimate EPD TRUE Progeny Difference Mendelian Sampling Effect Adapted from Dr. Bob Weaber.
2007 Melvin Tooker and Paul VanRaden Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2009 Happy Bulls, Happy Cows,
Online Mendelian Inheritance in Man (OMIM): What it is & What it can do for you Knowledge Management & Eskind Biomedical Library January 27, 2012 helen.
Use of DNA information in Genetic Programs.. Next Four Seminars John Pollak – DNA Tests and genetic Evaluations and sorting on genotypes. John Pollak.
Jeff O’ConnellInterbull annual meeting, Orlando, FL, July 2015 (1) J. R. O’Connell 1 and P. M. VanRaden 2 1 University of Maryland School of Medicine,
Questions?. Novel ncRNAs are abundant: Ex: miRNAs miRNAs were the second major story in 2001 (after the genome). Subsequently, many other non-coding genes.
POLYMORPHISM AND VARIANT ANALYSIS Saurabh Sinha, University of Illinois.
Geuvadis Analysis Meeting 16/02/2012 Micha Sammeth CNAG – Barcelona.
February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles.
April 2010 (1) Prediction of Breed Composition & Multibreed Genomic Evaluations K. M. Olson and P. M. VanRaden.
Update on USDA Fertility Project Megan Rolf Oklahoma State University 2015 NBCEC Brown Bagger.
12/16/14 StarterConnection/Exit: What is the true meaning of the word mutation? Are mutations bad / harmful? 12/16/14 Protein Synthesis Writing
Finding genes in the genome
G.R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD G.R. WiggansADSA 18.
Whole genome selection and the 2000 bull project at USMARC Larry Kuehn Research Geneticist.
Single Nucleotide Polymorphisms (SNPs) By Amira Jhelum Rahul Shweta.
Research proposal 2009 信息技术会议 Bioinformatics Analysis & Identification of non-Synonymous SNPs in Candidate Genes for Ascites College of Animal Husbandry.
G.R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD Select Sires‘ Holstein.
G.R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD 2011 National Breeders.
Using public resources to understand associations Dr Luke Jostins Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015.
G.R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD 2011 G.R. Wiggans DNA.
Bull Selection Strategies Using Genomic Estimated Breeding Values L. R. Schaeffer CGIL, University of Guelph ICAR-Interbull Meeting Niagara Falls, NY June.
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
Genome-Wides Association Studies (GWAS) Veryan Codd.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Genetic Code and Interrupted Gene Chapter 4. Genetic Code and Interrupted Gene Aala A. Abulfaraj.
? ? Individual 1Individual 2 1. Questions This is a pedigree for a disease involving a mutation within an imprinted gene. The disease manifests only when.
DNA Sire Identification Meat Animal Research Center Clay Center, NE
upstream vs. ORF binding and gene expression?
Use of DNA information in Genetic Programs.
Gene Hunting: Design and statistics
What are the Patterns Of Nucleotide Substitution Within Coding and
Beyond GWAS Erik Fransen.
Genome organization and Bioinformatics
Genomic Evaluations.
Nic’s genome contains 16,124 variants,
Pharmacogenomic variability and anaesthesia
Genomic Selection in Dairy Cattle
Increased reliability of genetic evaluations for dairy cattle in the United States from use of genomic information Abstr.
The Impact of the 1000 Bull Genomes Project and its Future
Presentation transcript:

What have we learned from sequencing efforts to date? Jared Decker Assistant Professor Beef Genetics Specialist Computational Genomics

By the July of 2015, costs $0.015 to sequence 1 million base pairs. September 2001, costs $5,292 to sequence 1 million base pairs.

Illumina HiSeq X Ten

Effective Population Size Time RecentHistorical Cattle Human

Where are we going next?

Imputation

Where are we going next? Imputation

Where are we going next? Imputation

Where are we going next? Imputation Analysis

Where are we going next? Imputation Analysis

16 GWAS on HD vs Imputed Sequence 1978 Holstein calves – BRD Case/Control Taylor Lab 11,641,785 SNPs 654,044 SNPs h 2 increases from 21% to 23% A lot more significant associations – new variants are discovered! Largest effect QTL changes

Mature Height 3,362 Animals FDR = 0.1 ( ____ ) GWAS on imputed data works nicely. Imputation with multiple breed reference set also works.

Where are we going next? Back to causal variants Led to genes by data not researchers opinion Continue to take very large numbers of genes and genomic regions into account

Causal Variant Successes Genetic defects Embryonic lethals Horned/Polled Coat Color

Causal Variant Successes Genetic defects Embryonic lethals Horned/Polled Coat Color

Causal Variant Successes Genetic defects Embryonic lethals Horned/Polled Coat Color

Causal Variants This is not easy! Genome structural variation discovery and genotyping Can Alkan, Bradley P. Coe & Evan E. Eichler Nature Reviews Genetics 12, (May 2011) doi: /nrg2958

DescriptionNumber of Variants Amino Acid Genomic106,634 Untranslated region Genomic137,581 Splice Genomic26,200 No homozygote Genomic1,016,064 Amino Acid RNA188,429 Untranslated region RNA50,061 Splice RNA279,906 No homozygote RNA158,999 Amino Acid 1kBulls32,756 Untranslated region 1kBulls19,407 Splice 1kBulls6,820 No homozygote 1kBulls6,271 2,029,128

Where are we going next? More types of SNP chips/panels Higher density panels focusing on functional variants for research – GGP-F250 Lower density panels focusing on functional variants for prediction Functional variants -> variants that influence protein sequence or abundance

DescriptionNumber Variants Imputation content33,730 Functional content193,503 Total variants227,233 from HD as part of imputation content31,835 from SNP50 as part of imputation content (subset of HD)22,183 from HD as part of functional content6,395 from SNP50 as part of functional content (subset of HD)443 Total HD38,230 Total SNP50 (subset of HD)22,626 NS Sift deleterious22,298 NS Sift tolerated48,994 NS no sift prediction49,627 Total Non-synonymous (NS) AA substitutions120,919 Frameshift indels1,265 In-Frame indels585 UTR20,402 Ensembl ncRNA (snoRNA, miRNA, snRNA, rRNA,Mt_tRNA, Mt_rRNA)1,573 Conserved non-coding elements4,081 Splice (not mutually exclusive)6,378 Genes with at least 1 variant23,059 Genes with no variant7,714

DescriptionNumber Variants Imputation content 33,730 Functional content193,503 Total variants227,233 from HD as part of imputation content31,835 from SNP50 as part of imputation content (subset of HD)22,183 from HD as part of functional content6,395 from SNP50 as part of functional content (subset of HD)443 Total HD38,230 Total SNP50 (subset of HD)22,626 NS Sift deleterious22,298 NS Sift tolerated48,994 NS no sift prediction49,627 Total Non-synonymous (NS) AA substitutions120,919 Frameshift indels1,265 In-Frame indels585 UTR20,402 Ensembl ncRNA (snoRNA, miRNA, snRNA, rRNA,Mt_tRNA, Mt_rRNA)1,573 Conserved non-coding elements4,081 Splice (not mutually exclusive)6,378 Genes with at least 1 variant23,059 Genes with no variant7,714

DescriptionNumber Variants Imputation content33,730 Functional content 193,503 Total variants227,233 from HD as part of imputation content31,835 from SNP50 as part of imputation content (subset of HD)22,183 from HD as part of functional content6,395 from SNP50 as part of functional content (subset of HD)443 Total HD38,230 Total SNP50 (subset of HD)22,626 NS Sift deleterious22,298 NS Sift tolerated48,994 NS no sift prediction49,627 Total Non-synonymous (NS) AA substitutions120,919 Frameshift indels1,265 In-Frame indels585 UTR20,402 Ensembl ncRNA (snoRNA, miRNA, snRNA, rRNA,Mt_tRNA, Mt_rRNA)1,573 Conserved non-coding elements4,081 Splice (not mutually exclusive)6,378 Genes with at least 1 variant23,059 Genes with no variant7,714

DescriptionNumber Variants Imputation content33,730 Functional content193,503 Total variants227,233 from HD as part of imputation content31,835 from SNP50 as part of imputation content (subset of HD)22,183 from HD as part of functional content6,395 from SNP50 as part of functional content (subset of HD)443 Total HD38,230 Total SNP50 (subset of HD)22,626 NS Sift deleterious22,298 NS Sift tolerated48,994 NS no sift prediction49,627 Total Non-synonymous (NS) AA substitutions 120,919 Frameshift indels1,265 In-Frame indels585 UTR20,402 Ensembl ncRNA (snoRNA, miRNA, snRNA, rRNA,Mt_tRNA, Mt_rRNA)1,573 Conserved non-coding elements4,081 Splice (not mutually exclusive)6,378 Genes with at least 1 variant23,059 Genes with no variant7,714

DescriptionNumber Variants Imputation content33,730 Functional content193,503 Total variants227,233 from HD as part of imputation content31,835 from SNP50 as part of imputation content (subset of HD)22,183 from HD as part of functional content6,395 from SNP50 as part of functional content (subset of HD)443 Total HD38,230 Total SNP50 (subset of HD)22,626 NS Sift deleterious22,298 NS Sift tolerated48,994 NS no sift prediction49,627 Total Non-synonymous (NS) AA substitutions120,919 Frameshift indels1,265 In-Frame indels585 UTR20,402 Ensembl ncRNA (snoRNA, miRNA, snRNA, rRNA,Mt_tRNA, Mt_rRNA)1,573 Conserved non-coding elements 4,081 Splice (not mutually exclusive)6,378 Genes with at least 1 variant23,059 Genes with no variant7,714

DescriptionNumber Variants Imputation content33,730 Functional content193,503 Total variants227,233 from HD as part of imputation content31,835 from SNP50 as part of imputation content (subset of HD)22,183 from HD as part of functional content6,395 from SNP50 as part of functional content (subset of HD)443 Total HD38,230 Total SNP50 (subset of HD)22,626 NS Sift deleterious22,298 NS Sift tolerated48,994 NS no sift prediction49,627 Total Non-synonymous (NS) AA substitutions120,919 Frameshift indels1,265 In-Frame indels585 UTR20,402 Ensembl ncRNA (snoRNA, miRNA, snRNA, rRNA,Mt_tRNA, Mt_rRNA)1,573 Conserved non-coding elements4,081 Splice (not mutually exclusive)6,378 Genes with at least 1 variant 23,059 Genes with no variant 7,714

Where are we going next? Then genomic predictions would be based on causal variants or closely linked variants Fewer variants on prediction panels leads to lower cost? Hope for multiple breed predictions? More precise predictions based using causal/closely linked

Where are we going next? Sequencing lots of animals! – Mostly influential sires 1000 Bull Consortium Individual groups – Support from breed associations

Where are we going next? Genomic “surveillance” of influential sires Breed associations sequences a sire when it reaches a certain number of progeny equivalents – 10,000? – 5% of annual registrations? Progeny equivalents – Progeny are 1 points – Grand progeny are 0.5 points – Great-grand progeny are 0.25 points

Where are we going next? Genomic “surveillance” of influential sires Early identification of possible functional variants Good and bad Variants responsible for embryonic lethals

Hype Cycle: DNA in Beef Breeding Time Visibility Technology Trigger Peak of Inflated Expectations Trough of Disillusionment Slope of Enlightenment Plateau of Productivity Genome is a complex system – Slope of Enlightenment might be a long climb!

Questions? A Steak in Genomics