Seefried, F., von Rohr P., Drögemüller C.

Slides:



Advertisements
Similar presentations
Mendel’s Laws.
Advertisements

Simple genetics example
Patterns of Heredity 4.1 Living things inherit traits in patterns. 4.2
Support vector machine
We processed six samples in triplicate using 11 different array platforms at one or two laboratories. we obtained measures of array signal variability.
Perspectives from Human Studies and Low Density Chip Jeffrey R. O’Connell University of Maryland School of Medicine October 28, 2008.
CharacterCrossNumber counted Shape of seed round x wrinkled 4832 round 0 wrinkled Shape of seed round x round 5474 round 1850 wrinkled Gregor Mendel got.
Genetics Made by: Nebo School District es/ppt/
Patterns of Heredity CHAPTER the BIG idea CHAPTER OUTLINE In sexual reproduction, genes are passed from parents to offspring in predictable patterns. Living.
Genetics: The Science of Heredity
From sequence data to genomic prediction
Exploring Genotypes and Phenotypes of Longhorn Cattle.
Day 1 In rabbits the allele for black coat color (B) is dominant over the allele for brown coat (b). What is the genotype ratio and phenotype ratio from.
Genetics Chapters 9 and 12.
Rutgers CS440, Fall 2003 Support vector machines Reading: Ch. 20, Sec. 6, AIMA 2 nd Ed.
Exploring Genotypes and Phenotypes of Longhorn Cattle Base Color.
GA-Based Feature Selection and Parameter Optimization for Support Vector Machine Cheng-Lung Huang, Chieh-Jen Wang Expert Systems with Applications, Volume.
Figure S1. Quantile-quantile plot in –log10 scale for the individual studies The red line represents concordance of observed and expected values. The shaded.
Kernel Methods A B M Shawkat Ali 1 2 Data Mining ¤ DM or KDD (Knowledge Discovery in Databases) Extracting previously unknown, valid, and actionable.
Genetics Review!. Mendel Dominance Inheritance MutationsPedigrees $100 $200 $300 $400 $500 FINAL JEOPARDY FINAL JEOPARDY.
CS177 Lecture 10 SNPs and Human Genetic Variation
John B. Cole 1, Daniel J. Null *1, Chuanyu Sun 2, and Paul M. VanRaden 1 1 Animal Genomics and Improvement 2 Sexing Technologies Laboratory Navasota, TX.
Supplemental Figure 1A. A small fraction of genes were mapped to >=20 SNPs. Supplemental Figure 1B. The density of distance from the position of an associated.
John B. Cole Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD AIPL Report.
Support Vector Machines and Gene Function Prediction Brown et al PNAS. CS 466 Saurabh Sinha.
CHAPTER 12: GENETICS.
Mammogram Analysis – Tumor classification - Geethapriya Raghavan.
Multibreed Genomic Evaluation Using Purebred Dairy Cattle K. M. Olson* 1 and P. M. VanRaden 2 1 Department of Dairy Science Virginia Polytechnic and State.
G.R. Wiggans,* 1 P.M. VanRaden, 1 T.A. Cooper, 1 C.P. Van Tassell, 2 T. Sonstegard, 2 and B. Simpson 3 1 Animal Improvement Programs and 2 Bovine Functional.
Whole genome selection and the 2000 bull project at USMARC Larry Kuehn Research Geneticist.
G.R. Wiggans 1, T. A. Cooper 1 *, K.M. Olson 2 and P.M. VanRaden 1 1 Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville,
 Reebop Activity SBI 3U February 19, Summary  What were some of the observable differences between our Reebop babies and Reebop parents?  What.
Exploring Genotypes and Phenotypes of Longhorn Cattle.
G.R. Wiggans Animal Improvement Programs Laboratory Agricultural Research Service, USDA Beltsville, MD 2011 National Breeders.
1 Kernel Machines A relatively new learning methodology (1992) derived from statistical learning theory. Became famous when it gave accuracy comparable.
Monohybrid Problems. General Rules to solving genetic crosses using Punnett Squares. 1. Make key 2. Write Parent Genotypes 3. Perform cross using square.
Sequencing Brown Swiss Cattle: GWAS and Genomic Prediction based on sequence data Mirjam Frischknecht World Brown Swiss Congress, Mende Von.
071126_EAS56_0057_FC – lanes 1-8 read 2 b a _EAS56_0057_FC – lanes 1-8 read 1 Table S1. Summary tables for a read 1 and b read 2 of a.
SNP Detection Congtam Pham 2/24/04 Dr. Marth’s Class.
Variation and Inheritance
Mendelian Heredity (Fundamentals of Genetics) Chapter 9
Disease risk prediction
6. Kernel Regression.
Mammogram Analysis – Tumor classification
Punnett Squares pp
Practice Problems.
Exploring Mendelian Genetics
Studying the inheritance of two characters simultaneously
Fundamentals of Genetics
Support Vector Machines
Warm-Up –1/13 Describe the phenotypic results of Mendel’s cross between two purebred parents (the F1 generation). Describe the phenotypic results of.
The effect of using sequence data instead of a lower density SNP chip on a GWAS EAAP 2017; Tallinn, Estonia Sanne van den Berg, Roel Veerkamp, Fred van.
CHAPTER 12: GENETICS.
Genetics definitions Label each chromosome pair as homozygous dominant, homozygous recessive, or heterozygous with definitions Label dominant.
Canine hip dysplasia is predictable by genotyping
Preview Science Concepts Using Science Graphics Writing Skills.
Detecting variance-controlling QTL
Mendelian Inheritance
Genomic Evaluations.
Studying the inheritance of two characters simultaneously
Genetics Review!.
Genomic Selection in Dairy Cattle
Perspectives from Human Studies and Low Density Chip
Using Haplotypes in Breeding Programs
Monohybrid crosses.
Predicting the Inheritance of Alleles
A Definitive Haplotype Map as Determined by Genotyping Duplicated Haploid Genomes Finds a Predominant Haplotype Preference at Copy-Number Variation Events 
Kernel Methods for large-scale Genomics Data Analysis
TLR7 escape from X inactivation in immune cells.
Presentation transcript:

Seefried, F., von Rohr P., Drögemüller C. Genotype prediction for a structural variant in Brown Swiss cattle (BSW) using Illumina Beadchip data Seefried, F., von Rohr P., Drögemüller C. 68th EAAP Meeting, 28th August 2017, Tallin, EST

Colour sidedness in Brown Swiss Usually Brown Swiss (BSW) cattle is characterized by solid brown colour 2 hypopigemented colour phenotypes exist at low frequency Belt Colour sidedness (Cs)

Genetics of Cs Complex structural variant Serial translocation of chromosome segments between BTA6 and BTA29 (Durkin et al. 2012) Dominant inheritance Illumina Infinium assay data provide normalised signal intensities (Log R Ratio)

Log R Ratio Log R Ratio Map position BTA29 signal intensities for Wild-type (black) and heterozygous (red) Cs animals Log R Ratio Map position

Support Vector Machines Supervised classification model Classification of geno- / phenotypes True Cs-genotypes were available Log R Ratios from SNPs within translocated region Genotype data Linear kernel Radial kernel

Classification - Example f(x) non-linear f(x) linear Log R Ratio Map position

Data & Prediction accuracy Chip densities: Illumina 777K / 150K / GGPLD Validation strategy: leave-one-out CV Accuracy criteria: Proportion of number of correct predictions over no. of total sample size True positive proportion Coat Phenotype Cs - Genotype No. of animals Solid brown wt/wt 2003 Colour-sided wt/Cs 87 Cs/Cs 19

Cs-Genotype prediction Results Proportion (%) of correct predictions using Log R Ratio (imputed genotypes) Chip Cs-Genotype prediction Phenotype Prediction SVM Kernel Linear Radial 777K 99.8 (99.3) 99.7 (99.6) 100 99.9 150K 99.8 (99.9) 99.8 (99.1) 99.8 LD 95.8 97.2 97.1 97.7 True positive proportion (%) using Log R Ratio (imputed genotypes) 777K 94.6 (91.6) 92.5 (93.0) 100 150K 96.6 (98.9) 92.7 (80.6) 96.2 LD 3.5 66.7 43.5 66.3

Summary SVM seem to perform well for calling structural variants using signal intensities / genotype data Benefits in prediction accuracy were detected using genotypes compared to using Log R ratio Linear kernel outperformed radial kernel in accuracy Effect of chip density No. of SNPs Benefits / Limitations using Log R ratio data: No pedigree / imputation Complete callingrate required No. of SNPs / size of the structural variant

Thank you!

Log R ratio - Computation R intensity values result from polar transformation of signal intensity readouts R stands for copy number Theta represents the angle => Genotype Log R ratio Ratio between observed and expected intensities Logs because distribution properties better

Log R ratio - Diagram

Classification Distribute data plane into different regions according to data classes Example Two classes red and black Separate plan into red and black region Function f(x) describes boundary, can either be linear or non-linear Linear f(x) => Regression Non-linear f(x) => one possibility SVM