Download presentation
Presentation is loading. Please wait.
Published byCora Baker Modified over 9 years ago
1
Population Approaches to Detecting and Genotyping Copy Number Variation Lachlan Coin July 2010
2
Outline Population-haplotype approach to CNV detecting and genotyping Application to SNP and CGH data Application to NGS sequence data
3
cnvHap approach to CNV discovery and genotyping Coin et al, 2010, Nature Methods 7, 541 - 546 (2010)
4
Example of trained model
5
cnvHap models haploid CN transitions Specify an per-base global transition rate matrix copy number to copy number from 0123401234 0 1 2 3 4 q 00 q 10 …. … Rate matrix multiplied by position specific scalar rate Values trained using EM, following the approach of Klosterman et al, used in Xrate for finding substitution rates
6
cnvHap joint model of CNV + SNP haplotypes
7
Cluster positions modelled using a linear model Model fitted using Ridge regression carried at each iteration of E-M algorithm
8
Using Illumina SNP arrays
9
Illumina Agilent Combined Illumina and Agilent arrays
10
Some CNVs exhibit shared structure
11
Improved CNV genotyping accuracy Cumulative Frequency of Squared Pearson Correlation
12
A deletion at 16p11.2 in a patient with ‘extreme obesity’ estimated by aCGH to be 546kb-700kb flanked by segmental duplication (>99% sequence identity) probably arises by NAHR, implying deletion is 739kb BMI = 29.2 kg.m -2 at age 7½ learning difficulties, delayed speech 28.9 Mb 29.2 Mb 29.5 Mb 29.8 Mb 30.1 Mb 30.4 Mb 30.7 Mb p13.2 p13.12 p12.3p12.1 q12.2 q21 q22.2 q23.1 q23.3q24.2 p11.2 log 2 ratio +1 0 - 1 - 2 - 3 MLPA probes Segmental duplication chromosome 16 RG Walters et al. Nature 463, 671-675 (2010) doi:10.1038/nature08727
13
16p11.2 deletions in obesity and population cohorts -3/931 British extreme early-onset obesity (SCOOP) 0/5304/643French child obesity case:control Lean/ Normal Weight ObeseCohort 0/6694/705French adult obesity case:control 1/62353/1592 Population cohorts (NFBC1966, CoLaus, EGPUT) 0/1402/159Swedish discordant siblings -2/141French bariatric surgery patients Obesity: P = 5.8x10 -7 OR = 29.8 [3.9–225] Morbid obesity: P = 6.4x10 -8 OR = 43.0 [5.6–329]
14
Coverage affected by GC content
15
Regression model fit to correct for GC bias
16
Loess curves fit to remove residual spatial variation of coverage
17
Detecting CNVS with NGS data Depth/haploid coverage B-allele frequency
18
NGS versus CGH data NGS data chrom1:350mb-351mbCGH data chrom1:350mb-351mb
19
NGS vs CGH data
20
Haplotype structure of deletion
21
NGS amplification Depth/coverage
22
With consistent break-points in population
23
Polyploid phasing and imputation Imputation error rate Switch error rate
24
Conclusions Population-haplotype model enables joint CNV discovery and genotyping using array data Preliminary results indicate this will also help using NGS data Combining information from multiple platforms improves sensitivity Imputation still works for ploidy > 2, phasing becomes more difficult
25
Acknowledgements Evangelos Bellos Shu-Yi Su Robin Walters Julian Asher Alex Blakemore Adam de Smith Phillipe Froguel Julia El-Sayed Moustafa David Balding (UCL) Rob Sladek (McGill)
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.