Current Data And Future Analysis Thomas Wieland, Thomas Schwarzmayr and Tim M Strom Helmholtz Zentrum München Institute of Human Genetics Geneva, 16/04/12.

Slides:



Advertisements
Similar presentations
Vanderbilt Center for Quantitative Sciences Summer Institute Sequencing Analysis Yan Guo.
Advertisements

GBS & GWAS using the iPlant Discovery Environment
Genome-wide Association Study Focus on association between SNPs and traits Tendency – Larger and larger sample size – Use of more narrowly defined phenotypes(blood.
GENOMICS TERM PROJECT Assessment of Significance in a SNP.
Genetic Basis of Agronomic Traits Connecting Phenotype to Genotype Yu and Buckler (2006); Zhu et al. (2008) Traditional F2 QTL MappingAssociation Mapping.
Data integration across omics landscapes Bing Zhang, Ph.D. Department of Biomedical Informatics Vanderbilt University School of Medicine
Ruibin Xi Peking University School of Mathematical Sciences
From sequence data to genomic prediction
Comprehensive analysis of RNA-Seq data reveals extensive RNA editing in a human transcriptome Peng et al. Nature Biotechnology (2012) doi: /nbt.2122.
CS177 Lecture 9 SNPs and Human Genetic Variation Tom Madej
Biology and Bioinformatics Gabor T. Marth Department of Biology, Boston College BI820 – Seminar in Quantitative and Computational Problems.
Positional Cloning LOD Sib pairs Chromosome Region Association Study Genetics Genomics Physical Mapping/ Sequencing Candidate Gene Selection/ Polymorphism.
Defining the Regulatory Potential of Highly Conserved Vertebrate Non-Exonic Elements Rachel Harte BME230.
PolyPhen and SIFT: Tools for predicting functional effects of SNPs Epi 244 Spring 2009 Sam S. Oh.
Vanderbilt Center for Quantitative Sciences Summer Institute Sequencing Analysis (DNA) Yan Guo.
Genomewide Association Studies.  1. History –Linkage vs. Association –Power/Sample Size  2. Human Genetic Variation: SNPs  3. Direct vs. Indirect Association.
SNP Resources: Finding SNPs Databases and Data Extraction Mark J. Rieder, PhD SeattleSNPs Variation Workshop March 20-21, 2006.
High Throughput Sequencing
Supplementary slides. Mock-ups Exome overview Genomic coverage: lower quartile 1, median 23, upper quartile 35 Protocols: Aligner used: BWA v2.3 Reference.
The Phase 1 Variant Set and Future Developments
Copyright © 2011 Partek Incorporated. All rights reserved. Statistics Visualizations Annotations Start-to-Finish Analysis of Integrated Genomics.
Geuvadis RNAseq analysis at UNIGE Analysis plans
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College
MES Genome Informatics I - Lecture VIII. Interpreting variants Sangwoo Kim, Ph.D. Assistant Professor, Severance Biomedical Research Institute,
GBS Bioinformatics Pipeline(s) Overview
Medical variations Gabor T. Marth Boston College Biology Department BI543 Fall 2013 February 5, 2013.
Loss-of-co-Homozygosity mapping and exome sequencing of a Syrian pedigree identified the candidate causal mutation associated with rheumatoid arthritis.
Experimental validation. Integration of transcriptome and genome sequencing uncovers functional variation in human populations Tuuli Lappalainen et al.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen,
Bioinformatics. Sequence information Mapping information Phenotypic information Literature Prediction programs -Gene prediction -Promotor prediction -Functional.
SNP Haplotypes as Diagnostic Markers Shrish Tiwari CCMB, Hyderabad.
Sample to Insight Alexander Kaplun, PhD Sep PGMD: a comprehensive pharmacogenomic database for personalized medicine and drug discovery.
RNA-Seq Primer Understanding the RNA-Seq evidence tracks on the GEP UCSC Genome Browser Wilson Leung08/2014.
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.
Epidemiology 217 Molecular and Genetic Epidemiology Bioinformatics & Proteomics John Witte.
California Pacific Medical Center
Geuvadis achievements and contributions Robert Häsler, functional genomics.
HW2: exome sequencing and complex disease Jacquemin Jonathan de Bournonville Sébastien.
.1Sources of DNA and Sequencing Methods.1Sources of DNA and Sequencing Methods 2 Genome Assembly Strategy and Characterization 2 Genome Assembly.
Copyright OpenHelix. No use or reproduction without express written consent1.
No reference available
Lectures 7 – Oct 19, 2011 CSE 527 Computational Biology, Fall 2011 Instructor: Su-In Lee TA: Christopher Miles Monday & Wednesday 12:00-1:20 Johnson Hall.
Moderní metody analýzy genomu - analýza Mgr. Nikola Tom Brno,
Personalized genomics
Introduction of the ChIP-seq pipeline Shigeki Nakagome November 16 th, 2015 Di Rienzo lab meeting.
Analysis of Next Generation Sequence Data BIOST /06/2015.
NCSU Summer Institute of Statistical Genetics, Raleigh 2004: Genome Science Session 3: Genomic Variation.
Canadian Bioinformatics Workshops
A brief guide to sequencing Dr Gavin Band Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015 Africa Centre for Health.
Introduction to Variant Analysis of Exome- and Amplicon sequencing data Lecture by: Date: Training: Extended version see: Dr. Christian Rausch 29 May 2015.
Using public resources to understand associations Dr Luke Jostins Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015.
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
Reliable Identification of Genomic Variants from RNA-seq Data Robert Piskol, Gokul Ramaswami, Jin Billy Li PRESENTED BY GAYATHRI RAJAN VINEELA GANGALAPUDI.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Date of download: 7/2/2016 Copyright © 2016 American Medical Association. All rights reserved. From: Clinical Interpretation and Implications of Whole-Genome.
From Reads to Results Exome-seq analysis at CCBR
Data mining. Sequence information Mapping information Phenotypic information Literature Prediction programs -Gene prediction -Promotor prediction -Functional.
Canadian Bioinformatics Workshops
Interpreting exomes and genomes: a beginner’s guide
Cancer Genomics Core Lab
Of Sea Urchins, Birds and Men
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
Post-GWAS and Mechanistic Analyses
Reliable Identification of Genomic Variants from RNA-Seq Data
GersteinLab.org Overview
Gene Safari (Biological Databases)
BF528 - Whole Genome Sequencing and Genomic Variation
.1Sources of DNA and Sequencing Methods 2 Genome Assembly Strategy and Characterization 3 Gene Prediction and Annotation 4 Genome Structure 5 Genome.
Presentation transcript:

Current Data And Future Analysis Thomas Wieland, Thomas Schwarzmayr and Tim M Strom Helmholtz Zentrum München Institute of Human Genetics Geneva, 16/04/12

Overview 1.Current Data – mRNA – miRNA – Imputation 2.Future Analysis – Variant Calling on mRNA Data

mRNA Our 48 samples have on average 55M reads 88% mapped (BWA; reads trimmed to 50bp) ~4% of aligned reads intersect lincRNA (from UCSC)

miRNA

31% (+- 13%) of reads overlapping with known miRNAs (UCSC)

Imputation info value: “a measure of the relative statistical information about the SNP allele frequency from the imputed data” (Marchini, J., & Howie, B. Nature reviews. Genetics, 2010)

Variant Calling on mRNA Data Illumina Pipeline Alignment BWAReference genome Variant calling SAMtools Variant filter SAMtools / custom Variant annotation UCSC gene tables dbSNP Database Candidate genes Linkage information Run statistics Inheritance model Base Quality Alignment Statistics Enrichment Statistics RefSeq genes lincRNA,miRNAs,...

Variant Calling on mRNA Data Variant calling SAMtools Variant filter SAMtools / custom Variant annotation UCSC gene tables dbSNP Database Candidate genes Linkage informationInheritance model RefSeq genes lincRNA,miRNAs,... pre-aligned mRNA files

Samples Case-Controls Homozygous-Heterozygous Type of variation SNV call quality dbSNP / HapMap Average heterozygozity

Exome database Genes Cases Controls Quality Annotation dbSNP HGMD 1000 genomes PolyPhen Prediction,...

Variant Calling on mRNA Data Possible analysis: mRNA calling vs. WG genotypes mRNA calling vs. Imputation RNA editing? Recent discussions about amount: “widespread” (M. Li et al., Science 333, 53, 2011) vs. “very few” (Schrider et al., Plos One, 2011) Can we find known sites from literature? (e.g. Li, J. B. et al. (2009). Genome-wide identification of human RNA editing sites by parallel DNA capturing and sequencing. Science, 324(5931), )