RNA-SEQ IN PPMI Whole-Blood samples

Slides:



Advertisements
Similar presentations
Vanderbilt Center for Quantitative Sciences Summer Institute Sequencing Analysis Yan Guo.
Advertisements

Supplementary Figure 1. Somatic mutation spectrum # Substitutions # Substitutions per Mb b c a Repeats Pseudogenes Whole genome Splice sites Non-coding.
Before we start: Align sequence reads to the reference genome
NGS Analysis Using Galaxy
ARC Biotechnology Platform: Sequencing for Game Genomics Dr Jasper Rees
RNAseq analyses -- methods
Experimental validation. Integration of transcriptome and genome sequencing uncovers functional variation in human populations Tuuli Lappalainen et al.
Introductory RNA-seq Transcriptome Profiling. Before we start: Align sequence reads to the reference genome The most time-consuming part of the analysis.
Introductory RNA-seq Transcriptome Profiling. Before we start: Align sequence reads to the reference genome The most time-consuming part of the analysis.
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013.
Melissa Spear Phillips Lab Mentor: Rose Reynolds SPUR 2011 Evolution of Genetic Networks Controlling Cellular Stress Response and Longevity.
No reference available
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
Canadian Bioinformatics Workshops
From Reads to Results Exome-seq analysis at CCBR
Million Veteran Program: Industry Day Genomic Data Processing and Storage Saiju Pyarajan, PhD and Philip Tsao, PhD Million Veteran Program: Industry Day.
GIAB: Genome reference material development resources for clinical sequencing Chunlin Xiao 1, Justin Zook 2, Shane Trask 1, Melissa Landrum 1, Marc Salit.
The regulation of Caspase 8 chIP-seq motifs mRNA expression DNA methylation.
RNA-Seq with the Tuxedo Suite Monica Britton, Ph.D. Sr. Bioinformatics Analyst September 2015 Workshop.
SNP and Genomic analysis SNP/genomic signature Clinical sampling Personalized chemotherapy Personalized Targeted therapy Personalized RNA therapy Personalized.
Konstantin Okonechnikov Qualimap v2: advanced quality control of
Introductory RNA-seq Transcriptome Profiling
NGS File formats Raw data from various vendors => various formats
Canadian Bioinformatics Workshops
RNA Quantitation from RNAseq Data
Genomon a high-integrity pipeline for cancer genome and transcriptome sequence analysis Kenichi Chiba(1), Yuichi Shiraishi(1), Ai Okada(1), Hiroko.
Cancer Genomics Core Lab
Biases and their Effect on Biological Interpretation
Using RNA-seq data to improve gene annotation
Dr. Christoph W. Sensen und Dr. Jung Soh Trieste Course 2017
Gene expression from RNA-Seq
RNA-Seq analysis in R (Bioconductor)
exRNA Metadata Standards
Detect alternative splicing
Basics of Comparative Genomics
Week 5 Theory and application for setting up an RNA-Seq pipeline
S1 Supporting information Bioinformatic workflow and quality of the metrics Number of slides: 10.
Data challenges in the pharmaceutical industry
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
Introductory RNA-Seq Transcriptome Profiling
Rod Eyles1, John Juma1, Morag Ferguson1, Trushar Shah1 1 IITA, Nairobi
 The human genome contains approximately genes.  At any given moment, each of our cells has some combination of these genes turned on & others.
Kallisto: near-optimal RNA seq quantification tool
THE PARKINSON’S PROGRESSION MARKERS INITIATIVE
Parkinson’s Progression Markers initiative
ppmi genetics Andy Singleton Laboratory of Neurogenetics, NIA, NIH
PPMI Beyond 2018.
“Proteomics is a science that focuses on the study of proteins: their roles, their structures, their localization, their interactions, and other factors.”
Genomes and Their Evolution
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
PPMI – GOALS 2019 Ken Marek PPMI Annual Meeting May 3, 2018
Exploring and Understanding ChIP-Seq data
Focus of research group (I)
Genome Biology & Applied Bioinformatics Mehmet Tevfik DORAK, MD PhD
New Statistical Methods for Family-Based Sequencing Studies
Integrative Multi-omic Analysis of Human Platelet eQTLs Reveals Alternative Start Site in Mitofusin 2  Lukas M. Simon, Edward S. Chen, Leonard C. Edelstein,
TOPMed Analysis Workshop Genetic Analysis Center Biostatistics Department University of Washington TOPMed Data Coordinating Center August 7-9, 2017 Introduction.
Galaxy course EMC TraIT Nov 2014_Jenster
Baekgyu Kim, Kyowon Jeong, V. Narry Kim  Molecular Cell 
Basics of Comparative Genomics
GWAS-eQTL signal colocalisation methods
Quantitative analyses using RNA-seq data
Sequence Analysis - RNA-Seq 2
Introduction to RNA-Seq & Transcriptome Analysis
Differential Expression of RNA-Seq Data
The Technology and Biology of Single-Cell RNA Sequencing
RNA-Seq Data Analysis UND Genomics Core.
HCA Data Access Oct 3rd 2019.
Presentation transcript:

RNA-SEQ IN PPMI Whole-Blood samples Kendalll Craig, Keller, Cookson, Van Keuren-Jensen May 3rd, 2018

Overview »PI: Cookson, Craig, Keller, Van Keuren-Jensen »Background: Develop a comprehensive RNA resource from whole blood samples that can be easily accessed and utilized by the PD research community »Methods: Whole Transcriptome RNA-Seq from PaxGene whole blood »Results (brief with possible figures): Phase I will be released at the end of May »Next steps: Complete and release Phase II and small RNA-Seq for all samples. Complete data portal for easy/accessible gene look up »Relevance for PPMI: Resource for examining transcriptomic changes, their relationship to genetic and clinical variables, and their potential as PD biomarkers

RNA-SEQ: BEYOND TRANSCRIPT QUANTIFICATION Pre-spliced, Spliced, Strandedness Junctions/Isoforms Alternative start-sites Integrated to individual w/ DNA SNPs Allele specific expression Non-sense mediated decay, eQTLs

STUDY FLOW 1 HudsonAlpha Institute for Biotechnology 2 TGen/USC 3 PPMI-INFO LONI - Data Transferred to Basespace - Limited LIMs Info Data Generation -Initial Quality Control -Analysis Tables -Pilot Portal For Querying Quality Control, Analysis, Portal -Data Portal, -Production Data Management -Coordination Data Sharing/Unblinding BCL / FASTQ Folders BAMs/FASTQs Tables Pilot Study Main Study Long RNA Comparison of 4 Different Assays on 48 Samples Comparison across 2 sequencing platforms, and 3 sites Analysis of Depth vs. Species Detection Small RNA Comparison of 2 Different Assays Long RNA NovaSeq Sequencing 200M Reads/100M Read Pairs 4,623 Samples Kappa + NEB 27 Plates Completed, Topping off underway Small RNA Finishing QC for production

Samples Subjects Baseline 6 Months YR1 YR2 YR3 PD 426 285 365 361 360 196 179 184 171 166 Prodromal 60 61 56 SWEDD 64 53 57 LRRK2 Affected 118 135 108 LRRK2 unaffected 79 138 98 43 SNCA and GBA 200 Genetic Registry 417 86 Samples: Longitudinal 4,500+ Samples on 1,000+ Individuals

DATA AND ANALYSIS FLOW FASTQs BAMs Tables Initial Release Alignment STAR Genome Build/Transcripts Mirror TopMed RNA-SeQC Differential Analysis Multiple Comparisons DESeq2 FASTQs Each: ~10-12 Gbyte Phase 1: 25-30 Tbytes BAMs Phase 1: 25-30 Tbytes Tables Each: 1-5 Mb Study: <10 Gbytes Counts Gene,Transcript P-val, Fold Δ Gene,Transcript BAM Files Integrative Analysis +VCFs from WGS Variant Calling, QTLs Stats Reads.. VCFs WGS DNA FASTQ Files Descriptive Analysis Gene/Transcript quantification Salmon TPMs Gene,Transcript Tables: Sufficient for many uses. FASTQs/BAMs: Require shipping of hard-drives

PHASE 1 PROGRESS: Initial 2,047 Samples Ave. Read Pairs: 109 Mil or 218 Mil Reads Med. Read Pairs: 107 Mil. Over 90M: 1,944/2,437 Samples

Beyond Coding

Principal Component Analysis By Plate Quality Control Overview Principal Component Analysis By Plate Principal Component Analysis By % Percent Correct Strand ~ 30% of Variance

Data Exploration Portal Interactive Querying of Study Data Landing Page Study Overview Pilot Study Quality Control Study Detail Gene Query Allows exporting of CSV/XLS Allows visualization and comparison to other databases such as GTEX Crosslinked to Relevant Biology and Disease Relevance Portable/Secure Single public HTML, w/ JSON API Integrates via Node.js to OAUTH SSO MongoDB, BigQuery version HTML5/JS, D3.js Cross-platform Validated

Acknowledgements Ivo Violich Michelle Webb Eric Alsop, Ph.D. Elizabeth Hutchins, Ph.D. Karen Crawford, Ph.D. Art Toga, Ph.D. Shawn Levy, Ph.D. + Team