Accessing TCGA Data.

Slides:



Advertisements
Similar presentations
Putting TAIR to work for you hands-on workshop for beginning and advanced users
Advertisements

Data Integration for Cancer Genomics. Personalized Medicine Tumor Board Question: given all we know about a patient, what is the “optimal” treatment?
Tutorial 7 Genome browser. Free, open source, on-line broswer for genomes Contains ~100 genomes, from nematodes to human. Many tools that can be used.
Bio 465 Summary. Overview Conserved DNA Conserved DNA Drug Targets, TreeSAAP Drug Targets, TreeSAAP Next Generation Sequencing Next Generation Sequencing.
The Central Dogma of Molecular Biology (Things are not really this simple) Genetic information is stored in our DNA (~ 3 billion bp) The DNA of a.
Case It workshop: integrating molecular biology computer simulations and bioinformatics into case-based learning and student research Mark Bergland and.
About OMICS Group OMICS Group is an amalgamation of Open Access publications and worldwide international science conferences and events. Established in.
TCGA The Cancer Genome Atlas Project January 24, 2008.
Jessica Dantzer Mooney Lab Center for Computational Biology and Bioinformatics Indiana University School of Medicine
NCI Cloud Pilot Collaboration Meeting
Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST INVESTIGATION 3 BIG IDEA 1.
Rate of breast cancer-specific survival Months Low gene expression (N=252) High gene expression (N=37) P < (Log-rank) a Months Low gene expression.
IGV tools. Pipeline Download genome from Ensembl bacteria database Export the mapping reads file (SAM) Map reads to genome by CLC Using the mapping.
Anthony Gitter Cancer Bioinformatics (BMI 826/CS 838) May 5, 2015
CBioPortal Web resource for exploring, visualizing, and analyzing multidimentional cancer genomics data.
Open Genomic Data Repositories and Analysis Resources Megan Laurance, Ph.D. Research Library.
Canadian Bioinformatics Workshops
CCRC Cancer Conference November 8, 2015.
An Overview of The Cancer Genome Atlas (TCGA)
NCRI Cancer Conference November 1, 2015.
Justin Kirby1, Lawrence Tarbox2, John Freymann1, Carl Jaffe3, Fred Prior2 1 Leidos Biomedical Research, Frederick National.
Illustration of how the diagnostic yield changes as the size of the gene panel (number of genes) increases. The general characteristics of panels of various.
To develop the scientific evidence base that will lessen the burden of cancer in the United States and around the world. NCI Mission Key message:
Sungkyunkwan University, School of Medicine.
Validation of late-type genes
GraDe-SVM: Graph-Diffused Classification for the Analysis of Somatic Mutations in Cancer Morteza H.Chalabi, Fabio Vandin Hello.
INVESTIGATION 3 BIG IDEA 1
Charles M. Rudin, MD, PhD, Alexander Drilon, MD, J.T. Poirier, PhD 
Cancer Genomics and Class Discovery
CSE 182 Project.
Disease risk prediction
How to go from SNP data in Ensembl to getting KASP markers?
NCBI Molecular Biology Resources
Optimizing Biological Data Integration
A b Supplemental Figure 1. (a) TERT mRNA quantification after siTERT knockdown compared to a non-targeting siRNA control (siNT) for the TCCSUP and UM-UC-3.
Genomic Analysis Chapter 19
Hire Toyota Innova in Delhi for Outstation Tour
NCI’s Genomics Data Commons (GDC) & NCI Cloud Pilots
Figure 1 Number of somatic mutation rates across The Cancer Genome Atlas (TCGA) projects Figure 1 | Number of somatic mutation rates across The Cancer.
Future Directions Unknowns:
Fig. 8. Recurrent copy number amplification of BRD4 gene was observed across common cancers. Recurrent copy number amplification of BRD4 gene was observed.
سرطان الثدي Breast Cancer
2012 סיכום מפגש 2 שלב המשכי תהליך חזוני-אסטרטגי של המועצה העליונה של הפיזיותרפיה בישראל.
INVESTIGATION 3 BIG IDEA 1
INVESTIGATION 3 BIG IDEA 1
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Prognostic significance of DDB2 in ovarian cancer.
Plots derived from provisional TCGA data from sequencing and expression analyses of invasive breast carcinoma cases. Plots derived from provisional TCGA.

KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Genomic Analysis Chapter 19-20
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Genomic alterations in non–small cell lung cancer, breast cancer, and colorectal cancer. Genomic alterations in non–small cell lung cancer, breast cancer,
INVESTIGATION 3 BIG IDEA 1
Figure 3 Examples of gene expression heterogeneity
Distribution of intrinsic subtypes among TNBC and distribution of TNBC among basal-like breast cancer. Distribution of intrinsic subtypes among TNBC and.
Working in the Post-Genomic C. elegans World
Charles M. Rudin, MD, PhD, Alexander Drilon, MD, J.T. Poirier, PhD 
TOPMed Analysis Workshop Genetic Analysis Center Biostatistics Department University of Washington TOPMed Data Coordinating Center August 7-9, 2017 Introduction.
Intrusive Advising Outcomes Summary
Single Sample Expression-Anchored Mechanisms Predict Survival in Head and Neck Cancer Yang et al Presented by Yves A. Lussier MD PhD The University.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Robustness of TRU, Proximal-proliferative (PP), and Proximal-inflammatory (PI) classification. Robustness of TRU, Proximal-proliferative (PP), and Proximal-inflammatory.
LATS2-associated gene expression pattern is down-regulated specifically in lumB breast tumors. LATS2-associated gene expression pattern is down-regulated.
To Infinium, and Beyond! Cancer Cell
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Figure 1. Identification of differentially expressed messenger RNAs (mRNAs) in the The Cancer Genome Atlas (TCGA) BRCA database. (A) Heat map of the log2-fold.
CHCHD2 and EGFR protein expression in NSCLC
The NCI Genomic Data Commons as an engine for precision medicine
MYC and LYN are coexpressed and have interdependent clinical outcomes.
Presentation transcript:

Accessing TCGA Data

Access Tiers of TCGA data Open Access Controlled Access Non-identifiable data Aggregate data Gene level summaries Any one can download, free to share Identifiable Data Raw sequence data Raw SNP data Must apply for access through dbGaP with eRA Commons account – Students/postdocs have access under their PI Data cannot be shared across labs https://gdc.cancer.gov/access-data/data-access-policies https://gdc.cancer.gov/access-data/obtaining-access-controlled-data/registering-and-working-era-commons-and-dbgap

Genome Data Commons – https://gdc.cancer.gov New Harmonized Data – mapped to Hg38 Different pipelines Than prior TCGA data

Genome Data Commons– https://gdc.cancer.gov Prior TCGA data from DCC and CGHub Hg19 aligned data

Firebrowse – http://www.firebrowse.org

CbioPortal – www.cbioportal.org

TCGA Breast Cancer Genomic Data Sites https://tcga-data.nci.nih.gov/docs/publications/brca_2015/ (all open access TCGA Breast Cancer Data) https://lbg.unc.edu/~hoadley/BRCA.817.rsemg.uqnorm.counts.txt (all 20,000 gene expression values) https://lbg.unc.edu/~hoadley/BRCA.817.rsemg.uqnorm.counts.intrinsic.txt (~2000 gene “classification list”)