CceHUB Sharing, Exploring and Analyzing Data An Environment for Collaborative Cancer Research clinical dataobservational & scientific data decision supportcomputation.

Slides:



Advertisements
Similar presentations
Common Instrument Middleware Architecture and Federation of Instrument Resources for X-ray Crystallography Rick McMullen Indiana University.
Advertisements

International Barcode Of Life Initiative
Enterprise Use Cases. Levels LevelDescriptionExamples 0 0aVerbal CommunicationNon-permanent, e.g. verbal communication 1Non-electronic dataMail, phone.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
DATABASES AT THE HUB NOW YOU CAN CREATE THEM YOURSELF! Ann Christine Catlin HUBbub 2013.
DATABASES AT THE HUB NOW YOU CAN CREATE THEM YOURSELF! Ann Christine Catlin Senior Research Scientist Rosen Center for Advanced Computing HUBbub 2013.
Arctic Observing Viewer a web mapping application for AON data collection sites
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS Ravi K Madduri University of Chicago and ANL.
Bindley Bioscience Center Vision: Nurture interactive communication and interdisciplinary discovery with flexible laboratory project spaces and an open.
The Changing Face of Research Anthony Beitz DART Integration Manager.
NCI’s Clinical Proteomic Technologies for Cancer: “Restructuring Proteomics to Succeed in Discovering Cancer Biomarkers” Joe.
Cancer Care Engineering Colorectal Cancer Gabriela Chiorean, M.D. May 27, 2011.
The CCE 5 th Annual Retreat Global Proteomics & Determination of Vitamin D Metabolites Update Jiri Adamec.
© 2008 LabKey Software Simplifying Scientific Data Management with LabKey Server January 29, 2009 Presenter: Peter Hussey,
SCIENCE-DRIVEN INFORMATICS FOR PCORI PPRN Kristen Anton UNC Chapel Hill/ White River Computing Dan Crichton White River Computing February 3, 2014.
Vitamin D and Cancer Dorothy Teegarden, Ph.D. Purdue University Professor and Associate Head for Research, Department of Nutrition Science Lead, Cancer.
CceHUB A Knowledge Discovery Environment for Cancer Care Engineering Research Ann Christine Catlin HUBzero Workshop November 7, 2008.
Consent2Share Linking Cohort Discovery to Consent David R Nelson MD Assistant Vice President for Research Professor of Medicine Director, Clinical and.
DOE Genomics: GTL Program IT Infrastructure Needs for Systems Biology David G. Thomassen Office of Biological and Environmental Research DOE Office of.
2007 GeneSpring MS GeneSpring for Metabolite BioMarker Analysis using Mass Spectrometry data Agilent Q-TOF VIP Visit Jan 16-17, 2007 Santa Clara, CA Thon.
Using the Purdue DB Technology to build simple on-demand data exploration tools Michael Grobe Pervasive Technology Institute Indiana University Hubbub.
Sage Bionetworks Mission Sage Bionetworks is a non-profit organization with a vision to create a “commons” where integrative bionetworks are evolved by.
Cancer Clinical Trial Suite (CCTS): An Introduction for Users A Tool Demonstration from caBIG™ Bill Dyer (NCI/Pyramed Research) June 2008.
CCE project update Metabolomics Raftery Group. Original Study 20 cancer, 28 normals and 14 with polyps NMR and GC-MS study.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
Cancer Care Engineering: A Collaborative Transformational Project Indiana University School of Medicine Purdue University.
Project Funding & New Projects Cancer Care Engineering.
INFSO-RI Enabling Grids for E-sciencE V. Breton, 30/08/05, seminar at SERONO Grid added value to fight malaria Vincent Breton EGEE.
Mission: Seed, nurture, and execute large multi-disciplinary projects that involve applying computing, systems engineering, and information technology.
CceHUB An Environment for Collaborative Cancer Research Ann Christine Catlin CCE Annual Retreat May 26, 2010 clinical dataobservational & scientific data.
{Fez} a Hatrack for your Metadata Sam Wilson & Nathan Denny HUBbub – Sep. 30, 2014.
NanoHUB.org and HUBzero™ Platform for Reproducible Computational Experiments Michael McLennan Director and Chief Architect, Hub Technology Group and George.
CceHUB Sharing, Exploring and Analyzing Data An Environment for Collaborative Cancer Research clinical dataobservational & scientific data decision supportcomputation.
Min Zhang, MD PhD Purdue University Joint work with Yanzhu Lin, Dabao Zhang.
Valentina Di Francesco Senior Program Officer for Bioinformatics, Structural Genomics and Systems Biology Microbial Genomics.
ACGT: Open Grid Services for Improving Medical Knowledge Discovery Stelios G. Sfakianakis, FORTH.
Representing Flow Cytometry Experiments within FuGE Josef Spidlen 1, Peter Wilkinson 2, and Ryan Brinkman 1 1 BC Cancer Research Centre, Vancouver, BC,
Sage Bionetworks Mission Sage Bionetworks is a non-profit organization with a vision to create a “commons” where integrative bionetworks are evolved by.
Cancer Care Engineering Joe Pekny, PhD Visionary Marietta Harrison, PhD Worker Bee.
THE MURDOCK Study: A Rich Data Resource for Biomarker Discovery and Validation Brian D. Bennett 1, Jessica D. Tenenbaum 1, Victoria Christian 1, Melissa.
A collaborative tool for sequence annotation. Contact:
Introduction to caIntegrator caBIG ® Molecular Analysis Tools Knowledge Center April 3, 2011.
CceHUB omicsknowledgebase Ann Christine Catlin 3 rd Annual Cancer Care Engineering Retreat June 20, 2008 An Environment for CCE Research.
Construction of Shanghai Life Science & Bio-technology Service Platform for Data Access and Sharing International Workshop on Strategies Presentation of.
Children’s Health Exposure Analysis Resource (CHEAR) CHEAR Center for Data Science Susan Teitelbaum, PhD November 4, 2015.
Data Management Support for Life Sciences or What can we do for the Life Sciences? Mourad Ouzzani
Automatic Discovery and Processing of EEG Cohorts from Clinical Records Mission: Enable comparative research by automatically uncovering clinical knowledge.
U.S. Department of the Interior U.S. Geological Survey Decision Support Tools and USGS Data Management Best Practices Cassandra Ladino USGS Chesapeake.
Joe Pekny, Professor Chemical Engineering Director, e-Enterprise Center Discovery Park Marietta Harrison, Professor Medicinal Chemistry & Molecular Pharmacology.
1 1 NOAA Office of Ocean Exploration End-to-End Data Management: A Success Story NOAA Tech Conference November 2005 Susan Gottfried National Coastal Data.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
The National Cancer Imaging Archive (NCIA) In Action: An Introduction for Users A Tool Demonstration from caBIG™ Carl Jaffe, MD NCI-Cancer Imaging Program.
The Bridge from Patient to Scientist Comparison: BioBank and Cancer Registry Data Source Distinct Patients Percent BioBank % Cancer Registry %
Cancer Care Engineering Colorectal Cancer Gabriela Chiorean, M.D. May 26, 2010.
Enhancements to Galaxy for delivering on NIH Commons
Biostatistics Resources for Clinical and Translational Research
Solutions to Clinical Data Visualization and Analysis
University of Chicago and ANL
Million Veteran Program Data Marts and Data Access
Upgrading the research performance in molecular medicine at the
Joseph JaJa, Mike Smorul, and Sangchul Song
Data challenges in the pharmaceutical industry
Workflows in archaeology & heritage sciences
Databases at the Hub Now you can Create them yourself!
Knowledge l Action l Impact
Golubev Alexandr, IMU Workshop, Chisinau 2018
Code Analysis, Repository and Modelling for e-Neuroscience
DEEDS A Platform for Sharing Data, Computing & Scientific Workflows
Code Analysis, Repository and Modelling for e-Neuroscience
Session 1: WELCOME AND INTRODUCTIONS
Presentation transcript:

cceHUB Sharing, Exploring and Analyzing Data An Environment for Collaborative Cancer Research clinical dataobservational & scientific data decision supportcomputation & visualization Ann Christine Catlin HUBbub

Master plan for the Cancer Care Engineering Colorectal Cancer Study 1.Blood Sample Acquisition Sample Processing, Annotation, Distribution Clinical Patient Data Collection IU Simon Cancer Center 2. OMIC Laboratory Analysis Data & Knowledge Acquisition Xu Lab Lipidomics IU School of Medicine Raftery Lab Metabolomics Purdue Regnier Lab Glycoproteomics Purdue Bindley Lab Global Proteomics Purdue Teegarden Lab Vitamin D Purdue Klaunig Lab Oxidative Stress IU School of Medicine 3. Predictive Modeling Data Synthesis & Analysis Knowledge Acquisition Zhang Group Integrative Models Purdue Sherer Population-based Models VA Hospital Chen Biological Network Models IUPUI 4. Visual Analytics Data Exploration & Analysis Knowledge Acquisition Ebert Group PURVAC Purdue 5.Iterative Feedback & Validation CCE Research Community molecular signatures for colorectal cancer that predict susceptibility, treatment response and ultimate treatment outcome

Sharing data, tools, analysis & knowledge sample collection clinical data collection lab data collection statistical & modeling tools lab analysis pipelines

A single portal : sharing data, tools, analysis & knowledge sample collection clinical data collection lab data collection statistical & modeling tools lab analysis pipelines cancer research groups worldwide

A single portal: sharing data, tools, analysis & knowledge a web environment with that supports data flow data sharing data analysis for the collaborating cancer research groups of CCE a web environment with that supports data flow data sharing data analysis for the collaborating cancer research groups of CCE

Support for clinical data clinical data laboratory analysis predictive modeling tools Clinical Research Team and Physicians Data contribution from clinical team Patients Diagnosis, Treatments, Surgeries, Lifestyle, Diet, Demographics, … Samples Collection, Processing, Protocols, Distribution, Tracking, … Automatic Metadata Processing Sample Data Patient Data Clinical Metadata cceHUB Database Data Workflow

Support for clinical data clinical data laboratory analysis predictive modeling tools Clinical Research Team Physicians Patients Diagnosis, Treatments, Surgeries, Lifestyle, Diet, Demographics, … Samples Collection, Processing, Protocols, Distribution, Tracking, … Automatic Metadata Processing Sample Data Patient Data Clinical Metadata cceHUB Database Data Flow Data contribution from clinical team nightly pull from hospital e-records patient data collection sample tracking data annotation clinical data archive blood sample bio-repository patient and sample linkage data viewing data search, filter & explore nightly pull from hospital e-records patient data collection sample tracking data annotation clinical data archive blood sample bio-repository patient and sample linkage data viewing data search, filter & explore Clinical Data Flow

Clinical and sample data collection & processing

Clinical data views

Clinical data : some stats Database Total Patients Diagnosis % Data Lifestyle % Data Cancer/Polyp Patients Treatment % Data patients240100%70%41 / 92100% First patient CCE001 enrolled on 04/02/2009 (the day cceHUB went live) Most recent patient CC285 enrolled on 02/15/2011 Most recent data : neoadjuvant chemoradiation treatment for patient CCE156 on 04/02/2011 Maximum patients enrolled on a single day 09/23/2009 = 9 # web-forms to track patient and sample data flow : 12 # accesses to clinical data viewer 04/02/2009 – 05/25/2010 : > 15,000 Database Total Samples Total Aliquots Sample Tracking Web-forms # instances cceHUB used to find missing aliquot samples sample processing sample transfer sample storage sample distribution 52 (we track sample barcodes, location, entry person, entry date)

Support for laboratory data clinical data laboratory analysis predictive modeling tools Research Labs Metabolomics, Lipidomics, Global Proteomics, Glycoproteomics, Vitamin D, Oxidative Stress, Genomics Clinical Data Lab Knowledge Base Repository Metadata cceHUB Database cceHUB Lab Instrument Data Repository Lab Workflow Knowledge Data Upload Sample-Dataset tracking Massive instrument- generated datasets

Research Labs Metabolomics, Lipidomics, Global Proteomics, Glycoproteomics, Vitamin D, Oxidative Stress, Genomics Support for laboratory data clinical data laboratory analysis predictive modeling tools Clinical Data Lab Knowledge Base Repository Metadata cceHUB Lab Instrument Data Repository Lab Workflow Knowledge Data Upload Sample-Dataset tracking Massive instrument- generated datasets cceHUB Database ”knowledge base” resources (protocols, sample preparation, instruments, standards, file formats, analysis) annotation for lab data files lab data files tracked to samples/patients data files upload with provenance metadata processing lab data collections data view & explore data access for analysis tools ”knowledge base” resources (protocols, sample preparation, instruments, standards, file formats, analysis) annotation for lab data files lab data files tracked to samples/patients data files upload with provenance metadata processing lab data collections data view & explore data access for analysis tools Laboratory Data Flow

Laboratory knowledge base

Laboratory data flow, standards, data annotations & upload

Laboratory data repository views

Laboratory data : some stats Lab#Samples Analyzed % Total Files/ Samples Uploaded Average File Size Analysis Tools at cceHUB Using cceHUB tools ? Bindley Biosciences Global Proteomics %193 files 193 samples 80MBDiscovery Pipeline Results Visualize/Compare 500 runs through discovery pipeline Teegarden Vitamin D %4 files 225 samples < 1MBVitamin D-Blood Draw - Clinical Data merge for SAS Yes, DataView Raftery Metabolomics GCGC-MS %230 files 230 samples 1 GBPeak classification and alignment GCGC-MS Visual Analytics Raftery Metabolomics NMR %1 file 110 samples < 1MB Xu Lipidomics %1 file 143 samples < 1MBLipidomics-BloodDraw- Clinical Data merge for SAS Yes, DataView Klaunig TEAC analysis %1 file 259 samples < 1MBTEAC-Blood Draw-Clinical Data merge for SAS Yes, DataView Klaunig Comet Assay %1 file 101 samples < 1MBCometAssay-Blood Draw- Clinical Data merge for SAS Yes DataView Klaunig Genotyping Assay --POCRE, MaCH genotype imputation (used by stat group on their own data) Regnier Glycoproteomics --

Support for modeling and analysis Modeling Groups Visual Analytics Data Synthesis & Analysis Knowledge Acquisition clinical data laboratory analysis predictive modeling tools Clinical Data Lab Knowledge Base Repository Metadata cceHUB Database cceHUB Lab Instrument Data Repository Data to Tools Data from Tools GCxGC MS Classification and Alignment LC-MS Discovery Pipeline: Spectrum Deconvolution

Tools to support data exploration and synthesis

Tools for physician decision support

Tool Results Collections and Analysis Browser

Collaboration for Cancer Care Engineering Research

ITMIG Global Prospective Database just underway …

Our technology extends to other cancer research workflows Using the HUB cyber infrastructure and cceHUB data technology to further collaborative Cancer Care Engineering Research