March 3, 2005 mBIRN All Hands Meeting Data Provenance Nicole Aucoin.

Slides:



Advertisements
Similar presentations
Interfacing processing and visualization tools: FIPS to Slicer3 and the QueryAtlas.
Advertisements

The Petroleum Registry of Alberta The Petroleum Registry of Alberta Energizing the flow of information Registry Information Session January 24, 2006.
System Design and Memory Limits. Problem  If you were integrating a feed of end of day stock price information (open, high, low, and closing price) for.
Motif Space Database Design Kiranjit Sidhu. 2 Outline  Schema Design  Content of Database  Functionality  Future Plans.
Sharepoint Portal Server Basics. Introduction Sharepoint server belongs to Microsoft family of servers Integrated suite of server capabilities Hosted.
4/20/2017.
CVSQL 2 The Design. System Overview System Components CVSQL Server –Three network interfaces –Modular data source provider framework –Decoupled SQL parsing.
Overview of Mini-Edit and other Tools Access DB Oracle DB You Need to Send Entries From Your Std To the Registry You Need to Get Back Updated Entries From.
Nutch Search Engine Tool. Nutch overview A full-fledged web search engine Functionalities of Nutch  Internet and Intranet crawling  Parsing different.
2007 Monthly Calendar You can print this template to use it as a wall calendar, or you can copy the page for any month to add it to your own presentation.
ClimDB/HydroDB (ClimHy) Integration ClimHy has been migrated from AND to LNO and will remain status quo in 2011 – Public page (
FIX Repository based Products Infrastructure for the infrastructure Presenter Kevin Houstoun.
RSR Ryan White HIV/AIDS Program Services Reporting System What’s New with the RSR 1.
Evaluation 101: After School Programs February 1, 2007 Region 3 After School Technical Assistance Center Conference.
Managing Monitoring Data from Many Sources A New Hampshire Experience Deb Soule Watershed Management Bureau New Hampshire Department of Environmental Services.
FBIRN AHM March 13-14, 2006 David B. Keator University of California, Irvine FBIRN NeuroInformatics Working Group Update.
GDT V5 Web Services. GDT V5 Web Services Doug Evans and Detlef Lexut GDT 2008 International User Conference August 10 – 13  Lake Las Vegas, Nevada GDT.
XML & Mediators Thitima Sirikangwalkul Wai Sum Mong April 10, 2003.
NMED 3850 A Advanced Online Design January 12, 2010 V. Mahadevan.
Copyrighted material John Tullis 10/17/2015 page 1 04/15/00 XML Part 3 John Tullis DePaul Instructor
Data Validation OPEN Development Conference September 19, 2008 Sushmita De Systems Analyst.
April 13 BEC Meeting BIRN Data Sharing Implementation From the BIRN DSTF Randy L. Gollub, Chair.
Implementing the XDS Infrastructure Bill Majurski IT Infrastructure National Institute of Standards and Technology.
Federated Database Set Up Greg Magsamen ITK478 SIA.
Clinical Measures Genotype Local Storage BIRN Rack SRB MCAT HID/ XNAT/ LONI DUP Calibration & Analysis Tools GRID Portal Mediator Institution A BIRN Rack.
2004 All Hands Meeting FBIRN 2005 – Database and Informatics Working Group David Keator.
WORD JUMBLE. Months of the year Word in jumbled form e r r f b u y a Word in jumbled form e r r f b u y a february Click for the answer Next Question.
The CERA2 Data Base Data input – Data output Hans Luthardt Model & Data/MPI-M, Hamburg Services and Facilities of DKRZ and Model & Data Hamburg,
IMPLEMENTING A SERVICE BUS ARCHITECTURE WITH BIZTALK 2009 AND THE ESB TOOLKIT 2.0 A Case Study.
Integrating QDEC with Slicer3 Click to add subtitle.
1 RMS Update - ERCOT May 14, Supporting Reports Section.
All Hands Meeting 2005 FBIRN Tools: 2005 Subtitle added here.
Neuroinformatics Working Group Update 10/26/2009 H Jeremy Bockholt.
Morphometry BIRN Semi-Automated Shape Analysis (SASHA) JHU (CIS): M. F. Beg, C. Ceritoglu, A. Kolasny, M. I. Miller, R. Yashinski MGH (NMR): B. Fischl;
Biomedical Informatics Research Network BIRN Workflow Portal.
FBIRN Use Case: Data Storage and Retrieval. User Query Results with standard descriptions in HIDB Results Images in SRB FIPS Result s FMRI Images Automated.
ICM – API Server & Forms Gary Ratcliffe.
Neuroimage Analysis Center An NCRR National Resource Center NAC Engineering Core Steve Pieper, Core PI SPL; Isomics, Inc.
2011 Calendar Important Dates/Events/Homework. SunSatFriThursWedTuesMon January
How to combine IRIS products Available APIs Examples of integrations Ole Andersen Senior Strategic Account Manager.
REQUIREMENTS GATHERING Moderators: M Miller Goals: To allow participants to provide feedback to the developers (BIRN-CC and test bed applications) of what.
Making Pages Dynamic Chapter 8 JavaScript for the WWW.
Biomedical Informatics Research Network The BIRN Architecture: An Overview Jeffrey S. Grethe, BIRN-CC 10/9/02 BIRN All Hands Meeting 2002.
July 2007 SundayMondayTuesdayWednesdayThursdayFridaySaturday
Biomedical Informatics Research Network Feature & Requirements of the BIRN Portal: Detail User Requirements Jeff Grethe, Steve Peltier October 9, 2002.
Data Provenance. Data Provenance Goals Replicate (re-apply) analyses Facilitate comparisons across workflows.
Sharepoint-Biztalk Integration with Multiple Transport protocols Jin Thakur
2003 All Hands Meeting FBIRN Complete the Data Analysis  Calibration and statistics issues Assess the variability Global measures ROI measures.
Provenance Work Plans and Deliverables October 2005  Data Provenance information in SRB and HID Test upload to SRB (March) Give DB working group formal.
MESA A Simple Microarray Data Management Server. General MESA is a prototype web-based database solution for the massive amounts of initial data generated.
Biomedical Informatics Research Network BIRN Workflow Portal.
Integrating ArcSight with Enterprise Ticketing Systems
LHC T0/T1 networking meeting
Integrating ArcSight with Enterprise Ticketing Systems
An introduction to outlining in PowerPoint
Materials Engineering Product Data Management (ePDM)
Registry Information Session
Chlamydia Learning Collaborative
2300 (11PM) September 21 Blue line is meridian..
2/18/2019.
3D Slicer Version 3.0 Update for mBIRN
Use Cases Simple Machine Translation (using Rainbow)
Education and Training Statistics Working Group – 1-2 June 2017
Operational Update 1.
E W ©
2015 January February March April May June July August September
Unit 6 - XML Transformations
Presentation transcript:

March 3, 2005 mBIRN All Hands Meeting Data Provenance Nicole Aucoin

Outline  Introduction  Current state of the project Demo  Integration with SRB  Integration with HID  Instrumentation of Upload Scripts  Requirements Gathering  Work plans and deliverables

Introduction  Data provenance is… Tracking what changes the data  Data provenance is good for… Recreating research for validation Testing processes on new data sets Information recovery when changing formats

Current State of the Project  Processing tools have been updated Slicer, FreeSurfer, shape analysis tools  Document type definition created to specify xml output for upload  Raw text output can be parsed into xml files  Documentation moved onto the wiki

Data Provenance DTD

Demo  Converting a volume file from Freesurfer MGH format to COR format, viewing it in Slicer  Raw data provenance information is captured in a text file, amidst processing output  The raw data file is parsed and an xml file is produced

Sample Output File mri_convert GCC 05/02/08-14:47:44-GMT {$Id:} mri_convert.c,v /02/09 21:45:55 fischl Exp {$} slicerl.bwh.harvard.edu Linux nicole

Sample Output File con’t slicer2-linux-x86 GCC VTK TCL TK ITK 02/08/05-09:48:08-EST \{Id: Go.tcl,v /12/02 23:46:19 nicole Exp\} {} i686 Linux nicole

Demo details  Call convertandshow script, piping output to a raw file Call mri_convert with –all-info flag Use mri_convert to convert MGH to COR Load COR volume into Slicer, with –all-info flag  Call dataprov tcl script on raw file, piping output to an xml file

Integration with SRB  Upload of xml files via S commands  Associate xml files with derived data they are describing

Integration with HID  Use XML parsing tools to extract the information from the XML file XSLT/DOM/XPATH  Upload values to the appropriate places in the HID Integrate with the new schema

Instrumentation of Upload Processes  Scripts versus pipelines  BIRNDUP  fBIRN uploads

Requirements Gathering  From the Database group Location to upload xml file How to integrate with the HID How to query from HID on data provenance fields  From Developers Keep me informed when new tools are used  Survey will be sent out to BIRN sites every few months Help testing out information gathering and parsing on various operating systems Integration of query by HID into Portal

Work Plans and Deliverables October 2005  Data Provenance information in SRB and HID Test upload to SRB (March) Give DB working group formal request for new fields (March) Integrate into upload pipelines (April) Test out various xml parsers (June) Test upload to HID (September)  Query by Provenance in HID (October)  Add fields to DTD (compiler flags, data URI, data id, ?), and update the specification table (March)  Convert DTD to a schema (March)  (share tools/information with fBIRN) (ongoing)

Work Plans and Deliverables March 2006 Query by Provenance in HID via Portal (December) Wrappers for third party programs (January)  Matlab mex files  SPM  FSL  Contact vendors to obtain more information, ask them to add it First pass on a data provenance toolkit (March)