Download presentation
Presentation is loading. Please wait.
Published byGriselda Hampton Modified over 9 years ago
1
eCAS Curation John J. Tran
2
DATA “BLOB” –staging area ALVIN LIU WHI COLORADO SELDI, MALDI, & MISC PI BILL GRIZZLE Current Curation Process & Status * Working with DMCC -- Combining knowledge from ESIS & eCAS into user-friendly interface -- Strategically selection of relevant meta data requirements for eCAS curation interface * Curation process is manual * We’ve collected data from these sites * Holy Grail: semi-automated curation work flow from data to archive
3
eCAS high level system design dB & file storage policy repository eCAS system web interface batch cmd-line interface meta-data & policy manager
4
ECAS Policy and Catalog Management RDF Process CDEs Protocols Publications Biomarkers REPOS XML/MysSQL CURATION PROCESS Repository Manager A. Create/Update/Delete Repository = collection of dataset B. Manage/Update Data Set 1. Create CDEs if no CDEs exist 2. Create map CDEs Product Type Populate Data Elements in Data Set 1. Map to protocol ID (RDF/bmdb) 2. Map to publication 3. Map to biomarker 4. Set policy information for data set Curation Policy Builder CurationPolicy.XML Curation & Pre-loading Stage 1. Download Data to staging area 2. Define product composition (files organization) 3. Build policy file (CurationPolicy.XML) 4. Make policy file ready for use DATA “BLOB” –staging area DATA LOADING PROCESS ALVIN LIU DMCC BILL GRIZZLE ETC Load Data 1. Point to data local or remote 2. For each product in data set a. generate metadata b. validate metadata c. catalog product + metada d. write product metadata to repository e. hand transform data f. roll back on errors 3. Commit process Load & Manage Data EDRN ECAS ARCHIVE
5
Curation & Policy/FM Workflow loginmain menu 1. curate data 2. manage policy & metadata policy manager 1. list policies 2. add policy 3. policy wizard data manager 1. index data 2. search/modify data 3. add data policy wizard 1. build using existing templates 2. start from scratch list policy templates > define data elements input key-pair values elements define product types input values for product types define product type map map relationship b/w product types & elements
6
CURATION DOCUMENT FLOW Select An Action * List all CDEs * List all EDRN data set * Associate CDE/Data set Please select a repository * Core * Seldi * Maldi * Misc PI - Add new Repository breadcrumb: home > New Repository Creator: Description: breadcrumb: home > repo: create Repository: Core * List all CDEs * List all EDRN data set * Associate CDE/Data set breadcrumb: home > repo: core CDE: Core breadcrumb: home > repo: core > CDE IdNameDesc EDRN Data Set: Core breadcrumb: home > repo: core > Data Set IdNameDesc MAP: Core breadcrumb: home > repo: core > MAP IdParentDesc breadcrumb: home > repo: core > CDE > edit CDE : edit ID: Name: Description: breadcrumb: home > repo: core > EDRN Data Set > edit EDRN Data Set: edit ID: Name: Description: breadcrumb: home > repo: core > MAP > edit Associate CDE/Data set ID: Parent: Elements: abc… cde… CDEs Abc Def Hij javascript Metadata K/V K/V K/V javascript
7
Sneak Preview of Tool in the Pipe-line
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.