DEEDS A Platform for Sharing Data, Computing & Scientific Workflows

Slides:



Advertisements
Similar presentations
Integrating ChemAxon technology into your End User Applications Java solutions for cheminformatics Ver. Mar., 2005.
Advertisements

Kensington Oracle Edition: Open Discovery Workflow Meets Oracle 10g Professor Yike Guo.
Application Graphic design / svetagraphics.com 01 FRAMEWORK data service.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
Presentation at WebEx Meeting June 15,  Context  Challenge  Anticipated Outcomes  Framework  Timeline & Guidance  Comment and Questions.
DATABASES AT THE HUB NOW YOU CAN CREATE THEM YOURSELF! Ann Christine Catlin HUBbub 2013.
DATABASES AT THE HUB NOW YOU CAN CREATE THEM YOURSELF! Ann Christine Catlin Senior Research Scientist Rosen Center for Advanced Computing HUBbub 2013.
1 Richard White Design decisions: architecture 1 July 2005 BiodiversityWorld Grid Workshop NeSC, Edinburgh, 30 June - 1 July 2005 Design decisions: architecture.
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
Graduate System for Management of Admissions, Alumni & Records Tracking (Grad SMAART) January 8, 2007 Office of Graduate Studies.
Using Web of Science as a Research Tool : Experience at HKUST Library Steve Yip Electronic Information Librarian.
Interpret Application Specifications
Cyberinfrastructure for Rapid Prototyping Capability Tomasz Haupt, Anand Kalyanasundaram, Igor Zhuk, Vamsi Goli Mississippi State University GeoResouces.
1 Components of A Successful Data Warehouse Chris Wheaton, Co-Founder, Client Advocate.
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
Your Interactive Guide to the Digital World Discovering Computers 2012.
What’s New for IT Professionals in Microsoft® SharePoint® Server 2013 (Day 2) Sayed Ali (MCTS, MCITP, MCT, MCSA, MCSE ) Senior SharePoint.
The SEASR project and its Meandre infrastructure are sponsored by The Andrew W. Mellon Foundation SEASR Overview Loretta Auvil and Bernie Acs National.
Moving forward our shared data agenda: a view from the publishing industry ICSTI, March 2012.
The Creation of a Big Data Analysis Environment for Undergraduates in SUNY Presented by Jim Greenberg SUNY Oneonta on behalf of the SUNY wide team.
CceHUB A Knowledge Discovery Environment for Cancer Care Engineering Research Ann Christine Catlin HUBzero Workshop November 7, 2008.
Aid Management Platform (AMP) Introduction to AMP Tanzania, February 2009.
Search Server Index Search Server Index Somewhere There’s a PLACE for Us: Linking Fedora Digital Collections and Open Geoportal Eleta Exline, Thelma Thompson,
Project Proposal Interface Design Website Coding Website Testing & Launching Website Maintenance.
ISpheres Project. Project Overview iSpheresCore iSpheresImage Demonstration References.
1 Systems Development Cheryl Itkin SIMCorB Meeting RTP, NC June 29-30, 2000 SIMCorB Organization Policy Systems Outreach.
Flexibility and user-friendliness of grid portals: the PROGRESS approach Michal Kosiedowski
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
First 5 LA Phase III Website Strategic Plan Presentation to Public Affairs Committee July 13, 2006.
NanoHUB.org and HUBzero™ Platform for Reproducible Computational Experiments Michael McLennan Director and Chief Architect, Hub Technology Group and George.
Slide 12.1 Chapter 12 Implementation. Slide 12.2 Learning outcomes Produce a plan to minimize the risks involved with the launch phase of an e-business.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
Streamflow - Programming Model for Data Streaming in Scientific Workflows Chathura Herath.
GPO’s Federal Digital System December 10, 2009 U.S. Government Printing Office.
Cooperative experiments in VL-e: from scientific workflows to knowledge sharing Z.Zhao (1) V. Guevara( 1) A. Wibisono(1) A. Belloum(1) M. Bubak(1,2) B.
CceHUB omicsknowledgebase Ann Christine Catlin 3 rd Annual Cancer Care Engineering Retreat June 20, 2008 An Environment for CCE Research.
Transition to Practice. We define “Transition to Practice” as making privacy tools and systems operational.
GT Research Data Project Team Original Charge: to investigate, evaluate, assess, and communicate Georgia Tech researchers’ data practices, processes, and.
MDL Information Systems, Inc. Powering the Process of Invention Donna del Rey Director, Business Planning
SciencePAD Open Software for Open Science Alberto Di Meglio – CERN.
Centre for Aerospace Systems Design & Engineering (CASDE), IIT Bombay Presentation to the 3 rd Meeting of Joint Policy Committee June 10 th 2002.
Enhancements to Galaxy for delivering on NIH Commons
Patrick Desbrow, CIO & VP of Engineering October 29, 2014
Kai Li, Allen D. Malony, Sameer Shende, Robert Bell
Discover. Analyze. Connect.
Jan 2016 Solar Lunar Data.
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
The Data Grid: Towards an architecture for Distributed Management
MATLAB Distributed, and Other Toolboxes
Joseph JaJa, Mike Smorul, and Sangchul Song
Tim Smith CERN Geneva, Switzerland
Components of A Successful Data Warehouse
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
VI-SEEM Data Repository
A Review of BSC Vocabulary
Databases at the Hub Now you can Create them yourself!
VI-SEEM Data Repository
Graduation Project Kick-off presentation - SET
HL7/College/University Internship Program
The Re3gistry software and the INSPIRE Registry
2017 Safety Group 1 – 5 Year Program Timeline Guide
Yearly Maintenance Process (for existing messages)
2009 TIMELINE PROJECT PLANNING 12 Months Example text Jan Feb March
4CeeD: Private Cloud and Data Cyber-Infrastructure for Scientific Instruments Steve Konstanty, Senior Research Programmer, CSL.
2016 Safety Group 1 – 5 Year Program Timeline Guide
Development Goals for Year 2
2012 Safety Group 1 – 5 Year Program Timeline Guide
Core 5: Training Randy Gollub, MD PhD Guido Gerig, PhD
2009 TIMELINE PROJECT PLANNING 12 Months Example text Jan Feb March
Presentation transcript:

DEEDS A Platform for Sharing Data, Computing & Scientific Workflows CIF21 DIBBs: EI: Digital Environment for Enabling Data-driven Science (DEEDS) PI: Ann Christine Catlin | co-PIs: Ashraful Alam, Joseph Francisco, Marisol Sepulveda, Connie Weaver Award #1724728 | Award Period August 2017 – July 2021 DEEDS “Use Cases” Workshop June 29 2018

Data Infrastructure Code Platform Input Files Dashboard Upload Output Spreadsheets Annotate Computing Workflows HPC Models Servers Interfaces Analytics Outcomes Sharing Access Figures Results Applications Reports Publication Exploration Reuse Dissemination Discovery

DEEDS DASHBOARD Users organize datasets by “cases” (experiments, subjects, specimens, sites, study units, research activities) to clarify how their investigation is conducted. This interpretive framework makes experimental design and dataset content easier for researchers to understand and use since Files & Data are connected to the activities that produced them. Users upload, annotate & classify files such as reports, figures, photos, device data, input, output, and any other files produced throughout the investigation. Interactive interfaces let users explore file collections, which are organized into mime-based categories for ease of use and discovery. Users define complex structured data models to describe properties, measurements, observations, and other data assembled throughout the investigation. Our “spreadsheets of spreadsheets” approach is used to upload, view, and operate interactively on multi-dimensional data tables. Users define and launch computing software, assign resources (e.g., HPC), select input, and trace execution. Output is automatically collected, annotated, classified, and uploaded to the dataset. Research workflows are captured end-to-end. Interactive interfaces join heterogenous dataset content so users can view, search, sort, filter, explore, download, compare, visualize, map the data in their dataset.

Environmental Science Electrical Engineering Performance modeling of solar PV systems Chemistry Joe Francisco University of Pennsylvania Optimization of molecular structure and determination of properties Nutrition Science Connie Weaver Purdue University Effect of berries on net calcium retention & biochemical markers of polyphenol and bone metabolism Marisol Sepúlveda Environmental Science Development of amphibian toxicity reference values for ecological risk assessment Ashraf Alam Electrical Engineering Post-doctoral Researcher Ross Hoehn Post-doctoral Researcher Kalina Hodges PhD Students Tahir Patel Reza Asadpour Xingshu Sun Post-doctoral Researcher Wes Flynn Research Laboratory Manager Sam Guffey

Research Computing Research Computing R&D Lead for DEEDS Project Computing Infrastructure Chandima Hewa Nadungodage Purdue University Steve Clark Purdue University Senior Software Engineers Sumudinie Fernando Guneshi Wickramaarachchi Computer Science PhD Student Andres Bejarano Computer Science Masters Students Paramesh Desigavinayagam Omkar Patil with support from the hubzero team Pascal Meunier Anthony Fuentes DEEDS platform architecture: sharing data, computing & scientific workflows

DEEDS implementation begins DEEDS TIMELINE AUG 2017 Research cases, input/ output files, computing code, HPC execution & workflow Solar PV EcoTox Research cases, data model for collected measurements, spreadsheets, workflow OCT 2017 JAN-FEB 2018 Berries, Quantum Research cases, input/output files, research computing, complex data models & collected measures, data & computing workflows All JUN 2018 Create datasets, evaluate DEEDS platform completeness & usability, testing, feedback DEEDS TEAMS DEEDS R&D Platform requirements, dashboard design, user interfaces design for cases & files, tools infrastructure, “submit” requirements , web component structure SEP 2017 DEEDS R&D Requirements analysis, dashboard feature prototyping, user interface design for data models, user interface prototyping, database & repository structure NOV-DEC 2017 Ongoing design & development, prototype platform, dashboard functionality for cases, files, data, & tools, dataset building& testing. Extensions, enhancements, fixes. MAR-JUN 2018 DEEDS R&D DEEDS implementation begins DEEDS version 1

DEEDS SCIENTIFIC DATASETS Create Organize Preserve Compute Share Explore Learn Educate Publish Reuse