A framework to support collaborative Velo: Knowledge Management for Collaborative (Science | Biology) Projects A framework to support collaborative 1
Scientific Knowledge Management (KM) Knowledge Management systematic strategy of creating, conserving, and sharing knowledge to increase performance and innovation Capabilities required for a collaborative scientific KM Platform Associating disparate information Questioning data and results Experimenting with data Sharing hypotheses, data, results 2
Velo Overview Velo supports common knowledge management needs across science domains Carbon sequestration Climate Modeling Bioinformatics Subsurface modeling ….. Easily customized to specific science needs Data types Analysis/simulation tools Pluggable, extensible architecture Robust and scalable – built on widely used open source technologies Built to support collaboration across multi- disciplinary teams 3 Metadata extraction Data Storage Access control Provenance Semantic markup Knowledge Management System
Knowledge Management in Velo Knowledge = data + models + results + provenance Scientific Data Manage empirical/observational/derived data used to set up and parameterize models Velo can be easily customized to handle different data types Models and Simulations Manage multiple versions of models and associated results Launch simulations and data analysis on HPC/cloud platforms Results Automatically retrieve and store outputs associated with specific model versions Incorporate visualizations of simulation/model outputs Provenance Automatically and manually create links between related inputs and outputs and computational processes 4
Velo: How it works Content Store Web Store Provenance Velo API Tool Integration Framework Web Client Desktop Client Collaborative Knowledge Management Framework
FSFA Demonstration 6
7
8
9
10
11
12
13
14
Carbon Sequestration Example 15
Velo: Data Management Ingest any data types into Velo Incorporate scripts and tools to visualize and analyze data Extensible programmatic framework for new data types Examples: Incorporating well bore data logs for subsurface modeling Managing genome data for bioinformatics 16
Models: Model Setup and Simulation Manage conceptual models Launch simulations on remote HPC platforms Extensible to incorporate tools for model creation Examples: Conceptual model worksheets for subsurface models Simulation launching Mesh visualization 17
Results: Management and Analysis Retrieve simulation results from execution platforms Automatically visualize results Framework for incorporating analysis and visualization tools Examples: Plots for climate simulation outputs Visualizing plume extents for contaminants 18
Tool History 1.Velo gives option to the user to record Inputs Outputs Control parameters 2.Automatically loads the last saved inputs in tool’s input form Current Development plan – Browse and re- run any earlier invocation 19
Provenance Ability to link related artifacts for forensic investigations Both manually and automatically Examples: Link input data sets to models Link conceptual model versions to results Associate comments and analyses to simulation outputs 20
Existing Velo-based Deployments 21 Velo FutureGen SimSeqiRESM Biology FSFA NEAMS Akuna Flexible Customizable Collaborative Scalable
Next Steps We’re keen to work with others To deploy Velo to support scientific communities To partner on proposals To collaborate on projects To enhance the technology We’ll open source the Velo technology mid-year Downloadable User documentation Programmer documentation 22