CyVerse Tools and Services

Slides:



Advertisements
Similar presentations
Managing Data with iPlant Introduction to Uploading, Downloading, Sharing, and Metadata in the Data Store.
Advertisements

Office of Science Office of Biological and Environmental Research Susan K. Gregurick, Ph.D. Program Manager Computational Biology & Bioinformatics Biological.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Building Data-intensive Pipelines Ravi K Madduri Argonne National Lab University of Chicago.
IPlant Collaborative Powering a New Plant Biology iPlant Collaborative Powering a New Plant Biology.
Customized cloud platform for computing on your terms !
BISQUE: Enabling Cloud and Grid Powered Image Analysis Ramona Walls iPlant Collaborative
Data to Discovery The iPlant Collaborative Community Cyberinfrastructure for Life Science Nirav Merchant iPlant / University.
Enabling Cloud and Grid Powered Image Phenotyping Nirav Merchant iPlant Collaborative
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
material assembled from the web pages at
Using Biological Cyberinfrastructure Scaling Science and People: Applications in Data Storage, HPC, Cloud Analysis, and Bioinformatics Training Scaling.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Customized cloud platform for computing on your terms ! Nirav Merchant
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Objectives.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Network for Integrating Bioinformatics into Life Sciences Education April, 2014.
Metadata in the iPlant Collaborative Cyberinfrastructure Birds of a Feather meeting at PAG XXII, Jan. 14, 2014.
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
IPlant Collaborative Hands-on Cyberinfrastructure Workshop – Part 2 R. Walls University of Arizona Biodiversity Information Standards (TDWG) Sep. 29, 2015,
Using Biological Cyberinfrastructure Scaling Science and People: Applications in Data Storage, HPC, Cloud Analysis, and Bioinformatics Training Scaling.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop iPlant Data Store.
The iPlant Collaborative Using iPlant for sharing, managing, and analyzing ecological data Ramona Walls Presented at ESA 2014 – Ignite session August 12,
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Atmosphere.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
IPlant Collaborative Hands-on Cyberinfrastructure Workshop - Part 1 R. Walls University of Arizona Biodiversity Information Standards (TDWG) Sep. 28, 2015,
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
Overview of Atmosphere
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop iPlant Data Store – Managing Your ‘Big’ Data.
| nectar.org.au NECTAR TRAINING Module 2 Virtual Laboratories and eResearch Tools.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of the iPlant Discovery Environment.
Galaxy Community Conference July 27, 2012 The National Center for Genome Analysis Support and Galaxy William K. Barnett, Ph.D. (Director) Richard LeDuc,
CyVerse-enabled NCBI Sequence Read Archive (SRA) Submission Pipeline
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of the iPlant Discovery Environment.
IPlant Collaborative Tools and Services Workshop Overview of the iPlant Discovery Environment Sriram Srinivasan.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Transforming Science Through Data-driven Discovery Genomics in Education University of Delaware – February 2016 Jason Williams, Education, Outreach, Training.
Transforming Science Through Data-driven Discovery Tools and Services Workshop Atmosphere Joslynn Lee – Data Science Educator Cold Spring Harbor Laboratory,
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
CyVerse Workshop Discovery Environment Overview. Welcome to the Discovery Environment A Simple Interface to Hundreds of Bioinformatics Apps, Powerful.
Transforming Science Through Data-driven Discovery Workshop Overview Ohio State University MCIC Jason Williams – Lead, CyVerse – Education, Outreach, Training.
CyVerse Data Store Managing Your ‘Big’ Data. Welcome to the Data Store Manage and share your data across all CyVerse platforms.
Teaching How to Scale Science (and People) Using Cloud Resources Nirav Merchant The University of Arizona
Transforming Science Through Data-driven Discovery Using CyVerse Cyberinfrastructure to Enable Data Intensive Research, Collaboration, and Education Joslynn.
Transforming Science Through Data-driven Discovery Using CyVerse Cyberinfrastructure to Enable Data Intensive Research, Collaboration, and Education Atmosphere.
Transforming Science Through Data-driven Discovery Bringing your Bioinformatics tools to CyVerse’s Discovery Environment using Docker Upendra Kumar Devisetty.
READ ME FIRST Use this template to create your Partner datasheet for Azure Stack Foundation. The intent is that this document can be saved to PDF and provided.
Accessing the VI-SEEM infrastructure
Scaling Compute with R in CyVerse
What is HPC? High Performance Computing (HPC)
Tools and Services Workshop
Customized cloud platform for computing on your terms !
Joslynn Lee – Data Science Educator
CyVerse Discovery Environment
MANAGING, SHARING, AND PUBLISHING DATA WITH THE CYVERSE DATA STORE
A Few Questions Before We Begin
Tools and Services Workshop Overview of Atmosphere
JMC CGEMS SUMMER GENOMICS TRAINING WORKSHOPS
Tools and Services Workshop
What is a Science Gateway?
Data uploading and sharing with CyVerse
NCI’s Genomics Data Commons (GDC) & NCI Cloud Pilots
SRA Submission Pipeline
Richard LeDuc, Ph.D. (Manager)
Cyberinfrastructure for the Life Sciences
Storing and Accessing G-OnRamp’s Assembly Hubs outside of Galaxy
MCBIOS 2016 – University of Memphis, TN
Presentation transcript:

CyVerse Tools and Services Jason Williams – Education, Outreach, Training Lead Cold Spring Harbor Laboratory williams@cshl.edu @JasonWilliamsNY

Transforming science through data-driven discovery CyVerse vision Transforming science through data-driven discovery More than 40K users, PBs of data, and hundreds of publications, courses, and discoveries

CyVerse evolution iPlant 2013 CyVerse 2016 Cyberinfrastructure for Life Sciences funding renewal CyVerse 2016 Transforming Science Through Data-Driven Discovery iPlant 2008 Empowering a New Plant Biology 2017 2006 public launch 2010 2015

CyVerse growth: user accounts

CyVerse growth: publications/acknowledgements

Community-focused cyberinfrastructure Platforms, tools, datasets Storage and compute Training and support

CyVerse is built for data Microbial Plant Animal Biomedical Ecological Sequence Images Other datatypes

CyVerse product stack Ease of Use Flexibility Ready to use Platforms Foundational Capabilities Established CI Components Extensible Services

Data Store Initial 100 GB allocation – TB allocations available The resources you need to share and manage data with your lab, colleagues and community Initial 100 GB allocation – TB allocations available Automatic data backup Easy upload /download and sharing Focus here is on genomics data, but not restricted to genomics data

Data lifecycle support Discovery Upload Data Commons Repository (DCR), Elasticsearch Discovery Environment, iCommands, Cyberduck Metadata Add, delete, copy; metadata templates; bulk metadata Publication Analysis Data Commons Repository (DCR), NCBI-SRA Discovery Environment, Atmosphere, Agave API, BisQue, DNA Subway Sharing Community Data folders, Data Commons, quick share links

Discovery Environment Hundreds of bioinformatics Apps in an easy-to-use interface A platform that can run almost any bioinformatics application Seamlessly integrated with data and high performance computing User extensible – add your own applications Focus here is on genomics data, but not restricted to genomics data

Sequence Read Processing Example Workflows Sequence Read Processing Data Publication HTProcess SRA Submission Data Commons Assembly Genome Transcriptome Variation Analysis Assembly Analysis Genome Annotation Association Association Pipeline Validate Pipeline RNA-Seq Methylation Discovery Environment Agave API Atmosphere

Atmosphere Simple: Access to hundreds of virtual machine images Cloud computing for the life sciences Simple: Access to hundreds of virtual machine images Flexible: Fully customize your software setup Powerful: Integrated with CyVerse computing and data resources Focus here is on genomics data, but not restricted to genomics data

On-demand Cloud CyVerse Cloud Atmosphere Instance (virtual machine) (Disk + CPU + Memory) + (Image) 128.196.34.158 CyVerse Cloud Atmosphere Instance (virtual machine)

Science APIs Science-as-a-service platform Fully customize CyVerse resources Science-as-a-service platform Define your own compute, and storage resources (local and CyVerse) Build your own app store of scientific codes and workflows Focus here is on genomics data, but not restricted to genomics data

API-enabled federation RENCI CSHL NASA Powered by CyVerse Arizona TACC

DNA Subway Commonly used bioinformatics tools in streamlined workflows Educational workflows for Genomes, DNA Barcoding, RNA-Seq Commonly used bioinformatics tools in streamlined workflows Teach important concepts in biology and bioinformatics Inquiry-based experiments for novel discovery and publication of data Focus here is on genomics data, but not restricted to genomics data

Support for Course-Based Research Experiences

Bisque Secure image storage, analysis, and data management Image analysis, management, and metadata Secure image storage, analysis, and data management Integrate existing applications or create new ones Custom visualization and image handling routines and APIs Focus here is on genomics data, but not restricted to genomics data

Image and GxE-driven collaboration

Looking ahead Future Funding: Division of Biological Infrastructure $100 Million, 10-year investment Year 9 of 10 (end date Sept 30, 2018) Future Funding: “The NSF BIO Directorate and NSF leadership are pleased with the progress of the project, and will be inviting an application for continued funding to support advances in life science research.” Discussions with Other Agencies and Foundations

Future-focused mission goals Enable data-driven discovery: Enable “deep” data integration and analysis Support sophisticated data expeditions defined by users or user groups Foster interoperability across computational resources and platforms: Deliver CyVerse as a self contained platform to public and private sector entities Encourage ”Powered by CyVerse Align with other resources: Amazon, Google, NIH Commons, many other federal projects. Train the next generation of data scientists: Develop a sophisticated workforce for academia and industry

Looking ahead Transitioning to Stampede 2 This summer, high-performance computing (HPC) systems utilized by CyVerse applications will transition to Stampede2 Improved speed, memory, and overall performance Longer wait times on these jobs till early fall

Looking ahead Improved user support Live chat feature to help you when you are stuck Project-based interfaces to help you organize data, analyses, and collaborators for a more collaborative experience.

Looking ahead All new CyVerse Learning Center Improved, easier to navigate guides and tutorials Organized through GitHub and Read-the-Docs – Easier to contribute to our documentation or make your own

Looking ahead Introducing SciApps Streamlined workflows for the most common analyses needs Extensible compute in an easy-to-navigate interface

CyVerse is a collaborative virtual organization CyVerse Institutions CyVerse is a collaborative virtual organization CyVerse UK