Joslynn Lee – Data Science Educator

Slides:



Advertisements
Similar presentations
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
Advertisements

April 2009 OSG Grid School - RDU 1 Open Science Grid John McGee – Renaissance Computing Institute University of North Carolina, Chapel.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Assessment of Core Services provided to USLHC by OSG.
Scientific Data Infrastructure in CAS Dr. Jianhui Scientific Data Center Computer Network Information Center Chinese Academy of Sciences.
IPlant Collaborative Powering a New Plant Biology iPlant Collaborative Powering a New Plant Biology.
Customized cloud platform for computing on your terms !
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Data to Discovery The iPlant Collaborative Community Cyberinfrastructure for Life Science Nirav Merchant iPlant / University.
Enabling Cloud and Grid Powered Image Phenotyping Nirav Merchant iPlant Collaborative
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Using Biological Cyberinfrastructure Scaling Science and People: Applications in Data Storage, HPC, Cloud Analysis, and Bioinformatics Training Scaling.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Customized cloud platform for computing on your terms ! Nirav Merchant
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Objectives.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Network for Integrating Bioinformatics into Life Sciences Education April, 2014.
IPlant Genomics in Education Workshop Genome Exploration in Your Classroom.
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
IPlant Collaborative Hands-on Cyberinfrastructure Workshop – Part 2 R. Walls University of Arizona Biodiversity Information Standards (TDWG) Sep. 29, 2015,
Using Biological Cyberinfrastructure Scaling Science and People: Applications in Data Storage, HPC, Cloud Analysis, and Bioinformatics Training Scaling.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop iPlant Data Store.
The iPlant Collaborative Using iPlant for sharing, managing, and analyzing ecological data Ramona Walls Presented at ESA 2014 – Ignite session August 12,
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Atmosphere.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
IPlant Collaborative Hands-on Cyberinfrastructure Workshop - Part 1 R. Walls University of Arizona Biodiversity Information Standards (TDWG) Sep. 28, 2015,
Overview of Atmosphere
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop iPlant Data Store – Managing Your ‘Big’ Data.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop BISQUE.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of the iPlant Discovery Environment.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI strategy and Grand Vision Ludek Matyska EGI Council Chair EGI InSPIRE.
IPlant Collaborative Tools and Services Workshop Overview of the iPlant Discovery Environment Sriram Srinivasan.
Transforming Science Through Data-driven Discovery Genomics in Education University of Delaware – February 2016 Jason Williams, Education, Outreach, Training.
Transforming Science Through Data-driven Discovery Tools and Services Workshop Atmosphere Joslynn Lee – Data Science Educator Cold Spring Harbor Laboratory,
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
CyVerse Workshop Discovery Environment Overview. Welcome to the Discovery Environment A Simple Interface to Hundreds of Bioinformatics Apps, Powerful.
Transforming Science Through Data-driven Discovery Workshop Overview Ohio State University MCIC Jason Williams – Lead, CyVerse – Education, Outreach, Training.
Transforming Science Through Data-driven Discovery Tools and Services Workshop Data Store – Managing your ‘Big’ Data Joslynn Lee, Ph.D. – Data Science.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
CyVerse Data Store Managing Your ‘Big’ Data. Welcome to the Data Store Manage and share your data across all CyVerse platforms.
EGI-InSPIRE RI An Introduction to European Grid Infrastructure (EGI) March An Introduction to the European Grid Infrastructure.
Transforming Science Through Data-driven Discovery Using CyVerse Cyberinfrastructure to Enable Data Intensive Research, Collaboration, and Education Joslynn.
Transforming Science Through Data-driven Discovery Using CyVerse Cyberinfrastructure to Enable Data Intensive Research, Collaboration, and Education Atmosphere.
Joslynn S. Lee, PhD, Data Science Educator Cold Spring Harbor Laboratory, DNA Learning Center Transforming Science Through Data-driven Discovery.
Transforming Science Through Data-driven Discovery Bringing your Bioinformatics tools to CyVerse’s Discovery Environment using Docker Upendra Kumar Devisetty.
Sustaining the software capabilities long term Address Solutions as part of software. Act on “Hard challenges are not technical” bringing in the right.
Accessing the VI-SEEM infrastructure
Semantic Web - caBIG Abstract: 21st century biomedical research is driven by massive amounts of data: automated technologies generate hundreds of.
Scaling Compute with R in CyVerse
Organizations Are Embracing New Opportunities
CyVerse Tools and Services
Tools and Services Workshop
Customized cloud platform for computing on your terms !
CyVerse Discovery Environment
INTAROS WP5 Data integration and management
Free Cloud Management Portal for Microsoft Azure Empowers Enterprise Users to Govern Their Cloud Spending and Optimize Cloud Usage and Planning MICROSOFT.
Tools and Services Workshop Overview of Atmosphere
JMC CGEMS SUMMER GENOMICS TRAINING WORKSHOPS
Tools and Services Workshop
Data uploading and sharing with CyVerse
EGI Webinar - Introduction -
Cyberinfrastructure for the Life Sciences
Brian Matthews STFC EOSCpilot Brian Matthews STFC
Storing and Accessing G-OnRamp’s Assembly Hubs outside of Galaxy
MCBIOS 2016 – University of Memphis, TN
Matthew Farmer Making Azure Integration Services Real
Presentation transcript:

Joslynn Lee – Data Science Educator CyVerse Overview Joslynn Lee – Data Science Educator DNA Learning Center, Cold Spring Harbor Laboratory jolee@cshl.edu

CyVerse evolution From plant science, to life science, and beyond… Transforming Science Through Data-Driven Discovery iPlant 2008 Empowering a New Plant Biology iPlant 2013 Cyberinfrastructure for Life Science Established by the U.S. National Science Foundation (NSF) in 2008 to develop cyberinfrastructure for life sciences research and democratize access to U.S. supercomputing capabilities. iPlant's original mission was to provide the cyberinfrastructure needed by the plant science research community to address Grand Challenge problems that could not be addressed with single-lab research funding iPlant developed a species-generic cyberinfrastructure platform that is now in high demand across research domains of many species. At the recommendation of the NSF, iPlant has extended its scope beyond plants. The broader life science (non-human research) community is quickly adopting iPlant's platform, expanding the user base, and leveraging additional domain knowledge and technical expertise to support the collaborative, while maintaining the founding principles and vision of the project.

We are funded by the National Science Foundation From plant science, to life science, and beyond… Directorate for Biological Sciences $100 Million in investment We are your colleagues and collaborators Freely available to the community Spur national/international collaboration Cite CyVerse: CyVerse.org/acknowledge-cite-cyverse DBI-0735191 and DBI-1265383

Transforming Science Through Data-Driven Discovery CyVerse evolution From plant science, to life science, and beyond… Vision: Transforming science through data-driven discovery Mission: Design, develop, deploy, and expand a national cyberinfrastructure for life science research, and train scientists in its use More than 30K users, PB of data, and hundreds of publications, courses, and discoveries CyVerse 2016 Transforming Science Through Data-Driven Discovery 1024 bytes  =  1 KB 1024 KB  =  1 MB 1024 MB  =  1 GB 1024 GB  =  1 TB 1024 TB  =  1 PB

What is cyberinfrastructure? CI provides solutions to the challenges of large-scale computational science were unapproachable because the computational requirements were too large, too complex, or simply unknown Platforms, tools, datasets, Storage and compute Training and support Software HPC People

CyVerse evolution From plant science, to life science, and beyond…

CyVerse supports all domains of life science From plant science, to life science, and beyond… Plant / Microbial Animal Biomedical Ecological/Climate CyVerse provides life scientists with powerful computational infrastructure to handle huge datasets and complex analyses

CyVerse supports all level of users User perspectives and potential applications Bench Scientist Bioinformatician Core Facilities Welch et al. 2013

CyVerse collaborators From plant science, to life science, and beyond… Arabidopsis Information Portal CyVerse collaborates to enable access to the solutions that work the best for you

CyVerse is a collaborative virtual organization CyVerse institutions From plant science, to life science, and beyond… CyVerse is a collaborative virtual organization

CyVerse products Ease of Use Flexibility From plant science, to life science, and beyond… Ready to use Platforms Extensible Services Ease of Use Flexibility Established CI Components Foundational Capabilities

CyVerse products Data Store Science APIs Discovery Environment Bisque From plant science, to life science, and beyond… Data Store Science APIs Discovery Environment Bisque Atmosphere DNA Subway

Data Store The resources you need to share and manage data with your lab, colleagues and community Initial 100 GB allocation – TB allocations available Automatic data backup Easy upload /download and sharing

Discovery Environment Hundreds of bioinformatics Apps in an easy-to-use interface user interface for access to the tools and computing resources Run existing bioinformatics software apps on CyVerse clusters or TACC supercomputers quickly, easily, and efficiently User extensible – add your own applications bioinformatics workflow—data management, analysis, sharing large datasets

Atmosphere Cloud computing for the life sciences Simple: One-click access to more than 200 virtual machine images Publish your own software suites, create your own work environments, and run the software for community use access the CyVerse’s core infrastructure resources, including high performance computing (HPC), grid computing environments

Science APIs (Application Programming Interfaces) Fully customize CyVerse resources Science-as-a-service platform Define your own compute, and storage resources (local and CyVerse) Build your own app store of scientific codes and workflows and share with anyone (developers and bioinformaticians)

Bisque Image analysis, management, and metadata Bio-Image Semantic Query User Environment Secure image storage, analysis, and data management Integrate existing applications or create new ones 100+ biological image formats

DNA Subway Educational workflows for Genomes, DNA Barcoding, RNA-Seq Commonly used bioinformatics tools in streamlined workflows Teach important concepts in biology and bioinformatics Inquiry-based experiments for novel discovery and publication of data