Transforming Science Through Data-driven Discovery Genomics in Education University of Delaware – February 2016 Jason Williams, Education, Outreach, Training.

Slides:



Advertisements
Similar presentations
 Preparing undergraduates to succeed in college and beyond in a bioinformatics-rich curriculum  Discussion of existing resources, opportunities, and.
Advertisements

The Golden Age of Biology DNA -> RNA -> Proteins -> Metabolites Genomics Technologies MECHANISMS OF LIFE Health Care Diagnostics Medicines Animal Products.
We are developing a web database for plant comparative genomics, named Phytome, that, when complete, will integrate organismal phylogenies, genetic maps.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
1 iPlant Data Store (iDS) Supporting the Lifecycle of Data Nirav Merchant 1.
DNA Subway Green Line Overview. Growth of Sequence Read Archive (SRA) 2.2 Quadrillion bases Log Scale!
IPlant Collaborative Powering a New Plant Biology iPlant Collaborative Powering a New Plant Biology.
Development of Bioinformatics and its application on Biotechnology
Using DNA Subway in the Classroom Red Line Lesson Sketch.
Genome Annotation using MAKER-P at iPlant Collaboration with Mark Yandell Lab (University of Utah) iPlant: Josh Stein (CSHL) Matt Vaughn.
Gramene Objectives Develop a database and tools to store, visualize and analyze data on genetics, genomics, proteomics, and biochemistry of grass plants.
Manifestations of a Code Genes, genomes, bioinformatics and cyberspace – and the promise they hold for biology education.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Objectives.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Network for Integrating Bioinformatics into Life Sciences Education April, 2014.
IPlant Genomics in Education Workshop Genome Exploration in Your Classroom.
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
Copyright © 2010 Pearson Education Inc. Lecture 01 – Genetics & Genomics: An Introduction Based on Chapter 1 – Genetics: An introduction.
Welcome to DNA Subway Classroom-friendly Bioinformatics.
I. Introduction and Red Line Education for Data-unlimited Science.
The iPlant Collaborative
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Gramene Objectives Provide researchers working on grasses and plants in general with a bird’s eye view of the grass genomes and their organization. Work.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop iPlant Data Store.
The iPlant Collaborative Using iPlant for sharing, managing, and analyzing ecological data Ramona Walls Presented at ESA 2014 – Ignite session August 12,
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
IPlant Genomics in Education Workshop Genome Exploration in Your Classroom.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
IPlant Genomics in Education
Bioinformatics Curriculum Issues, goals, curriculum.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop iPlant Data Store – Managing Your ‘Big’ Data.
Build an Automated Workflow Visual Workflow Creator Discovery Environment.
The iPlant Collaborative Vision Enable life science researchers and educators to use and extend cyberinfrastructure.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop …and Environments.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of the iPlant Discovery Environment.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop BISQUE.
The iPlant Collaborative Vision Enable life science researchers and educators to use and extend cyberinfrastructure.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of the iPlant Discovery Environment.
Introductory Phylogenetic Workflows in the Discovery Environment Sheldon McKay iPlant Collaborative, DNALC, Cold Spring Harbor Laboratory Feb 8, 2012.
RNA-Seq visualization with CummeRbund
Using DNA Subway in the Classroom Genome Annotation: Red Line.
Canadian Bioinformatics Workshops
Transforming Science Through Data-driven Discovery Tools and Services Workshop Atmosphere Joslynn Lee – Data Science Educator Cold Spring Harbor Laboratory,
Transforming Science Through Data-driven Discovery Tools and Services Workshop Data Store Overview.
CyVerse Workshop Discovery Environment Overview. Welcome to the Discovery Environment A Simple Interface to Hundreds of Bioinformatics Apps, Powerful.
Transforming Science Through Data-driven Discovery Workshop Overview Ohio State University MCIC Jason Williams – Lead, CyVerse – Education, Outreach, Training.
Transforming Science Through Data-driven Discovery Tools and Services Workshop Data Store – Managing your ‘Big’ Data Joslynn Lee, Ph.D. – Data Science.
Transforming Science Through Data-driven Discovery Tools and Services Workshop Data Store – Managing your ‘Big’ Data Joslynn Lee – Data Science Educator.
Transforming Science Through Data-driven Discovery Using CyVerse Cyberinfrastructure to Enable Data Intensive Research, Collaboration, and Education Joslynn.
IPlant Genomics in Education Workshop Genome Exploration in Your Classroom.
Transforming Science Through Data-driven Discovery Using CyVerse Cyberinfrastructure to Enable Data Intensive Research, Collaboration, and Education Atmosphere.
Joslynn S. Lee, PhD, Data Science Educator Cold Spring Harbor Laboratory, DNA Learning Center Transforming Science Through Data-driven Discovery.
Transforming Science Through Data-driven Discovery Bringing your Bioinformatics tools to CyVerse’s Discovery Environment using Docker Upendra Kumar Devisetty.
CyVerse Tools and Services
Tools and Services Workshop
Joslynn Lee – Data Science Educator
CyVerse Discovery Environment
JMC CGEMS SUMMER GENOMICS TRAINING WORKSHOPS
Overview Bioinformatics: Analyzing biological data using statistics, math modeling, and computer science BLAST = Basic Local Alignment Search Tool Input.
Genome organization and Bioinformatics
Cyberinfrastructure for the Life Sciences
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

Transforming Science Through Data-driven Discovery Genomics in Education University of Delaware – February 2016 Jason Williams, Education, Outreach, Training Lead Joslynn Lee, Data Science Educator Cold Spring Harbor

CyVerse Evolution iPlant 2008 Empowering a New Plant Biology iPlant 2013 Cyberinfrastructure for Life Science CyVerse 2016 Transforming Science Through Data-Driven Discovery

We are funded by the National Science Foundation We are your colleagues and collaborators! $100 Million in investment Freely available to the community Spur national/international collaboration Cite CyVerse: CyVerse.org/acknowledge-cite-cyverse DBI and DBI CyVerse Evolution

CyVerse 2016 Transforming Science Through Data-Driven Discovery Vision: Transforming science through data-driven discovery Mission: Design, develop, deploy, and expand a national cyberinfrastructure for life science research, and train scientists in its use More than 30K users, PB of data, and hundreds of publications, courses, and discoveries

What is Cyberinfrastructure? Data storage Software High-performance computing People organized into systems that solve problems of size and scope that would not otherwise be solvable.

What is Cyberinfrastructure? Platforms, tools, datasets Storage and compute Training and support

CyVerse supports all domains of life science Plant / Microbial Animal Biomedical Ecological/Climate CyVerse is built for Data

CyVerse product stack Ready to use Platforms Foundational Capabilities Established CI Components Extensible Services Ease of Use Flexibility

Genomics in Education

Big data biology – Education and Research Image Credits: Genome sequencing costs: Oxford nanopore sequencer: Agricultural drone: Fitbit: 100K fold costs decrease in sequencing Hand-held sequencers Drones Biological sensors Biology is swimming in data

Big data biology – Too fast to keep up? “Essentially, all models are wrong, but some are useful” – George E.P. Box

Big data biology – Too fast to keep up?

1866 – Mendel publishes work on inheritance 1869 – DNA discovered 1915 – Hunt Morgan describes linkage and recombination 1953 – Structure of DNA described 1956 – Human chromosome number determined 1968 – First gene mapped to autosome 1977 – Dideoxy sequencing 1983 – PCR 1986 – Human Genome Project proposed

Big data biology – Too fast to keep up? 1993 – First MicroRNAs described 2003 – First ‘Gold Standard’ human genome sequence 2005 – First draft of human haplotype map (HapMap) 2007 – ENCODE project

Big data biology – Too fast to keep up?

Challenge – bringing students into the fold How do scientists share their data and make it publically available? How do scientists extract maximum value from the datasets they generate? How can students and educators (who will need to come to grips with data-intensive biology) be brought into the fold? ResearchEducation Students can work with the same data at the same time and with the same tools as research scientists.

Can you navigate the tools? What are your challenges in teaching bioinformatics in the classroom?

Take the Subway

DNA Subway Classroom friendly bioinformatics Faculty identified guiding requirements that shaped the development of CyVerse educational platforms: Mix lecture and lab – have a wet bench “hook” Student-scientist partnerships – someone has to care about the data Co-investigation – projects should potentially lead to publications Scale – platforms should support projects multiple classrooms can join.

DNA Subway Classroom friendly bioinformatics More than 13,000 users More than 28,000 student projects in 2015

DNA Subway Red Line: Genome annotation Red Line Analyze up to 150 KB of DNA sequence De novo gene prediction Construct evidence-based gene models Visualize genome sequence in browser

DNA Subway Yellow Line: Genome prospecting Yellow Line Analyze DNA or protein sequence Search plant genomes using TARGeT Explore gene duplications, transposons, and non-coding sequences not detectable in conventional BLAST searches

DNA Subway Blue Line: DNA barcoding, and phylogenetics Analyze DNA or protein sequence Search plant genomes using TARGeT Explore gene duplications, transposons, and non-coding sequences not detectable in conventional BLAST searches Blue Line

DNA Subway Green Line: Transcriptome analysis Green Line Examine RNA-Seq data for differential expression Use High-performance computing to analyze complete datasets Generate lists of genes and fold-changes; add results to Red Line projects

Transforming Science Through Data-driven Discovery Parker Antin Nirav Merchant Eric Lyons Matt Vaughn Doreen Ware Dave Micklos CyVerse is supported by the National Science Foundation under Grant No. DBI and DBI CyVerse Executive Team