Current GEP members The Genomics Education Partnership (GEP) began in 2006 with 16 members, and has grown steadily. GEP members represent a very diverse.

Slides:



Advertisements
Similar presentations
WHY CMS? WHY NOW? CONTENT MANAGEMENT SYSTEM. CMS OVERVIEW Why CMS? What is it? What are the benefits and how can it help me? Centralia College web content.
Advertisements

Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS Ravi K Madduri University of Chicago and ANL.
BIOCMS: Resource Integration and Web Application Framework for Bioinformatics DHUNDY R BASTOLA †, *, ANIL KHADKA †, MOHAMMAD SHAFIULLAH † AND HESHAM ALI.
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
NGS Analysis Using Galaxy
GMOD in the Cloud Genome Informatics November 3, 2011 Scott Cain GMOD Project Coordinator Ontario Institute for Cancer Research
Enriching the Ontology for Biomedical Investigations (OBI) to Improve Its Suitability for Web Service Annotations Chaitanya Guttula, Alok Dhamanaskar,
Customized cloud platform for computing on your terms !
Bioinformatics Core Facility Ernesto Lowy February 2012.
Manifestations of a Code Genes, genomes, bioinformatics and cyberspace – and the promise they hold for biology education.
Genomics Virtual Lab: analyze your data with a mouse click Igor Makunin School of Agriculture and Food Sciences, UQ, April 8, 2015.
Computer Lab (I) Introduction of galaxy and UCSC genome browser.
Infrastructure clouds, microbial genomics, and the Cloud Virtual Resource project (CloVR) Sam Angiuoli
Enabling Cloud and Grid Powered Image Phenotyping Nirav Merchant iPlant Collaborative
Department of Health and Human Services National Institutes of Health National Center for Research Resources Division of Research Infrastructure Extending.
GEP CURRICULUM FOR BEGINNING STEM MAJORS GEP Alumni Consortium.
I hear and I forget. I see and I remember. I do and I understand. - Confucius PCAST recommendation #2: “Advocate and provide support for replacing standard.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
NGS data analysis CCM Seminar series Michael Liang:
Enabling Cloud and Grid Powered Image Phenotyping Martha Narro iPlant Collaborative Adapted.
-- Don Preuss NCBI/NLM/NIH
I. Introduction and Red Line Education for Data-unlimited Science.
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
+ => Bioinformatics: from Sequence to Knowledge Outline: Introduction to bioinformatics The TAU Bioinformatics unit Useful bioinformatics issues and databases:
Sackler Medical School
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Nature Reviews/2012. Next-Generation Sequencing (NGS): Data Generation NGS will generate more broadly applicable data for various novel functional assays.
The UCSC Table Browser & Custom Tracks Advanced searching and discovery using the UCSC Table Browser and Custom Tracks Osvaldo Graña CNIO Bioinformatics.
IPlant Genomics in Education
The iPlant Collaborative Vision Enable life science researchers and educators to use and extend cyberinfrastructure.
The iPlant Collaborative Vision Enable life science researchers and educators to use and extend cyberinfrastructure.
Research Problem In one sentence, describe the problem that is the focus of your classroom research project about student learning: There is a growing.
The role of the National Agricultural Library in arthropod genomics research - implementing and developing tools for genomic data management Monica Poelchau.
UCSC Genome Browser Zeevik Melamed & Dror Hollander Gil Ast Lab Sackler Medical School.
Accessing and visualizing genomics data
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space The Capabilities of the GridSpace2 Experiment.
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
Transforming Science Through Data-driven Discovery Genomics in Education University of Delaware – February 2016 Jason Williams, Education, Outreach, Training.
The Genomics Education Partnership, 2011 Charles Hauser 1, Wilson Leung 3, Chris Shaffer 3, David Lopatto 2, Sarah Elgin 3 and faculty and students of.
Visualizing data from Galaxy
CyVerse Workshop Discovery Environment Overview. Welcome to the Discovery Environment A Simple Interface to Hundreds of Bioinformatics Apps, Powerful.
Q Education Transform Resources. Intel® is Committed to Transforming Education for the Next Generation Intel supports education transformation 
Centralizing Bioinformatics Services: Analysis Pipelines, Opportunities, and Challenges with Large- scale –Omics, and other BigData High-Performance Computing.
Give Your Data the Edge A Scalable Data Delivery Platform
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
CyVerse Tools and Services
Tools and Services Workshop
University of Chicago and ANL
Joslynn Lee – Data Science Educator
CyVerse Discovery Environment
NGS Analysis Using Galaxy
Regulatory Genomics Lab
Computational Reasoning in High School Science and Math
JMC CGEMS SUMMER GENOMICS TRAINING WORKSHOPS
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
Introduction to G-OnRamp
Cyberinfrastructure for the Life Sciences
Consortium for Entrepreneurship Education
Student Satisfaction Results
Storing and Accessing G-OnRamp’s Assembly Hubs outside of Galaxy
Yating Liu July 2018 G-OnRamp workshop
Follow-up from last night: XSEDE credits
Harvard Web Publishing Web Publishing for the Harvard Community
Regulatory Genomics Lab
Regulatory Genomics Lab
Cloud for Development Data Storage, Data Analytics, Data Sharing
BCoN Data Integration Workshop, University of Kansas, Feb 13-14, 2018
Presentation transcript:

Current GEP members The Genomics Education Partnership (GEP) began in 2006 with 16 members, and has grown steadily. GEP members represent a very diverse group of schools, both public and private, large and small, with varying educational missions and diverse student populations. Currently there are > 100 affiliated schools; > 70 faculty/year are engaged in GEP research, and > 1,000 undergraduates participate each year. Faculty generally join by attending a one-week workshop at WUSTL. Shared work (done in summer Alumni Workshops) is organized on the GEP website (curriculum development, publications, etc.). We find that institutional characteristics have little correlation with student success, indicating that diverse students in diverse settings benefit from course-based research experiences of this type. Engaging Biologists with Big Data Using Interactive Genome Annotation Remi Marenco 1, Wilson Leung 2, Sarah C.R. Elgin 2 and Jeremy Goecks 1 1 George Washington University and 2 Washington University in St. Louis Project goal : combine two successful and long-running projects—the Genomics Education Partnership and the Galaxy Project—to create an integrated, Web-based, and scalable environment (G-OnRamp) that will enable biologists to utilize large genomic datasets for interactive annotation of any genome, an activity that can serve as an introduction and training for “big data” biomedical analyses. The Genomics Education Partnership (GEP) Primary goals: Incorporate genomics and bioinformatics into the undergraduate curriculum Engage undergraduates in genomics research Central organization: Hosts training workshops for GEP faculty / TAs Develops & maintains web framework for projects Hosts shared curriculum & assessment Student photos taken by GEP faculty Michael Rubin (University of Puerto Rico – Cayey) and Heather Eisler (University of the Cumberlands) Workflow Faculty members have collaboratively developed a variety of ways to use the GEP approach in their teaching: Short ( ∼ 10 hrs) modules in a genetics course Longer modules within molecular biology laboratory courses Stand-alone genomics lab courses Independent research studies Results produced by GEP students are reconciled and used in subsequent scientific publications [e.g., Leung et al. 2015, G3. 5(5):719-40]. Training benefits Students are challenged to analyze and evaluate available evidence (assembled on the GEP UCSC genome browser) to create optimal gene models, often in the face of contradictory evidence, & explore other genomic features (right). GEP students report substantial learning gains, which improve significantly with more time invested (bottom). GEP challenges can be addressed by Galaxy GEP provides an ideal use case for training scientists to work with big data, but there are several challenges that Galaxy can address and help to scale and improve GEP: GEP ChallengeGalaxy Feature to Address Challenge Difficult to set up and integrate GEP computational tools Automated installation and configuration Cannot be easily extended to organisms beyond Drosophila Develop approach to work with any organism and with multiple organisms Limited flexibility to add custom analyses and data into the curriculum Supports completely customizable workflows and analyses Difficult to collaborate on gene annotations and analysis workflows across physically distributed sites Web-based collaboration framework for sharing all Galaxy objects Contacts Jeremy Goecks Sarah CR Elgin GEP + Galaxy = G-OnRamp G-OnRamp Goals: Create a custom Galaxy server to power interactive annotation of any genome Provide an interactive, Web-based platform that can scale to support world-wide big data biomedical training through interactive genome annotation Foster the growth of the GEP and other educational communities to increase the participation of undergraduates and the broader scientific community in genomics research Workflows, tools, and visualizations will be agnostic to the organism: Facilitate the analyses and annotations of non-model organisms Ensure that G-OnRamp can reach as broad an audience as possible Validating G-OnRamp using GEP: GEP faculty will serve as beta testers to ensure that G-OnRamp meets real educational needs Provide continuous feedback to help guide the development of G-OnRamp Help test and revise curriculum and training materials during workshops Shaffer CD et al. 2014, CBE Life Sci Educ. 13(1): Public “draft” genomes Divide into overlapping student projects ( kb) Sequence and assembly improvement Optional wet bench experiment PCR/sequencing of gaps Evidence-based gene annotation Collect projects, compare and confirm annotations Reassemble into high quality annotated sequence Analyze and publish results Sequence Improvement Annotation Collect projects, compare and verify final consensus sequence Optional evidence-based TSS and motif annotation Mean scores Learning gain items in the SURE survey Q1 (1-10 hrs.) Q4 (>36 hrs.) SURE (Summer Research) Understanding the research process Ability to analyze data Independence Acknowledgements G-OnRamp supported by NIH Grant HG GEP supported by HHMI grant # , NSF grant # and WUSTL. Galaxy supported by NIH grant HG and GWU. UCSC Track Hubs in Galaxy Many datatypes supported: Bed, BigWig, Bam, Bed Simple Repeats, GFF3, GTF, and more coming soon TrackHub Datatype in any Galaxy: Give your data to Hub Archive Creator, and visualize them on the UCSC Track Hub PR pending for Galaxy Access your Track Hub files from anywhere: Use the Hub Archive Creator in Galaxy Download the Track Hub structure Move it to any server you want, the way you want G-OnRamp available today in Beta: On our servers Get the workflow, give your data and visualize in the UCSC Track Hub Feedback welcome at Future plans Extend the G-OnRamp workflow for analysis of functional genomic data: ChIP-seq for finding transcription factor binding sites DNase-seq/ATAC-seq for finding open chromatin regions Bisulfite sequencing to identify methylated sites Develop integrated and interactive Web-based tools and visualizations for: Viewing annotation evidence Distributed and collaborative annotations, including reconciling annotations from multiple individuals Make it easy for individuals to use and install G-OnRamp on: Public national cybercomputing infrastructure such as CyVerse, XSEDE, and JetStream Commercial cloud computing platforms such as Amazon Virtual machines on local computers using preinstalled packages GEP gene annotation workflow for Galaxy for *any* genome We have developed a comprehensive Galaxy workflow that produces multiple complementary datasets to facilitate the annotation of any genome: Gene prediction models from several different predictors Expression data from RNA-Seq (i.e. read coverage, splice junctions, assembled transcripts) Homology results from BLAST Repetitive regions such as transposons, simple repeats, low complexity repeats, and tandem repeats Homology Repeat Regions RNA-seq Gene Predictions Year joined