{ Web Apollo A Web-based Genomics Annotation Editing Platform Ed Lee, Gregg Helt, Justin Reese, Monica Munoz-Torres*, Christopher Childers, Rob Buels,

Slides:



Advertisements
Similar presentations
Trellis DAS/2 Server Framework Gregg Helt. DAS/2 Overview Same goal and overall strategy as DAS1 – HTTP transport, URL queries, XML responses – RESTful.
Advertisements

Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov.
Genome Browsers Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
15 Chapter 15 Web Database Development Database Systems: Design, Implementation, and Management, Fifth Edition, Rob and Coronel.
A Web-based Collaboratory for Supporting Environmental Science Research Xiaorong Xiang Yingping Huang Greg Madey Department of Computer Science and Engineering.
14 1 Chapter 14 Database Connectivity and Web Development Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
Chapter 14 Database Connectivity and Web Technologies
Charlie Crocker Vice President Farallon Geographics, Inc. An Overview of Internet Mapping Technology.
Web-based Query & Reporting System for Software User Consultant Richard Knowles Rutgers University Electrical & Computer Engineering Mentors: Amy Chen.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Version Enterprise Architect Redefines Modeling in 2006 An Agile and Scalable modeling solution Provides Full Lifecycle.
Web Application Architecture: multi-tier (2-tier, 3-tier) & mvc
NGS Analysis Using Galaxy
GenSAS: Genome Sequence Annotation Server, a Tool for Online Annotation and Curation Dorrie Main, Taein Lee, Ping Zheng, Sook Jung, Stephen P. Ficklin,
Create with SharePoint 2010 Jen Dodd Sr. Solutions Consultant
GMOD in the Cloud Genome Informatics November 3, 2011 Scott Cain GMOD Project Coordinator Ontario Institute for Cancer Research
INTRODUCTION TO WEB DATABASE PROGRAMMING
Architecture Of ASP.NET. What is ASP?  Server-side scripting technology.  Files containing HTML and scripting code.  Access via HTTP requests.  Scripting.
Research sponsored by Mathematics, Information and Computational Sciences Office U.S. Department of Energy Al Geist Jens Schwidder David Jung Computer.
Customized cloud platform for computing on your terms !
Other User Interface Highlights Configurable Undo/Redo History: transcript based, each transcript has a separate history for Undo/Redo Real-time updating:
Geneva, 30 October 2009 Giuseppe Sindoni, Istat, Italy An online system for multi-channel, register-based census data collection.
1 UTGB Shell An Open-Source Browser Framework for the Integration of Biological Data Taro L. Saito, Shin Sasaki, Budrul Ahsan and.
Genome Annotation using MAKER-P at iPlant Collaboration with Mark Yandell Lab (University of Utah) iPlant: Josh Stein (CSHL) Matt Vaughn.
Comparative Genomics Tools in GMOD GMOD.org Dave Clements 1, Sheldon McKay 2, Ken Youns-Clark 2, Ben Faga 3, Scott Cain 4, and the GMOD Consortium 1 National.
Fundamentals of Database Chapter 7 Database Technologies.
The Hymenoptera Genome Database (HGD, is an informatics resource supporting genomics of hymenopteran insect species. It currently.
History tracking, including browsing of an annotation's edit history and full undo/redo functions Real-time updating: edits in one client are instantly.
WebApollo: A Web-Based Sequence Annotation Editor for Community Annotation Ed Lee, Gregg Helt, Nomi Harris, Mitch Skinner, Christopher Childers, Justin.
WebApollo extending JBrowse to support DAS & genomic annotation editing Gregg Helt, Ed Lee, Nomi Harris, Mitch Skinner, Suzanna Lewis, Ian Holmes Lawrence.
© 2008 IBM Corporation ® IBM Cognos Business Viewpoint Miguel Garcia - Solutions Architect.
HyperContent 2.0 Common Solutions Group September 21, 2005 Alex Vigdor, Columbia University.
Three’s a crowd-source: Observations on Collaborative Genome Annotation. Monica Munoz-Torres, PhD via Suzanna Lewis Biocurator & Bioinformatics Analyst.
GMOD: Managing Genomic Data from Emerging Model Organisms Dave Clements 1, Hilmar Lapp 1, Brian Osborne 2, Todd J. Vision 1 1 National Evolutionary Synthesis.
Apollo Future Plans Nomi Harris, BDGP/FlyBase GMOD Meeting, Cambridge April 27, 2004.
CERN - IT Department CH-1211 Genève 23 Switzerland t DB Development Tools Benthic SQL Developer Application Express WLCG Service Reliability.
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
Got genom e? Community Meetings GMOD.org The GMOD community meets semi- annually to discuss GMOD components, best practices,
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 14 Database Connectivity and Web Technologies.
1 MSCS 237 Overview of web technologies (A specific type of distributed systems)
The New Website of the Gene Ontology Consortium Seth Carbon Chris Mungall, PhD Monica Munoz-Torres, PhD Genomics Division,
BRUDNO LAB: A WHIRLWIND TOUR Marc Fiume Department of Computer Science University of Toronto.
Tengcha – generic middleware for retrieving data from Chado Justin Reese GMOD Meeting April 5, 2012.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GMOD Meeting August 6-7, 2009 Oxford, UK Scott Cain, PhD. GMOD Project Coordinator Ontario Institute for Cancer Research
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
14 1 Chapter 14 Web Database Development Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel.
Web Technologies Lecture 8 Server side web. Client Side vs. Server Side Web Client-side code executes on the end-user's computer, usually within a web.
Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
Implementation of SCENS Yan Zhao. Current Status Current implementation is web-based –
Introduction and Principles Web Server Scripting.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 14 Database Connectivity and Web Technologies.
Executive Overview. Software modeling is essential, because it is the map that guides your developers. Additionally: Modeling Software  Visual information.
The Bovine Genome Database Abstract The Bovine Genome Database (BGD, facilitates the integration of bovine genomic data. BGD is.
The Virtual Observatory and Ecological Informatics System (VOEIS): Using RESTful architecture and an extensible data model to provide a unique data management.
BUSINESS SENSITIVE 1 SAAW - Sequence Annotation and Analysis Workshop Boyu Yang and Gene Godbold Battelle Memorial Institute, Charlottesville Operations.
Visualizing data from Galaxy
Copyright c 2004 OSIsoft Inc. All rights reserved. Visualizing Performance Management Managing Information with RtPortal Gregg Le Blanc - OSIsoft Brian.
Canadian Bioinformatics Workshops
Why Create a PGDB? Perform pathway analyses as part of a genome project Analyze omics data Create a central public information resource for the organism,
Genome Browsers.
Database Driven Websites
The Re3gistry software and the INSPIRE Registry
The Celera Genome Browser: A Tool for Visualizing and Annotating the Human Genome
Ensembl Genome Repository.
got genome? Community Meetings Databases Training GMOD.org
Database Connectivity and Web Development
Yating Liu July 2018 G-OnRamp workshop
Presentation transcript:

{ Web Apollo A Web-based Genomics Annotation Editing Platform Ed Lee, Gregg Helt, Justin Reese, Monica Munoz-Torres*, Christopher Childers, Rob Buels, Lincoln Stein, Ian Holmes, Christine Elsik, Suzanna Lewis Biocuration 2013 | Cambridge, UK Lawrence Berkeley National Laboratory, Joint Genome Institute, for the US Department of Energy at UCB

 The first real-time, collaborative genomics annotation editor on the Web  Easy-to-use environment for multiple, distributed users to review, update, and share genome feature markups Web Apollo is:

The need for an updated tool Assembly Manual annotation Experimental validation Automated Annotation Requires optimized genome visualization and editing tools More researchers involved Cheaper sequencing More genomes being sequenced High throughput RNA-seq and improved automated annotation (more assembly errors) (lack of gold standard gene structure training data) The democratization of genome-scale sequencing calls for a new kind of annotation editing tool.

 Allows:  Access to computational analysis & experimental evidence  Manual curation  Includes:  Intuitive and varied tools  Compatibility with GMOD  Is:  Widely used (initially designed for centralized, resource-rich projects). Desktop Apollo

 BUT…  Requires Apollo Download & Chado Install  Annotation saved locally, in flat files; no support for sharing  One annotator at a time Desktop Apollo

 Annotations saved directly to a centralized database  Java Web Start downloaded Apollo software more transparently  BUT…  Must load all data for a region at once  Edits from other users not visible without reloading  Potential issues with stale annotation data  Needs Java Installation Java Web Start Apollo, an Improvement

 No downloads required Web Apollo: Collaborative Annotation  Web-based  Annotations saved to centralized database  Edit server mediates multiple user edits  Uses dynamic (lazy) data loading: only the region of interest  Real-time annotation updates  Customizable to meet researchers’ needs: rules, appearance, etc.  Supports User Authentication & Authorization:  Read, Edit, Review, Complete, Publish (Export) annotations  Automatically promote tracks

BAM BigWig GFF3 VCF* Web Apollo JBrowse visualization (Javascript) Apollo Edit Operations & User Management Trellis Data Broker (Java) JSON Static Data Generation Pipeline (Perl) Server-side Data Service Annotation Editing Engine (Java) Berkeley DB temporary store User Management User Interface Data Sources Analysis Pipelines - BAM - BED - BigWig - GFF3 - MAKER output* Data Repositories Chado MySQL DAS servers Annotation Exports Chado GFF3 FASTA Permanent store Annotators(Javascript) Web Apollo Architecture

 Plug-in to JBrowse  Javascript genome annotation browser  Fast and responsive  Highly interactive  Visit P.93 Web-based Client

 Extensions of JBrowse track features:  GUI for editing annotations  2 new kinds of tracks:  annotation editing  sequence alteration editing  Selection of features & sub-features  Dragging  Edge-matching  Communicates with annotation editing engine and data providing service.  Sends ‘Edit’ operations to the server, lets it decide what to do, server makes the ‘Edit’, pushes back to all clients * Web-based Client

 The server:  Java servlet  GBOL data model: object model & API, based on the Chado schema  The editing logic is in the server:  selects longest ORF as CDS  flags non-canonical splice sites  Plug-in architecture for sequence alignment searches: BLAT  Uses BerkeleyDB  Stores Annotations, Edits, History  Supports Real Time Collaboration Annotation Editing Engine

Server-side Data Service

 Trellis  A data broker with plug-in architecture for both output formats and back-end data stores  Web Apollo support is implemented as plug-in that outputs JSON format  Also has output plug-ins for GFF3 & BED  On the back-end, we implemented 3 plug-ins for:  UCSC MySQL genome database  Chado  DAS servers (e.g.: Ensembl)

Further customization

 Ability to annotate regulatory regions & features  Collapsing and expanding tracks  Sticky ‘User Annotations’ track  Genome slicing: annotating across contigs  Folding of intronic space Future Enhancements

 Release   Demo Site   At GMOD  Releases & Demo

 Web Client and Static Data Generation Pipeline   Annotation editing server    Trellis Data access server  Source Code (BSD License)

 To all our users & contributors! Especially:  Code: Mitch Skinner, Nomi Harris, Thomas Down, Carson Holt.  Feedback: Sue Brown, Sanjay Chellapilla, Daniel Ence, Juergen Gadau, Nicolae Herndon, Elisabeth Huguet, Carolyn Lawrence, Sasha Mikheyev, Barry Moore, Jan Oettler, Xiang Qin, Lukas Schrader, Kim Worley, Mark Yandell, Jing-Jiang Zhou. File reformatting: Anna Bennett.  To our funding agencies:  NIH: NIGMS and NHGRI.  DOE: Office of the Director, Office of Science, Office of Basic Energy Sciences. Thanks