Other User Interface Highlights Configurable Undo/Redo History: transcript based, each transcript has a separate history for Undo/Redo Real-time updating:

Slides:

Advertisements

Similar presentations

MICHAEL MARINO CSC 101 Whats New in Office Office Live Workspace 3 new things about Office Live Workspace are: Anywhere Access Store Microsoft.

Advertisements

A Toolbox for Blackboard Tim Roberts

Sharpdesk Overview Desktop Composer Search Imaging

Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov.

The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.

Genome Browsers Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.

BIOCMS: Resource Integration and Web Application Framework for Bioinformatics DHUNDY R BASTOLA †, *, ANIL KHADKA †, MOHAMMAD SHAFIULLAH † AND HESHAM ALI.

Computer Science 101 Web Access to Databases Overview of Web Access to Databases.

Maintaining and Updating Windows Server 2008

Slide 1 of 9 Presenting 24x7 Scheduler The art of computer automation Press PageDown key or click to advance.

NGS Analysis Using Galaxy

Chapter 11 Adding Media and Interactivity. Flash is a software program that allows you to create low-bandwidth, high-quality animations and interactive.

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall.

Linux Operations and Administration

Computing for Bioinformatics Introduction to databases What is a database? Database system components Data types DBMS architectures DBMS systems available.

Architecture Of ASP.NET. What is ASP?  Server-side scripting technology.  Files containing HTML and scripting code.  Access via HTTP requests.  Scripting.

Customized cloud platform for computing on your terms !

Copyright © cs-tutorial.com. Introduction to Web Development In 1990 and 1991,Tim Berners-Lee created the World Wide Web at the European Laboratory for.

{ Web Apollo A Web-based Genomics Annotation Editing Platform Ed Lee, Gregg Helt, Justin Reese, Monica Munoz-Torres*, Christopher Childers, Rob Buels,

Title: GeneWiz browser: An Interactive Tool for Visualizing Sequenced Chromosomes By Peter F. Hallin, Hans-Henrik Stærfeldt, Eva Rotenberg, Tim T. Binnewies,

Office Live Workspace Visio 2007 Outlook 2007 Groove 2007 Access 2007 Excel 2007 Word 2007.

London April 2005 London April 2005 Creating Eyeblaster Ads The Rich Media Platform The Rich Media Platform Eyeblaster.

Advanced Level Course. Site Extras Site Extras consist of four categories: Stationeries Site Trash Designs Components.

London April 2005 London April 2005 Creating Eyeblaster Ads The Rich Media Platform The Rich Media Platform Eyeblaster.

New Features in Release 9.2 (July 27, 2009). 2 Release 9.2 New Features Updated Shopping Experience Home/Shop page Shop at the top search New Hosted Supplier.

Fundamentals of Database Chapter 7 Database Technologies.

Tutorial 121 Creating a New Web Forms Page You will find that creating Web Forms is similar to creating traditional Windows applications in Visual Basic.

The Hymenoptera Genome Database (HGD, is an informatics resource supporting genomics of hymenopteran insect species. It currently.

Session 1 SESSION 1 Working with Dreamweaver 8.0.

History tracking, including browsing of an annotation's edit history and full undo/redo functions Real-time updating: edits in one client are instantly.

GUI For A Virtual Pipeline Simulation Testbed By, Revathi Manni Ranganathan Major Professor: Dr.Virgil Wallentine.

Use cases for Tools at the Bovine Genome Database Apollo and Bovine QTL viewer.

Galaxy for Bioinformatics Analysis An Introduction TCD Bioinformatics Support Team Fiona Roche, PhD Date: 31/08/15.

WebApollo: A Web-Based Sequence Annotation Editor for Community Annotation Ed Lee, Gregg Helt, Nomi Harris, Mitch Skinner, Christopher Childers, Justin.

WebApollo extending JBrowse to support DAS & genomic annotation editing Gregg Helt, Ed Lee, Nomi Harris, Mitch Skinner, Suzanna Lewis, Ian Holmes Lawrence.

CHAPTER TEN AUTHORING.

Three’s a crowd-source: Observations on Collaborative Genome Annotation. Monica Munoz-Torres, PhD via Suzanna Lewis Biocurator & Bioinformatics Analyst.

Apollo Future Plans Nomi Harris, BDGP/FlyBase GMOD Meeting, Cambridge April 27, 2004.

Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,

Browsing the Genome Using Genome Browsers to Visualize and Mine Data.

WebSphere Portal Technical Conference U.S Creating Rich Internet (AJAX) Applications with WebSphere Portlet Factory.

The New Website of the Gene Ontology Consortium Seth Carbon Chris Mungall, PhD Monica Munoz-Torres, PhD Genomics Division,

Alastair Kerr, Ph.D. WTCCB Bioinformatics Core An introduction to DNA and Protein Sequence Databases.

Tengcha – generic middleware for retrieving data from Chado Justin Reese GMOD Meeting April 5, 2012.

Sackler Medical School

NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.

9 Systems Analysis and Design in a Changing World, Fourth Edition.

Annotator Interface Sharon Diskin GUS 3.0 Workshop June 18-21, 2002.

EBI is an Outstation of the European Molecular Biology Laboratory. Gautier Koscielny VectorBase Meeting 08 Feburary 2012, EBI VectorBase Text Search Engine.

Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov.

August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.

A guided tour of Ensembl This quick tour will give you an outline view of what Ensembl is all about. You will learn: –Why we need Ensembl –What is in the.

Protein sequence databases Petri Törönen Shamelessly copied from material done by Eija Korpelainen This also includes old material from my thesis

UCSC Genome Browser Zeevik Melamed & Dror Hollander Gil Ast Lab Sackler Medical School.

Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.

Welcome to the combined BLAST and Genome Browser Tutorial.

The Bovine Genome Database Abstract The Bovine Genome Database (BGD, facilitates the integration of bovine genomic data. BGD is.

TRACKSTER &CIRCSTER DEMO Slides: /g/funcgen/trainings/visualization/Demos/Trackster+Circster.ppt Galaxy: Galaxy Dev:

Maintaining and Updating Windows Server 2008 Lesson 8.

BUSINESS SENSITIVE 1 SAAW - Sequence Annotation and Analysis Workshop Boyu Yang and Gene Godbold Battelle Memorial Institute, Charlottesville Operations.

Wednesday NI Vision Sessions

Visualizing data from Galaxy

Basics of Genome Annotation Daniel Standage Biology Department Indiana University.

Canadian Bioinformatics Workshops

1 RIC 2009 Symbolic Nuclear Analysis Package - SNAP version 1.0: Features and Applications Chester Gingrich RES/DSA/CDB 3/12/09.

Web Apollo/JBrowse • JBrowse is a web based genome browser

N. Capp, E. Krome, I. Obeid and J. Picone

The Celera Genome Browser: A Tool for Visualizing and Annotating the Human Genome

Yating Liu July 2018 G-OnRamp workshop

Presentation transcript:

Other User Interface Highlights Configurable Undo/Redo History: transcript based, each transcript has a separate history for Undo/Redo Real-time updating: edits in one client are instantly pushed to all other clients Edge Matching: when annotation is selected, any other annotations with edges that match coordinates of selected annotation are highlighted. Tracking of non-canonical splice sites in curated annotations Ability to add comments, either chosen from a pre-defined set of comments or as freeform text. Can set start of translation for a transcript or let server determine automatically Convenient management of user login, authentication, and edit permissions Two-stage curation process: edit, then publish Summary As technical advances make sequencing faster and cheaper, genomic annotation efforts must adapt to keep pace. The upward trend in the number of genome sequencing projects means there will be a larger reliance on contributions from domain specialists. Thus the curation environment is shifting from a traditional centralized model, in which all curators for a given genome project share the same physical location, to a geographically dispersed community annotation model—which requires new tools to support community annotation efforts. WebApollo was designed to provide an easy to use, web-based environment that allows multiple distributed users to edit and share sequence annotations. Curators and investigators from the bee genome research community are currently beta-testing WebApollo. Their curation efforts, findings and interactions will dramatically upgrade the quality of the annotation data for the genomes of honeybee (Apis mellifera), and two bumble bees (Bombus impatiens and Bombus terrestris), which will lead to a better understanding of the biology of these social insects. WebApollo is comprised of three components: a web-based client, a server-side annotation editing engine, and a server-side service that provides the client with data from different source databases. All three software components are open source and freely available. The web-based client is designed as an extension to JBrowse, a JavaScript-based genome browser that provides a fast, highly interactive interface for the visualization of genomic data. This JBrowse extension provides the gestures needed for editing annotations, such as dragging and dropping features to create new annotations of genes, transcripts and other genomic elements, dragging to change exon boundaries of existing annotations, and using context-specific menus to modify features. It has support for deep sequencing visualization (e.g., BAM data). The extension also connects to the annotation-editing service and the data-providing services. The server-side annotation-editing engine is written in Java. It handles all the necessary logic for editing and deals with the complexities of modifications in a biological context, where a single change can have multiple cascading effects (e.g., when splitting or merging transcripts). Edits are stored persistently in the server, allowing users to quickly recover their data in the event of unexpected browser or server crashes. The server provides synchronized updates over multiple browser instances, so that every edit is immediately visible to all users who are viewing or editing the same region. It offers multiple levels of user accessibility, allowing project owners to decide with whom to share their work, and whether to allow read-only or both read and write access. The server-side service that provides data to the client is built on top of Trellis, a Distributed Annotation System (DAS) server framework. It sends JBrowse-supported JavaScript Object Notation (JSON) data, rather than the more verbose DAS XML. We developed Trellis plugins to access data from the UCSC MySQL genome database, Ensembl DAS services, and GMOD Chado databases. WebApollo will be publicly released in the 4rth quarter of 2012 Public demo at More info at WebApollo: A Web-based Sequence Annotation Editor for Distributed Community Annotation WebApollo: A Web-based Sequence Annotation Editor for Distributed Community Annotation Ed Lee 1, Gregg Helt 1, Nomi Harris 1, Robert Buels 3, Christopher Childers 2, Justin Reese 2, Mónica Muñoz-Torres 2, Christine Elsik 2, Ian Holmes 3, and Suzanna Lewis 1 1) Berkeley Bioinformatics Open-source Projects, Lawrence Berkeley National Laboratory, Berkeley, CA; 2) Georgetown University, Washington DC; 3) University of California at Berkeley, Berkeley, CA Shown below is WebApollo displaying tracks of genomic annotations along a small region of a scaffold from Honey Bee (Apis Mellifera) genome assembly 4.5. In the top track are in-progress gene models being created and edited in WebApollo. Results from various gene prediction programs are shown in yellow & orange. Results from MAKER, an analysis pipeline that builds consensus results from a number of the other computational analyses, is displayed in teal. The Official Gene Set, created using GLEAN, is shown in green. Transcripts from the NCBI RefSeq database are shown in pink. Aligned ESTs are shown in purple Results from high throughput sequencing (RNA-Seq) are displayed as assembled contigs (using exonerate) in dark blue, and as coverage graphs for two different experiments at the bottom. This region illustrates the problem with relying solely on computational analyses for determining gene structures. Of the gene prediction results, no program is in complete agreement with any other in calling intron-exon boundaries across the region. MAKER, the Official Gene Set, and RefSeq also disagree. Results like these are common, and curators and tools to enable curation are needed to manually resolve these differences in order to create a more accurate gene set for the sequenced organism. WebApollo allows curators to build gene models via an intuitive drag- and-drop user interface. Curators can create an initial annotation based on any computational result, then add or delete exons, extend exons, and merge or split transcripts. Manual Curation of Gene Structures: a crucial component of genome analysis Curating Genomic Sequence Alterations Genome assemblies with lower sequencing coverage can often have small errors in the assembled genomic sequence. These are often first spotted by curators, when the errors cause problems with gene structures. To enable curation in the presence of genome sequencing errors, WebApollo allows curators to annotate genomic sequence insertions, deletions, and substitutions. These annotations do not alter the underlying assembly on the server. However they are taken into account when determining the sequence of an annotation. The above image shows the results of a series of sequence alteration editing operations in WebApollo. The top panel shows no sequence alterations, but the transcript annotation (in blue) is flagged with an orange exclamation icon which indicates that the curated intron-exon junction does not follow the canonical splice site pattern of having a "GT" immediately 3' of the junction. In the second panel a curator has examined this issue and determined that a base was mis-called in the assembly, and has therefore added a substitution annotation (shown in yellow), substituting a "T" for a "C". This change immediately triggers removal of the non-canonical warning icon, since with the substitution the splice junction now has the canonical "GT". In the third panel a curator has created a sequence insertion annotation (shown in green) upstream of the splice, and for the transcript annotation this leads to a stop codon which truncates the CDS. In the last panel a sequence deletion annotation has been created (shown in red), which causes a frame shift for the annotation transcript, resulting in reversal of the CDS truncation. This work was supported by the National Institutes of Health grant numbers 5R01GM from the National Institute of General Medical Sciences; and by the Director, Office of Science, Office of Basic Energy Sciences, of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231 Near-Future Plans Test with additional genomes before public release. Better documentation of both user interface and server components Add ability to search for a region by sequence residues Loading data directly from GFF3 files, both remotely and from user's local machine. Enable curation of additional annotation types (e.g. tRNAs) Enable adding DB-xrefs (e.g. for GO functional annotation) User track configuration to set annotation colors, height, etc. Hierarchical organization of tracks Public release in fourth quarter of 2012 Data Sources BAM Files high-throughput sequencing results BAM Files high-throughput sequencing results Wiggle Files Wiggle Files Trellis Data Broker GMOD Chado Postgres Databases (read only) UCSC MySQL Genome Database UCSC MySQL Genome Database Distributed Annotation System (DAS) Servers WebApollo Editing Server BerkeleyDB Temporary Store User Management Database User Management Database GMOD Chado Persistent Store Jbrowse File Pre-processor Jbrowse File Pre-processor GFF3 Files GFF3 Files Taking Advantage of Deep RNA Sequencing Taking Advantage of Deep RNA Sequencing Incorporation of high-throughput RNA sequencing data is rapidly becoming a requirement for thorough genome structure curation. WebApollo can display high-throughput RNA sequencing data in different ways, as shown here. The genomic coverage plot in dark blue was generated by server-side pre-processing of RNA-seq aligned reads, and shows for each base position in the genome how many aligned reads cover that position. WebApollo can also display individual aligned reads as shown in teal. Due to the large number of reads that can be generated by a single RNA-seq experiment, the aligned reads are typically stored in a binary file format (BAM) for efficient storage and retrieval. WebApollo is capable of using BAM file indexing to efficiently access only the slices of the BAM file needed for the current view. This allows WebApollo to directly load aligned reads from BAM files on any standard web server. The example shown highlights the importance of utilizing deep RNA sequencing for curation. A previously unknown alternatively spliced form of an annotated gene is indicated by a number of RNA seq reads that skip one particular exon. This example also highlights the usefulness of selection edge-matching (in red). Edit Publish