Controller View (web) Model Model T HE E U P ATH DB / GUS-WDK S EARCH S TRATEGY S YSTEM Cristina Aurrecoechea 1, Brian P. Brunk 2, Steve Fischer 2, Xin.

Slides:



Advertisements
Similar presentations
Welcome to informaworld TM. The following demo will show you just a few of the features on informaworld TM. Please select where you would like start. ePublication.
Advertisements

Business Development Suit Presented by Thomas Mathews.
The Developer Perspective Michelle Osmond. Design – Requirements Gathering Sales & Research projects –Prototypes/Demos User group meetings Usability workshops.
Objective Understand web-based digital media production methods, software, and hardware. Course Weight : 10%
IAEA International Atomic Energy Agency INIS Collection Search: Introduction and main features INIS Training Seminar 7-11 October 2013, Vienna Domenico.
The KB on its way to Web 2.0 Lower the barrier for users to remix the output of services. Theo van Veen, ELAG 2006, April 26.
EuPathDB –Eukaryotic Pathogen Database Resources Chris Stoeckert, Ph.D. Dept of Genetics and Penn Center for Bioinformatics, University of Pennsylvania.
Engaging networks can help you to grow your online community Outreach top 10.
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
We are developing a web database for plant comparative genomics, named Phytome, that, when complete, will integrate organismal phylogenies, genetic maps.
SQL Server Reporting Services
Struts 2.0 an Overview ( )
UNIT-V The MVC architecture and Struts Framework.
Windows.Net Programming Series Preview. Course Schedule CourseDate Microsoft.Net Fundamentals 01/13/2014 Microsoft Windows/Web Fundamentals 01/20/2014.
Working with the Conifer_dbMagic database: A short tutorial on mining conifer assembly data. This tutorial is designed to be used in a “follow along” fashion.
Introduction to the Enterprise Library. Sounds familiar? Writing a component to encapsulate data access Building a component that allows you to log errors.
Influenza Research Database (IRD): A Web-based Resource for Influenza Virus Data and Analysis Victoria Hunt 1 *, R. Burke Squires 1, Jyothi Noronha 1,
Classroom User Training June 29, 2005 Presented by:
Title: GeneWiz browser: An Interactive Tool for Visualizing Sequenced Chromosomes By Peter F. Hallin, Hans-Henrik Stærfeldt, Eva Rotenberg, Tim T. Binnewies,
July 2015 CSHL Data analysis: GO tools and YeastMine, use-case examples.
XP New Perspectives on Browser and Basics Tutorial 1 1 Browser and Basics Tutorial 1.
EGAN: Exploratory Gene Association Networks by Jesse Paquette Biostatistics and Computational Biology Core Helen Diller Family Comprehensive Cancer Center.
New Features in Release 9.2 (July 27, 2009). 2 Release 9.2 New Features Updated Shopping Experience Home/Shop page Shop at the top search New Hosted Supplier.
1 Welcome to the GrameneMart Tutorial A tool for batch data sequence retrieval 1.Select a Gramene dataset to search against. 2.Add filters to the dataset.
Marcel Casado NCAR/RAP WEATHER WARNING TOOL NCAR.
SAGExplore web server tutorial for Module II: Genome Mapping.
Fission Yeast Computing Workshop -1- Searching, querying, browsing downloading and analysing data using PomBase Basic PomBase Features Gene Page Overview.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics Lab v1 | Saurabh Sinha1 Powerpoint by Casey Hanson.
SRI International Bioinformatics 1 Object Groups & Enrichment Analysis Suzanne Paley Pathway Tools Workshop 2010.
DEPICT: DiscovEring Patterns and InteraCTions in databases A tool for testing data-intensive systems.
WDK Overview How the WDK implements MVC and provides a base from which custom sites can be created.
NCBI Vector-Parasite Genomic Related Databases Chuong Huynh NIH/NLM/NCBI Sao Paulo, Brasil July 12, 2004
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
SAGExplore web server tutorial for Module I: Genome Explore.
IPortal Bringing your company and your business partners together through customized WEB-based portal software. SanSueB Software Presents iPortal.
What’s New for Web Developers in ASP.NET and Visual Studio 2008 Kate Gregory Microsoft Regional Director
EuPathDB: an integrated resource and tool for eukaryotic pathogen bioinformatics Aurrecoechea C., Heiges M., Warrenfeltz S. for the EuPathDB team CTEGD,
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Web Development Kit (WDK) Y. Thomas Gan
XML Engr. Faisal ur Rehman CE-105T Spring Definition XML-EXTENSIBLE MARKUP LANGUAGE: provides a format for describing data. Facilitates the Precise.
The Public Face of TAIR User Interface Design Responsiveness to User Input.
EnsMart: A Generic System for Fast and Flexible Access to Biological Data Arek Kasprzyk et al (2004) 14: , Genome research EBI, Wellcome Trust.
Web and Tool Integration Architectures Discussion July 8, 2005 Presenters: Doug Marcey, Carey Gire.
A radiologist analyzes an X-ray image, and writes his observations on papers  Image Tagging improves the quality, consistency.  Usefulness of the data.
Building Dashboards SharePoint and Business Intelligence.
Generic Database. What should a genome database do? Search Browse Collect Download results Multiple format Genome Browser Information Genomic Proteomic.
Project Database Handler The Project Database Handler is a brokering application that mediates interactions between the project database and the external.
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
BY: Alex Quigley Andrew Monaghan Daniel Ozac Eoin McCrann Philip Hamilton.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
IAEA International Atomic Energy Agency INIS Collection Search: Introduction and main features The Role of the International Nuclear Information System.
Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013.
SAGExplore web server tutorial. The SAGExplore server has three different modules …
Copyright OpenHelix. No use or reproduction without express written consent1 1.
Lei Kong, Ph.D. Center for Bioinformatics Peking University ABrowse - A General Purpose Genome Browser Framework.
Copyright OpenHelix. No use or reproduction without express written consent1.
Ontology Driven Data Collection for EuPathDB Jie Zheng, Omar Harb, Chris Stoeckert Center for Bioinformatics, University of Pennsylvania.
Welcome to the combined BLAST and Genome Browser Tutorial.
Welcome to the GrameneMart Tutorial A tool for batch data sequence retrieval 1.Select a Gramene dataset to search against. 2.Add filters to the dataset.
Excel Services Displays all or parts of interactive Excel worksheets in the browser –Excel “publish” feature with optional parameters defined in worksheet.
JAFER Toolkit Project Oxford University 1 JAFER Java-based high level Z39.50 toolkit Matthew Dovey; Colin Tatham; Antony Corfield; Richard Mawby Oxford.
The TDR Targets Database Prioritizing potential drug targets in complete genomes.
National Cancer Institute Uma Mudunuri ABCC, NCI-Frederick ISRCE Monthly Meeting, Nov 9th 2010 bioDBnet The biological DataBase network.
Haritha Dasari Josue Balandrano Coronel -
University of Pittsburgh
Large Scale Annotation of Genomic Datasets with Genephony
Objective Understand web-based digital media production methods, software, and hardware. Course Weight : 10%
Explore Evolution: Instrument for Analysis
Welcome - webinar instructions
SDMX IT Tools SDMX Registry
Presentation transcript:

Controller View (web) Model Model T HE E U P ATH DB / GUS-WDK S EARCH S TRATEGY S YSTEM Cristina Aurrecoechea 1, Brian P. Brunk 2, Steve Fischer 2, Xin Gao 2, Omar S. Harb 2, Mark Heiges 1, Jessica C. Kissinger 1, Eileen T. Kraemer 1, Cary Pennington 1, David S. Roos 2, Chris Ross 1, Christian J. Stoeckert 2 & Charles Treatman 2 1 Univ. Georgia, Athens GA, & 2 Univ. Pennsylvania, Philadelphia PA User perspectives on Strategies Computer-human interaction (CHI) studies during prototyping drove the design, and showed high user enthusiasm. Usage stats show 3-fold increase in use of Booleans in two months since release. User feedback very positive. WDK Implementation Runs on any relational database schema Model: configured by you in XML. Abstracts DB to high level Records (Genes, ORFs, etc) Also specifies queries and returned columns Automated sanity testing Can talk to processes (BLAST) via a WS Framework View: Tomcat, JSP, tag library, JavaScript, Ajax, CSS You embed JSP tags in your site and style them w/ CSS Controller: Struts WDK Upcoming features Add genes to a “basket” to generate a report, add to a strategy as a step or send to a tool (e.g., multiple sequence alignment) Web services access to queries Assign weights to results from individual steps for improved filtering Transform a set of one type into another type based on genome span relations The EuPathDB suite of genome database web sites recently introduced a graphical search interface that motivates users to undertake dynamic computational experiments, exploring relationships across datasets to identify biologically meaningful genes and other entities. For example, users seeking novel therapeutic targets may wish to prioritize putative enzymes that distinguish pathogens from their hosts, and are expressed during appropriate developmental stages. Strategies are initiated by running one of 80+ queries, and extended by adding additional searches, linked via Boolean operators represented graphically as Venn diagrams. Sub-strategies allow modular construction and tree structures, and searches may be extended using filters (e.g. by strain or species) and transforms (e.g. orthologs). A graphical display makes the overall logic obvious, and facilitates revision of individual steps, with changes propagated forward through the strategy. Users may name and save their strategies, creating protocols that can be shared with colleagues. (See, e.g., The strategy system has been subjected to extensive usability studies, and deployed on all EuPathDB databases (CryptoDB, GiardiaDB, PlasmoDB, ToxoDB, TrichDB and TriTrypDB). Although these sites have offered text-based Boolean operations for many years, usability analysis indicated that most users were not taking full advantage of that feature. Following release of the graphical Search Strategy system, the number of searches per visit dramatically increased. Response from our user community has been extremely positive, as investigators have discovered the power of combining datasets and making dynamic adjustments to define optimal parameters and highlight biologically-relevant relationships. With the accelerating growth in diversity and scale of available datasets, the potential for exploiting interrelationships increases dramatic­ally, and we expect this interface to have a significant impact in bringing “genomic thinking” to a broad audience. This system was developed using the GUS Web Development Kit (WDK), a schema-independent middleware system for generating genomics websites The EuPathDB suite of genome database web sites recently introduced a graphical search interface that motivates users to undertake dynamic computational experiments, exploring relationships across datasets to identify biologically meaningful genes and other entities. For example, users seeking novel therapeutic targets may wish to prioritize putative enzymes that distinguish pathogens from their hosts, and are expressed during appropriate developmental stages. Strategies are initiated by running one of 80+ queries, and extended by adding additional searches, linked via Boolean operators represented graphically as Venn diagrams. Sub-strategies allow modular construction and tree structures, and searches may be extended using filters (e.g. by strain or species) and transforms (e.g. orthologs). A graphical display makes the overall logic obvious, and facilitates revision of individual steps, with changes propagated forward through the strategy. Users may name and save their strategies, creating protocols that can be shared with colleagues. (See, e.g., The strategy system has been subjected to extensive usability studies, and deployed on all EuPathDB databases (CryptoDB, GiardiaDB, PlasmoDB, ToxoDB, TrichDB and TriTrypDB). Although these sites have offered text-based Boolean operations for many years, usability analysis indicated that most users were not taking full advantage of that feature. Following release of the graphical Search Strategy system, the number of searches per visit dramatically increased. Response from our user community has been extremely positive, as investigators have discovered the power of combining datasets and making dynamic adjustments to define optimal parameters and highlight biologically-relevant relationships. With the accelerating growth in diversity and scale of available datasets, the potential for exploiting interrelationships increases dramatic­ally, and we expect this interface to have a significant impact in bringing “genomic thinking” to a broad audience. This system was developed using the GUS Web Development Kit (WDK), a schema-independent middleware system for generating genomics websites The EuPathDB suite of databases covers genomic and functional genomics datasets for a variety of eukaryotic pathogens. Shown here is PlasmoDB, which contains the genus Plasmodium, including P.falciparum, the malaria parasite. Use Case Use data in PlasmoDB to find parasite (Plasmodium) drug target genes This panel shows a schematic of a strategy, using queries and booleans. The actual strategy is built below. Transferases (E.C.) [union] Kinase activity (GO) [intersect]  [intersect] present in Haemosporida, not Mammals [intersect] not under diversifying selection (SNPs) [transform] orthology to any Plasmodium genes Run a query (choose from menu) 2 Add a step (another query) Add more steps… Build a Strategy Revise steps at any time…. Changes propagate forward. A strategy can integrate data from genome annotation, expression, SNPs, proteomics, etc. Nest strategies to add complexity. View results from all or any species. Use orthology to transform results to other species. Download customized reports of results. Choose from many available columns. Sort and move columns. Dynamically revise, add or delete steps. a strategy link tocollegaues. It’s Easy to Build a Strategy… Genomics Database WDK Engine Query Cache Genomics Data Denormalized For Query Speed Genomics Data Denormalized For Query Speed Genomics Data User Login and Search History WDK Model (Java Objects) WDK Model (Java Objects) WDK Model (XML) (XML) WDK Query Engine(Java) Engine(Java) Web Services Framework JavaBeans (JSP compatible) JavaBeans JSP Tag Library Struts controller WDK Sanity Test …Strategies are Powerful Save and browse strategies. Challenge: exploit the power of integrated genome annotation, expression data, proteomics data, SNPs, etc. Strategies… A Graphical Query Interface for Genomics Databases Solution: Strategies… A Graphical Query Interface for Genomics Databases # Nested Strategy P.f. transcript expr. at 24 hours +/- 8 [union] P.f. transcript expr. in Trophozoites [union] P.f. protein expr. in Trophozoites JSP and CSS = You provide = WDK provides = Optional Different types of strategies: Genes, Isolates, SNPs, Transcript assemblies, Chromosomes, Array Elements, ORFs, etc. Strategies Web Dev Kit (WDK) EuPathDB is an NIAID Bioinformatics Resource Center Supported by NIAID Contract No. HHSN C and The Bill & Melinda Gates Foundation Processes (eg, BLAST)