The CIPRES Portal: Current Status and Future Plans

Slides:



Advertisements
Similar presentations
Building Portals to access Grid Middleware National Technical University of Athens Konstantinos Dolkas, On behalf of Andreas Menychtas.
Advertisements

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA, SAN DIEGO Enabling Phylogenetic Research via the CIPRES Science Gateway Wayne Pfeiffer.
ProActive Task Manager Component for SEGL Parameter Sweeping Natalia Currle-Linde and Wasseim Alzouabi High Performance Computing Center Stuttgart (HLRS),
St Testing, Simulation and Monitoring (actually mostly simulation) Stephen Hillier Joint Meeting, Mainz, June 2001.
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
Biology.sdsc.edu CIPRes in Kepler: An integrative workflow package for streamlining phylogenetic data analyses Zhijie Guan 1, Alex Borchers 1, Timothy.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
Creating the CIPRES Science Gateway for Inference of Large Phylogenetic Trees Mark A. Miller San Diego Supercomputer Center.
The BioBox Initiative: Bio-ClusterGrid Gilbert Thomas Associate Engineer Sun APSTC – Asia Pacific Science & Technology Center.
Christopher Jeffers August 2012
Cloud Usage Overview The IBM SmartCloud Enterprise infrastructure provides an API and a GUI to the users. This is being used by the CloudBroker Platform.
COMP 410 & Sky.NET May 2 nd, What is COMP 410? Forming an independent company The customer The planning Learning teamwork.
Cluster Computing Applications for Bioinformatics Thurs., Aug. 9, 2007 Introduction to cluster computing Working with Linux operating systems Overview.
CountryData Development Improving the collation, availability and dissemination of development indicators (including the MDGs) Nairobi, 27 November 2013.
COMP 6471 Software Design Methodologies Winter 2006 Dr Greg Butler
CSCI 6962: Server-side Design and Programming Introduction to Java Server Faces.
Flexibility and user-friendliness of grid portals: the PROGRESS approach Michal Kosiedowski
SEMESTER PROJECT PRESENTATION CS 6030 – Bioinformatics Instructor Dr.Elise de Doncker Chandana Guduru Jason Eric Johnson.
Authors Project Database Handler The project database handler dbCCP4i is a small server program that handles interactions between the job database and.
1 PyMOL Evolutionary Trace Viewer 1.1 Lichtarge Lab Sept. 13, 2010.
Wenjing Wu Computer Center, Institute of High Energy Physics Chinese Academy of Sciences, Beijing BOINC workshop 2013.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University.
SEE-GRID-SCI The SEE-GRID-SCI initiative is co-funded by the European Commission under the FP7 Research Infrastructures contract no.
EMBOSS over a Grid 1. 1st EELA Grid School December 4th of 2006 Eduardo MURRIETA LEON Romualdo ZAYAS-LAGUNAS Pierre-Alain BRANGER Jérôme VERLEYEN Roberto.
NEES Cyberinfrastructure Center at the San Diego Supercomputer Center, UCSD George E. Brown, Jr. Network for Earthquake Engineering Simulation NEES TeraGrid.
Database search. Overview : 1. FastA : is suitable for protein sequence searching 2. BLAST : is suitable for DNA, RNA, protein sequence searching.
From basic Concepts to Advanced applications Molecular Evolution & Phylogeny By Ofir Cohen The Bioinformatics Unit G.S. Wise Faculty of Life Science Tel.
Introduction to Taverna Online and Interaction service Aleksandra Pawlik University of Manchester.
The National Center for Genomic Analysis Support: creating a national cyberinfrastructure environment for genomics researchers. William Barnett, Thomas.
SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA, SAN DIEGO Hybrid MPI/Pthreads Parallelization of the RAxML Phylogenetics Code Wayne Pfeiffer.
Biomedical and Bioscience Gateway to National Cyberinfrastructure John McGee Renaissance Computing Institute
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Copyright OpenHelix. No use or reproduction without express written consent1.
Zach Miller Computer Sciences Department University of Wisconsin-Madison Supporting the Computation Needs.
PROTEIN IDENTIFIER IAN ROBERTS JOSEPH INFANTI NICOLE FERRARO.
Elsevier Operative Techniques - Netter Process Flow
Nithyamoorthy S Core Mind Technologies
StoRM: a SRM solution for disk based storage systems
The CIPRES Science Gateway: Enabling High-Impact Science for Phylogenetics Researchers with Limited Resources Mark Miller, Wayne Pfeiffer, and Terri.
CUAHSI HIS Sharing hydrologic data
Shaowen Wang1, 2, Yan Liu1, 2, Nancy Wilkins-Diehr3, Stuart Martin4,5
SowiDataNet - A User-Driven Repository for Data Sharing and Centralizing Research Data from the Social and Economic Sciences in Germany Monika Linne, 30.
An easier path? Customizing a “Global Solution”
IBM Cognos Analytics Administrator V11 C Questions Answers
Skill Based Assessment
Skill Based Assessment - Entity Framework -
Skill Based Assessment
SRA Submission Pipeline
Skill Based Assessment
Advanced Invitations.
Dev Test on Windows Azure Solution in a Box
Development of the Nanoconfinement Science Gateway
Soo Park and Janine Aquino
Understanding the Visual IDE
Module 01 ETICS Overview ETICS Online Tutorials
Code Analysis, Repository and Modelling for e-Neuroscience
Serpil TOK, Zeki BAYRAM. Eastern MediterraneanUniversity Famagusta
Explore Evolution: Instrument for Analysis
Wide Area Workload Management Work Package DATAGRID project
Chapter 13 The Data Warehouse
Basic Local Alignment Search Tool (BLAST)
Rob Straight, Principal Product Manager
DATABASE DESIGN & DEVELOPMENT
Code Analysis, Repository and Modelling for e-Neuroscience
REST Services Data and tools on the Web have been exposed in both WSDL and REST. Taverna provides a custom processor for accessing REST services Peter.
Chapter 5 SNMP Management
Chapter 5 SNMP Management
Introduction to the SHIWA Simulation Platform EGI User Forum,
Overview of Computer system
Presentation transcript:

The CIPRES Portal: Current Status and Future Plans Mark A. Miller Principal Investigator, Biology San Diego Supercomputer Center

CIPRES Software Libraries were created to enable fine-grained communication between programs

CIPRES Portal V 1.0 was created to expose these libraries to the user community

The CIPRES Portal V 1.X parses uploaded input files and provides users with appropriate tool selections for the data set. Results are stored temporarily for download.

Portal 1 Tools Portal 1 Features PAUP (limited options) RAxML +bootstrapping; (“Black box” options) GARLI (limited options) MrBayes (limited options) ClustalW RecIDCM3 boosting for PAUP and RAxML ReST accessible Supports Nexus, Phylip, and Hennig86 formats. Portal 1 Features

Problem CORBA architecture has a very high overhead for adding new data types/services This makes it hard to expose to new command line options, even when they are already available from the command line tool.

CIPRES Portal V 2: New design goals Decrease the overhead by creating a new architecture that can still have “knowledge” about the data, but the data is treated as text strings, and not as CORBA objects. Add new user requested features: Access to most or all native command line options Add new tools more quickly Provide personal user space for storing results

CIPRES Portal V 2.0 was built on a generic portal architecture called The Workbench Framework

All command line parameters can be set.

All command line parameters can be set.

With release of Portal v 2.0: Rate of job submission doubles Rate of Garli use increases 9-fold Rate of MrBayes use increases 5-fold

To expose Command Line Tools quickly, the Workbench Framework uses the PISE XML standard…. <?xml version="1.0" encoding="ISO-8859-1" ?> <!DOCTYPE pise SYSTEM "http://www.phylo.org/dev/rami/PARSER/pise.dtd" [ <!ENTITY nucdbs SYSTEM "http://www.phylo.org/dev/rami/XMLDIR/nucdbs.xml"> <!ENTITY protdbs SYSTEM "http://www.phylo.org/dev/rami/XMLDIR/protdbs.xml"> <!ENTITY blastDBpath SYSTEM "http://www.phylo.org/dev/rami/XMLDIR/blastDBpath.xml"> <!ENTITY fastaDBpath SYSTEM "http://www.phylo.org/dev/rami/XMLDIR/fastaDBpath.xml"> <!ENTITY blocksDBpath SYSTEM "http://www.phylo.org/dev/rami/XMLDIR/blocksDBpath.xml"> <!ENTITY nucDBfasta SYSTEM "http://www.phylo.org/dev/rami/XMLDIR/nucDBfasta.xml"> <!ENTITY protDBfasta SYSTEM "http://www.phylo.org/dev/rami/XMLDIR/protDBfasta.xml"> ]> <pise> <head> <title>TFASTY</title> <version>34t10d3</version> <description>Compare PS to Translated NS Or NS-DB</description> <authors>W. Pearson</authors> <reference>Pearson, W. R. (1999) Flexible sequence similarity searching with the FASTA3 program package. Methods in Molecular Biology</reference> <reference>W. R. Pearson and D. J. Lipman (1988), Improved Tools for Biological Sequence Analysis, PNAS 85:2444-2448</reference> <reference> W. R. Pearson (1998) Empirical statistical estimates for sequence similarity searches. In J. Mol. Biol. 276:71-84</reference> <reference>Pearson, W. R. (1996) Effective protein sequence comparison. In Meth. Enz., R. F. Doolittle, ed. (San Diego: Academic Press) 266:227-258</reference> <category>Protein Sequence</category>

The webtooldev server was created so anyone can create and test new interfaces…

The scalable interface generator allows us to add new tools quickly…

Problem Many users cannot complete their jobs in the 72 hour limit

Under current development…..connect the portal to scalable resources Workbench Framework The TeraGrid resources are faster, and they are scalable. The CIPRES Portal has been awarded 400,000 Tergrid cpu hours, the CIPRES Portal is now a TeraGrid Science Gateway project. We are building the infrastructure to support this access. We have implemented parallel MrBayes, Garli, and RAxML. We are working with developers to deploy restart options, so if a job times out, it can be restarted. TeraGrid (group allocation)