Download presentation
Presentation is loading. Please wait.
Published byMyron Wilcox Modified over 6 years ago
1
The CIPRES Portal: Current Status and Future Plans
Mark A. Miller Principal Investigator, Biology San Diego Supercomputer Center
2
CIPRES Software Libraries were created to enable fine-grained communication between programs
3
CIPRES Portal V 1.0 was created to expose these libraries to the user community
4
The CIPRES Portal V 1.X parses uploaded input files and provides users with
appropriate tool selections for the data set. Results are stored temporarily for download.
5
Portal 1 Tools Portal 1 Features PAUP (limited options)
RAxML +bootstrapping; (“Black box” options) GARLI (limited options) MrBayes (limited options) ClustalW RecIDCM3 boosting for PAUP and RAxML ReST accessible Supports Nexus, Phylip, and Hennig86 formats. Portal 1 Features
6
Problem CORBA architecture has a very high overhead for adding new data types/services This makes it hard to expose to new command line options, even when they are already available from the command line tool.
7
CIPRES Portal V 2: New design goals
Decrease the overhead by creating a new architecture that can still have “knowledge” about the data, but the data is treated as text strings, and not as CORBA objects. Add new user requested features: Access to most or all native command line options Add new tools more quickly Provide personal user space for storing results
8
CIPRES Portal V 2.0 was built on a generic portal architecture called
The Workbench Framework
11
All command line parameters can be set.
12
All command line parameters can be set.
13
With release of Portal v 2.0:
Rate of job submission doubles Rate of Garli use increases 9-fold Rate of MrBayes use increases 5-fold
14
To expose Command Line Tools quickly, the Workbench Framework uses the PISE XML standard….
<?xml version="1.0" encoding="ISO " ?> <!DOCTYPE pise SYSTEM " [ <!ENTITY nucdbs SYSTEM " <!ENTITY protdbs SYSTEM " <!ENTITY blastDBpath SYSTEM " <!ENTITY fastaDBpath SYSTEM " <!ENTITY blocksDBpath SYSTEM " <!ENTITY nucDBfasta SYSTEM " <!ENTITY protDBfasta SYSTEM " ]> <pise> <head> <title>TFASTY</title> <version>34t10d3</version> <description>Compare PS to Translated NS Or NS-DB</description> <authors>W. Pearson</authors> <reference>Pearson, W. R. (1999) Flexible sequence similarity searching with the FASTA3 program package. Methods in Molecular Biology</reference> <reference>W. R. Pearson and D. J. Lipman (1988), Improved Tools for Biological Sequence Analysis, PNAS 85: </reference> <reference> W. R. Pearson (1998) Empirical statistical estimates for sequence similarity searches. In J. Mol. Biol. 276:71-84</reference> <reference>Pearson, W. R. (1996) Effective protein sequence comparison. In Meth. Enz., R. F. Doolittle, ed. (San Diego: Academic Press) 266: </reference> <category>Protein Sequence</category>
15
The webtooldev server was created so anyone can create and test new interfaces…
16
The scalable interface generator allows us to add new tools quickly…
17
Problem Many users cannot complete their jobs in the 72 hour limit
18
Under current development…..connect the portal to scalable resources
Workbench Framework The TeraGrid resources are faster, and they are scalable. The CIPRES Portal has been awarded 400,000 Tergrid cpu hours, the CIPRES Portal is now a TeraGrid Science Gateway project. We are building the infrastructure to support this access. We have implemented parallel MrBayes, Garli, and RAxML. We are working with developers to deploy restart options, so if a job times out, it can be restarted. TeraGrid (group allocation)
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.