CICC Web services and Issues Jungkee (Jake) Kim Community Grids Laboratory.

Slides:



Advertisements
Similar presentations
Polska Infrastruktura Informatycznego Wspomagania Nauki w Europejskiej Przestrzeni Badawczej Institute of Computer Science AGH ACC Cyfronet AGH The PL-Grid.
Advertisements

Java Solutions for Cheminformatics Feb 2008 Whats new for PP.
SOMA2 – Drug Design Environment. Drug design environment – SOMA2 The SOMA2 project Tekes (National Technology Agency of Finland) DRUG2000 program.
IRRA DSpace April 2006 Claire Knowles University of Edinburgh.
3/22/2006Community Grids Lab1 VOTable Services. 3/22/2006Community Grids Lab2 ServiceDescriptionInputOutput FileGenerator Service Combines clusterfile.
Indiana University School of David Wild – CICC Quarterly Meeting, Jan Page 1 Projects 1-4 update David Wild CICC Quarterly Meeting January 27.
VARUNA – Towards a Grid- based Molecular Modeling Environment CICC/MACE – Meeting May 22, 2006 Mookie Baik Department of Chemistry & School of Informatics.
Building a Chemical Informatics Grid Marlon Pierce Community Grids Laboratory Indiana University.
Community Grids Lab1 CICC Project Meeting Developed two web services (VotableToTabbedFileService – convert votable to tabbed file) and (ExtractionFromSingleVotableService.
CICC Chemical Compound Mining Workflows Jungkee (Jake) Kim Community Grids Laboratory.
Building a Chemical Informatics Grid Marlon Pierce Community Grids Laboratory Indiana University.
Dr. Matthew Wright Product Director.
Setting Up Information Portal Irwan Sampurna C-CONTENT 23 May 2006.
TAILS: COBWEB 1 [1] Online Digital Learning Environment for Conceptual Clustering This material is based upon work supported by the National Science Foundation.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik University of Manchester.
Configuring Mash Up Data. External Data In CRM Contact Without Synchronization External Data.
Technical BI Project Lifecycle
Community Grids Lab CICC Activities Geoffrey Fox, Marlon Pierce Indiana University.
Chemical Informatics and Cyber- infrastructure Building Blocks Chemical Informatics Resources:  Deluge of experimental data > 100,000 compounds screened.
1 Towards Automating Complex Associative Access to Multiple Bioinformatics Data Sources Ling Liu, Calton Pu David Buttler, Wei Han Henrique Paques, Dan.
NYU Microarray Database (NYUMAD)
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
B.Sc. Multimedia ComputingMedia Technologies Database Technologies.
Building Services for BCI with Taverna Jungkee (Jake) Kim Community Grids Laboratory.
Building a Chemical Informatics Grid
Mapping Physical Formats to Logical Models to Extract Data and Metadata Tara Talbott IPAW ‘06.
HDF 1 NCSA HDF XML Activities Robert E. McGrath Mike Folk National Center for Supercomputing Applications.
MIT CSAIL/IBM Watson Research © 2004 IBM Corporation Haystack: Bringing Good Metadata to Life Dennis Quan
1 Chapter 1: Introduction 1.1 Introduction to SAS Enterprise Miner.
Chapter 1: Introduction
DHTML. What is DHTML?  DHTML is the combination of several built-in browser features in fourth generation browsers that enable a web page to be more.
W EB - BASED B IOINFORMATICS P IPELINES FOR B IOLOGISTS Integrative Services for Genomic Analysis (ISGA) Chris Hemmerich Center for Genomics and Bioformatics.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik University of Manchester materials by Dr Katy Wolstencroft and Dr Aleksandra.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space The Capabilities of the GridSpace2 Experiment.
Metadata Creation with the Earth System Modeling Framework Ryan O’Kuinghttons – NESII/CIRES/NOAA Kathy Saint – NESII/CSG July 22, 2014.
PowerDesigner 12.5 by Dejan Sunderic
ETD DTD Based on ETD-ML1.1(Virginia Tech) - convert SGML DTD to XML DTD Added some tags -such as TableOfConetents, ListOfPicture, ListOfTables Added “id”
Web Services for Earth Science Data Edward Armstrong, Thomas Huang, Charles Thompson, Nga Quach, Richard Kim, Zhangfan Xing Winter ESIP 2014 Washington.
Let VRS Work for You! ELUNA Conference 2008 Presenter: Kelly P. Robinson GIL Service Georgia State University
VAMDC tutorial for prospective data-providers Guy Rixon meeting, IPR, November 2013.
Taverna Workflow. A suite of tools for bioinformatics Fully featured, extensible and scalable scientific workflow management system – Workbench, server,
Updating JUPITER framework using XML interface Kobe University Susumu Kishimoto.
LexBIG Release Overview Aug 21, LexBIG Context Project Goals for Sept –Incremental point release of LexBIG infrastructure to support EVS activities.
WaveMaker Visual AJAX Studio 4.0 Training Basics: Building Your First Application Binding Basics.
SCAP E SCAPE Project EU project aimed at building a scalable platform for planning and execution of computation intensive processes for ingestion or migration.
Wrapping Scientific Applications As Web Services Using The Opal Toolkit Wrapping Scientific Applications As Web Services Using The Opal Toolkit Sriram.
Moby Web Services Iván Párraga García MSc on Bioinformatics for Health Sciences May 2006.
Google Refine for Data Quality / Integrity. Context BioVeL Data Refinement Workflow Synonym Expansion / Occurrence Retrieval Data Selection Data Quality.
Metadata Mòrag Burgon-Lyon University of Glasgow.
© NCSR, Frascati, July 18-19, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Use of PROTÉGÉ to generate ontology and lexicons for the 1 st domain.
MyGrid/Taverna Provenance Daniele Turi University of Manchester OMII f2f Meeting, London, 19-20/4/06.
Faculty Advisor – Dr. Suraj Kothari Client – Jon Mathews Team Members – Chaz Beck Marcus Rosenow Shaun Brockhoff Jason Lackore.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Understanding Data Intensive Systems Using Dynamic Analysis and Visualization Nesrine NOUGHI.
Hyperion Artifact Life Cycle Management Agenda  Overview  Demo  Tips & Tricks  Takeaways  Queries.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space The Capabilities of the GridSpace2 Experiment.
X-RAY. A java project can be scanned for instances of design patterns The results are represented in a table – design pat- tern participants are associated.
Using VAMDC: tutorial introduction Guy Rixon VAMDC annual meeting 2013, Open University.
December, 2006 ws-VLAM Workflow Management System a Re-factoring of VLAM Dmitry Vasyunin Adianto Wibisono Adam Belloum.
Provenance Work Plans and Deliverables October 2005  Data Provenance information in SRB and HID Test upload to SRB (March) Give DB working group formal.
Pupil Name OCR Nationals in ICT (2010) : Unit 06: Design a SpreadsheetsAO2 – Format a spreadsheet.
Building CICC Web services
Avraham Leff James T. Rayfield IBM T.J. Watson Research Center
External Services & Frameworks
CICC Project Meeting Introduction to VOTable 1.1
CICC Combines Grid Computing with Chemical Informatics
CICC Chemical Compound Mining Workflows
Use Cases Simple Machine Translation (using Rainbow)
Requirements “Content Guide”
Jungkee (Jake) Kim TMD and XML Jungkee (Jake) Kim
Presentation transcript:

CICC Web services and Issues Jungkee (Jake) Kim Community Grids Laboratory

CICC Web Services I BCI Clustering –Provides Bernard Chemical Information (BCI) clustering packages –A module of the workflow for HTS data organization and flagging –Status: Added URL output support to the previous solid prototype (Multi-user durable) Taverna Beanshell Scripting for data format adjusting (e.g. Filtering out the head part listing column names) –To do: Evaluating the URI(URL) based workflow design ToxTree –Estimates toxic hazard by applying a decision tree approach –A module of the workflow for HTS data organization and flagging –Status: A test prototype producing the level of toxicity in a brief or verbose explanation against a SMILE structure –To do: Refining the Web service for cluster input and external property support The Taverna Beanshell scripting for data merging not used in some modules

CICC Web Services II Workflow for HTS data organization and flagging –Demonstrates how screening data can be flagged and organized for human analysis –Status: Individual modules except the visualization are in prototype –To do: Defining at least XML schema or DTD for the workflow data (at most the Ontology) Redefining current workflow model to reflect the new feature of Taverna 1.4 supporting complex data structures and the provenance plugin Other Planed Web Services –Open Source Chemistry Analysis Routines (OSCAR) Extracts chemical information from text and produces an XML instance highlighting the chemical information A module of the PMR workflow Status: OSCAR3 is available and works fine as a Java application To do: Studying XML instances for extracting chemical names –InfoChems SPRESI Web Service Provides access to the SPRESI molecule database Status: Perl scripts for accessing SPRESI Web Service To do: Developing a Web service wrapper to utilize InfoChems SPRESI Web Service

BCI Clustering URL Service Methods Service MethodDescription InputURLOutput makebitsURLGene rate Generate fingerprints from a SMILES structure SMIstringFingerprint and program output divkmURLGenerat e Cluster fingerprints with Divkmeans SCNstringDKM data and program output smile2dkmURLMakebits + divkmSMIstringAll SMI, DKM and std. outputs optclusURLGenera te Generate the best levels in a hierarchy SMIstring DKMstring Best data and program output rnnclusURLGenera te Extract individual cluster partitions SMIstring DKMstring New partition and std. output smile2ClusterPartiti onedURL Generate a new SMILES structure w/ extra col. SMIstringAll intermediate data and output

Workflow for smile2ClusterPartitionedURL

Workflow for Toxic Hazard in Verbose

Diagram of Workflow2 Input/Output Web Services Beanshell Scripting