Grids for Chemical Informatics Randall Bramley, Geoffrey Fox, Dennis Gannon, Beth Plale Computer Science, Informatics, Physics Pervasive Technology Laboratories.

Slides:



Advertisements
Similar presentations
Common Instrument Middleware Architecture and Federation of Instrument Resources for X-ray Crystallography Rick McMullen Indiana University.
Advertisements

LEAD Portal: a TeraGrid Gateway and Application Service Architecture Marcus Christie and Suresh Marru Indiana University LEAD Project (
Pulan Yu School of Informatics Indiana University Bloomington Web service based Varuna.Net.
CVRG Presenter Disclosure Information Tahsin Kurc, PhD Center for Comprehensive Informatics Emory University CardioVascular Research Grid Core Infrastructure.
Kensington Oracle Edition: Open Discovery Workflow Meets Oracle 10g Professor Yike Guo.
1 Challenges and New Trends in Data Intensive Science Panel at Data-aware Distributed Computing (DADC) Workshop HPDC Boston June Geoffrey Fox Community.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
As computer network experiments increase in complexity and size, it becomes increasingly difficult to fully understand the circumstances under which a.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) IRNC Kick-Off Workshop July 13,
Clouds from FutureGrid’s Perspective April Geoffrey Fox Director, Digital Science Center, Pervasive.
Cyberinfrastructure Supporting Social Science Cyberinfrastructure Workshop October Chicago Geoffrey Fox
Help!!! Some Future Semantic Grid Activities CrisisGrid and ServoGrid PTLIU Laboratory for Community Grids Geoffrey Fox Computer Science, Informatics,
V. Chandrasekar (CSU), Mike Daniels (NCAR), Sara Graves (UAH), Branko Kerkez (Michigan), Frank Vernon (USCD) Integrating Real-time Data into the EarthCube.
TeraGrid Gateway User Concept – Supporting Users V. E. Lynch, M. L. Chen, J. W. Cobb, J. A. Kohl, S. D. Miller, S. S. Vazhkudai Oak Ridge National Laboratory.
INFSO-RI Enabling Grids for E-sciencE FloodGrid application Ladislav Hluchy, Viet D. Tran Institute of Informatics, SAS Slovakia.
Grid Computing for Real World Applications Suresh Marru Indiana University 5th October 2005 OSCER OU.
Education and Grid Services Geoffrey Fox Professor of Computer Science, Informatics, Physics Community Grids Laboratory Pervasive Technology Laboratories.
18:15:32Service Oriented Cyberinfrastructure Lab, Grid Deployments Saul Rioja Link to presentation on wiki.
Cancer Bioinformatics Grid (caBIG) CANS 2006 Chicago, Illinois Shannon Hastings Department of Biomedical Informatics Ohio State University.
Future Grid Future Grid User Portal Marlon Pierce Indiana University.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Geospatial Systems Architecture Todd Bacastow. GIS Evolution
CyberInfrastructure to Support Scientific Exploration and Collaboration Dennis Gannon (based on work with many collaborators, most notably Beth Plale )
OpenQuake Infomall ACES Meeting Maui May Geoffrey Fox
SAN DIEGO SUPERCOMPUTER CENTER NUCRI Advisory Board Meeting November 9, 2006 Science Gateways on the TeraGrid Nancy Wilkins-Diehr TeraGrid Area Director.
material assembled from the web pages at
Grid Portals Dennis Gannon Marlon Pierce Indiana University.
Crystal25 Hunter Valley, Australia, 11 April 2007 Crystal25 Hunter Valley, Australia, 11 April 2007 JAINIS (JCU and Indiana Instrument Services): A Grid.
GEM Portal and SERVOGrid for Earthquake Science PTLIU Laboratory for Community Grids Geoffrey Fox, Marlon Pierce Computer Science, Informatics, Physics.
San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure SRB + Web Services = Datagrid Management System (DGMS) Arcot.
Grid Architecture William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (These slides are available at grid.lbl.gov/~wej/Grids)
Geospatial Systems Architecture Todd Bacastow. Views of a System Architecture Enterprise Information Computational Engineering Technology.
Renaissance Computing Institute: An Overview Lavanya Ramakrishnan, John McGee, Alan Blatecky, Daniel A. Reed Renaissance Computing Institute.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GRID ARCHITECTURE Chintan O.Patel. CS 551 Fall 2002 Workshop 1 Software Architectures 2 What is Grid ? "...a flexible, secure, coordinated resource- sharing.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
Sponsored by the National Science Foundation A New Approach for Using Web Services, Grids and Virtual Organizations in Mesoscale Meteorology.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
ISERVOGrid Architecture Working Group Brisbane Australia June Geoffrey Fox Community Grids Lab Indiana University
Applications and Requirements for Scientific Workflow Introduction May NSF Geoffrey Fox Indiana University.
Integrating Geographical Information Systems and Grid Applications Marlon Pierce Contributions: Ahmet Sayar,
November Geoffrey Fox Community Grids Lab Indiana University Net-Centric Sensor Grids.
NCSA Alliance Portal Expedition Demo Marlon Pierce, Greg Daues, Gopi Kandaswamy, and Liang Fang Supercomputing 2004 Pittsburg, PA.
Remarks on OGSA and OGSI e-Science All Hands Meeting September Geoffrey Fox, Indiana University.
Indiana University School of Informatics The LEAD Gateway Dennis Gannon, Beth Plale, Suresh Marru, Marcus Christie School of Informatics Indiana University.
TeraGrid Gateway User Concept – Supporting Users V. E. Lynch, M. L. Chen, J. W. Cobb, J. A. Kohl, S. D. Miller, S. S. Vazhkudai Oak Ridge National Laboratory.
Geospatial Systems Architecture
Biomedical and Bioscience Gateway to National Cyberinfrastructure John McGee Renaissance Computing Institute
Applications and Requirements for Scientific Workflow May NSF Geoffrey Fox Indiana University.
HPC in the Cloud – Clearing the Mist or Lost in the Fog Panel at SC11 Seattle November Geoffrey Fox
Toward a common data and command representation for quantum chemistry Malcolm Atkinson Director 5 th April 2004.
Partnerships in Innovation: Serving a Networked Nation Grid Technologies: Foundations for Preservation Environments Portals for managing user interactions.
Applications and Requirements for Scientific Workflow May NSF Geoffrey Fox Indiana University.
OGCE Workflow and LEAD Overview Suresh Marru, Marlon Pierce September 2009.
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
CIMA and Semantic Interoperability for Networked Instruments and Sensors Donald F. (Rick) McMullen Pervasive Technology Labs at Indiana University
Directions in eScience Interoperability and Science Clouds June Interoperability in Action – Standards Implementation.
LEAD Project Discussion Presented by: Emma Buneci for CPS 296.2: Self-Managing Systems Source for many slides: Kelvin Droegemeier, Year 2 site visit presentation.
Grid Execution Management for Legacy Code Architecture Exposing legacy applications as Grid services: the GEMLCA approach Centre.
1 Web Service Information Systems and Applications GGF16 Semantic Grid Workshop Athens Greece February Geoffrey Fox Computer Science, Informatics,
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
All Hands Meeting 2005 BIRN-CC: Building, Maintaining and Maturing a National Information Infrastructure to Enable and Advance Biomedical Research.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
1 Building Gateways to Grid Capabilities Dennis Gannon (with collaborator Beth Plale) Department of Computer Science School of Informatics Indiana University.
Recap: introduction to e-science
iSERVOGrid Architecture Working Group Brisbane Australia June
Some remarks on Portals and Web Services
Cyberinfrastructure and PolarGrid
Status of Grids for HEP and HENP
Chemical Informatics and Cyberinfrastructure Collaboratory
Presentation transcript:

Grids for Chemical Informatics Randall Bramley, Geoffrey Fox, Dennis Gannon, Beth Plale Computer Science, Informatics, Physics Pervasive Technology Laboratories Indiana University Bloomington IN 47401

What is a Grid? Name borrowed from the power grid. The concept: A ubiquitous information & computation resource A definition a network of compute and data resources that has been supplemented with a layer of services that provide uniform and secure access to a set of applications of interest to a distributed community of users. Grids may be wide-area or enterprise

Scientific Challenges The current and future generations of scientific problems are: Data Oriented Increasingly stream based. Often need petabyte archives In need of on-demand computing resources Conducted by geographically distributed teams of specialists Who don’t want to become experts in grid computing. On-Demand Storm predictions Streaming Observations Forecast Model Data Mining Storms Forming

Information/Knowledge Grids Distributed (10’s to 1000’s) of data sources (instruments, file systems, curated databases …) Data Deluge: 1 (now) to 100’s petabytes/year (2012) Moore’s law for Sensors Possible filters assigned dynamically (on-demand) Run image processing algorithm on telescope image Run Gene sequencing algorithm on compiled data Needs decision support front end with “what-if” simulations Metadata (provenance) critical to annotate data Integrate across experiments as in multi-wavelength astronomy Data Deluge comes from pixels/year available

Internet Scale Distributed Services Grids use Internet technology to manage sets of network connected resources Classic Web: independent one-to-one access to individual resources Grids integrate together and manage multiple Internet- connected resources: People, Sensors, computers, data systems Grids are built on top of commodity web service technology with broad industry support Organization can be explicit as in TeraGrid which federates many supercomputers; CrisisGrid which federates first responders, commanders, sensors, GIS, (Tsunami) simulations, science/public data Organization can be implicit such as curated databases and simulation resources that “harmonize a community”

The Architecture of Gateway Grids The Users Desktop. Gateway Services Grid Portal Server Grid Portal Server Physical Resource Layer Core Grid Services Proxy Certificate Server / vault Proxy Certificate Server / vault Application Events Resource Broker User Metadata Catalog User Metadata Catalog Replica Mgmt Application Workflow Application Workflow App. Resource catalogs App. Resource catalogs Application Deployment Application Deployment Execution Management Execution Management Information Services Information Services Self Management Self Management Data Services Data Services Resource Management Resource Management Security Services Security Services OGSA-like Layer

Let’s look at a few real examples (about a dozen … many more exist!)

BIRN – Biomedical Information

Mesoscale Meteorology NSF LEAD project - making the tools that are needed to make accurate predictions of tornados and hurricanes. - Data exploration and Grid workflow

Workflow in the LEAD Grid Katrina output

Renci Bio Gateway Providing access to biotechnology tools running on a back-end Grid. - leverage state-wide investment in bioinformatics - undergraduate & graduate education, faculty research - another portal soon: national evolutionary synthesis center

Flood Modeling Large-scale flooding along Brays Bayou in central Houston triggered by heavy rainfall during Tropical Storm Allison (June 9, 2001) caused more than $2 billion of damage. University of Texas TACC Center for Research in Water Resources ORNL Purdue Gordon Wells, UT; David Maidment, UT; Budhu Bhaduri, ORNL, Gilbert Rochon, Purdue

X-Ray Crystallography

SERVOGrid

SERVOGrid Requirements Seamless Access to Data repositories and large scale computers Integration of multiple data sources including sensors, databases, file systems with analysis system Including filtered OGSA-DAI (Grid database access) Rich meta-data generation and access with SERVOGrid specific Schema extending openGIS (Geography as a Web service) standards and using Semantic Grid Portals with component model for user interfaces and web control of all capabilities Collaboration to support world-wide work Basic Grid tools: workflow and notification NOT metacomputing

Database Analysis and Visualization Portal Repositories Federated Databases Data Filter Services Field Trip Data Streaming Data Sensors ? Discovery Services SERVOGrid Research Simulations ResearchEducation Customization Services From Research to Education Education Grid Computer Farm Grid of Grids: Research Grid and Education Grid GIS Grid Sensor Grid Database Grid Compute Grid

Google maps can be integrated with Web Feature Service Archives to filter and browse seismic records. Integrating Archived Web Feature Services and Google Maps

MyGrid - Bioinformatics

ABC The Williams Workflows A: Identification of overlapping sequence B: Characterisation of nucleotide sequence C: Characterisation of protein sequence

Physical Network Discovery Metadata BioInformatics Grid Chemical Informatics Grid … Domain Specific Grids/Services … Data Access/Storage SecurityWorkflowMessagingManagement Information/Knowledge Instrument/Sensor Compute/Supercomputer MIS Core Low Level Grid Services Application Services Policy M(B,C)IS is Molecular (Bio, Chem) Information System supporting specific metadata (CML, CellML, SBML) and physical representations HTS Tools Quantum Calculations CIS Sequencing Tools Biocomplexity Simulations BIS Portals Collaboration Services

Comments on Grid Components Support GT4 and WS-I+(+); Support Java and.NET Portals – all services will have a portlet interface Compute Grid -- This is some sort of Condor Grid (as used by Cambridge) Supercomputer Grid -- (extended) TeraGrid Workflow, Metadata, Information Management – learn from Taverna, link with BPEL style workflow, link with other Semantic Grid/metadata services Instruments – learn from CIMA/Reciprocal Net, compare with Sensors in LEAD/SERVOGrid MIS/CIS – See if idea sensible – in any case need CML, LSID, Molecular visualization Application Services – Need a wizard. Support “filters” (Wild) and loosely coupled simulations (Baik) Data – Link to PubChem and Bioinformatics – link to Baik database Discovery – Extended UDDI Security – review any special requirements and status of PubChem, caBIG, myGrid etc, Collaboration, Management, Messaging, Policy -- nothing special needed