Ian Foster Computation Institute Argonne National Lab & University of Chicago Education in the Science 2.0 Era.

Slides:



Advertisements
Similar presentations
College of Natural Sciences University of Northern Iowa Welcome to the Computer Science Department Dr. Ben Schafer.
Advertisements

ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
Ian Foster Computation Institute Argonne National Lab & University of Chicago Services for Science.
Ian Foster Computation Institute Argonne National Lab & University of Chicago Cyberinfrastructure and the Role of Grid Computing Or, “Science 2.0”
Application of GRID technologies for satellite data analysis Stepan G. Antushev, Andrey V. Golik and Vitaly K. Fischenko 2007.
The Global Storage Grid Or, Managing Data for “Science 2.0” Ian Foster Computation Institute Argonne National Lab & University of Chicago.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Ian Foster Computation Institute Argonne National Lab & University of Chicago Service-Oriented Science: Scaling eScience Impact Or, “Science 2.0”
Corporation For National Research Initiatives NSF SMETE Library Building the SMETE Library: Getting Started William Y. Arms.
Knowledge Environments for Science: Representative Projects Ian Foster Argonne National Laboratory University of Chicago
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
Globus 4 Guy Warner NeSC Training.
Kate Keahey Argonne National Laboratory University of Chicago Globus Toolkit® 4: from common Grid protocols to virtualization.
Building Data-intensive Pipelines Ravi K Madduri Argonne National Lab University of Chicago.
Scientific Data Infrastructure in CAS Dr. Jianhui Scientific Data Center Computer Network Information Center Chinese Academy of Sciences.
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
GIG Software Integration: Area Overview TeraGrid Annual Project Review April, 2008.
April 2006 Science Gateways on the TeraGrid Nancy Wilkins-Diehr Area Director for Science Gateways San Diego Supercomputer Center
State of Service Oriented Science Tools Open Source Grid Cluster Conference Oakland.
Cancer Bioinformatics Grid (caBIG) CANS 2006 Chicago, Illinois Shannon Hastings Department of Biomedical Informatics Ohio State University.
© What do bioinformaticians do?
Department of Biomedical Informatics Service Oriented Bioscience Cluster at OSC Umit V. Catalyurek Associate Professor Dept. of Biomedical Informatics.
Advancing Scientific Discovery through TeraGrid Scott Lathrop TeraGrid Director of Education, Outreach and Training University of Chicago and Argonne National.
Ian Foster Argonne National Lab University of Chicago Globus Project The Grid and Meteorology Meteorology and HPN Workshop, APAN.
August 2007 Advancing Scientific Discovery through TeraGrid Adapted from S. Lathrop’s talk in SC’07
SAN DIEGO SUPERCOMPUTER CENTER NUCRI Advisory Board Meeting November 9, 2006 Science Gateways on the TeraGrid Nancy Wilkins-Diehr TeraGrid Area Director.
Ian Foster Computation Institute Argonne National Lab & University of Chicago Globus and Service Oriented Architecture.
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
Informal Learning, Cyberlearning and Innovative Education Diana G. Oblinger, Ph.D.
Building and Running caGrid Workflows in Taverna 1 Computation Institute, University of Chicago and Argonne National Laboratory, Chicago, IL, USA 2 Mathematics.
Middleware Support for Virtual Organizations Internet 2 Fall 2006 Member Meeting Chicago, Illinois Stephen Langella Department of.
Service Oriented Science Ian Foster Argonne National Laboratory University of Chicago Univa Corporation.
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
Tools for collaboration How to share your duck tales…
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
Authors: Ronnie Julio Cole David
ICCS WSES BOF Discussion. Possible Topics Scientific workflows and Grid infrastructure Utilization of computing resources in scientific workflows; Virtual.
Ian Foster Computation Institute Argonne National Lab & University of Chicago Cyberinfrastructure and the Role of Grid Computing Or, “Science 2.0”
Ian Foster Computation Institute Argonne National Lab & University of Chicago Scaling eScience Impact.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
Globus online Software-as-a-Service for Research Data Management Steve Tuecke Deputy Director, Computation Institute University of Chicago & Argonne National.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
2005 GRIDS Community Workshop1 Learning From Cyberinfrastructure Initiatives Grid Research Integration Development & Support
Securing the Grid & other Middleware Challenges Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer.
Ian Foster Computation Institute Argonne National Lab & University of Chicago Grid Enabling Open Science.
Grid Rapid Application Virtualization Interface (gRAVI) - Service Oriented Science Ravi K Madduri, Argonne National Laboratory/ University of Chicago Joshua.
GADU: A System for High-throughput Analysis of Genomes using Heterogeneous Grid Resources. Mathematics and Computer Science Division Argonne National Laboratory.
Globus.org/genomics Globus Galaxies Science Gateways as a Service Ravi K Madduri, University of Chicago and Argonne National Laboratory
Interoperability Achieved by GADU in using multiple Grids. OSG, Teragrid and ANL Jazz Presented by: Dinanath Sulakhe Mathematics and Computer Science Division.
GRIDSTART a European GRID coordination attempt Fabrizio Gagliardi CERN.
Ian Foster Computation Institute Argonne National Lab & University of Chicago Application Hosting Services — Enabling Science 2.0 —
Grid Execution Management for Legacy Code Architecture Exposing legacy applications as Grid services: the GEMLCA approach Centre.
Cloud Computing for Business Cloud Computing Services Cloud Computing Services.
Northwest Indiana Computational Grid Preston Smith Rosen Center for Advanced Computing Purdue University - West Lafayette West Lafayette Calumet.
Parallel Computing Globus Toolkit – Grid Ayaka Ohira.
TeraGrid Software Integration: Area Overview (detailed in 2007 Annual Report Section 3) Lee Liming, JP Navarro TeraGrid Annual Project Review April, 2008.
Security in Research Computing John Sandefur UAB Comprehensive Cancer Center John-Paul Robinson UAB Research Computing.
Cancer Bioinformatics Grid (caBIG) CANS 2006 Chicago, Illinois
Unit 3 Virtualization.
Ravi K Madduri, Argonne National Laboratory/ University of Chicago
Biological Databases By: Komal Arora.
Tools and Services Workshop
Joslynn Lee – Data Science Educator
Joseph JaJa, Mike Smorul, and Sangchul Song
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
WIS Strategy – WIS 2.0 Submitted by: Matteo Dell’Acqua(CBS) (Doc 5b)
Grid Application Model and Design and Implementation of Grid Services
What is a Grid? Grid - describes many different models
Sky Computing on FutureGrid and Grid’5000
Presentation transcript:

Ian Foster Computation Institute Argonne National Lab & University of Chicago Education in the Science 2.0 Era

2 “Web 2.0” l Software as services u Data- & computation-rich network services l Services as platforms u Easy composition of services to create new capabilities (“mashups”)—that themselves may be made accessible as new services l Enabled by massive infrastructure buildout u Google projected to spend $1.5B on computers, networks, and real estate in 2006 u Dozens of others are spending substantially l Paid for by advertising Declan Butler, Nature

3 Science 2.0: E.g., Virtual Observatories Data Archives User Analysis tools Gateway Figure: S. G. Djorgovski Discovery tools

4 Science 2.0 People create services (data or functions) … which I discover and use … & maybe compose to create a new function... and then publish as a new service.  I find “someone else” to host services, so I don’t have to become an expert in operating services & computers!  I hope that this “someone else” can manage security, reliability, scalability, … !! “Service-Oriented Science”, Science, 2005

5 Education and Science 2.0 1) Services as subject u Teach how to discover, apply, build services u Produce a legacy of educational services 2) Services as content u Use services to teach specific content areas u Produce a legacy of educational materials 3) Services as enabler u Outsource the mundane & expensive, so educators can focus on education u Produce a legacy of infrastructure services

6 Services as Subject l Students learn how to discover & invoke services to address specific problems l Then how to build & publish new services l Opportunities for both individual creativity & for intra- and inter-college collaboration

7 Services as Content l Develop educational materials that leverage remote services, e.g.: u Virtual Observatory (astronomy) u caBIG (cancer biology) u Physics (Quarknet, I2U2) u NanoHub (nanotechnology) u TeraGrid “Science Gateways” l Sponsor development of new content services & associated educational materials u Offer to host the resulting services (see next item)

8 Interactive educational projects –Students use real data –Groundbreaking research in classroom –Full lesson plans for teachers SkyServer

9 Cancer Bioinformatics Grid Data uchicago.edu <BPEL Workflow Doc> BPEL Engine Analytic osu.edu Analytic duke.edu <Workflow Results> <Workflow Inputs> link caBiG: BPEL work: Ravi Madduri et al.

10 Earth System Grid l Climate simulation data u Per-collection control u Different user classes u Server-side processing l Implementation (GT) u Portal-based User Registration (PURSE) u PKI, SAML assertions u GridFTP, GRAM, SRM l >2000 users l >100 TB downloaded — DOE OASCR

11 Science Gateways: E.g., Biology Public PUMA Knowledge Base Information about proteins analyzed against ~2 million gene sequences Back Office Analysis on Grid Millions of BLAST, BLOCKS, etc., on OSG and TeraGrid Natalia Maltsev et al.,

12 Services as Enabler l E.g., significant obstacle to teaching parallel computing is a lack of parallel computers!  Create infrastructure that allows remote sites to host “virtual clusters” for many colleges  Any college can teach parallel computing l Similarly for other specialized resources u E.g., database systems, scientific software, network testbeds, … l Needs: u Hosting infrastructure for virtual resources u A library of configured virtual resources

13 Education and Science 2.0 1) Services as subject u Teach how to discover, apply, build services u Produce a legacy of educational services 2) Services as content u Use services to teach specific content areas u Produce a legacy of educational materials 3) Services as enabler u Outsource the mundane & expensive, so educators can focus on education u Produce a legacy of infrastructure services