University of Illinois at Urbana-ChampaignNational Center for Supercomputing Applications Towards Truly Ubiquitous Cyberinfrastructure LAGrid 07 Jim Myers.

Slides:



Advertisements
Similar presentations
PRAGMA 13 MAEViz Tutorial MAE Center PI: Amr Elnashai, MAEviz PI: Bill Spencer, Co-PI: Jim Myers, PM: Terry McLaren Software Team: Chris Navarro, Shawn.
Advertisements

1 From Grids to Service-Oriented Knowledge Utilities research challenges Thierry Priol.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University.
NG-CHC Northern Gulf Coastal Hazards Collaboratory Simulation Experiment Integration Sandra Harper 1, Manil Maskey 1, Sara Graves 1, Sabin Basyal 1, Jian.
A Unified Approach to Combat Counterfeiting: Use of the Digital Object Architecture and ITU-T Recommendation X.1255 Robert E. Kahn President & CEO CNRI,
Sharing Content and Experience in Smart Environments Johan Plomp, Juhani Heinila, Veikko Ikonen, Eija Kaasinen, Pasi Valkkynen 1.
Unveiling ProjectWise V8 XM Edition. ProjectWise V8 XM Edition An integrated system of collaboration servers that enable your AEC project teams, your.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) IRNC Kick-Off Workshop July 13,
Provenance in Open Distributed Information Systems Syed Imran Jami PhD Candidate FAST-NU.
0 General information Rate of acceptance 37% Papers from 15 Countries and 5 Geographical Areas –North America 5 –South America 2 –Europe 20 –Asia 2 –Australia.
Social and behavioral scientists building cyberinfrastructure David W. Lightfoot Assistant Director, National Science Foundation Social, Behavior & Economic.
Emerging Research Dimensions in IT Security Dr. Salar H. Naqvi Senior Member IEEE Research Fellow, CoreGRID Network of Excellence European.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
V. Chandrasekar (CSU), Mike Daniels (NCAR), Sara Graves (UAH), Branko Kerkez (Michigan), Frank Vernon (USCD) Integrating Real-time Data into the EarthCube.
Computing in Atmospheric Sciences Workshop: 2003 Challenges of Cyberinfrastructure Alan Blatecky Executive Director San Diego Supercomputer Center.
OpenMDR: Alternative Methods for Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
San Diego Supercomputer CenterUniversity of California, San Diego Preservation Research Roadmap Reagan W. Moore San Diego Supercomputer Center
CI Days: Planning Your Campus Cyberinfrastructure Strategy Russ Hobby, Internet2 Internet2 Member Meeting 9 October 2007.
Presentation Outline (hidden slide) Technical Level: 100 Intended Audience: TDMs, ITPros, ITDMs, BI specialists Objectives (what do you want the audience.
The Yellow Group Design Informatics (Regli, Stone, Kusiak, Leifer, Gupta, Chung, Fenves, Law, Kopena)
1st Workshop on Intelligent and Knowledge oriented Technologies Universal Semantic Knowledge Middleware Marek Paralič,
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Ohio State University Department of Computer Science and Engineering 1 Cyberinfrastructure for Coastal Forecasting and Change Analysis Gagan Agrawal Hakan.
Making Connections: SHARE and the Open Science Framework Jeffrey Open Repositories 2015.
The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Dataset Caitlin Minteer & Kelly Clynes.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
Peter Bajcsy, Rob Kooper, Luigi Marini, Barbara Minsker and Jim Myers National Center for Supercomputing Applications (NCSA) University of Illinois at.
MAEviz as a MAE/NCSA Cyberenvironment Partnership Jim Myers Associate Director NCSA Cyberenvironments.
A framework to support collaborative Velo: Knowledge Management for Collaborative (Science | Biology) Projects A framework to support collaborative 1.
© DATAMAT S.p.A. – Giuseppe Avellino, Stefano Beco, Barbara Cantalupo, Andrea Cavallini A Semantic Workflow Authoring Tool for Programming Grids.
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
National Center for Supercomputing Applications Barbara S. Minsker, Ph.D. Associate Professor National Center for Supercomputing Applications and Department.
SEEK Welcome Malcolm Atkinson Director 12 th May 2004.
Grid Computing & Semantic Web. Grid Computing Proposed with the idea of electric power grid; Aims at integrating large-scale (global scale) computing.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
10/24/09CK The Open Ontology Repository Initiative: Requirements and Research Challenges Ken Baclawski Todd Schneider.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
ESFRI & e-Infrastructure Collaborations, EGEE’09 Krzysztof Wrona September 21 st, 2009 European XFEL.
Cooperative experiments in VL-e: from scientific workflows to knowledge sharing Z.Zhao (1) V. Guevara( 1) A. Wibisono(1) A. Belloum(1) M. Bubak(1,2) B.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
Scientific Workflow systems: Summary and Opportunities for SEEK and e-Science.
MAEviz Terry McLaren Project Manager, Cyberenvironment Technologies (CET), National Center for Supercomputing Applications (NCSA), University of Illinois.
Enabling e-Research in Combustion Research Community T.V Pham 1, P.M. Dew 1, L.M.S. Lau 1 and M.J. Pilling 2 1 School of Computing 2 School of Chemistry.
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
7. Grid Computing Systems and Resource Management
Why to care about research?
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
An Overview of Scientific Workflows: Domains & Applications Laboratoire Lorrain de Recherche en Informatique et ses Applications Presented by Khaled Gaaloul.
Virtual Information and Knowledge Environments Workshop on Knowledge Technologies within the 6th Framework Programme -- Luxembourg, May 2002 Dr.-Ing.
Toward a common data and command representation for quantum chemistry Malcolm Atkinson Director 5 th April 2004.
Realising the Community Vision of Concurrent Enterprising ICE 2003 Moderator Roberto Santoro, ESoCE NET Roberto Santoro, ESoCE NETwww.esoce.net.
Technology-enhanced Learning: EU research and its role in current and future ICT based learning environments Pat Manson Head of Unit Technology Enhanced.
CIMA and Semantic Interoperability for Networked Instruments and Sensors Donald F. (Rick) McMullen Pervasive Technology Labs at Indiana University
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
International Planetary Data Alliance Registry Project Update September 16, 2011.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
GISELA & CHAIN Workshop Digital Cultural Heritage Network
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
EOSC MODEL Pasquale Pagano CNR - ISTI
Joslynn Lee – Data Science Educator
About Thetus Thetus develops knowledge discovery and modeling infrastructure software for customers who: Have high value data that does not neatly fit.
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Bird of Feather Session
Presentation transcript:

University of Illinois at Urbana-ChampaignNational Center for Supercomputing Applications Towards Truly Ubiquitous Cyberinfrastructure LAGrid 07 Jim Myers Associate Director for Cyberenvironments and Technologies, National Center for Supercomputing Applications (NCSA), University of Illinois at Urbana-Champaign

National Center for Supercomputing Applications Cyber-resources Innovative Systems Communities and Applications Cyberenvironments

Outline Whats Changing in Science? What Role should Cyberinfrastructure (CI) play? What Do Ubiquitous (and Persistent) mean for CI Development? Designing for Ubiquity Some Examples Conclusions National Center for Supercomputing Applications

How is Science Changing? Quantitative Modeling and Simulation Better Data (e.g. Higher Signal to Noise) More Data (e.g. High Throughput) –Closer ties between research and application –Investigation of subtle, non-linear, multi-dimensional phenomena –Statistical analysis of complex systems

National Center for Supercomputing Applications The Research Process Its just the Scientific Method…

National Center for Supercomputing Applications The Research Process F g ~m Conceptual Logical Physical Assumptions Reference Data Controls… Reduction Statistics Analysis of Alternatives… With Experimental Design…

National Center for Supercomputing Applications The Research Process Scientific Instrument Method F g ~m High-speed camera And Multiple, Coupled Objectives…

National Center for Supercomputing Applications The Research Process Collaboration Reference Data Curation Model Validation Sub-discipline Creation Best-practice Dissemination Application Education … Scientific Instrument Method And Community Processes …

National Center for Supercomputing Applications The Research Process Non-linear, high- dimensional, coupled, multi-scale phenomena Scientific Instrument Method And Its No Longer F g ~m …

National Center for Supercomputing Applications Amdahls Law for Scientific Progress: Data discovery Translation Experiment setup Group coordination Tool integration Training Feature Extraction Data interpretation Acceptance of new models/tools Dissemination of best practices Interdisciplinary communication Data productionProcessing power Data transfer/storage !

National Center for Supercomputing Applications Whats Needed to Support the Research Lifecycle? Discover Mine Translate Reference Extract Experiment Design Annotation Provenance Gap Analysis Reference Data Publish Share Coordinate Curate Validate Relate 1 2 Valid Range Project Execution Engineering Views Standards / Best practice Sensor Data Algorithms/ Services

National Center for Supercomputing Applications There is a class of bovine-related problems for which shape is not important Yet shape is clearly needed in a general cow model Should we reach consensus here? Is there one best way to map volume to height? Consider a Spherical Cow… Moo! ACME Trucking

National Center for Supercomputing Applications Key Issues for Ubiquitous & Persistent CI CI must be built before the parts are done It must be evolvable by independent parties It must enable coordination without central control It must allow science to evolve / progress –No fixed domain model Researchers/educators must be able to work in multiple communities/value chains (across CI projects) It must convey knowledge as well as tools to end users It must align the interests of CI funders, developers, providers, users, …

Can this be done? National Center for Supercomputing Applications

Yes! Design Principles for loosely coupled, scalable (not scaled) systems and organizations Agile, community/science driven development processes over longer-term community/science driven design …e-Science, Semantic Grid, Web 2.0 … …intelligence at the edges…

National Center for Supercomputing Applications Key Cyberenvironment Design Concepts Explicit Representations Separating How from What: –Content (metadata, global IDs, …) –Process (workflow, provenance, …) –Virtual Organizations (policies, resources, semantics, translation) –GUI Integration (portals, rich clients, …) –…

University of Illinois at Urbana-ChampaignNational Center for Supercomputing Applications MAEViz – an Example Cyberenvironment ( Consequence-Based Risk Management for Seismic Events) Mid-America Earthquake Center Engineering View of MAE Center Research Portal-based Collaboration Environment Distributed Data/metadata Sources Multi-disciplinary Collaboration Hazard Definition Inventory Selection Fragility Models Damage Prediction Decision Support

National Center for Supercomputing Applications Content Management Whatever thing we are talking about, we want –To know its type, –Have descriptive information so we can find and categorize it, –Be able to version it, –Specify who owns and can access it, –Define its relationships to other things, –Manage copies of it / know when you have it, –Be able to translate it, –Dynamically add new information we learn about it, –…

National Center for Supercomputing Applications Content Aware ARKs, DOI, LSID WebDAV, JCR, RDF, SAM, Tupelo Desktop Secure Enterprise Data Public Reference Data Data/Metadata

National Center for Supercomputing Applications Process Management Framework Workflow description as a means of communicating experiment protocol –Actors built as modules, web services, grid jobs… –Process execution managed through direct calls, service calls, data transfer, events, manual processes, … Workflow generated by applications, by example, graphically, or discovered from provenance Execution performed using an engine with appropriate speed, reliability, availability of modules, etc. Workflow templates and provenance records treated as sharable content (versioned, compared, documented, …) Process descriptions captured at multiple levels of detail (scientific, mathematical, engineering, debugging, …) Community Provenance and Process extend across workflows

National Center for Supercomputing Applications Process Management Workflow Creation Hierarchical Workflow Application Interface Provenance Workflow-by-Example X=f(y) Y = f2(z) Scripting

National Center for Supercomputing Applications Process Aware Workflow, Provenance, RDF Discover Process Capture Execute Report

National Center for Supercomputing Applications Virtual Organizations Grid/portal concept for managing –Single sign-on security –access control policies –toolsets and views –data sources –processes and results –resource pools –vocabularies and models –… Tools query VO manager to configure themselves based on VO context/policies/preferences

National Center for Supercomputing Applications Pluggable User Interfaces Portlet/Rich-Client concept, broadened to include VO configuration of –Content sources –Events –Workflow/Provenance repositories –Data models/ontologies –Translations Portal technologies: JSR 168, Teamlets, WSRP, JSR 286, … Rich Clients: Eclipse/OSGi, JSR 170, 283, …

National Center for Supercomputing Applications Group Aware Collaboratory, Portal, … Plan, Coordinate, Share, Compare Wiki Task List Chat Document Repository Scenario Repository Training Materials SSO

National Center for Supercomputing Applications Dynamic Plug-ins, WSRP, Provenance Eclipse RCP WorkflowDataGIS MAEviz Plug-in Framework Auto-update New Third-Party Analyses Compare, Contrast, Validate

National Center for Supercomputing Applications Rich, VO-oriented plug-in mechanism Third-party Plug-in Adds to menu Adds to interface Adds to workflow Adds to provenance Joins Security Context Maps data model X X X X

Environmental Observatories Rely on advances in: sensors and sensor networks at intensively instrumented sites shared by the research community cyberinfrastructure with high bandwidth to connect the sites, data repositories, and researchers into collaboratories distributed modeling platforms From USGS

National Center for Supercomputing Applications Observatories as a Community Focus

National Center for Supercomputing Applications Sensors Data Products Derived Data Products Storage QA/QC Archive Operations/Expt. Design Cache Knowledge Store Community Provisioning Community Coordination/ Knowledge Creation Events Model Dev/ Validation Research & Education Projects Observatory Operation and Evolution On-demand Services and HPC Third-party Resources Data Access Environmental Observatory Processes Documentation Coordination Recommendations

National Center for Supercomputing Applications Ubiquity = Supporting Scientific Discourse Cyberenvironments represent rethinking current practice to create CI –That is enabling rather than stifling –That evolves as fast a research evolves –That connects research and practice –That empowers individuals to contribute new resources –That can be ubiquitous and persistent –That enables resource repurposing to address new questions –That opens new career paths for CI developers, data scientists, systems engineers, …

National Center for Supercomputing Applications Cyberinfrastructure Challenges How can CI increase the productivity and competitiveness of the scientific community? How can CI developers enhance their capacity to respond to user needs more rapidly and more effectively? How should CI technical design and organizational structures change to enable solutions at scale – as a ubiquitous, persistent infrastructure for science and engineering research and education?

National Center for Supercomputing Applications Cyberenvironments Mosaic and Cyberenvironments Mosaic –By early 1990s, the internet had a wealth of resources, but they were inaccessible to most scientists –Individual publishing –Browsing versus retrieving –See Web The Machine is Us/ing Us Cyberenvironments –By the early 2000s, the internet and grid had a wealth of interactive resources, but they were inaccessible to most scientists –Individual information models –Fusion versus gathering

National Center for Supercomputing Applications Acknowledgments NCSA CET Staff NCSA Collaborators CI Community National Science Foundation/State of Illinois/ONR Mathematical, Information and Computational Sciences Division of the Office of Science … and Thank You