Presentation is loading. Please wait.

Presentation is loading. Please wait.

Collaborative eScience: Evolving Approaches Charles Severance NCeSS eCollaboration Workshop June 28, 2006 This material is Copyright Creative Commons Attribution.

Similar presentations


Presentation on theme: "Collaborative eScience: Evolving Approaches Charles Severance NCeSS eCollaboration Workshop June 28, 2006 This material is Copyright Creative Commons Attribution."— Presentation transcript:

1 Collaborative eScience: Evolving Approaches Charles Severance NCeSS eCollaboration Workshop June 28, 2006 This material is Copyright Creative Commons Attribution 2.5

2 Outline A look back at the past 15 years Putting the “collab” in Collaborative eScience The current tools of Collaborative eScience –Collaboration –Portals –Repository Some Refections A “future” eScience Case Study Some sections are hidden to fit into 30 minutes…

3 The Founding Concepts Scientific Domain Groups of People Common User Interface Data Sharing –In the moment –Long-term Experimental Equipment Compute Visualization

4 Over 15 Years of Collaborative eScience 20001991 - 19992001200220032004200520062007 UARC/SPARC SakaiWorktoolsCHEF OGCE Grid Portal NEESGrid Globus Tool Kit NEESIT SCIGate ?

5 What was SPARC? Before UARC..

6 What was SPARC? UARC/ SPARC

7 2/2001 600 users 800 data sources

8 SPARC Software (1991-2001) Written from scratch –No Middleware –No Portal Technology Three rewrites over 10 years –NextStep (Version 1) –Java Applets with server support (Version 2) –Browser based - kind of like a portal (Version 3) At the end, in 2001 - it was ready for another rewrite

9 Keys to SPARC Success Ten years of solid funding –Team consistency –Long enough to make and recover from “mistakes” Long term relationship between IT folks and scientists - evolved over time - relationship was “grey” Software rewritten several times over life of project based on evolving user needs and experience with each version of the program Portion of effort was invested in evaluation of usability - feedback to developers

10 After SPARC: Now What? Getting people together is an important part of collaborative eScience –WorkTools - Based on Lotus Notes –CHEF - Collaborative framework - Based on Java and Jetspeed –Sakai - Collaboration and Learning Environment - Java Critical point: Collaborative software is only one component of eScience UM Focus: Building reusable user interface technologies for the people part of collaborative eScience

11 WorkTools Over 9000 users (2000 active) at the end of 2003 WorkTools - The “organic” single-server approach - if you build it (and give away free acounts), they will come…

12 CompreHensive CollaborativE Framework (CHEF) Fall 2001: CHEF Development begins –Generalized extensible framework for building collaboratories Funded internally at UM All JAVA - Open Source –Jakarta Jetspeed Portal –Jakarta Tomcat Servlet Container –Jakarta Turbine Service Container Build community of developers through workshops and outreach

13 CHEF Applications CourseTools Next Generation WorkTools Next Generation NEESGrid NSF National Middleware Grid Portal

14 NEESGrid

15 NEESGrid - The Equipment Network for Earthquake Engineering Simulation NSF Funded. NCSA, ANL, USC/ISI, UM, USC, Berkeley, MSU

16 CHEF-Based NEESGrid Software

17 Overall Data Modeling Efforts NEES Site ASite CSite B Equipment People Experiments Trials EquipmentPeople ExperimentsTrials Data Tsnumai Specimen Shake Table Specimen Geotech Specimen Centrifuge Specimen UnitsSensors Descriptions Site Specifications Database Project Description Domain Specific models Common Elements Data / Observations

18 DT Main System PTZ/ USB Still Capture DT Client BT848 Video Frames DT Client Capturing Video and Data Camera Control Gateway DAQ Data Capture DT Client Simulation Coordinator Site A Site B

19 DT Main System Data Monitoring Tools Still Image / Camera Control ~ <> ^ ^ <> Camera Control Gateway Creare viewers Still image camera control Thumb- nail

20 Sakai CHEF and NEES - hidden

21 Sakai General Collaborative Tools Announcements Assignments Blog Chat Room Threaded Discussion Drop Box Email Archive Message Of The Day News/RSS Preferences Resources Schedule Web Content Wiki Worksite Setup WebDAV

22 Sakai: Product Placement Collaboration and eResearch Teaching and Learning

23 NMI / OGCE www.ogce.org NSF National Middleware Initiative Indiana, UTexas, ANL, UM, NCSA

24 Chalk Talk:School of Portals (2004) OGCE 1.1 XCAT NEES 3.0 GridPort NEES 1.1 GridPort 3 Sakai uPortal CHEF OGCE 1.2 ? OGCE 2 Jetspeed Alliance GridPort 2 CompetitionCollaboration Convergence GridSphere

25 Chalk Talk:School of eScience Portals (2006) OGCE 1.1 XCAT GridPort NEES 1.1 GridPort 3 Sakai uPortal CHEF OGCE 2 Jetspeed Alliance GridPort 2 CompetitionCollaboration Convergence GridSphere SciGate ? SciDoc

26 Atlas Portal GatewayDesktop Gateway Applications and Users ITERCMS Gateway Technologies Services and Components Resources SRB Petascale Compute ClarensIdentitySecurityOpalMetaData Petascale Data SciGate Production Integration and Administration Sakai Globus BlueGeneORNL… Management Components ControlExperimentSimulation KnowledgeStore …Process … … Configure: Atlas Portal Experiment Process Control Knowledge Store Sakai SRB Opal Clarens Metadata Configure: ITER Portal Experiment Process Control Knowledge Store Sakai SRB Opal Clarens Metadata

27 Some Reflections

28 Computer Scientists like to “stay in their box” Many of the technology solutions work well in their “initial domain” Once an eScience team “adopts” a technology (often step 1) their further progress (and focus) is limited by the technology. Agility is very important in the early phases of eScience Builders of components *must* make their component as interoperable as possible

29 Sakai and Web 2.0 Web 2.0 is about making sure data is available in some form beyond just displayed in the Sakai Tool Set. – Formats RSS Resource Description Framework (RDF) HTML – Protocols RSS / getData / SOAP / REST – Consuming Applications Portals Google delic.io.us

30 Sakai Data Interoperability... interoperability and data portability are key elements... EnterpriseDirectory StudentInformation AuthoringEnvironment PersonalLearningEnvironment PortalEnvironment CollaboarationEnvironment ContentManagement LMS Systems DataRepository

31 Sakai Data Interoperability... interoperability and data portability are key elements... AuthoringEnvironment PersonalEnvironment PortalEnvironment CollaboarationEnvironment ContentManagement LMS Systems DataRepository Identity ACL

32 HTML REST WebDav SOAP iCal SPARQL RSS CalDav Collaboration and Learning WebDav SOAP Collaboration and Learning Current Sakai Future Sakai Sakai “Swiss Army Knife”

33 Interoperability at the UI RSS, ATOM, RDF, SOAP, REST, HTML The SOAUI

34 Reflecting on Middleware and Virtual Organizations (slides not hidden)

35 Collaborative Tools Shared Compute Data Sources Data Repository Portal Technology Knowledge Tools Where is the Middleware? “..composing and orchestrating many technologies…” “..interoperability is key…” Identity ACL

36 Middleware Collaborative Tools Shared ComputeData Sources Data RepositoryPortal Technology Knowledge Tools Identity ACL Is Middleware The Universal Connector?

37 Shared ComputeData Sources Data RepositoryPortal Technology Knowledge Tools Identity ACL The Universal Connectors tcp/iphttp/https web services Collaborative Tools

38 Collaborative Tools Shared Compute Data Sources Data Repository Portal Technology Knowledge Tools Is Middleware “inside” each application? Identity ACL

39 Collaborative Tools Shared Compute Data Sources Data Repository Portal Technology Knowledge Tools Middleware is simply another component - used as needed Middleware Identity ACL

40 Collaborative Tools Shared Compute Data Sources Data Repository Portal Technology Knowledge Tools Identity and Access Control: A very important function of Middleware Middleware Identity ACL Lets Talk about This

41 Chalk Talk:Identity and Access Control CAS Shibboleth Kerberos Globus CompetitionCollaboration Convergence LDAP PubCookie K.X509 MyProxy ?? GridShib Cosign ??? Identity ACL

42 Identity and ACL: Goal State One server - one software distribution Virtual Organization Software Supports all protocols –Globus Certificate Authority –Shibboleth –LDAP –MyProxy –Kerberos Who will do this? Who will fund this? Who can get these competitors to cooperate?

43 AUTHN/AUTHZ Meetings

44 My eScience Fantasy

45 The pre-requisites My net worth is $5B (I give myself grants) I encounter some tech-savvy scientists in a field who are using technology to do world-class research… They have never been visited by any other computer scientist… They are working in groups of 1-30 geographically distributed around the world They all work on a beach with Internet2 connections and wide-open wireless and favourable exchange rates

46 A B D E Vol 4 Vol 3 Vol 2 Vol 1 F C Compute Data Models Tutorials Experiments Remote Observation eDocuments

47 Step 1: Visit The Scientists Understand what they are doing and how they are doing it? Ask them how they would like to improve it. Show each application to other scientists. Ask the other scientists how they would improve it. Help each group improve their work - help them using whatever technology they are currently using

48 Step 2: Add some technology Install the super-multi-protocol Virtual Organization software and provide a NOC for the VO software - identity and simple attributes - make sure the VO is easy to use! Install Sakai - point it at the VO software for identity add icon at the top of Sakai Give each scientist an account in the VO Give each effort in the field a site within Sakai

49 Heart Study Collaboratory Login My WorkspaceABCDEOpen Forum Home Chat Resources Tutorials Site B Mail List Live Meetings

50 Step 2: Use the VO For those who want to protect their information, help them add SSO to their sites, backed by the VO service Since it is multi-protocol - likely there will be no modification of the underlying science code - only a server configuration change Identity ACL

51 A B D E Vol 4 Vol 3 Vol 2 Vol 1 F C Compute Data Models Tutorials Experiments Remote Observation eDocuments Identity ACL Heart Study Collaboratory Login My WorkspaceABCDEOpen Forum Home Chat Resources Tutorials Site B Mail List Live Meetings

52 Step 4: Unique Identifier Service Come up with a way for any member of the VO to “get” a unique identifier Demand some information (build a little data model) –Person’s name and organization (implicit from request) –What kind of thing this will represent (experiment, document, image series) –Simple description –Keyword/value extensions Build an simple way request and retrieve these through a simple web service - capture implicit metadata from request (when, IP address, etc). Make sure it works from perl! Encourage community to start marking “stuff” with these identifiers in their stovepipes

53 Step 5: Data Models Begin to work with subsets of the field to try to find common data models across stovepipes Start simple - use very simple RDF - human readable Broaden / deepen model slowly - explore variations Define simple file-system pattern for storing metadata associated with a file and/or a directory

54 Step 6: A Backup-Style Repo Build a data repository which will function as a backup Basic idea - each time you get identifier - this enables backup space - any data and/or metadata can be uploaded under that particular identifier and left in the repository Make the repo multi-protocol, FTP, DAV, Web-Service with attachments, GridFTP, etc. Make it so there can be a network of cooperating repositories

55 A B D E Vol 4 Vol 3 Vol 2 Vol 1 F C Compute Data Models Tutorials Experiments Remote Observation eDocuments Identity ACL Heart Study Collaboratory Login My WorkspaceABCDEOpen Forum Home Chat Resources Tutorials Site B Mail List Live Meetings GUID Service Central Repo Local Repo Local Repo

56 Year 4 and on… Once the basic stovepipes have been “brought in from the cold” and made part of a community with no harm, the next steps are to begin to work “cross- stovepipe” –Evolve data models to be far richer with many variants –Build value added tools that are aware of the data models and are usable across stovepipes Teach the community to build and share tools - gently encourage development standards - Java / JSR-168 perhaps Most important: Always listen to the users

57 Science at the center of eScience Connect Enhance Data Models Data Storage New Tools New Approaches Priority Science Scientists … start at the center and work outwards… … apply technology when the users will see it as a “win” … Communicate New Technologies Repositories

58 Conclusion Many years ago, eScience had science as its main focus Custom approaches resulted in too many unique solutions Computer scientists began a search for the “magic bullet” - each group found a different magic bullet Each group now competes for mind share (and funding) to be the “one true” magic bullet

59 Conclusion (cont) One way to solve the “many competing technologies” solution is to form “super groups” which unify the technologies No single technology gets to claim “they are the one” (Middleware cannot be “in the middle” because then it gets “in the way”) Each technology needs to become a drop-in service/component which is available for use only when appropriate Once we can get past looking at the technologies as the main focus, we get back to science as the main focus

60 Lets remember why we started this whole field in the first place… Scientific Domain Groups of People Common User Interface Data Sharing –In the moment –Long-term Experimental Equipment Compute Visualization To download www.dr-chuck.com “Chuck’s Talks”


Download ppt "Collaborative eScience: Evolving Approaches Charles Severance NCeSS eCollaboration Workshop June 28, 2006 This material is Copyright Creative Commons Attribution."

Similar presentations


Ads by Google