Collaborative eScience: Evolving Approaches Charles Severance Rutgers CyberInfrastructure Meeting April 4, 2006 www.dr-chuck.com.

Slides:



Advertisements
Similar presentations
Using the Collaborative Tools in NEESgrid Charles Severance University of Michigan.
Advertisements

SOFTWARE PRESENTATION ODMS (OPEN SOURCE DOCUMENT MANAGEMENT SYSTEM)
Objectives Moodle is an online learning environment where instructors & their students interact. In this workshop you will learn: 1.Configure system requirements.
Using the Sakai Collaborative Toolkit in eScience Applications Charles Severance Sakai Chief Architect October 3, 2005 GGF-15.
NEESGrid: Early Experiences in CyberInfrastructure Charles Severance University of Michigan Sakai Project, NEESGrid Project, OGCE Project
Sakai: Building the framework for collaborative E-Research Charles Severance Sakai Chief Architect
Using Collaborative Tools in NEESGrid Charles Severance University of Michigan NEESGrid System Integration Team
Web Server Administration
NEES Grid Data Overview Comments to Charles Severance
NEESGrid - A Collaborative Portal Study Charles Severance University of Michigan NEESGrid SI Team
Collaborative eScience: Evolving Approaches Charles Severance NCeSS eCollaboration Workshop June 28, 2006 This material is Copyright Creative Commons Attribution.
Using the Sakai Collaborative Toolkit in e-Research Applications Charles Severance, Joseph Hardin, Glenn Golden University of Michigan - Sakai Project,
Catania Science Gateway Framework Motivations, architecture, features Catania, 09/06/2014Riccardo Rotondo
From Your Course Syllabus to Earthquake Engineering: Collaboration using the CHEF Framework Charles Severance University of Michigan.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
New School Websites Teacher Pages. Visit the SCUSD Website for videos tutorials: For more information.
Section 6.1 Explain the development of operating systems Differentiate between operating systems Section 6.2 Demonstrate knowledge of basic GUI components.
Grid Computing, B. Wilkinson, a.1 Grid Portals.
SAKAI February What is SAKAI? Sakai ≠ Course Management System Sakai = Collaboration & Learning Environment.
About Dynamic Sites (Front End / Back End Implementations) by Janssen & Associates Affordable Website Solutions for Individuals and Small Businesses.
Tutorial 1 Getting Started with Adobe Dreamweaver CS3
© 2011 Delmar, Cengage Learning Chapter 7 Managing a Web Server and Files.
HPC club presentation A proposal for a campus-wide research grid Barry Wilkinson Department of Computer Science UNC-Charlotte Dec. 2, 2005.
Chapter 8 The Internet: A Resource for All of Us.
IUScholarWorks is a set of services to make the work of IU scholars freely available. Allows IU departments, institutes, centers and research units to.
Grid Computing, B. Wilkinson, b.1 National Science Foundation Middleware Initiative (NMI) Started in 2001 initially over 3 years “to create and deploy.
Grids and Portals for VLAB Marlon Pierce Community Grids Lab Indiana University.
Sakai Update VRE Workshop Charles Severance Sakai Chief Architect June 22, 2005.
CHEF II / Sakai Architecture. CHEF II Changes uPortal replaces Jetspeed –jsr 168 portlet, servlet compliant Spring replaces Turbine component framework.
National Center for Supercomputing Applications NCSA OPIE Presentation November 2000.
Teaching with SmartSite An Introduction Steve Faith.
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
NSF Middleware Initiative Renee Woodten Frost Assistant Director, Middleware Initiatives Internet2 NSF Middleware Initiative.
Neil Witheridge APAN29 Sydney February 2010 ARCS Authorisation Services Neil Witheridge Manager, ARCS Authorisation Services APAN29, Sydney, February 2010.
Fisheries Oceanography Collaboration Software Donald Denbo NOAA/PMEL-UW/JISAO Presented by Nancy Soreide NOAA/PMEL AMS 2002/IIPS 10.3.
August 2003 At A Glance VMOC-CE is an application framework that facilitates real- time, remote cooperative work among geographically dispersed mission.
NanoHUB.org and HUBzero™ Platform for Reproducible Computational Experiments Michael McLennan Director and Chief Architect, Hub Technology Group and George.
Collaborative eScience: Evolving Approaches Charles Severance Executive Director, Sakai Foundation Shaping Collaboration 2006 Geneva, Switzerland December.
Holding slide prior to starting show. A Portlet Interface for Computational Electromagnetics on the Grid Maria Lin and David Walker Cardiff University.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
CSC350: Learning Management Systems COMSATS Institute of Information Technology (Virtual Campus)
Cooperative experiments in VL-e: from scientific workflows to knowledge sharing Z.Zhao (1) V. Guevara( 1) A. Wibisono(1) A. Belloum(1) M. Bubak(1,2) B.
The Sakai JSR-168 Portlet (Version 2) Charles Severance December 17, 2005.
Www2.computer.org Web Publishing Training Leo Wadsworth, Staff Manager April 2008.
1 Gateways. 2 The Role of Gateways  Generally associated with primary sites in ESG-CET  Provides a community-facing web presence  Can be branded as.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
Sakai / uPortal / JSR-286 BOF Charles Severance. Questions What do people want? Who wants this so badly to work on it?
WHAT IS E-COMMERCE? E-COMMERCE is a online service that helps the seller/buyer complete their transaction through a secure server. Throughout the past.
Sakai eScience/eResearch Austin Sakai Conference December 8, 2005 Joseph Hardin Stephanie Teasley Chuck Severance KYOU / sakai Boundary, Situation.
Active Directory. Computers in organizations Computers are linked together for communication and sharing of resources There is always a need to administer.
McGraw-Hill/Irwin © 2008 The McGraw-Hill Companies, All Rights Reserved Chapter 15 Creating Collaborative Partnerships.
R e D R e S S Portals, Desktop Applications and Distributed Services for e-Research - Updated vision - Rob Crouchley and Rob Allan With thanks to Chuck.
Holding slide prior to starting show. Lessons Learned from the GECEM Portal David Walker Cardiff University
PIER USER TRAINING 1 Class Date and Location. Instructor Name Instructor Contact Info 2 PIER User Training 1.
Planning an Active Directory Deployment Lesson 1.
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
Learning Management System. Introduction Software application or Web-based technology used to plan, implement, and assess a specific learning process.
Tutorial 1 Getting Started with Adobe Dreamweaver CS5.
NEESGrid - A Collaborative Portal Study
OGCE OGCE The Open Grid Computing Environments Collaboratory
System And Application Software
Using the Sakai Collaborative Toolkit in eScience Applications
Portals, Desktop Applications and Distributed Services for e-Research
The Sakai Project and Partnership
The Internet and Electronic mail
Collaborative eScience: Evolving Approaches
Presentation transcript:

Collaborative eScience: Evolving Approaches Charles Severance Rutgers CyberInfrastructure Meeting April 4,

Outline A look back at the past 15 years Putting the “collab” in Collaborative eScience The current tools of Collaborative eScience –Collaboration –Portals –Repository Reflecting on 15 years of Experience –What is wrong with Middleware? –Authorization and Authentication - Are we there yet? A “future” eScience Case Study

The Founding Concepts Scientific Domain Groups of People Common User Interface Data Sharing –In the moment –Long-term Experimental Equipment Compute Visualization

Over 15 Years of Collaborative eScience UARC/SPARC SakaiWorktoolsCHEF OGCE Grid Portal NEESGrid Globus Tool Kit NEESIT SCIGate ?

What was SPARC? Before UARC..

What was SPARC? UARC/ SPARC

2/ users 800 data sources

SPARC Software Written from scratch –No Middleware –No Portal Technology Three rewrites over 10 years –NextStep –Java Applets with server support –Browser based - kind of like a portal At the end, in it was ready for another rewrite

Keys to SPARC Success Ten years of solid funding –Team consistency –Long enough to learn from “mistakes” Long term relationship between IT folks and scientists - evolved over time - relationship was “grey” Software rewritten several times over life of project based on evolving user needs and experience with each version of the program Portion of effort was invested in evaluation of usability - feedback to developers

After SPARC: Now What? Getting people together is an important part of collaborative eScience –WorkTools - Based on Lotus Notes –CHEF - Collaborative framework - Based on Java and Jetspeed –Sakai - Collaboration and Learning Environment - Java Critical point: Collaborative software is only one component of eScience UM Focus: Building reusable user interface technologies for the people part of collaborative eScience

WorkTools Over 9000 users (2000 active) at the end of 2003 WorkTools - The “organic” single-server approach - if you build it (and give away free acounts), they will come…

CompreHensive CollaborativE Framework (CHEF) Fall 2001: CHEF Development begins –Generalized extensible framework for building collaboratories Funded internally at UM All JAVA - Open Source –Jakarta Jetspeed Portal –Jakarta Tomcat Servlet Container –Jakarta Turbine Service Container Build community of developers through workshops and outreach

CHEF Applications CourseTools Next Generation WorkTools Next Generation NEESGrid NSF National Middleware Grid Portal

NEESGrid - The Equipment Network for Earthquake Engineering Simulation NSF Funded. NCSA, ANL, USC/ISI, UM, USC, Berkeley, MSU

CHEF-Based NEESGrid Software

Overall Data Modeling Efforts NEES Site ASite CSite B Equipment People Experiments Trials EquipmentPeople ExperimentsTrials Data Tsnumai Specimen Shake Table Specimen Geotech Specimen Centrifuge Specimen UnitsSensors Descriptions Site Specifications Database Project Description Domain Specific models Common Elements Data / Observations

DT Main System PTZ/ USB Still Capture DT Client BT848 Video Frames DT Client Capturing Video and Data Camera Control Gateway DAQ Data Capture DT Client Simulation Coordinator Site A Site B

DT Main System Data Monitoring Tools Still Image / Camera Control ~ <> ^ ^ <> Camera Control Gateway Creare viewers Still image camera control Thumb- nail

What’s in a name? Sakai is named after Hiroyuki Sakai of the Food Channel Television program “Iron Chef”. Hiroyuki is renowned for his fusion of French and Japanese cuisine.

Sakai General Collaborative Tools Announcements Assignments Chat Room Threaded Discussion Drop Box Archive Message Of The Day News/RSS Preferences Resources Schedule Web Content Worksite Setup WebDAV

Requirements Overlap Physics Research Collaboration Earthquake Research Collaboration Teaching and Learning Grid Computing Visualization Data Repository Large Data Libraries Quizzes Grading Tools Syllabus SCORM Chat Discussion Resources

Sakai: Product Placement Collaboration and eResearch Teaching and Learning

Additional General Collaboration Tools Under Development Wiki based on Radeox Blog Shared Display Shared Whiteboard Multicast Audio Multicast Video These are works-in-progress by members of the Sakai eResearch community. There are no dates for release.

NMI / OGCE NSF National Middleware Initiative Indiana, UTexas, ANL, UM, NCSA

Chalk Talk:School of Portals (2004) OGCE 1.1 XCAT NEES 3.0 GridPort NEES 1.1 GridPort 3 Sakai uPortal CHEF OGCE 1.2 ? OGCE 2 Jetspeed Alliance GridPort 2 CompetitionCollaboration Convergence GridSphere

Chalk Talk:School of eScience Portals (2006) OGCE 1.1 XCAT GridPort NEES 1.1 GridPort 3 Sakai uPortal CHEF OGCE 2 Jetspeed Alliance GridPort 2 CompetitionCollaboration Convergence GridSphere SciGate ? SciDoc

Atlas Portal GatewayDesktop Gateway Applications and Users ITERCMS Gateway Technologies Services and Components Resources SRB Petascale Compute ClarensIdentity Security OpalMetaData Petascale Data SciGate Production Integration and Administration Sakai Globus BlueGeneORNL… Management Components Control ExperimentSimulation KnowledgeStore … Process … … Configure: Atlas Portal Experiment Process Control Knowledge Store Sakai SRB Opal Clarens Metadata Configure: ITER Portal Experiment Process Control Knowledge Store Sakai SRB Opal Clarens Metadata

The Ecology of Collaborative eScience

Collaborative Tools Shared Compute Data Sources Data Repository Portal Technology Knowledge Tools Scope of Collaborative E-Science “..composing and orchestrating many technologies…” “..interoperability is key…” Identity ACL

User Interface for Collaborative E- Science Portals are an excellent technology for building a federated user interface across these disparate components using standards like JSR-168. Collaborative Tools Shared Compute Data Sources Data Repository Portal Technology Knowledge Tools Identity ACL

Portals may only be an intermediate step in the process.. Collaborative Tools Shared Compute Data Sources Data Repository Portal Technology Knowledge Tools Identity ACL Desktop Applications

Focus of Sakai Activity in eScience Sakai is focused primarily on integration with portals and working closely with data repositories. Collaborative Tools Shared Compute Data Sources Data Repository Portal Technology Knowledge Tools Identity ACL Discuss First

Collaboration.vs. Portal Basic organization is about the thing it represents - Teragrid, NVO Site customization is based on the resource owners Sometimes there is an individual customization aspect Many small rectangles to provide a great deal of information on a single screen Portals think of rectangles operating independently - like windows Think “Dashboard” Basic organization is about the shape of the people and groups Customization based on the “group leaders” New groups form quickly and organically Doing one thing at a time - chat, upload - perhaps multiple active windows on a desktop Very interactive Think of navigation as picking a tool or switching from one class to another Think “Application”

Sakai Portlet Version 0.2 Tree View Gallery View Proxy portlets Source in SVN Configurable via properties file Announcements (sakai.announcements) Assignments (sakai.assignment) Chat Room (sakai.chat) Discussion (sakai.discussion) Gradebook (sakai.gradebook.tool) Archive (sakai.mailbox) Membership (sakai.membership) Message Forums (sakai.messageforums) Preferences Tool (sakai.preferences) Presentation (sakai.presentation) Profile (sakai.profile) Resources (sakai.resources) Wiki (sakai.rwiki) Tests & Quizzes (sakai.samigo) Roster (sakai.site.roster) Schedule (sakai.schedule) Site Info (sakai.siteinfo) Syllabus (sakai.syllabus)

Sakai JSR-168 Portlet Web Services are used to login to Sakai establish a session and retrieve a list of Sakai Sites, Pages, and Tools The portlet is 100% stock JSR-168 –Works in Pluto, uPortal, and GridSphere

Three Variations Display the Sakai gallery - all of Sakai except for the login and branding. Retrieve the hierarchy of sites, pages and tools display in a tree view with the portlet and show selected tools/pages in an iframe within the portlet Proxy tool placement for a particular Sakai tool such as sakai.preferences

Sakai Gallery View

How Gallery Works uPortal, Pluto, or GridSphere Sakai Web Svcs Charon Portal Sakai Portlet Login /portal/gallery

Sakai Tree View

How Tree View Works uPortal, Pluto, or GridSphere Sakai Web Svcs Charon Portal Sakai Portlet Login ToolList /portal/page/FF96

Sakai Proxy Tool

Proxy Tool Selection

How Proxy Portlet Works uPortal, Pluto, or GridSphere Sakai Web Svcs Charon Portal Sakai Portlet Login SiteList /portal/page/FF96 1 2

SakaiSite.getToolsDom My Workspace ~csev af54f077-42d e3-59c158af2a9a Home b7b19ad f0-3a964cd20f77 Message of the Day sakai.motd b6b-e74e-40eb-80cb c My Workspace Information sakai.iframe.myworkspace New WS method is upwards compatible with getSitesDom

Sakai Repository Integration Approach

Focus of Sakai Activity in eScience Sakai is focused primarily on integration with portals and working closely with data repositories. Collaborative Tools Shared Compute Data Sources Data Repository Portal Technology Knowledge Tools Identity ACL D i s c u s s N o w

Collaboration.vs. Repository Many different systems may be active at the same time Systems evolve, improve, and are often replaced every few years Systems focused on the dynamic needs of users and applications Thousands of simultaneous online users Performance tuning Must be very easy to use; almost unnoticeable Used informally hundreds of times per day per user Think “ ” Generally one system for the area Long term strategic choice for institution System focused on accessing, indexing, curation, and storage Millions of high quality objects properly indexed Data and metadata quality Must enforce standards and workflow to insure data quality Most use is very purposeful: search, publish, add value Think “Library”

Inbound Object Flow Ingest Create and use in native form Prepare for storage Data Model Store Curate, convert, update and maintain over time IndexLens Search View Reuse DRSakai The DR establishes a data model for “site” objects. The CLE hands sites to the DR. The DR may have to do “model” or content cleanup before completing the ingest process. The lens or disseminator understands the data model and is capable of rendering the objects. The lens is part of the DR. Preparation for storage may include cleanup, conversion, copyright clearance, and other workflow steps.

Outbound Object Flow Data Model IndexLens Search View Reuse DR Sakai Sakai can find and re-use objects in the repository. Data Model Lens ViewSearch Reuse

Sakai and Repositories Going Forward Instead of solving the problem by creating a single DR technology that is a superset - which might take years Focus on data portability between systems - reduce the impedance mismatch (or needed conversion between systems) RDF enables object portability across systems, languages, and technologies

Sakai Repository Approach Move Sakai and other Collaboration systems toward RDF –Experiment with using RDF as native storage format –High Performance RDF - Fedora testing - 180M tuples - complex queries - 70ms Move data repositories toward RDF –Move from schema-based stovepipe objects to OWL/RDF based objects with referential integrity –Explore dimensions of portability of disseminator / lenses - this is an important research area Get started immediately….

Fedora Images

Some Reflections

Collaborative Tools Shared Compute Data Sources Data Repository Portal Technology Knowledge Tools Where is the Middleware? “..composing and orchestrating many technologies…” “..interoperability is key…” Identity ACL

Middleware Collaborative Tools Shared ComputeData Sources Data RepositoryPortal Technology Knowledge Tools Identity ACL Is Middleware The Universal Connector?

Collaborative Tools Shared ComputeData Sources Data RepositoryPortal Technology Knowledge Tools Identity ACL The Universal Connectors tcp/iphttp/https web services

Collaborative Tools Shared Compute Data Sources Data Repository Portal Technology Knowledge Tools Is Middleware “inside” each application? Identity ACL

Collaborative Tools Shared Compute Data Sources Data Repository Portal Technology Knowledge Tools Middleware is simply another component - used as needed Middleware Identity ACL

Collaborative Tools Shared Compute Data Sources Data Repository Portal Technology Knowledge Tools Identity and Access Control: A very important function of Middleware Middleware Identity ACL Lets Talk about This

Chalk Talk:Identity and Access Control CAS Shibboleth Kerberos Globus CompetitionCollaboration Convergence LDAP PubCookie K.X509 MyProxy ?? GridShib Cosign ??? Identity ACL

Identity and ACL: Goal State One server - one software distribution Virtual Organization Software Supports all protocols –Globus Certificate Authority –Shibboleth –LDAP –MyProxy –Kerberos Who will do this? Who will fund this? Who can get these competitors to cooperate?

AUTHN/AUTHZ Meetings

My eScience Fantasy

The pre-requisites My net worth is $5B (I give myself grants) I encounter some tech-savvy scientists in a field who are using technology to do world-class research… They have never been visited by any other computer scientist… They are working in groups of 1-30 geographically distributed around the world They all work on a beach with Internet2 connections and wide-open wireless and favourable exchange rates

A B D E Vol 4 Vol 3 Vol 2 Vol 1 F C Compute Data Models Tutorials Experiments Remote Observation eDocuments

Step 1: Visit The Scientists Understand what they are doing and how they are doing it? Ask them how they would like to improve it. Show each application to other scientists. Ask the other scientists how they would improve it. Help each group improve their work - help them using whatever technology they are currently using

Step 2: Add some technology Install the super-multi-protocol Virtual Organization software and provide a NOC for the VO software - identity and simple attributes Install Sakai - point it at the VO software for identity add icon at the top of Sakai Give each scientist an account in the VO Give each effort in the field a site within Sakai

Heart Study Collaboratory Login My WorkspaceABCDEOpen Forum Home Chat Resources Tutorials Site B Mail List Live Meetings

Step 2: Use the VO For those who want to protect their information, help them add SSO to their sites, backed by the VO service Since it is multi-protocol - likely there will be no modification of the underlying science code - only a server configuration change Identity ACL

A B D E Vol 4 Vol 3 Vol 2 Vol 1 F C Compute Data Models Tutorials Experiments Remote Observation eDocuments Identity ACL Heart Study Collaboratory Login My WorkspaceABCDEOpen Forum Home Chat Resources Tutorials Site B Mail List Live Meetings

Step 4: Unique Identifier Service Come up with a way for any member of the VO to “get” a unique identifier Demand some information (build a little data model) –Person’s name and organization (implicit from request) –What kind of thing this will represent (experiment, document, image series) –Simple description –Keyword/value extensions Build an simple way request and retrieve these through a simple web service - capture implicit metadata from request (when, IP address, etc). Make sure it works from perl! Encourage community to start marking “stuff” with these identifiers in their stovepipes

Step 5: Data Models Begin to work with subsets of the field to try to find common data models across stovepipes Start simple - use very simple RDF - human readable Broaden / deepen model slowly - explore variations Define simple file-system pattern for storing metadata associated with a file and/or a directory

Step 6: A Backup-Style Repo Build a data repository which will function as a backup Basic idea - each time you get identifier - this enables backup space - any data and/or metadata can be uploaded under that particular identifier and left in the repository Make the repo multi-protocol, FTP, DAV, Web-Service with attachments, GridFTP, etc. Make it so there can be a network of cooperating repositories

A B D E Vol 4 Vol 3 Vol 2 Vol 1 F C Compute Data Models Tutorials Experiments Remote Observation eDocuments Identity ACL Heart Study Collaboratory Login My WorkspaceABCDEOpen Forum Home Chat Resources Tutorials Site B Mail List Live Meetings GUID Service Central Repo Local Repo Local Repo

Year 4 and on… Once the basic stovepipes have been “brought in from the cold” and made part of a community with no harm, the next steps are to begin to work “cross- stovepipe” –Evolve data models to be far richer with many variants –Build value added tools that are aware of the data models and are usable across stovepipes Teach the community to build and share tools - gently encourage development standards - Java / JSR-168 perhaps Most important: Always listen to the users

Science at the center of eScience Connect Enhance Data Models Data Storage New Tools New Approaches Priority Science Scientists … start at the center and work outwards… … apply technology when the users will see it as a “win” … Communicate New Technologies Repositories

Conclusion Many years ago, eScience had science as its main focus Custom approaches resulted in too many unique solutions Computer scientists began a search for the “magic bullet” - each group found a different magic bullet Each group now competes for mind share (and funding) to be the “one true” magic bullet

Conclusion (cont) One way to solve the “many competing technologies” solution is to form “super groups” which unify the technologies No single technology gets to claim “they are the one” (Middleware is not “in the middle”) Each technology needs to become a drop-in service/component which is available for use only when appropriate Once we can get past looking at the technologies as the main focus, we get back to science as the main focus

Lets remember why we started this whole field in the first place… Scientific Domain Groups of People Common User Interface Data Sharing –In the moment –Long-term Experimental Equipment Compute Visualization To download “Chuck’s Talks”