Science Studio: Remote Access to Scientific Experiments using a Comprehensive Management Tool.

Slides:



Advertisements
Similar presentations
E. Matias Canadian Light Source CLS Beamlines & ScienceStudio.
Advertisements

C3.ca in Atlantic Canada Virendra Bhavsar Director, Advanced Computational Research Laboratory (ACRL) Faculty of Computer Science University of New Brunswick.
Introduction Main technologies: core written in Java embedded Jython interpreter code managed using the Eclipse plugin framework client program uses Eclipse.
UWO Nanofabrication Facility and Science Studio. Facility to be hooked into Science Studio: Western Nanofabrication Facility, University of Western Ontario.
E. Matias D. Chabot, D. Maxwell, D. Medrano (CLS) C. Armstrong (IBM) M. Fuller, S. McIntryre (University of Western Ontario) Remote Access at the CLS.
Tom Sheridan IT Director Gas Technology Institute (GTI)
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
E. Matias Canadian Light Source June 2009 – Presentation to - NSLS II EFD Division CLS Beamline Controls.
Web 2.0 Elder Matias CLS – What Is Web 2.0?  In plain English …. Automating tedious tasks using web technology Tools to help people and software.
Copyright 2002 Prentice-Hall, Inc. Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich Chapter 16 Designing.
User Office Status CANARIE Site Visit July, 2009.
©Silberschatz, Korth and Sudarshan1.1Database System Concepts Chapter 1: Introduction Purpose of Database Systems View of Data Data Models Data Definition.
1.Database plan 2.Information systems plan 3.Technology plan 4.Business strategy plan 5.Enterprise analysis Which of the following serves as a road map.
Canadian Light Source Elder Matias (CLS) EPICS April 09 Collaboration Meeting.
New Task Group CRIS Architecture & Development Maximilian Stempfhuber RWTH Aachen University Library
Computer Concepts 2014 Chapter 7 The Web and .
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
Creating Business Workflow Using SharePoint Designer 2007 Presented by Tarek Ghazali IT Technical Specialist Microsoft SQL Server MVP Microsoft SQL Server.
Submitted by: Madeeha Khalid Sana Nisar Ambreen Tabassum.
Chapter 9 Elements of Systems Design
A Scalable Application Architecture for composing News Portals on the Internet Serpil TOK, Zeki BAYRAM. Eastern MediterraneanUniversity Famagusta Famagusta.
Trimble Connected Community
12-CRS-0106 REVISED 8 FEB 2013 EPICS Collaboration Meeting 2013 CSS An integrated development and runtime environment for ITER plant system local controls.
Copyright © cs-tutorial.com. Introduction to Web Development In 1990 and 1991,Tim Berners-Lee created the World Wide Web at the European Laboratory for.
Geneva, 30 October 2009 Giuseppe Sindoni, Istat, Italy An online system for multi-channel, register-based census data collection.
Web Application Beamline Control Module Very Sensitive Elemental and Structural Probe Employing Radiation from a Synchrotron (VESPERS) The VESPERS beamline.
Connecting OurGrid & GridSAM A Short Overview. Content Goals OurGrid: architecture overview OurGrid: short overview GridSAM: short overview GridSAM: example.
Science Studio and ANISE (Jan 2010 update) Elder Matias Canadian Light Source.
Beate Danielsen, Health Information Solutions Soora Wi, Kaiser Permanente Eileen Walsh, Kaiser Permanente.
Fundamentals of Database Chapter 7 Database Technologies.
Science Studio CANARIE Review July 7, Team: People and Orgs Dionisio Medrano Dylan Maxwell *Elder Matias *Lavina Carter Dong Liu Chris Armstrong.
Lecture 9: Chapter 9 Architectural Design
Managed by UT-Battelle for the Department of Energy 1 Integrated Catalogue (ICAT) Auto Update System Presented by Jessica Feng Research Alliance in Math.
GumTree Feature Overview Tony Lam Data Acquisition Team Bragg Institute eScience Workshop 2006.
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
Data Management BIRN supports data intensive activities including: – Imaging, Microscopy, Genomics, Time Series, Analytics and more… BIRN utilities scale:
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Of Remote Beamlines, Micro-diffraction and HP Network Computing VESPERS X ray Beamline Capabilities: Micro-diffraction/fluorescence User Base: Earth and.
© 2007 IBM Corporation SOA on your terms and our expertise Software WebSphere Process Server and Portal Integration Overview.
This presentation describes the development and implementation of WSU Research Exchange, a permanent digital repository system that is being, adding WSU.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
A radiologist analyzes an X-ray image, and writes his observations on papers  Image Tagging improves the quality, consistency.  Usefulness of the data.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
ESFRI & e-Infrastructure Collaborations, EGEE’09 Krzysztof Wrona September 21 st, 2009 European XFEL.
Project Database Handler The Project Database Handler is a brokering application that mediates interactions between the project database and the external.
Original Requirements for Science Studio : (1)Convenient control of all aspects of an X ray fluorescence (XRF) facility: visible sample, easy sample manipulation,
Science Studio Web Access to EPICS The CLS Experience Elder Matias Canadian Light Source.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
Slide 1 Service-centric Software Engineering. Slide 2 Objectives To explain the notion of a reusable service, based on web service standards, that provides.
Simplified Experiment Submit Proposal Results Excited Users Do Expt Data Analysis Feedback.
DSpace System Architecture 11 July 2002 DSpace System Architecture.
1 ANISE: Active Network for Information from Synchrotron Experiments “Active” means near-instantaneous stream processing of complex data during transfer.
NSF Middleware Initiative Purpose To design, develop, deploy and support a set of reusable, expandable set of middleware functions and services that benefit.
A computer contains two major sets of tools, software and hardware. Software is generally divided into Systems software and Applications software. Systems.
XRD data analysis software development. Outline  Background  Reasons for change  Conversion challenges  Status 2.
Thomas Gutberlet HZB User Coordination NMI3-II Neutron scattering and Muon spectroscopy Integrated Initiative WP5 Integrated User Access.
Science Studio and ANISE (Jan 2010 update) Elder Matias Canadian Light Source.
V7 Foundation Series Vignette Education Services.
Diamond Light Source Data Acquisition System Paul Gibbons.
LetItFlow Architecture Specification Project Meeting Vienna, – Victor Carmocanu SIVECO Romania.
InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.
12. DISTRIBUTED WEB-BASED SYSTEMS Nov SUSMITHA KOTA KRANTHI KOYA LIANG YI.
Open Source distributed document DB for an enterprise
Remote Access at the CLS
Web Engineering.
CHAPTER 3 Architectures for Distributed Systems
Chapter 16 Designing Distributed and Internet Systems
Patrick Dreher Research Scientist & Associate Director
Presentation transcript:

Science Studio: Remote Access to Scientific Experiments using a Comprehensive Management Tool

2 User Access to Synchrotrons  Who is the community that will use your platform?  Synchrotrons are electron storage rings that emit high intensity photons that are used for experiments by a large scientific community (tens of thousands worldwide).  Access is normally granted for single periods of 1-3 days in a half- year cycle.  What couldn’t your community do without the platform?  Physical distances and episodic access prevent rapid scientific progress and limit scientific collaboration.  Why was that a problem or limitation?  Governments worldwide have invested >$2B in these facilities, yet the scientific outcomes could be optimised.

User Access to Synchrotrons  What middleware was needed to resolve the limitations? Workflow management Engine for the User Office Web Portal for remote data access (during and post experiment) Enterprise Service Bus and SOA to integrate internal and external data analysis services  How do your plans meet the needs Users will have frequent remote access to the VESPERS beamline at the Canadian Light Source under conditions where many collaborators can participate in the experiment. 3

Science Studio serves three purposes: Management of all aspects of a scientific experiment including data storage, collaboration with others, processing of data; Control of, or interaction with, remote experiments on the CLSI VESPERS Beamline and UWO Nanofabrication Laboratory and User Services (sample management, scheduling, peer review, user training) 4

5 Team: People and Orgs  Remote Control  User Services  System Deployment  Integration  System Architecture  System Requirements  Testing  Data Analysis/Grid Computing  User Office Software  Scientific Workflow Engines

6 Team: People and Orgs Dionisio Medrano Dylan Maxwell Daron Chabot Elder Matias Chris Armstrong John Haley Mike Bauer Stewart McIntyre Marina Suominen Fuller Jinhui Qin Nathaniel Sherry Yuhong Yan Zahid Anwar Ludeng (Eric) Zhao Dan Ni Yaofeng Xu

System Architecture Web Application Beamline Control Module DB SAN JMSCA VESPERS HTTP 1. VESPERS Beamline 2. EPICS control system 3. Beamline Control Module (BCM) 4. Web Application 5. Database 6. File Storage 7. Web Interface

VESPERS Beamline  VESPERS — Very Sensitive Elemental and Structural Probe Employing Radiation from a Synchrotron  A bending magnet beamline on sector 6 at the Canadian Light Source synchrotron in Saskatoon, Saskatchewan.  A hard x-ray microprobe with an energy range of 6 to 30keV.  Techniques: X-Ray Fluorescence (XRF) & X-Ray Diffraction (XRD) Web Application Beamline Control Module DB SAN JMSCA VESPERS HTTP

VESPERS Endstation CCD Detector (XRD) Microscope MCA Detector (XRF)Sample

EPICS Low-level Control System  EPICS — Experimental Physics and Industrial Control System  The standard control system at the CLS.  EPICS consists of a network of Input-Output Controls (IOCs) which are connected to directly to devices.  An IOC provides many Process Variables (PVs) which relate to either an input or output from a device and have a unique name.  Channel Access (CA) is used to read or write to any PV without knowing which IOC provides the PV.  More than 50,000 PVs in the CLS control system. Web Application Beamline Control Module DB SAN JMSCA VESPERS HTTP

Beamline Control Module (BCM)  The BCM provides a high-level interface to the low-level control system (EPICS).  Logical and physical separation of business logic and control logic.  Virtual device abstraction that provides independence from low-level control system.  Virtual devices can be logically organized into a device hierarchy.  Basic devices can be combined to build more functional devices.  Communication with external applications using two message queues (ActiveMQ). Web Application Beamline Control Module DB SAN JMSCA VESPERS HTTP

Web Application  A J2EE Servlet application that provides a web-based interface Science Studio.  Tools: Spring (MVC), iBATIS (ORM), JSecurity (Apache Ki), Apache Tomcat  Divided into two parts: the Core application and the VESPERS beamline application.  Core application is responsible for providing access to the business objects.  VESPERS application is responsible for remote control of the VESPERS beamline. Web Application Beamline Control Module DB SAN JMSCA VESPERS HTTP

Database  Metadata associated with the operation of a remote controlled beamline and the organization of experimental data collected on that beamline.  A project is the top level organizational unit and is associated with a project team.  A session defines a period of time allocated to a project team to conduct experiments.  An experiment relates a sample and the technique being applied to that sample.  A scan records the location of the acquired experimental data. Web Application Beamline Control Module DB SAN JMSCA VESPERS HTTP

Database Schema person project_person project_role project session laboratory sampleexperimentscan techniqueinstrument Instrument_technque facility

Experimental Data Storage  Experimental data is stored at the CLS.  Common directory structure shared with other beamlines.  A large data storage facility is now operational at the University of Saskatchewan as part of WestGrid. Web Application Beamline Control Module DB SAN JMSCA VESPERS HTTP

VESPERS Web Interface  Rich web interface to Science Studio and the VESPERS beamline.  Designed to be used over commodity broadband internet.  Developed for the Firefox web browser without any additional plugins or extensions.  Known to work with other browsers, but requires the Canvas HTML tag.  AJAX is used for the VESPERS interface to provide device values in pseudo real time.  ExtJS, a JavaScript framework, provides many advanced GUI elements. Web Application Beamline Control Module DB SAN JMSCA VESPERS HTTP

Beamline Setup

Experiment Setup

XRF (X-Ray Fluorescence)

Beamline Hutch Cameras

Experimental Data Viewer

 X-Ray Fluorescence (XRF): Reveals Elemental Composition  Characteristic Element Lines Selected and Mapped Over a 2D Scan Area  S: Kα  Cr: Kα & Cr: Kβ  Fe: Kα & Fe: Kβ  Ni: Kα & Ni: Kβ  2D Maps Generated for Selected Elemental Lines

 X-Ray Diffraction (XRD): Reveals Structural Information  Peak Fitting and Indexing of Image Set to Create a Grain Orientation Map Peak Search  Old IDL Programme – Matched Peak  New C Programme – Matched Peak  New C Programme – Expected Peak  The XRD Indexing programme examines the locations of peaks in an image in order to determine the kind of lattice structure the samples constituent atoms are arranged in. Shown here are the results of an older indexing programme written in IDL, and the new indexing programme, written in C. The new indexing programme is proving to be more versatile, and more reliable than the old programme, often indexing sets of data that the old programme failed with.  Grain Orientations Indexing Process

User Office Workflow Goal: Many tasks in proposal & sample management at CLS To develop a workflow management system that manages ordering of tasks e.g. (training  before  shipping) Tracks manual as well as SS task progression  Mar 6-month cycle CLS call for proposals Proposal submission To CLS CLS gathers proposals CLS reviews proposals CLS grants scientist Beamline time  cientist packs sample I wonder if CLS received my sample yet? Scientist must complete Online SS training CLS health & safety inspection Many other tasks Perform Experiment Return Sample Take Survey …

User office Workflow Status  Workflow Management Engine  Beamline User  User Office  Task :Training  Completed  Notify  Approved  Notify  Record Progress Features Open source Petri-nets based Direct support for workflow control flow patterns Ability to interact with web services declared in WSDL Relies on XML standards e.g. XPath and XQuery for data & doesn’t use proprietary languages Architecture System Core: YAWL engine. Engine instantiates specifications designed using YAWL designer. managed by the YAWL repository Environment composed of YAWL services inspired by “web services” paradigm, end-users, applications, and organizations are all services in YAWL.

Screenshot: User Training Test Creation

Screenshot: User Survey Taking Page

Screenshot: User Survey Edit Page

Screenshot: Workflow Sample Management

Screenshot: Workflow Call for Proposals

User Office Workflow Example  Prototype Implementation 1. CLS issues a call for proposals and gives deadline 2. Beamline users submit proposals 3. User Office administrator ends registration or extends deadline 4. User Office administrator assigns proposals to user office reviewers 5. Reviewers look at proposals and rank them 6. User Office looks at ranking and chooses the proposals to accept 7. Accepted proposals contact persons are notified 8. Beamline User completes training (web service) 9. After training is completed (simulated by a delay) the CLS is notified

Scheduling Module  Goal: To automate the review process and the method by which beam time is allocated and scheduled to users depending on the access mechanism chosen by the user and the stage of operation (construction, commissioning or operation) of the beamline.  Side effects: Facilitate the management of cycles, runs and modes of operation Use automatic scheduling to handle more scheduling conditions and constraints than human beings are able to handle manually and identify optimal solutions.

Scheduling Module Features Users Submit proposals Integer Programming and Heuristic Algorithm Schedule INPUT: SEARCH AND CONSTRAINT SATISFIABILITY: OUTPUT: Beamlines2 Experiments3 Release Times[1,1,2] Deadlines[8,15,5] Weights[4,5,1] Processing Times[10,4,3] Eligibility[[0,1,0],[1,0,1]] CONSTRAINTS 1. One beamline per experiment 2. Start time after release time 3. Only eligible beamlines can be selected. 7. No overlap of experiment per beamline

34 Comparison to Current State of the Art User Access to Synchrotrons Synchrotrons access is normally granted for single periods of 1-3 days in a half-year cycle. But…  Episodic access requires careful pooling of samples  Little or no time for analysis of data during the experimental access.  “Reprise” experiments difficult to schedule  Travel is expensive and inconvenient  User control and records of experiments at the site is sometimes limited. The network would encourage common data formats and protocols leading to closer collaboration

35 Next Steps Access to VESPERS /XRF2.0 available to UWO users by Fall User training module in use at CLS by July 2009 VESPERS/Diffraction 1.0 available by UWO Feb 2010 All user scheduling in place by April 2010 VESPERS available to internet users end of 2010