CLAs Reconstruction and Analysis Physics Data Processing with SOA based Framework Vardan Gyurjyan on behalf of Clas12 software group.

Slides:



Advertisements
Similar presentations
웹 서비스 개요.
Advertisements

MicroKernel Pattern Presented by Sahibzada Sami ud din Kashif Khurshid.
Ch:8 Design Concepts S.W Design should have following quality attribute: Functionality Usability Reliability Performance Supportability (extensibility,
COM vs. CORBA.
Chapter 13 Review Questions
Programming Paradigms and languages
IT INFRASTRUCTURE AND EMERGING TECHNOLOGIES
The road to reliable, autonomous distributed systems
I.1 Distributed Systems Prof. Dr. Alexander Schill Dresden Technical University Computer Networks Dept.
Introduction To System Analysis and Design
CSC-8530: Distributed Systems Christopher Salembier 28-Oct-2009.
DCS Architecture Bob Krzaczek. Key Design Requirement Distilled from the DCS Mission statement and the results of the Conceptual Design Review (June 1999):
Phones OFF Please Operating System Introduction Parminder Singh Kang Home:
CS 550 Amoeba-A Distributed Operation System by Saie M Mulay.
Ch 12 Distributed Systems Architectures
Software Issues Derived from Dr. Fawcett’s Slides Phil Pratt-Szeliga Fall 2009.
Chapter 3 Software Two major types of software
TECHWARZ. (Multiplexed Information and Computing Service)  Multics was an extremely influential early time-sharing operating system.  Goal: Develop.
Platform as a Service (PaaS)
Introduction to Databases Transparencies 1. ©Pearson Education 2009 Objectives Common uses of database systems. Meaning of the term database. Meaning.
Software Development Concepts ITEC Software Development Software Development refers to all that is involved between the conception of the desired.
SOA, BPM, BPEL, jBPM.
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
Computer System Architectures Computer System Software
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 12 Slide 1 Distributed Systems Architectures.
German National Research Center for Information Technology Research Institute for Computer Architecture and Software Technology German National Research.
Thomas Jefferson National Accelerator Facility Page 1 Clas12 Reconstruction and Analysis Framework V. Gyurjyan S. Mancilla.
Quality Attributes of Web Software Applications – Jeff Offutt By Julia Erdman SE 510 October 8, 2003.
CSC271 Database Systems Lecture # 4.
Introduction and Overview Questions answered in this lecture: What is an operating system? How have operating systems evolved? Why study operating systems?
COM vs. CORBA Computer Science at Azusa Pacific University September 19, 2015 Azusa Pacific University, Azusa, CA 91702, Tel: (800) Department.
CSE 303 – Software Design and Architecture
DISTRIBUTED COMPUTING
1b.1 Types of Parallel Computers Two principal approaches: Shared memory multiprocessor Distributed memory multicomputer ITCS 4/5145 Parallel Programming,
CS 390- Unix Programming Environment CS 390 Unix Programming Environment Topics to be covered: Distributed Computing Fundamentals.
International Telecommunication Union Geneva, 9(pm)-10 February 2009 ITU-T Security Standardization on Mobile Web Services Lee, Jae Seung Special Fellow,
DCE (distributed computing environment) DCE (distributed computing environment)
Introduction To System Analysis and Design
Scalable Web Server on Heterogeneous Cluster CHEN Ge.
Introduction to DFS. Distributed File Systems A file system whose clients, servers and storage devices are dispersed among the machines of a distributed.
CE Operating Systems Lecture 3 Overview of OS functions and structure.
Cracow Grid Workshop October 2009 Dipl.-Ing. (M.Sc.) Marcus Hilbrich Center for Information Services and High Performance.
Thomas Jefferson National Accelerator Facility Page 1 Clas12 Reconstruction and Analysis Framework V. Gyurjyan S. Mancilla.
Service Oriented Architecture CCT355H5 Professor Michael Jones Suezan Makkar.
Chapter 5 McGraw-Hill/Irwin Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved.
Distribution and components. 2 What is the problem? Enterprise computing is Large scale & complex: It supports large scale and complex organisations Spanning.
Computer Science in Context Evangelos E. Milios Professor and Graduate Coordinator Faculty of Computer Science Dalhousie University.
© 2013, published by Flat World Knowledge Chapter 10 Understanding Software: A Primer for Managers 10-1.
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
© FPT SOFTWARE – TRAINING MATERIAL – Internal use 04e-BM/NS/HDCV/FSOFT v2/3 JSP Application Models.
V Gyurjyan, D Abbott, J Carbonneau, G Gilfoyle, D Heddle, G Heyes, S Paul, C Timmer, D Weygand V. Gyurjyan JLAB data acquisition and analysis group.
The JANA Reconstruction Framework David Lawrence - JLab May 25, /25/101JANA - Lawrence - CLAS12 Software Workshop.
Lecture VIII: Software Architecture
Application of Design Patterns to Geometric Decompositions V. Balaji, Thomas L. Clune, Robert W. Numrich and Brice T. Womack.
Introduction to Visual Basic. NET,. NET Framework and Visual Studio
Enterprise Architectures
Clouds , Grids and Clusters
SOA (Service Oriented Architecture)
Distribution and components
Design and Maintenance of Web Applications in J2EE
Constructing a system with multiple computers or processors
Software models - Software Architecture Design Patterns
Multithreaded Programming
Outline Chapter 2 (cont) OS Design OS structure
Clas Reconstruction and Analyses framework
Physics data processing with SOA
Introduction to SOA and Web Services
CLAs Reconstruction and Analysis
Presentation transcript:

CLAs Reconstruction and Analysis Physics Data Processing with SOA based Framework Vardan Gyurjyan on behalf of Clas12 software group

Outline  Problem statement  SOA based framework as a solution  Current status of the ClaRA project  Future plans  Conclusion October 5, 2015V. Gyurjyan

Computing power CMOS Technology Single Chip Integration Node/Rack Integration Network Integration October 5, 2015V. Gyurjyan

Integration Chip 4 Cores Computer Card 1Chip 13.6GF/s Mode Card 32 Computer Cards 435 GF/s Rack 32 Mode Cards 13.9 TF/s IBM Blue Gene 72 Racks 1PF/s October 5, 2015V. Gyurjyan

Network Evolution User Traffic 2x / 12months Router Capacity 2.2x / 18months Moore’s Law 2x / 18 months Network Capacity 2x / 7 months CAT4 10Mbps 10base-T CAT5 100Mbps 100base-T CAT5e 1Gbps 1000base-T 2003 – CAT6 10Gbps 2007 – CAT7 100Gbps October 5, 2015V. Gyurjyan

High Performance Computing Trends 1. Exponential growth in processor performance (coming to an end) 2. Power cost = System cost: invention required 3. Growth in level of parallelism (near term solution) October 5, 2015V. Gyurjyan

IBM Approach – Path to Petascale  Multiple modest cores on a single chip rather than one high- performance processor  Watts/FLOP will not improve much from future technologies.  Linux environment and MPI (standard messaging interface) October 5, 2015V. Gyurjyan

"The Network is the Computer." John Gage October 5, 2015V. Gyurjyan

Specifics of the Offline Software  Lifetime of the software >= lifetime of the experiment.  Collaborative nature of the development.  Coexistence of parallel running applications for the single experiment.  Unprecedented scale and complexity of the physics computing environment  Physics computing environment must keep up with fast growing computing technologies  Large worldwide user base. October 5, 2015V. Gyurjyan

PDP (Physics Data Processing) Application Conventional vs. parallel/distributed October 5, 2015V. Gyurjyan

Running Conventional Software Application Copy checkout Give up Configure Compile Fix errors Run Modified? Complain yes no yesno ok October 5, 2015V. Gyurjyan

Programming Errors  Compile time  Program does not compile.  Compiler reports a “best guess” of the problem  Undeclared variables or functions  Missing semicolon or brace  Typos  Missing files or libraries  Type ambiguities  Run time  Executable crashes or has unexpected behavior  May not appear for all conditions or all data sets  Uninitialized variables  Memory errors  Numeric errors  Type errors in print statements  Closing a NULL file pointer  Accessing a NULL pointer  Variables out of scope October 5, 2015V. Gyurjyan

Challenges of the Conventional Approach  Difficult to organize and coordinate activities  Difficult to maintain  Inevitable fragmentation of the software  Poor scalability  Computing skills are required to use physics data processing applications October 5, 2015V. Gyurjyan

ABC B A A B C CLAS 6 CLAS 12 A+B << C C: requires a few or no programming skills October 5, 2015V. Gyurjyan

One way to eat an elephant A bite at a time October 5, 2015V. Gyurjyan

Where we start?  Each bite is a clear, simple, single purpose application, developed by group B member.  Group A, with a tight collaboration with group B and C shall control and manage the process, never loosing maniacal focus on a big picture (elephant). Define a piece of a big problem Understand the problem Distill the problem to its essence solutionTest October 5, 2015V. Gyurjyan

“Things should be made as simple as possible, but not simpler.” Albert Einstein October 5, 2015V. Gyurjyan

Language and Architecture Evolution Structured and Procedural programming Object Oriented programming Assembly Language Service Oriented programming October 5, 2015V. Gyurjyan

SOA  SOA promotes the goal of separating service users from the service implementation.  Style of building reliable systems that deliver functionality as services  Loose coupling between interacting services  Directories and addressing mechanisms are at the center of SOA. Program Arbitrary format Service Standard format Complex Specialized, simple October 5, 2015V. Gyurjyan

Attributes of Services  Well defined, easy-to-use, somewhat standardized interface  Self-contained with no visible dependencies to other services  (almost) Always available but idle until requests come  Location transparency  Easily accessible and usable readily, no “integration” required  New services can be offered by combining existing services  Quantifiable quality of service October 5, 2015V. Gyurjyan

Service Interface  Standard message based  Highly Polymorphic  Intent is enough  Implementation can be changed in ways that do not break all the service consumers October 5, 2015V. Gyurjyan

Service Orientation is scalable  End users can consume and combine a lot of services since they don’t have to know or “learn” how the services are made.  Service providers (A+B) can offer their services to a lot more consumers by optimizing  The user interface  Access  Implementations October 5, 2015V. Gyurjyan

“On Demand” Physics Data Processing  Use software as you need  Much lower setup time, forget about  Installation  Implementation  Training  Maintenance  Scalable and effective usage of resources  Parallelism (CPU, Storage, Bandwidth…) October 5, 2015V. Gyurjyan

What is ClaRA?  Framework that Implements SOA.  Service development environment.  Toolbox of generic physics data processing services.  Network distributed platform.  The “Glue”, binding together services into an algorithmic data analysis application. October 5, 2015V. Gyurjyan

Design criteria  Framework service shall be simple to use and easy to learn.  Framework service should be customizable to be able to adapt to the different data processing tasks.  Framework shall provide context sensitive help and assistance, with many real world physics data processing application examples.  Framework shall provide ready to use services, encapsulating essential functionalities of the physics data processing system.  Services shall be reusable and easily replaceable.  Physics data processing application design and implementation shall require a few or no programming skills.  Neither specific computing environment, nor compiling shall be necessary to build and run physics data processing application.  Framework shall provide graphical environment for physics data processing application development.  Frameworks platform shall be network distributed, and shall have temporal continuity.  The new system shall provide World Wide Web access to the services for remote configuration and execution of the data processing applications. The necessary security considerations must be addressed. October 5, 2015V. Gyurjyan

Data and Algorithm  Framework advocates clear separation between:  a) data and algorithm  b) transient and persistent data  Methods in the data object will be limited to manipulations of the internal data members only.  Algorithm will process one type of data and generate data objects of a different type. Algorithm Data October 5, 2015V. Gyurjyan

Persistent and Transient Data  Physics algorithm objects should not use data objects directly in the persistent storage.  Transient data storage as a means of communication between physics algorithms.  Two different optimization criteria for applications using persistent and transient data.  Being independent from the persistent storage technology. October 5, 2015V. Gyurjyan

Data Object categories Data EventDetectorStatistical October 5, 2015V. Gyurjyan

ClaRA Platform Front-End Container Normative Service Service Container node-1 Service Container node-2 Service Container node-3 Service Container node-N cMsg SCC WWW Web Service 1 Web Service 2 Web Service 3 Web Service N SOAP Users CMSG SOAP October 5, 2015V. Gyurjyan

Current Status Geometry Service Geometry Service Magnetic Field Map Service Magnetic Field Map Service GEMC Service GEMC Service Tracking Service Tracking Service bCNU Service bCNU Service Event Data Service Event Data Service ClaRA cMsg Platform Thin Clients WWW ClaRA WebServices Platform Math Service Stat Service Probability Service Geometry Service Matrices Service October 5, 2015V. Gyurjyan

Examples  EVIO event producer and EVIO event consumer services (C++).  data producer and data consumer services. C examples use cMsg payload (ASCII).  C++ geometry service client example  Java geometry service client example  Web services JSP clients October 5, 2015V. Gyurjyan

Tracking composite application Transient data Space- point maker Coarse track finder Cluster Analyzer Ambiguity solver Track fitter Histogram builder Histogram builder Persistent data ClaRA cMsg Platform Thin Clients October 5, 2015V. Gyurjyan

Tracking application service decomposition DetectorData EvtData StatData TransientEvtData TransientDetData TransientStatData Track candidates Resolved Tracks Space Points Raw Data Final Tracks SpacePointFormation CoarseTrackFinder SeadMaker VertexFinder ClusterAnalyzer AmbiguitySolver TrackFitter TrackScoring Supervisor start retrieve record retrieve record retrieve record retrieve Transient Storage Tracking State machine October 5, 2015V. Gyurjyan

Performance measurements October 5, 2015V. Gyurjyan