Distributed Data Management Architecture for Embedded Computing

Slides:



Advertisements
Similar presentations
National University Community Research Institute (NUCRI) NU Community Research Institute (NUCRI) HASTAC (higher education)/HASS grid National School Board.
Advertisements

San Diego Supercomputer Center & National Partnership for Advance Computational Infrastructure Storage Resource Broker Reagan W. Moore San Diego Supercomputer.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Data Grids for Collection Federation Reagan W. Moore University.
Open problems How can I acquire desired RT data? How can I discover/access desired RT data? How can I share/integrate the data? Can I integrate dynamically?
High Performance Wireless Research and Education Network
The Storage Resource Broker and.
Peter Berrisford RAL – Data Management Group SRB Services.
An interdisciplinary collaboration University of California, San Diego
Data Grid: Storage Resource Broker Mike Smorul. SRB Overview Developed at San Diego Supercomputing Center. Provides the abstraction mechanisms needed.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Data Grids Reagan W. Moore San Diego Supercomputer Center.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Data Grids, Digital Libraries and Persistent Archives Reagan.
NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid PPDG Data Handling System Reagan.
San Diego Supercomputer CenterNational Partnership for Advanced Computational Infrastructure1 Grid Based Solutions for Distributed Data Management Reagan.
Complex Systems Engineering for the Global information Grid Bob Marcus
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Sensor Networks: Next Generation Problems Frank Vernon Scripps Institution of Oceanography University of California at San Diego SAMSI Sensor Network Workshop.
Applying Data Grids to Support Distributed Data Management Storage Resource Broker Reagan W. Moore Ian Fisk Bing Zhu University of California, San Diego.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
Modern Data Management Overview Storage Resource Broker Reagan W. Moore
ABSTRACT Real-time systems applied to seismic data acquisition, asynchronous processing, and data archiving tasks have clearly demonstrated their utility.
Scientific Data Infrastructure in CAS Dr. Jianhui Scientific Data Center Computer Network Information Center Chinese Academy of Sciences.
National Partnership for Advanced Computational Infrastructure Digital Library Architecture Reagan Moore Chaitan Baru Amarnath Gupta George Kremenek Bertram.
Jan Storage Resource Broker Managing Distributed Data in a Grid A discussion of a paper published by a group of researchers at the San Diego Supercomputer.
Rule-Based Data Management Systems Reagan W. Moore Wayne Schroeder Mike Wan Arcot Rajasekar {moore, schroede, mwan, {moore, schroede, mwan,
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
EGEE-II INFSO-RI Enabling Grids for E-sciencE Data Grid Services/SRB/SRM & Practical Hai-Ning Wu Academia Sinica Grid Computing.
Chinese activity on inter-Grid test between SRB and SIG Guoqing Li, RSGS/NRSCC & Carol Song, RCAC/Purdue University Presented at WGISS-23, Hanoi 23 May.
Production Data Grids SRB - iRODS Storage Resource Broker Reagan W. Moore
Crystal25 Hunter Valley, Australia, 11 April 2007 Crystal25 Hunter Valley, Australia, 11 April 2007 JAINIS (JCU and Indiana Instrument Services): A Grid.
Distributed Data Management Architecture for Embedded Computing Hans-Werner Braun *,Todd Hansen *, Kent Lindquist, Bertram Ludäscher *, John Orcutt †,
Rule-Based Programming for VORBs Bertram Ludaescher Arcot Rajasekar Data and Knowledge Systems San Diego Supercomputer Center U.C. San Diego.
San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure SRB + Web Services = Datagrid Management System (DGMS) Arcot.
Rule-Based Preservation Systems Reagan W. Moore Wayne Schroeder Mike Wan Arcot Rajasekar Richard Marciano {moore, schroede, mwan, sekar,
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Archive for the NSDL Reagan W. Moore Charlie Cowart.
Geosciences - Observations (Bob Wilhelmson) The geosciences in NSF’s world consists of atmospheric science, ocean science, and earth science Many of the.
SRB 1 & iRODS 2 Arcot Rajasekar Reagan Moore Mike Wan SDSC/UCSD Pathways to OOI-CI CyberData Architecture 1 Storage Resource Broker 2 integrated Rule Oriented.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
The Global Land Cover Facility is sponsored by NASA and the University of Maryland.The GLCF is a founding member of the Federation of Earth Science Information.
SAN DIEGO SUPERCOMPUTER CENTER By: Roman Olschanowsky An Introduction to the.
Michael Doherty RAL UK e-Science AHM 2-4 September 2003 SRB in Action.
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
Introduction to The Storage Resource.
1 HPEC'02 Distributed Data Management Architecture for Embedded Computing The Problem: –Integrated real-time management of large, distributed, heterogeneous.
Biomedical Informatics Research Network The Storage Resource Broker & Integration with NMI Middleware Arcot Rajasekar, BIRN-CC SDSC October 9th 2002 BIRN.
NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Interlib Technology Integration Reagan.
SDSC Storage Resource Broker & Meta-data Catalog SRB Archives HPSS, ADSM, UniTree, DMF Databases DB2, Oracle, Sybase File Systems Unix, NT, Mac OSX Application.
RBNB DataTurbine (Ring Buffered Network Bus ) Released under Apache 2.0 Open Source License Solution for accessing both streaming and static data, from.
National Archives and Records Administration1 Integrated Rules Ordered Data System (“IRODS”) Technology Research: Digital Preservation Technology in a.
All Hands Meeting 2005 BIRN-CC: Building, Maintaining and Maturing a National Information Infrastructure to Enable and Advance Biomedical Research.
Collection-Based Persistent Archives Arcot Rajasekar, Richard Marciano, Reagan Moore San Diego Supercomputer Center Presented by: Preetham A Gowda.
Preservation Data Services Persistent Archive Research Group Reagan W. Moore October 1, 2003.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Using iRODS with the EnginFrame Grid Portal into the GRIDA3 project Francesco Locunto Marco Piras Matteo Vocale.
Clouds , Grids and Clusters
Collection Based Persistent Archives
Problem: Ecological data needed to address critical questions are dispersed, heterogeneous, and complex Solution: An internet-based mechanism to discover,
GRID COMPUTING PRESENTED BY : Richa Chaudhary.
Video and Sensor Network Architecture and Displays
Arcot Rajasekar Michael Wan Reagan Moore (sekar, mwan,
Interlib Technology Integration
Growing importance of metadata for synthetics: Calculating and Sharing Synthetic Seismic Data Dogan Seber University of California, San Diego San Diego.
Distributed Data Management Architecture for Embedded Computing
DL4-Met Hydroclimate Data Logger
VORB Virtual Object Ring Buffers
What is a Grid? Grid - describes many different models
Presentation transcript:

Distributed Data Management Architecture for Embedded Computing Hans-Werner Braun*,Todd Hansen*, Kent Lindquist, Bertram Ludäscher*, John Orcutt†, Arcot Rajasekar*, Frank Vernon† *San Diego Supercomputer Center , †Scripps Institution of Oceanography, University of California, San Diego Abstract. Real-time data management is faced with the problems of disseminating large collections of data to users and applications, providing a collaborative environment for analyzing and performing analysis-intensive computing, and at a basic level with problems of providing access, managing, curating, storing and moving large quantities of real-time data. The evolution of the data grid technologies provides solutions to these problems by developing middleware that can seamlessly integrate multiple data resources and provide a uniform method of accessing data across a virtual network space. The “Grid” is a term used for defining a variety of notions linking multiple computational resources such as people, computers and data [Foster & Kesselman 1999]. The term “Data Grid” has come to denote a network of storage resources, from archival systems, to caches, to databases, that are linked in the physical world across a distributed network using common interfaces. Examples of implementations include the physics community [PPDG 1999, Hoschek 2001, GriPhyN 2000, NVO 2001], biomedical applications (BIRN 2001), ecological sciences [KNB 1999] and earthquake and multi-sensor systems [NEES 2000], etc. In this paper we describe a real-time data grid architecture, called the VORB, that we are developing as part of an NSF-funded project called ROADNet [ROADNet 2001]. Because of rapidly evolving capabilities for observing the Earth, especially with network of sensors providing continuous real-time data, data volumes will reach petabytes in scales. Moreover, with emerging technological capabilities it will be possible to deploy multiplicity of sensor networks that can generate data spanning many disciplines. Autonomous field sensors Seismic, oceanic, climate, ecological, …, video, audio, … Real-time Data Acquisition: ANZA Seismic Network (1981-present): 13 Broadband Stations, 3 Borehole Strong Motion Arrays, 5 Infrasound Stations, 1 Bridge Monitoring System; Kyrgyz Seismic Network (1991-present): 10 Broadband Stations; IRIS PASSCAL Transportable Array (1997-Present): 15 - 60 Broadband and Short Period Stations; IDA Global Seismic Network (~1990 -Present): 38 Broadband Stations The Problem: Integrated real-time management of large, distributed, heterogeneous data streams from sensor networks and embedded systems … e.g. integration: handling of heterogeneous data from multiple (signal) domains, interfacing with sensors and data loggers to acquire streams real-time processing: monitor streams of seismic data, detect events, trigger computations and other actions distribution: federate multi-domain sensor and embedded system networks for transparent access persistent large-scale archival and querying of “consolidated” data packets High Performance Wireless Research Network (HPWREN) High performance backbone network: 45Mbps duplex point-to-point links, backbone nodes at quality locations, network performance monitors at backbone sites; High speed access links: hard to reach areas, typically 45Mbps or 802.11radios, point-to-point or point-to-multipoint

Distributed Data Management Architecture for Embedded Computing Hans-Werner Braun*,Todd Hansen*, Kent Lindquist, Bertram Ludäscher*, John Orcutt†, Arcot Rajasekar*, Frank Vernon† *San Diego Supercomputer Center , †Scripps Institution of Oceanography, University of California, San Diego . Datalogger type Usage Video cameras for visual analysis (wild-life migration, river monitoring, tide monitors) Campbell data logger weather,strain monitor, general purpose sensor interface and logging Handar weather logger Solinst Stream level conductivity,temp , etc ADCP (Acoustic Doppler Current Profiler) Measure ocean currents   Ashtech GPS RTCM corrections for Differential GPS CODAR Surface Current Profile Radar Reftek, Quanterra, Kinemetrics Broadband seismic and strong motion sensors Sensor data integration Sensor Configurations: with & without data loggers Antelope / ORB Real-time System VORB = SRB + ORB + AR apply Storage Resource Broker (SRB) Data Grid Technology and Adaptive Rule (AR) based programming to federate real-time ORB systems Data Grid Technology (SRB) collaborative access to distributed heterogeneous data, single sign-on authentication and seamless authorization,data scaling to Petabytes and 100s of millions of files, data replication, etc. Current Event Detection Logic: ORB-detect, ORB-trigger, … specific to seismic event detection SRB Archives HPSS, ADSM, UniTree, DMF Databases DB2, Oracle, Sybase File Systems Unix, NT, Mac OSX Application C, C++, Linux I/O Unix Shell Dublin Core Resource, User Defined Meta-data Remote Proxies DataCutter Third-party copy Java, NT Browsers Web Prolog Predicate MCAT HRM VORB Cat SrcORB Client/App. VORB Rule Engine Event-Condition- Action Rules SRC Events USR Events Condition Evaluation Archive ORB VORB Active Rule System: Programmable rule-based active database logic flexible and seamlessly extensible to other domains References Foster, I., and Kesselman, C., (1999) “The Grid: Blueprint for a New Computing Infrastructure,” Morgan Kaufmann. PPDG, (1999) “The Particle Physics Data Grid”, (http://www.ppdg.net/). Hoschek, W., Jaen-Martinez, J., Samar, A., Stockinger, H., and Stockinger, K. (2000) “Data Management in an International Data Grid Project,” IEEE/ACM International Workshop on Grid Computing Grid'2000, Bangalore, India 17-20 December 2000. GriPhyN, (2000) “The Grid Physics Network”, (http://www.griphyn.org/proj-desc1.0.html ). NEES, (2000) “Network for Earthquake Engineering Simulation”, (http://www.eng.nsf.gov/nees/ ). KNB, (1999) “The Knowledge Network for Biocomplexity”, (http://knb.ecoinformatics.org/ ) NVO, (2001) “National Virtual Observatory”, (http://www.srl.caltech.edu/nvo/ ). RoadNet, (2001) “Wireless Access and Real-time Data Management”, (http://roadnet.ucsd.edu/ ). Harvey, D., D. Quinlan, F. Vernon, and R. Hansen (1998), “ORB: A New Real-Time Data Exchange and Seismic Processing System”, Seis. Res. Lett. 69, 165. Harvey, D., F. Vernon, G. Pavlis, R. Hansen, K. Lindquist, D. Quinlan, and M. Harkins(1998), “Real-Time Integration of Seismic Data from the IRIS PASSCAL Braodband Array, Regional Seismic Networks, and the Global Seismic Network,” EOS Trans. AGU 79, F567. Processing System”, Seis. Res. Lett. 69, 165. A-Amri, M.S., and A.M. Al-Amri. (1999) “Configuration of the Siesmographic Networks in Saudi Arabia,” Seis. Res. Lett. 70, 322-331. Von Seggern, D., G. Biasi, and K. Smith, (2000), Network Operations Transitions to Antelope at the Nevada Seismological Laboratory,” Seis. Res. Lett. 71, 444. Vernon F., and T. Wallace (1999), “Virtual Seismic Networks,” IRIS Newsletter, 18, 7-8. Vernon F., (2000) “Overview of the USARRAY Real-Time Systems”, EOS Trans AGU 81, S318. Lausen, G., B. Ludaescher, and W. May. (1998) “On Active Deductive Databases: The Statelog Approach,” In Transactions and Change in Logic Databases, H. Decker, B Freitag, M Kifer, A Voronkov, eds. LNCS 1472, 1998. Ludaescher, B., P. Mukhopadhyay, Y. Papakonstantinou. (2002) “A Transducer-Based XML Query Processor”, Intl. Conference on Very Large Databases (VLDB), 2002. Virtual Object Ring Buffer (VORB) Architecture Overview VORB online data