Implementation of a streaming database management system on a Blue Gene architecture for measurement data processing. Erik Zeitler Uppsala data base lab.

Slides:



Advertisements
Similar presentations
PARMON A Comprehensive Cluster Monitoring System PARMON Team Centre for Development of Advanced Computing, Bangalore, India Contact: Rajkumar Buyya
Advertisements

Clusters, Grids and their applications in Physics David Barnes (Astro) Lyle Winton (EPP)
An Overview of the Computer System
HPCC Mid-Morning Break High Performance Computing on a GPU cluster Dirk Colbry, Ph.D. Research Specialist Institute for Cyber Enabled Discovery.
ASKAP Central Processor: Design and Implementation Calibration and Imaging Workshop 2014 ASTRONOMY AND SPACE SCIENCE Ben Humphreys | ASKAP Software and.
Prepared by:Mohammed Ayesh
What Can You do With BTM? Business Transaction Management touches the following disciplines:  Performance Management  Application Management  Capacity.
Fermilab Scientific Computing Division Fermi National Accelerator Laboratory, Batavia, Illinois, USA. The Power of Data Driven Triggering DAQ Topology.
Only Sky is a limit, So lets the real job start S. Pogrebenko, , &
Basics of Operating Systems March 4, 2001 Adapted from Operating Systems Lecture Notes, Copyright 1997 Martin C. Rinard.
Infiniband enables scalable Real Application Clusters – Update Spring 2008 Sumanta Chatterjee, Oracle Richard Frank, Oracle.
31 January 2007Craig E. Ward1 Large-Scale Simulation Experimentation and Analysis Database Programming Using Java.
LIGO-G E ITR 2003 DMT Sub-Project John G. Zweizig LIGO/Caltech Argonne, May 10, 2004.
Yavor Todorov. Introduction How it works OS level checkpointing Application level checkpointing CPR for parallel programing CPR functionality References.
SensIT PI Meeting, January 15-17, Self-Organizing Sensor Networks: Efficient Distributed Mechanisms Alvin S. Lim Computer Science and Software Engineering.
Version 4.0. Objectives Describe how networks impact our daily lives. Describe the role of data networking in the human network. Identify the key components.
National Center for Supercomputing Applications Observational Astronomy NCSA projects radio astronomy: CARMA & SKA optical astronomy: DES & LSST access:
So, Jung-ki Distributed Computing System LAB School of Computer Science and Engineering Seoul National University Implementation of Package Management.
What is Ubiquity™? ubiquity (noun): the state of being, or seeming to be, everywhere at once Ubiquity™ is a powerful building management system that.
Profiling Grid Data Transfer Protocols and Servers George Kola, Tevfik Kosar and Miron Livny University of Wisconsin-Madison USA.
1 100 SKA stations (2020 ) Projets avec SKA. 2 Telescope Project (~2020) for a giant radiotelescope in the centimetre-metre range one square kilometre.
An RFI Mitigation Strategy for the Allen Telescope Array Geoffrey C. Bower UC Berkeley.
PARMON A Comprehensive Cluster Monitoring System A Single System Image Case Study Developer: PARMON Team Centre for Development of Advanced Computing,
Cluster Computers. Introduction Cluster computing –Standard PCs or workstations connected by a fast network –Good price/performance ratio –Exploit existing.
COMPTUER CLUSTERING WITH LINUX-ON-CD Robert Ibershoff Computer Electronic Networking.
The Murchison Wide Field Array Murchison, ~300 km from Geraldton.
© 2010 IBM Corporation IBM InfoSphere Streams Enabling a smarter planet Roger Rea InfoSphere Streams Product Manager Sept 15, 2010.
LOFAR project Astroparticle Physics workshop 26 April 2004.
Rensselaer Why not change the world? Rensselaer Why not change the world? 1.
Beowulf Cluster Jon Green Jay Hutchinson Scott Hussey Mentor: Hongchi Shi.
LIGO-G9900XX-00-M ITR 2003 DMT Sub-Project John G. Zweizig LIGO/Caltech.
14 Aug 08DOE Review John Huth ATLAS Computing at Harvard John Huth.
St.Petersburg state university computing centre and the 1st results in the DC for ALICE V.Bychkov, G.Feofilov, Yu.Galyuck, A.Zarochensev, V.I.Zolotarev.
Achieving Scalability, Performance and Availability on Linux with Oracle 9iR2-RAC Grant McAlister Senior Database Engineer Amazon.com Paper
© ASTRON On the Fly LOFAR Station Correlator André W. Gunst.
1 CMPE 511 HIGH PERFORMANCE COMPUTING CLUSTERS Dilek Demirel İşçi.
6/4/ Key components of the computer Classification of computers based on Purposes Classification of computers based on Signals Classification of.
Systems and Communications1 CERN Openlab – 10/11 December 2013 ESRF: IT Challenges.
TASHKENT UNIVERSITY OF INFORMATION TECHNOLOGIES Lesson №18 Telecommunication software design for analyzing and control packets on the networks by using.
Integrating New Capabilities into NetPIPE Dave Turner, Adam Oline, Xuehua Chen, and Troy Benjegerdes Scalable Computing Laboratory of Ames Laboratory This.
Next Generation Operating Systems Zeljko Susnjar, Cisco CTG June 2015.
LOFAR The Low Frequency Array Shep Doeleman LOFAR Group.
Large Area Surveys - I Large area surveys can answer fundamental questions about the distribution of gas in galaxy clusters, how gas cycles in and out.
7. Grid Computing Systems and Resource Management
Introduction to the new mainframe © Copyright IBM Corp., All rights reserved. 1 Main Frame Computing Objectives Explain why data resides on mainframe.
EURO-VO: GRID and VO Lofar Information System Design OmegaCEN Kapteyn Institute TARGET- Computing Center University Groningen Garching, 10 April 2008 Lofar.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
Lofar Information System on GRID A.N.Belikov. Lofar Long Term Archive Prototypes: EGEE Astro-WISE Requirements to data storage Tiers Astro-WISE adaptation.
1 Next Generation Correlators, June 26 th −29 th, 2006 The LOFAR Blue Gene/L Correlator Stichting ASTRON (Netherlands Foundation for Research in Astronomy)
Cluster Computers. Introduction Cluster computing –Standard PCs or workstations connected by a fast network –Good price/performance ratio –Exploit existing.
What we DO need to make Desktop Grids a Success in Practice Michela Taufer UCSD - TSRI.
Background Computer System Architectures Computer System Software.
Meeting with University of Malta| CERN, May 18, 2015 | Predrag Buncic ALICE Computing in Run 2+ P. Buncic 1.
Ben Stappers (ASTRON) LOFAR 101 Pizza Lunch. LOFAR Technical Concept LOFAR is a very large distributed radio telescope: LOFAR is a very large distributed.
High Altitude Long Operation
Chapter 16 Client/Server Computing Dave Bremer Otago Polytechnic, N.Z. ©2008, Prentice Hall Operating Systems: Internals and Design Principles, 6/E William.
BLUE GENE Sunitha M. Jenarius. What is Blue Gene A massively parallel supercomputer using tens of thousands of embedded PowerPC processors supporting.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
1 ASTRON is part of the Netherlands Organisation for Scientific Research (NWO) Netherlands Institute for Radio Astronomy Astronomy at ASTRON George Heald.
The Science Data Processor and Regional Centre Overview Paul Alexander UK Science Director the SKA Organisation Leader the Science Data Processor Consortium.
Grid Optical Burst Switched Networks
Clouds , Grids and Clusters
Appro Xtreme-X Supercomputers
GlassFish in the Real World
Статус ГРИД-кластера ИЯФ СО РАН.
LOFAR Beam Formed (BF) Data & Pipeline Overview September 9, 2010
BlueGene/L Supercomputer
Microsoft Virtual Academy
Performance And Scalability In Oracle9i And SQL Server 2000
Analytics, BI & Data Integration
Presentation transcript:

Implementation of a streaming database management system on a Blue Gene architecture for measurement data processing. Erik Zeitler Uppsala data base lab

Looking out into space: Use large radio telescopes!

Problem: Size matters We have hit the limit

Use many large radio telescopes? Augment the measurements using signal processing  They act together as a HUGE telescope Look in one direction only Expensive…

Solution Use a huge amount of small antennas This enables new scientific applications (and challenges) } Broad band Multi direction receivers

Scientific applications Re-ionization epoch the 1 st 10 5 years – hydrogen forming Deep Extragalactic Surveys To boldly go… Transient Sources All-sky surveys of –gamma bursts –flare stars –supernovae Ultra High Energy Cosmic Rays Pulsars

Antennas, antennas, antennas… Broad band radio receiver 80…300 MHz, 3 dimensions Produces 0.9 Gbps raw data Central site + 20 outstations located within a circular area, diameter 350 km  13  10 3 antennas

System overview Antennas Basic beam forming FPGAs Network GbE, 10GbE Central Processing facility Linux clusters, IBM Blue Gene/L Off line analysis PCs, workstations, Blue Gene

System overview

Central processing tasks FFT Signal correlation Calibration RFI mitigation (noise from human activities) Stratosphere plasma Subtracting known objects Transient analysis Peak detection

Computing challenges Multiple incoming data streams 20 Tbps Multiple experiments Complex computations Demand for rapid reconfiguration of computing systems Use case: On-line transient analysis

Central processing facilities On line processing Linux cluster (buffering) Light weight BG/L (beam) 6 racks  6144 compute nodes + 96 I/O nodes Off-line processing Linux clusters, SAN, GRID, …

Blue Gene Dataflow supercomputer LLNL installation: 64 racks (65536 CPUs)  70 TFLOPS on the size of a tennis court

BG/L architecture I/O node: 2x Linux Each I/O node coordinates 64 compute nodes 512 MB RAM Compute node: 2x Single threaded light weight OS Typically: –1 CPU for computation –1 CPU for communication 512 MB RAM

User agent

UDBL project Implement a very high performance stream database manager based on AmosII DB kernel ( Utilize the BG/L computing environment for scalable data stream queries involving user-defined computations Implement specialized query optimization: Planning BG/L node configuration for given stream queries Re-configuration when interesting phenomena occur

This far (after 4 months) Implementing primitives for data ~ Computation Aggregation Communication Fusion Proof of concept cases Signal processing Peak detection Stream join Benchmark Based on real LOFAR/LOIS data Performance analysis for stream databases

A simple example gnuplot(peakdetect(vector_elements(wina gg(vector_elements(readlofarvectorfile( "temp.DAT")),256,256))));

Other application areas Other space physics research areas projects at IRFU Network traffic analysis Financial (stock market) information Content analysis of streaming media

Questions?