DATABASE MANAGEMENT SYSTEMS IN DATA INTENSIVE ENVIRONMENNTS Leon Guzenda Chief Technology Officer.

Slides:



Advertisements
Similar presentations
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Data Grids for Collection Federation Reagan W. Moore University.
Advertisements

Peter Berrisford RAL – Data Management Group SRB Services.
Data Management Expert Panel - WP2. WP2 Overview.
Database System Concepts and Architecture
Database Architectures and the Web
Enterprise Web Architecture and Performance Shennon Shen & Scott Carey --- Plumtree Software Inc.
11© 2011 Hitachi Data Systems. All rights reserved. HITACHI DATA DISCOVERY FOR MICROSOFT® SHAREPOINT ® SOLUTION SCALING YOUR SHAREPOINT ENVIRONMENT PRESENTER.
Database Software File Management Systems Database Management Systems.
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
Workshop on Cyber Infrastructure in Combustion Science April 19-20, 2006 Subrata Bhattacharjee and Christopher Paolini Mechanical.
Microsoft Virtual Server 2005 Product Overview Mikael Nyström – TrueSec AB MVP Windows Server – Setup/Deployment Mikael Nyström – TrueSec AB MVP Windows.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 17 Client-Server Processing, Parallel Database Processing,
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Overview of Database Languages and Architectures.
1 © Prentice Hall, 2002 The Client/Server Database Environment.
Centralized and Client/Server Architecture and Classification of DBMS
Client-Server Processing and Distributed Databases
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
1: IntroductionData Management & Engineering1 Course Overview: CS 395T Semantic Web, Ontologies and Cloud Databases Daniel P. Miranker Objectives: Get.
The Client/Server Database Environment
Word Wide Cache Distributed Caching for the Distributed Enterprise.
Database System Concepts and Architecture Lecture # 3 22 June 2012 National University of Computer and Emerging Sciences.
SAP R/3 System: Client Server Overview (Buck-Emden & Galimow, 1998) Dr. K. Palaniappan.
1 Dr. Markus Hillenbrand, ICSY Lab, University of Kaiserslautern, Germany A Generic Database Web Service for the Venice Service Grid Michael Koch, Markus.
Database Architecture Introduction to Databases. The Nature of Data Un-structured Semi-structured Structured.
Presenter: Dipesh Gautam.  Introduction  Why Data Grid?  High Level View  Design Considerations  Data Grid Services  Topology  Grids and Cloud.
Software Software essential is coded programs that perform a serious of algorithms. Instructions loaded into primary memory (RAM) from secondary storage.
2. Database System Concepts and Architecture
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
Lecture On Introduction (DBMS) By- Jesmin Akhter Assistant Professor, IIT, Jahangirnagar University.
The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Dataset Caitlin Minteer & Kelly Clynes.
Csi315csi315 Client/Server Models. Client/Server Environment LAN or WAN Server Data Berson, Fig 1.4, p.8 clients network.
Production Data Grids SRB - iRODS Storage Resource Broker Reagan W. Moore
(C) 2008 Clusterpoint(C) 2008 ClusterPoint Ltd. Empowering You to Manage and Drive Down Database Costs April 17, 2009 Gints Ernestsons, CEO © 2009 Clusterpoint.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
4 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved. Computer Software Chapter 4.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
Elmasri and Navathe, Fundamentals of Database Systems, Fourth Edition Copyright © 2004 Pearson Education, Inc. Slide 2-1 Data Models Data Model: A set.
CLASS Information Management Presented at NOAATECH Conference 2006 Presented by Pat Schafer (CLASS-WV Development Lead)
1 Database Management Systems (DBMS). 2 Database Management Systems (DBMS) n Overview of: ä Database Management Components ä Database Systems Architecture.
INNOV-10 Progress® Event Engine™ Technical Overview Prashant Thumma Principal Software Engineer.
INTRODUCTION TO DBS Database: a collection of data describing the activities of one or more related organizations DBMS: software designed to assist in.
The Global Land Cover Facility is sponsored by NASA and the University of Maryland.The GLCF is a founding member of the Federation of Earth Science Information.
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
Introduction to The Storage Resource.
Database Systems Lecture 1. In this Lecture Course Information Databases and Database Systems Some History The Relational Model.
Tom Meyer, Iowa State SCT/Pixel Online Workshop June, 2001 CORBA Common Object Request Broker Architecture.
25 April Unified Cryptologic Architecture: A Framework for a Service Based Architecture Unified Cryptologic Architecture: A Framework for a Service.
NeST: Network Storage John Bent, Venkateshwaran V Miron Livny, Andrea Arpaci-Dusseau, Remzi Arpaci-Dusseau.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
The Storage Resource Broker and.
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
E-commerce Architecture Ayşe Başar Bener. Client Server Architecture E-commerce is based on client/ server architecture –Client processes requesting service.
1 Case Study: Business Intelligence & Customer Data Customer Support Web-based Dashboard VP Marketing SQL XSLT XML Data Grid Customer Data Customer Order.
Preservation Data Services Persistent Archive Research Group Reagan W. Moore October 1, 2003.
ISC321 Database Systems I Chapter 2: Overview of Database Languages and Architectures Fall 2015 Dr. Abdullah Almutairi.
WP2: Data Management Gavin McCance University of Glasgow.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Chapter 1 Characterization of Distributed Systems
Databases and DBMSs Todd S. Bacastow January 2005.
Chapter 9: The Client/Server Database Environment
Database System Concepts and Architecture
The Client/Server Database Environment
The Client/Server Database Environment
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Data, Databases, and DBMSs
Database Environment Transparencies
Database System Architectures
The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Datasets A.Chervenak, I.Foster, C.Kesselman, C.Salisbury,
Presentation transcript:

DATABASE MANAGEMENT SYSTEMS IN DATA INTENSIVE ENVIRONMENNTS Leon Guzenda Chief Technology Officer

Copyright  Objectivity, Inc DMW20043/16/04 AGENDA Introduction Issues and Approaches Summary & Resources

Objectivity, Inc. & Objectivity/DB

Copyright  Objectivity, Inc DMW20043/16/04 Objectivity Corporate Information Object Database Management for: Data intensive applications that manipulate complex data High throughput systems Very large volumes of data Main Markets Government Scientific Telecommunications Engineering Manufacturing Complex IT Product Highlights High Performance with complex data Scalability and High Availability Fully Distributed Interoperability -C++, Java, Smalltalk, SQL and XML -Linux, LynxOS, Unix and Windows Productivity -Eclipse IDE -Eliminates the object to DB mapping layer

Copyright  Objectivity, Inc DMW20043/16/04 SCALABILITY Data Volume Terabytes [BaBar] Throughput – Ingested 32 Terabytes per Day [Benchmark] In a recent benchmark with Objectivity/DB running on 64 Irix processors (600 MHz), CXFS and a 100 Terabyte SAN we achieved: An ingest rate of 32 Terabytes per day (input, correlate and commit) Simultaneous queries from 32 processors running at near to 100% CPU capacity Simultaneous movement and deletion of aged data to a long term repository Simultaneous Users – 100s of Thousands [SprintPCS]

Issues and Approaches

Copyright  Objectivity, Inc DMW20043/16/04 ISSUES Describing complex data Exponentially increasing data volumes Sharing data across sites Querying huge datasets Cost of Ownership

Copyright  Objectivity, Inc DMW20043/16/04 DESCRIBING COMPLEX DATA Approaches: Old Way -Definitions buried in header files -Language-specific schema language (DDL/SQL) Current Approaches -Unified Modeling Language [UML] -XML Trends -Java Database Objects [JDO] -Grid Database Access and Integration Services -Higher level schemas and ONTOLOGIES

Copyright  Objectivity, Inc DMW20043/16/04 DATA VOLUMES Approaches: Old Way -Keep data in compressed files and index them in a DBMS -Proprietary tape archives Current Approaches -Store everything in an ODBMS (lower overheads than an RDBMS) -Hierarchical storage systems (HPSS etc.) Trends -Solid State Disks at the front end, commodity disks at the back end -Heterogeneous Storage Area Networks [SAN], e.g. CXFS -Fiber Optic processor-to-SAN switches -Grid enablement (totally distributed archives)

Copyright  Objectivity, Inc DMW20043/16/04 SHARING DATA ACROSS SITES Approaches: Old Way -Transfer files/disks/tapes -Filesystem or no security Current Approaches -Distributed databases and the World Wide Web -High bandwidth networks -Authentication and secure transport layers Trends -Grid enablement -Federated databases -Ultra-high bandwidth networks and remote replication -Flexible, localized security mechanisms

Copyright  Objectivity, Inc DMW20043/16/04 Distributed Federations A2 Replica of A A Organization X Organization Y User X1 User X2 User X3 User Y1 A3 Replica of A

Copyright  Objectivity, Inc DMW20043/16/04 Distributed Federations A2 Replica of A A Organization X Organization Y User X1 Mobile and Detached User X2 User X3 User Y1 A3 Replica of A

Copyright  Objectivity, Inc DMW20043/16/04 QUERYING HUGE DATASETS Approaches: Old Way -Hold metadata (indexes and relationships) in a searchable file Current Approaches -Hold metadata in a RDBMS and data in files -Hold metadata and data in an ODBMS Trends -Adaptations of text search engines -Distributed Parallel Query Engines -Specialized search accelerators

Copyright  Objectivity, Inc DMW20043/16/04 Current Architecture Queries run synchronously within the client Networking & Event Managers Storage & Transaction Managers Query & Index Managers Object & Schema Managers Language Interfaces APPLICATION DBA ToolsLock Server Data “Page” Server Mass Storage Data “Page” Server

Copyright  Objectivity, Inc DMW20043/16/04 Parallel Query Engine [PQE] Queries run asynchronously and in parallel, either locally or distributed Networking & Event Managers Storage & Transaction Managers Query & Index Managers Object & Schema Managers Language Interfaces APPLICATION DBA Tools Lock Server Data “Page” Servers PQE

Copyright  Objectivity, Inc DMW20043/16/04 PQE and Search Accelerator Queries run asynchronously and in parallel, but with Predicate Management within the Search Accelerator Networking & Event Managers Storage & Transaction Managers Query Manager Object & Schema Managers Language Interfaces APPLICATION DBA Tools Lock Server Data Servers PQE FPGA & RAM Search Accelerator

Copyright  Objectivity, Inc DMW20043/16/04 COST OF OWNERSHIP Approaches: Old Way -Build It Yourself (many hidden costs) -Run It Yourself Current Approaches -Use Commercial Off The Shelf [COTS] software -Open Source -Commodity hardware & tiered storage Trends -Heterogeneous storage -Grid Enablement -Resource and Skill Brokers (Future)

SUMMARY

Copyright  Objectivity, Inc DMW20043/16/04 SUMMARY Database languages are still evolving Data throughput and system latency times are decreasing Sharing data across sites still presents many challenges Querying vast datasets will become faster and cheaper Software vendors are wrestling with Open Source issues Startup costs are still high, but the trends are downward Grid enablement will help Keep working on the Standards!

Copyright  Objectivity, Inc DMW20043/16/04 RESOURCES Technical Overview Data Sheets and White Papers Free downloadable Java and C++ evaluation software and tutorials Global Grid Forum ANY QUESTIONS?