Some Ideas for a Revised Requirement List Dirk Duellmann.

Slides:



Advertisements
Similar presentations
Object Persistency & Data Handling Session C - Summary Object Persistency & Data Handling Session C - Summary Dirk Duellmann.
Advertisements

OO databases 1 Object Oriented databases. OO databases 2 Developing OODBMS - motivation motivation more and more application areas require systems that.
Chapter 10: Designing Databases
Peer-to-Peer (P2P) Distributed Storage 1Dennis Kafura – CS5204 – Operating Systems.
CS-550: Distributed File Systems [SiS]1 Resource Management in Distributed Systems: Distributed File Systems.
Using DSVM to Implement a Distributed File System Ramon Lawrence Dept. of Computer Science
Objektorienteret Middleware Presentation 2: Distributed Systems – A brush up, and relations to Middleware, Heterogeneity & Transparency.
Approaches to EJB Replication. Overview J2EE architecture –EJB, components, services Replication –Clustering, container, application Conclusions –Advantages.
D. Düllmann - IT/DB LCG - POOL Project1 POOL Release Plan for 2003 Dirk Düllmann LCG Application Area Meeting, 5 th March 2003.
SOCATS STL-based Object Caching And Transport System.
1 Principles of Reliable Distributed Systems Tutorial 12: Frangipani Spring 2009 Alex Shraer.
File System Implementation
Overview Distributed vs. decentralized Why distributed databases
Wide-area cooperative storage with CFS
Object-Oriented Methods: Database Technology An introduction.
CSE 490dp Resource Control Robert Grimm. Problems How to access resources? –Basic usage tracking How to measure resource consumption? –Accounting How.
DISTRIBUTED COMPUTING
Google AppEngine. Google App Engine enables you to build and host web apps on the same systems that power Google applications. App Engine offers fast.
Web Application Architecture: multi-tier (2-tier, 3-tier) & mvc
Introduction to Databases Transparencies 1. ©Pearson Education 2009 Objectives Common uses of database systems. Meaning of the term database. Meaning.
MIS 710 Module 0 Database fundamentals Arijit Sengupta.
Distributed File Systems Concepts & Overview. Goals and Criteria Goal: present to a user a coherent, efficient, and manageable system for long-term data.
1 Distributed and Parallel Databases. 2 Distributed Databases Distributed Systems goal: –to offer local DB autonomy at geographically distributed locations.
Database System Concepts and Architecture Lecture # 3 22 June 2012 National University of Computer and Emerging Sciences.
Database System Concepts and Architecture Lecture # 2 21 June 2012 National University of Computer and Emerging Sciences.
Object Databases as Data Stores for HEP Dirk Düllmann IT/ASD & RD45.
Replication & EJB Graham Morgan. EJB goals Ease development of applications –Hide low-level details such as transactions. Provide framework defining the.
Enterprise Java Beans Java for the Enterprise Server-based platform for Enterprise Applications Designed for “medium-to-large scale business, enterprise-wide.
DBSQL 14-1 Copyright © Genetic Computer School 2009 Chapter 14 Microsoft SQL Server.
Cloud Use Cases, Required Standards, and Roadmaps Excerpts From Cloud Computing Use Cases White Paper
Csi315csi315 Client/Server Models. Client/Server Environment LAN or WAN Server Data Berson, Fig 1.4, p.8 clients network.
DATABASE MANAGEMENT SYSTEMS IN DATA INTENSIVE ENVIRONMENNTS Leon Guzenda Chief Technology Officer.
OS2- Sem ; R. Jalili Introduction Chapter 1.
Kyung Hee University 1/41 Introduction Chapter 1.
Distributed DBMSs- Concept and Design Jing Luo CS 157B Dr. Lee Fall, 2003.
Kjell Orsborn UU - DIS - UDBL DATABASE SYSTEMS - 10p Course No. 2AD235 Spring 2002 A second course on development of database systems Kjell.
1 Database Management Systems (DBMS). 2 Database Management Systems (DBMS) n Overview of: ä Database Management Components ä Database Systems Architecture.
Eduardo Gutarra Velez. Outline Distributed Filesystems Motivation Google Filesystem Architecture The Metadata Consistency Model File Mutation.
3-Tier Architecture Chandrasekaran Rajagopalan Cs /01/99.
INTRODUCTION TO DBS Database: a collection of data describing the activities of one or more related organizations DBMS: software designed to assist in.
The POOL Persistency Framework POOL Project Review Introduction & Overview Dirk Düllmann, IT-DB & LCG-POOL LCG Application Area Internal Review October.
Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition File System Implementation.
Presentation 3: Designing Distributed Objects. Ingeniørhøjskolen i Århus Slide 2 af 14 Outline Assumed students are knowledgeable about OOP principles.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
Espresso - a Feasibility Study of a Scalable, Performant ODBMS Dirk Duellmann CERN IT/DB and RD45 n Aim of this Study n Architectural Overview n Espresso.
D. Duellmann - IT/DB LCG - POOL Project1 The LCG Pool Project and ROOT I/O Dirk Duellmann What is Pool? Component Breakdown Status and Plans.
D. Duellmann - IT/DB LCG - POOL Project1 The LCG Dictionary and POOL Dirk Duellmann.
1 Information Retrieval and Use De-normalisation and Distributed database systems Geoff Leese September 2008, revised October 2009.
1 10 Systems Analysis and Design in a Changing World, 2 nd Edition, Satzinger, Jackson, & Burd Chapter 10 Designing Databases.
Chapter Five Distributed file systems. 2 Contents Distributed file system design Distributed file system implementation Trends in distributed file systems.
Postgraduate Module Enterprise Database Systems Technological Educational Institution of Larisa in collaboration with Staffordshire University Larisa
E-commerce Architecture Ayşe Başar Bener. Client Server Architecture E-commerce is based on client/ server architecture –Client processes requesting service.
Jean-Philippe Baud, IT-GD, CERN November 2007
Database Management.
(on behalf of the POOL team)
Database Management:.
Open Source distributed document DB for an enterprise
Hybrid Cloud Architecture for Software-as-a-Service Provider to Achieve Higher Privacy and Decrease Securiity Concerns about Cloud Computing P. Reinhold.
Persistency Author: Youhei Morita.
POOL: Component Overview and use of the File Catalog
File System Implementation
Software Design and Architecture
The Client/Server Database Environment
POOL persistency framework for LHC
Dirk Düllmann CERN Openlab storage workshop 17th March 2003
Advanced Operating Systems
Introduction to Databases Transparencies
Lecture 1: Multi-tier Architecture Overview
Introduction To Distributed Systems
An Interactive Browser For BaBar Databases
Presentation transcript:

Some Ideas for a Revised Requirement List Dirk Duellmann

D. Duellmann Storage Manager Requirements –Volume - Address space sufficient for LHC data stores n at least 100PB total store size n assuming a maximum size of a single file of GB –Location Transparency - Heterogeneous and distributed access n transparent access independent I/O servers or client platform –need initial list of compiler platform combinations for clients & servers n no explicit host or filenames for readers –Navigation - Between arbitrary data objects n scalable performant access –e.g. lookup time depends less than linearly on the store size n inter-file and inter-host navigation –e.g. to support streaming of a single event into different physical files on multiple hosts. –Consistency & Robustness - Do we need Transactions? n Consistent concurrent updates to shared data n Support for crash recovery for db & application meta data

D. Duellmann Meta Data Requirements n Private Schema & Data - Users may change schema or catalogue of their private data without compromising the consistency of shared data n Scalable Meta Data - Database schema and catalogue implementation should not constrain concurrency or scalability n Schema Consistency - Enforce consistency between application classes, store schema and persistent objects –Ease extraction of data subsets for testing n We need more flexibility than any other typical ODBMS applications!

D. Duellmann Data Replication & Import/Export n Support for consistent replicas of subsets of data –to increase availability and performance n Which access control model ? n read-only - nobody writes after initial distribution –anybody may read, no locks required n ownership - only owner is allowed to write/update –remote readers may get stale (but consistent) data  random access - any process may write if it gets a lock –may require WAN locks on many machines n Which latency is acceptable for re-syncronisation? n after some high-level transaction –e.g. complete set of updates producing a new calibration, event collection, simulation run …  after each DB transaction –may require frequent WAN data access on many machines

D. Duellmann Language Binding Requirements n Language Support - at least for C++ and JAVA –Complete OO support including C++ templates and polymorphism (virtual functions) –Language interoperability (for a well defined subset of language constructs?)  Trade-off between  Risk for Experiment Code - Insulation against change of vendor  Transparency for End Users - As simple to use as transient data  Flexibility - More modern binding: e.g. activate/de-activate functionality  Performance - e.g. I/O on demand

D. Duellmann Language Binding Requirements n This system is overconstrained! –Multiple Solution exist for end-user interface –Standards Compliant Binding n ODMG binding - or at least ODMG like navigation model –direct use of binding & C++ pointers to persistent data (CMS) –Complete End-User Insulation n Experiment specific insulation layer - usually coupled to one specific application framework –complete conversion into transient objects (BaBar) –on demand conversion into transient objects using smart pointers (LHCb) n Should we split and define two Interfaces? –more flexible, performant, exposed lower level –more easy, customised, insulated higher level n Would a common insulation layer be possible/help at all? n Need to start a requirement definition group soon!