STAR COLLABORATION MEETING July 2004 Michael DePhillips 1 STAR DBs Near And Long Term Plans.

Slides:



Advertisements
Similar presentations
Implementing Tableau Server in an Enterprise Environment
Advertisements

DB2 Tools Pertemuan 3 Matakuliah: T0413 Tahun: 2009.
Chapter 5: Server Hardware and Availability. Hardware Reliability and LAN The more reliable a component, the more expensive it is. Server hardware is.
Database Management Systems, R. Ramakrishnan and J. Gehrke1 External Sorting Chapter 11.
Toolbox Mirror -Overview Effective Distributed Learning.
 Need for a new processing platform (BigData)  Origin of Hadoop  What is Hadoop & what it is not ?  Hadoop architecture  Hadoop components (Common/HDFS/MapReduce)
Database Software File Management Systems Database Management Systems.
DBA Meeting December Supporting the MINOS MySQL Database at FNAL Nick West.
Memory Management April 28, 2000 Instructor: Gary Kimura.
VTS INNOVATOR SERIES Real Problems, Real solutions.
F Fermilab Database Experience in Run II Fermilab Run II Database Requirements Online databases are maintained at each experiment and are critical for.
Backup and Recovery Part 1.
Event Viewer Was of getting to event viewer Go to –Start –Control Panel, –Administrative Tools –Event Viewer Go to –Start.
Star (Traditional) Database Tasks & MySQL 1. Database Types & Operation Issues 2. Server & Database deployments 3. Tools with MySQL 4. Data definition.
Distributed Data Stores – Facebook Presented by Ben Gooding University of Arkansas – April 21, 2015.
IT The Relational DBMS Section 06. Relational Database Theory Physical Database Design.
Chapter Oracle Server An Oracle Server consists of an Oracle database (stored data, control and log files.) The Server will support SQL to define.
Shuei MEG review meeting, 2 July MEG Software Status MEG Software Group Framework Large Prototype software updates Database ROME Monte Carlo.
Online Database Support Experiences Diana Bonham, Dennis Box, Anil Kumar, Julie Trumbo, Nelly Stanfield.
Copyright ®xSpring Pte Ltd, All rights reserved Versions DateVersionDescriptionAuthor May First version. Modified from Enterprise edition.NBL.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
CMPE 421 Parallel Computer Architecture
HBase A column-centered database 1. Overview An Apache project Influenced by Google’s BigTable Built on Hadoop ▫A distributed file system ▫Supports Map-Reduce.
Goodbye rows and tables, hello documents and collections.
MySQL. Dept. of Computing Science, University of Aberdeen2 In this lecture you will learn The main subsystems in MySQL architecture The different storage.
MySQL and GRID Gabriele Carcassi STAR Collaboration 6 May Proposal.
1 Moshe Shadmon ScaleDB Scaling MySQL in the Cloud.
1 Alice DAQ Configuration DB
NoSQL Databases Oracle - Berkeley DB. Content A brief intro to NoSQL About Berkeley Db About our application.
Copyright 2006 MySQL AB The World’s Most Popular Open Source Database MySQL Cluster: An introduction Geert Vanderkelen MySQL AB.
1099 Why Use InterBase? Bill Todd The Database Group, Inc.
Lecture Topics: 11/17 Page tables TLBs Virtual memory flat page tables
Memory Management – Page 1 of 49CSCI 4717 – Computer Architecture Memory Management Uni-program – memory split into two parts –One for Operating System.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 External Sorting Chapter 13.
Achieving Scalability, Performance and Availability on Linux with Oracle 9iR2-RAC Grant McAlister Senior Database Engineer Amazon.com Paper
Database Management Systems, R. Ramakrishnan and J. Gehrke 1 External Sorting Chapter 13.
A Brief Documentation.  Provides basic information about connection, server, and client.
Week 7 : Chapter 7 Agenda SQL 710 Maintenance Plan:
U:/msu/course/cse/103 Day 18, Slide 1 CSE 103 Day 18 If you are not enrolled in CSE 103, please save and log out by 10:10.
Online Databases General Functions Database Categories Tools Conditions & Configurations DB Technical Issues Activities of Interest & Need R. Jeff Porter.
The Million Point PI System – PI Server 3.4 The Million Point PI System PI Server 3.4 Jon Peterson Rulik Perla Denis Vacher.
SQL/Lesson 7/Slide 1 of 32 Implementing Indexes Objectives In this lesson, you will learn to: * Create a clustered index * Create a nonclustered index.
Chap 7: Consistency and Replication
Infrastructure for Data Warehouses. Basics Of Data Access Data Store Machine Memory Buffer Memory Cache Data Store Buffer Bus Structure.
Online monitor for L2 CAL upgrade Giorgio Cortiana Outline: Hardware Monitoring New Clusters Monitoring
Performance Testing Test Complete. Performance testing and its sub categories Performance testing is performed, to determine how fast some aspect of a.
IMS 4212: Database Implementation 1 Dr. Lawrence West, Management Dept., University of Central Florida Physical Database Implementation—Topics.
Maintenance Practices. Goal  Automate the necessary DBA chores to put organizations on the path of having healthier, consistent and more trustworthy.
Introduction to Database Systems1 External Sorting Query Processing: Topic 0.
Oracle Business Intelligence Foundation – Testing and Deploying OBI Repository.
Log Shipping, Mirroring, Replication and Clustering Which should I use? That depends on a few questions we must ask the user. We will go over these questions.
SQL Basics Review Reviewing what we’ve learned so far…….
 Project Team: Suzana Vaserman David Fleish Moran Zafir Tzvika Stein  Academic adviser: Dr. Mayer Goldberg  Technical adviser: Mr. Guy Wiener.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
DBMS ● What are they? ● Why used ● Examples? – Oracle – Access – MySQL – Postgres – SQLServer – Sqlite.
MIKADO – Generation of ISO – SeaDataNet metadata files
Understanding Data Storage
Managing Multi-User Databases
Database application MySQL Database and PhpMyAdmin
SQL Server Monitoring Overview
Cloud based Open Source Backup/Restore Tool
Hustle and Bustle of SQL Pages
Database Administration for the Non-DBA
CSCI206 - Computer Organization & Programming
Scaling and Performance
What’s new in SQL Server 2016 Availability Groups
Database administration
Presentation transcript:

STAR COLLABORATION MEETING July 2004 Michael DePhillips 1 STAR DBs Near And Long Term Plans

2 STAR COLLABORATION MEETING July 2004Michael DePhillips Run 4 OnlineStats Data Taken 300 million events taken (this includes in addition to physics, fast detector, test runs, bad runs, etc.) 300 million events taken (this includes in addition to physics, fast detector, test runs, bad runs, etc.) Space ~63 gig + nightly backups and log files + nightly backups and log files MySql Health – No problems In fact – one table had > 9 million rows In fact – one table had > 9 million rows

3 STAR COLLABORATION MEETING July 2004Michael DePhillips Plans for Run 5 Hardware Upgrades (onlsun1) Production Machines should be isolated Restructure of the eventTag db (DAQ ?) 50 gig 50 gig Further integrate bufferbox Fold in old – evp (another E450) No single point of failure No single point of failure Quasi –fault tolerance Quasi –fault tolerance Additional backup/ redundancy Additional backup/ redundancy Gives an addition platform for “some” development Gives an addition platform for “some” development

4 STAR COLLABORATION MEETING July 2004Michael DePhillips Long Term Planning Although, hardware upgrades are sometimes effective… they are seldom the best answer for long term planning Caveat – STAR DB servers stay somewhat current with available technology. Caveat – STAR DB servers stay somewhat current with available technology. Scalability is essential – so we need a programmatic and algorithmic changes DAQ1000 max data 1kHz, this is still under investigation DAQ1000 max data 1kHz, this is still under investigation

5 STAR COLLABORATION MEETING July 2004Michael DePhillips ONLINE - Long Term – So Far ONLINE - Long Term – So Far Repository for Discarded Suns (E450s) I like this machine, very robust and stable I like this machine, very robust and stable More sophisticated “cluster” Incremental minor upgrades. This can still be expensive and we could eventually go the way of DAQ and get some cheap Linux boxes This can still be expensive and we could eventually go the way of DAQ and get some cheap Linux boxes Make use of our online Linux “cluster”

6 STAR COLLABORATION MEETING July 2004Michael DePhillips OFFLINE – Configuration Master/Slave Replication Master (all writes (inserts and updates)) Master (all writes (inserts and updates))robinson.star.bnl.gov Slaves (all reads (selects…)) Slaves (all reads (selects…)) Two DNS Round-Robins db.star.bnl.gov – 4 machines – used primarily for production db.star.bnl.gov – 4 machines – used primarily for production dbx.star.bnl.gov – 2 machines – used primarily for analysis dbx.star.bnl.gov – 2 machines – used primarily for analysis Six offsite slaves Six offsite slaves

7 STAR COLLABORATION MEETING July 2004Michael DePhillips What’s New With MySQL Major upgrade to Worked well Worked well Added some new administrative features Added some new administrative features is now beta – will be production by ’05 so I’ve begun to test it. New table type - “cluster”.

8 STAR COLLABORATION MEETING July 2004Michael DePhillips OFFLINE - Issues Performance, Performance, Performance The Usual Suspects Are: Database Configuration Database Configuration Usage (queries – and programmatic algos) Usage (queries – and programmatic algos) Database design Database design Hardware Hardware

9 STAR COLLABORATION MEETING July 2004Michael DePhillips OFFLINE – Near Term (done) HARDWARE Upgrade For dbx.star.bnl.gov swapped out one For dbx.star.bnl.gov swapped out one 2-P3 1.4 / 1 gig – memory 2-P3 1.4 / 1 gig – memory for for two Zeon HT / 3 gig - memory two Zeon HT / 3 gig - memory it was time and it HELPED! it was time and it HELPED!

10 STAR COLLABORATION MEETING July 2004Michael DePhillips Configuration We are presently optimized as far as how out dbs are set up Indexes, buffers etc… Indexes, buffers etc… Actual queries are optimal Actual queries are optimal With the upgrade came the query cache Remembers an executed query and caches its results Remembers an executed query and caches its results Next time the server sees ‘exact’ same query, query is not executed, data is returned from the cache. Next time the server sees ‘exact’ same query, query is not executed, data is returned from the cache.

11 STAR COLLABORATION MEETING July 2004Michael DePhillips Query 1 - Tables When run directly on the server these queries run at 100hz or better. The cost of the API and network reduces this to ~58hz which means ~30 seconds to complete this pass which means ~30 seconds to complete this pass Although 30 seconds is not a large amount of time, the information returned very rarely changes. Repeated queries of unchanged data, is always a “red flag” pointing to potential areas to optimize.

12 STAR COLLABORATION MEETING July 2004Michael DePhillips Query 2 - data Real Completion Time (seconds) CPU Completion Time (milliseconds) CPU Completion Time (milliseconds) SVT 57 (average of ten runs) 5 (average of ten runs) SVT 57 (average of ten runs) 5 (average of ten runs) Huge configuration – thousands of empty tables to fill EMC EMC Large amount of data stored as blobs TOF TOF TRG TRG TRACKER TRACKER

13 STAR COLLABORATION MEETING July 2004Michael DePhillips Long Term Not a lot of low hanging fruit There are two passes First rarely changes First rarely changes heap table heap table cache cache Rethink our design less blobs less blobs smaller trees smaller trees New Server Technology distributed computing distributed computing MySql Cluster MySql Cluster

14 STAR COLLABORATION MEETING July 2004Michael DePhillips Service Task Complete THANK YOU! Dmitry Arkhipkin and Julia Zoulkarneeva Database Query Tool Will be available off the Database Page

15 STAR COLLABORATION MEETING July 2004Michael DePhillips FAQs In doubt please ask ? Where do I connect for analysis? Check your configuration file Check your configuration file dbServers.xml in your home directory dbx.star.bnl.gov dbx.star.bnl.gov DO NOT connect to robinson (Please) DO NOT connect to robinson (Please) TIME STAMPS amp.html amp.html amp.html amp.html

16 STAR COLLABORATION MEETING July 2004Michael DePhillips Timestamp Quick Tutorial 1 Three Timestamps beginTime beginTime DAQ Time entryTime entryTime Time value is put into the database Deactive Deactive Time value is ‘turned off’

17 STAR COLLABORATION MEETING July 2004Michael DePhillips Timestamp Quick Tutorial 2 Insert a record bt1 = daqTime = et1 Insert second record bt2>bt1 Insert third record bt3<bt1 Three validity windows 1)Valid for bt1 ->bt2 2)Valid for bt2 -> infinity 3)Valid to bt3 -> bt1

18 STAR COLLABORATION MEETING July 2004Michael DePhillips Timestamp Quick Tutorial 3 A production time (pt1) was tagged after bt1 but before the entry of bt2. Later, after entries of bt2 and bt3 we want to reproduce data from pt1. Pt1 gets passed (dbv switch) to StDbLib and based on et1 ONLY gets data from bt1.

19 STAR COLLABORATION MEETING July 2004Michael DePhillips Timestamp Quick Tutorial 4 Three entries are in and pt is done, time for a new production…BUT… a value must be changed….we DEACTIVATE. We don’t delete, or over write the value because we may still want to reproduce that “exact” original production. SQL has a where clause that enforces pt2>dt1 || dt = 0. A query can still be run that allows pt1 dt1 || dt = 0. A query can still be run that allows pt1<dt1 which means the first entry was valid at the time of pt1, therefore, it is used.