Construction of a database Per Weidenman PAR AB Database A collection of data It belongs together It models the ”world” Database management system (DBMS)

Slides:



Advertisements
Similar presentations
How data is stored. Data can be stored in paper-based systems including: Reference books Dictionaries Encyclopaedias Directories Index Files Filing systems.
Advertisements

A database is a collection of data that is stored in a computer system. Databases allow their users to enter, access, and analyze their data quickly and.
BY LECTURER/ AISHA DAWOOD DW Lab # 2. LAB EXERCISE #1 Oracle Data Warehousing Goal: Develop an application to implement defining subject area, design.
Database Management3-1 L3 Database Management Santa R. Susarapu Ph.D. Student Virginia Commonwealth University.
Database Management An Introduction.
Your Interactive Guide to the Digital World Discovering Computers 2012 Chapter 10 Managing a Database.
Database Management: Getting Data Together Chapter 14.
1 1 Concepts of Database Management, 4 th Edition, Pratt & Adamski Chapter 1 Introduction to Database Management.
Getting Started (Excerpts) Chapter One DAVID M. KROENKE’S DATABASE CONCEPTS, 2 nd Edition.
Living in a Digital World Discovering Computers 2010.
Discovering Computers Fundamentals, 2011 Edition Living in a Digital World.
Attribute databases. GIS Definition Diagram Output Query Results.
IMS1907 Database Systems Summer Semester 2004/2005 Lecture 2 Relational DBMS Software An Overview of Microsoft Access.
Transaction Processing Systems, & Management Information Systems.
DAY 21: MICROSOFT ACCESS – CHAPTER 5 MICROSOFT ACCESS – CHAPTER 6 MICROSOFT ACCESS – CHAPTER 7 Akhila Kondai October 30, 2013.
Class 6 Data and Business MIS 2000 Updated: September 2012.
Class 3 Data and Business MIS 2000 Updated: January 2014.
Chapter 4: Organizing and Manipulating the Data in Databases
ACS1803 Lecture Outline 2 DATA MANAGEMENT CONCEPTS Text, Ch. 3 How do we store data (numeric and character records) in a computer so that we can optimize.
Database Lecture # 1 By Ubaid Ullah.
CPS120: Introduction to Computer Science Information Systems: Database Management Nell Dale John Lewis.
1 DATABASE TECHNOLOGIES BUS Abdou Illia, Fall 2012 (September 5, 2012)
Introduction –All information systems create, read, update and delete data. This data is stored in files and databases. Files are collections of similar.
Discovering Computers Fundamentals, 2012 Edition Your Interactive Guide to the Digital World.
Copyright © 2003 by Prentice Hall Computers: Tools for an Information Age Chapter 13 Database Management Systems: Getting Data Together.
Objectives Overview Define the term, database, and explain how a database interacts with data and information Define the term, data integrity, and describe.
DATABASE. A database is collection of information that is organized so that it can easily be accessed, managed and updated. It is also the collection.
1 INTRODUCTION TO DATABASE MANAGEMENT SYSTEM L E C T U R E
Databases. Database A database is an organized collection of related data.
Management Information Systems MS Access MS Access is an application software that facilitates us to create Database Management Systems (DBMS)
Databases Topic 4 Text Materials Chapter 3 – Databases and Data Warehouses.
Database Fred Durao What is a database? A database is any organized collection of data. Some examples of databases you may encounter in.
Fluency with Information Technology INFO100 and CSE100 Katherine Deibel Katherine Deibel, Fluency in Information Technology1.
CPS120: Introduction to Computer Science Lecture 19 Introduction to SQL.
Lecturer: Gareth Jones. How does a relational database organise data? What are the principles of a database management system? What are the principal.
Discovering Computers Fundamentals Fifth Edition Chapter 9 Database Management.
Objectives Overview Define the term, database, and explain how a database interacts with data and information Describe the qualities of valuable information.
DAY 12: DATABASE CONCEPT Tazin Afrin September 26,
Chapter 5 Database Processing. Neil uses software to query a database, but it has about 25 standard queries that don’t give him all he needs. He imports.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
MS Access 2007 Management Information Systems 1. Overview 2  What is MS Access?  Access Terminology  Access Window  Database Window  Create New Database.
Dimu' Rumpak © 2009 by Prentice Hall 1 Getting Started Didimus Rumpak, M.Si. Database Concepts Chapter 1 1.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
Databases. What is a database?  A database is used to store data. The word DATA is actually Latin for FACTS. A database is, therefore, a place, or thing.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
INFO1408 Database Design Concepts Week 15: Introduction to Database Management Systems.
Database. Data Base A database is a collection of related data, and the software used in databases to store, organize and retrieve the data is called.
ITGS Databases.
1 CSE 2337 Introduction to Data Management Access Book – Ch 1.
Database revision.
Databases and Information Management Chapter 6. Outline Database Relational Database Database Management System (DBMS) Structured Query Language Data.
Course FAQ’s I do not have any knowledge on SQL concepts or Database Testing. Will this course helps me to get through all the concepts? What kind of.
Visual Programing SQL Overview Section 1.
SQL Jan 20,2014. DBMS Stores data as records, tables etc. Accepts data and stores that data for later use Uses query languages for searching, sorting,
Verification & Validation. Batch processing In a batch processing system, documents such as sales orders are collected into batches of typically 50 documents.
Information Systems Today: Managing in the Digital World TB3-1 3 Technology Briefing Database Management “Modern organizations are said to be drowning.
Chapter 10 Database Management. Data and Information How are data and information related? p Fig Next processing data stored on disk Step.
1 Chapter 9 Database Management. Objectives Overview Define the term, database, and explain how a database interacts with data and information Describe.
Instructor: Pavlos Pavlikas1 How Data is Stored Chapter 8.
SQL (3) Research questions, databases, and analytics; Importing data, exporting data, using other tools Information Structures and Implications 2015 Bettina.
NOTE: To change the image on this slide, select the picture and delete it. Then click the Pictures icon in the placeholder to insert your own image. DATABASE.
Relational Database Systems Bartosz Zagorowicz. Flat Databases  Originally databases were flat.  All information was stored in a long text file, called.
MICROSOFT ACCESS – CHAPTER 5 MICROSOFT ACCESS – CHAPTER 6 MICROSOFT ACCESS – CHAPTER 7 Sravanthi Lakkimsety Mar 14,2016.
MySQL Tutorial. Databases A database is a container that groups together a series of tables within a single structure Each database can contain 1 or more.
Database Processing Chapter "No, Drew, You Don’t Know Anything About Creating Queries.” Copyright © 2015 Pearson Education, Inc. Operational database.
GCSE ICT How data is stored. How is data stored? Data can be stored in paper-based systems including: –Reference books –Dictionaries –Encyclopaedias –Directories.
Chapter Ten Managing a Database.
Database Management Systems
Database Management  .
DATABASE TECHNOLOGIES
Presentation transcript:

Construction of a database Per Weidenman PAR AB

Database A collection of data It belongs together It models the ”world” Database management system (DBMS) The database (a collection of interrelated data) Software to manage and access the data

DBMS Input: transactions User: Searching Reporting Statistical analysis Organised data DBMS requirements ”Database” Data Warehouse etc.

Database management systems (DBMS) Microsoft Access Microsoft SQL Server DB2 Oracle MySQL FirebirdSQL etc. SQL – Structured Query Language A computer language to define and search data

Relational databases Tables containing data, organised in rows and columns Keys, used for linking data in different tables

Example Simple database for collecting and organising statistical papers Created in Microsoft Access

Paper name and details Link to dokument (pdf file) Autors

A database with four tables Keys

One of the tables, containing paper name and details One paper on each row Rows containing paper name and other details Key

The keys are used to link data in the four tables

…56111…56 123…44123… Aaaa Bbbb Cccc Dddd Table ”artiklar” Table ”författare2” Table ”personer2” Key: artikel_id Key: artikel_id Key: person_id Key: person_id One paper having 3 autors One person being the autor of 2 papers

A query: the result of asking the database about papers and autors One paper and the corresponding 3 autors One autor and the corresponding 2 papers

DBMS Input: transactions User: Searching Reporting Statistical analysis Organised data DBMS requirements ”Database” Data Warehouse etc. IT Department ”Business” users

DBMS requirements from a statistical / analytical viewpoint Data quality Data types Performance Maximun information Historical data Regulation and secrecy

DBMS requirements from a statistical / analytical viewpoint DBMS Data quality Instead of entering text/data by typing… Input: transactions Sales System X Enter customer name: User: Searching Reporting Sales System X Choose customer name: Volvo Personvagnar AB Volvo Lastvagnar AB Volvo Construction AB Volvo Bussar AB Volvo Logistics AB … … use, if possible, selection from a list of valid values

DBMS requirements from a statistical / analytical viewpoint DBMS Data quality Input: transactions User: Searching Reporting Sales System X Enter customer age: Define rules for valid input (values, intervals, etc.) We dont want: Negative values

DBMS requirements from a statistical / analytical viewpoint DBMS Data quality Input: transactions User: Searching Reporting Handling of missing values … Missing values should stored as ”null” in the database. Not as 0 (digit zero)

DBMS requirements from a statistical / analytical viewpoint Data types Text Numeric

DBMS requirements from a statistical / analytical viewpoint Performance DBMS Input: transactions User: Searching Reporting Statistical analysis Organised data DBMS requirements Searching for individual records Creating ”prepared” reports by counting or summing Large datasets Multivariate methods Iterative estimation Etc.

DBMS requirements from a statistical / analytical viewpoint DBMS Maximum information Input: transactions User: Searching Reporting Sales System X Enter customer age: 34 We need to report on age groups: … Thus we store age as an interval, not as a value! The fallacy of beeing too user oriented!

DBMS requirements from a statistical / analytical viewpoint DBMS Historical data Input: transactions User: Searching Reporting Sales System X Customer name: Customer address: Order date: Order value: Table: Orders Customer ID Order date Order value Each new order for a specific customer … … will be added to table Orders and stored as a ”new row”

DBMS requirements from a statistical / analytical viewpoint DBMS Historical data Input: transactions User: Searching Reporting Sales System X Customer name: Customer address: Order date: Order value: Table: Customers Customer ID Customer name Customer address But a new address … … will probably UPDATE the existing record (row) for the specific customer Thus, the old value of ”customer address” will be deleted and replaced with the new value. But this will do fine for users focusing on searching / reporting!

DBMS requirements from a statistical / analytical viewpoint DBMS Historical data Input: transactions User: Searching Reporting Customer ID Customer name Customer address Table: Customers Table: Customers_history Customer ID Customer name Customer address FromTo Create av new table to contain historic records Each time a value is UPDATED for a certain customer … … the complete (previous) record is transfered to the table Customers_history

DBMS requirements from a statistical / analytical viewpoint DBMS Historical data Input: transactions User: Searching Reporting Customer ID Customer name Customer address Table: Customers Table: Customers_history Customer ID Customer name Customer address FromTo This structure will make analysis of processes possible But not easy!

DBMS requirements from a statistical / analytical viewpoint Regulation and sectrecy

DBMS requirements from a statistical / analytical viewpoint Current data Current + historical data Operating on individual records Operating on many records Next on this channel…

DBMS Input: transactions User: Searching Reporting Statistical analysis Organised data DBMS requirements A database containing historic transactions

Board data PAR / Bisnode database Tables Basic company data One record per company. Contains name, address, startdate, enddate, line of business, etc. Historic company data Many records per company. Contains the accumulated historic records from table FTG Balance sheet data One record per annual report (thus many records per company). Turnover, profit, key ratios, etc. Board member data Many records per company and person. FTG FTG_H BOKSLUT FUNKTION_ PERIOD And many more tables! Serrano Statistical analysis How? Historic names etc. Sampling for times series statistics

END

Basic company data One record per company. Contains name, address, startdate, enddate, line of business, etc.

Historic company data Many records per company. Contains the accumulated historic records from table FTG

Balance sheet data One record per annual report (thus many records per company). Turnover, profit, key ratios, etc.

Board member data Many records per company and person.

Serrano Balance sheet data from different periods transformed to yearly data records

Serrano Historic transactions from FTG_H transformed to yearly data records

Serrano Board Data Balance member data from any mix of startdate, enddata and period length transformed to yearly data records

Summing up register data to annual figures A ÅR Nu 321 Exampel. Register containing balance sheet data: Number of employes Turnover Profit Tangible assets Etc. Exampel. Register containing balance sheet data: Number of employes Turnover Profit Tangible assets Etc.

A ÅR Nu 321 B Brutet räkenskapsår Summing up register data to annual figures

A ÅR Nu 321 B C Omlagda räkenskapsår Summing up register data to annual figures

A ÅR Nu 321 B C D Missing data Summing up register data to annual figures

ÅR Nu 321 B Förslag: Bryt ner flödesvariablerna (omsättning, vinst, etc.) till månadsvärden … Förslag: Bryt ner flödesvariablerna (omsättning, vinst, etc.) till månadsvärden … Summing up register data to annual figures

ÅR Nu 321 B Förslag: … och summera månadsvärdena till ett ’fingerat’ kalenderårsvärde. Förslag: … och summera månadsvärdena till ett ’fingerat’ kalenderårsvärde. Förslag: … samt imputera för full täckning under sista året Förslag: … samt imputera för full täckning under sista året Summing up register data to annual figures

ÅR Nu 321 B Summing up register data to annual figures Database

First exampel Register based transport statistics for SIKA: Decreased response burden Increased understanding of the transporting companies (as a complement to the ”usual” fokus on type of goods) Time series describing economic status and change.

Objective: Describing economic status and change in transporting companies during the last ten years. Total number of employes and turnover …

Objective: Describing economic status and change in transporting companies during the last ten years. … or turnover growth compared to BNP

Objective: Describing economic status and change in transporting companies during the last ten years. … or profit development for different types of freight companies

Objective: Describing economic status and change in transporting companies during the last ten years. … or the number of employes in a cohort of new companies.

Aktiva företagAktiva aktiebolagBNP ÅrTotalt Därav aktie- bolag Antal anställda Nettoom- sättning (Mkr) Löpande priser (Mkr) Tables based on balance sheet data from each company

Aktiva företagAktiva aktiebolagBNP ÅrTotalt Därav aktie- bolag Antal anställda Nettoom- sättning (Mkr) Löpande priser (Mkr) What data is needed? Company data including micro level history. Exactly which companies where active in transport during each year? Company data including micro level history. Exactly which companies where active in transport during each year? Balance sheet data from all transporting companies for each year

Aktiva företagAktiva aktiebolagBNP ÅrTotalt Därav aktie- bolag Antal anställda Nettoom- sättning (Mkr) Löpande priser (Mkr) What data is needed? Company data including micro level history. Exactly which companies where active in transport during each year? Company data including micro level history. Exactly which companies where active in transport during each year? Balance sheet data from all transporting companies for each year Faster access to ”last years” data compared to taxation based registers

A ÅR Nu B C D 321 Sampling companies for time series statistics

A ÅR Nu B C D 321 Sampling companies for time series statistics

A ÅR Nu B C D 321 Sampling companies for time series statistics

A ÅR Nu B C D 321 ACDACD ABCDABCD ABCABC Sampling companies for time series statistics