Data Management David Nathan & Peter Austin & Robert Munro.

Slides:



Advertisements
Similar presentations
Introducing the ELAR information system architecture
Advertisements

LSA Archiving Tutorial January 2005 Archives, linguists, and language speakers.
Forest Markup / Metadata Language FML
C6 Databases.
Introduction to Databases
Dr Gordon Russell, Napier University Unit Data Dictionary 1 Data Dictionary Unit 5.3.
Basic guidelines for the creation of a DW Create corporate sponsors and plan thoroughly Determine a scalable architectural framework for the DW Identify.
Data - Information - Knowledge
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
File Systems and Databases
1 1 File Systems and Databases Chapter 1 The Worlds of Database Systems Prof. Sin-Min Lee Dept. of Computer Science.
Modern Systems Analysis and Design Third Edition
XML(EXtensible Markup Language). XML XML stands for EXtensible Markup Language. XML is a markup language much like HTML. XML was designed to describe.
Information systems and databases Database information systems Read the textbook: Chapter 2: Information systems and databases FOR MORE INFO...
Chapter 4 Database Management Systems. Chapter 4Slide 2 What is a Database Management System (DBMS)?  Database An organized collection of related data.
Chapter 1: The Database Environment
Web Services Michael Smith Alex Feldman. What is a Web Service? A Web service is a message-oriented software system designed to support inter-operable.
Chapter 2 Introduction to Database Development Database Processing David M. Kroenke © 2000 Prentice Hall.
Page 1 ISMT E-120 Introduction to Microsoft Access & Relational Databases The Influence of Software and Hardware Technologies on Business Productivity.
Different approaches to digital preservation Hilde van Wijngaarden Digital Preservation Officer Koninklijke Bibliotheek/ National Library of the Netherlands.
Improving Data Discovery in Metadata Repositories through Semantic Search Chad Berkley 1, Shawn Bowers 2, Matt Jones 1, Mark Schildhauer 1, Josh Madin.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Copyright © 2012 Pearson Education, Inc. Publishing as Prentice Hall 9.1.
MAHI Research Database Data Validation System Software Prototype Demonstration September 18, 2001
An Introduction to Software Architecture
1 INTRODUCTION TO DATABASE MANAGEMENT SYSTEM L E C T U R E
Provenance Metadata for Shared Product Model Databases Etiel Petrinja, Vlado Stankovski & Žiga Turk University of Ljubljana Faculty of Civil and Geodetic.
Chapter 9 Designing Databases Modern Systems Analysis and Design Sixth Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich.
Concepts and Terminology Introduction to Database.
What is a database? Example of database application: –Banks, –Hospital, –etc Extensions: –Multimedia databases –Geographic Information Systems –Data warehouse.
1 XML: an introduction David Nathan. 2 XML  an in-line markup system  single sequence of plain text only (but can be unicode)  equivalent to a tree.
Computer Science 101 Database Concepts. Database Collection of related data Models real world “universe” Reflects changes Specific purposes and audience.
Lecture 12 Designing Databases 12.1 COSC4406: Software Engineering.
MS Access: Creating Relational Databases Instructor: Vicki Weidler Assistant: Joaquin Obieta.
1 Data Warehouses BUAD/American University Data Warehouses.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Instructor: Dema Alorini Database Fundamentals IS 422 Section: 7|1.
1 Introduction to Software Engineering Lecture 1.
CIS 210 Systems Analysis and Development Week 6 Part II Designing Databases,
Databases Shortfalls of file management systems Structure of a database Database administration Database Management system Hierarchical Databases Network.
IS 325 Notes for Wednesday August 28, Data is the Core of the Enterprise.
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA 1 Comparability of language data and analysis Using an ontology for linguistics Scott Farrar, U.
Building Data and Document-Driven Decision Support Systems How do managers access and use large databases of historical and external facts?
1 Digital Preservation Testbed Database Preservation Issues Remco Verdegem Bern, 9 April 2003.
+ Information Systems and Databases 2.2 Organisation.
AL-MAAREFA COLLEGE FOR SCIENCE AND TECHNOLOGY INFO 232: DATABASE SYSTEMS CHAPTER 1 DATABASE SYSTEMS Instructor Ms. Arwa Binsaleh.
CISB113 Fundamentals of Information Systems Data Management.
1 Database Systems Instructor: Nasir Minhas Assistant Professor UIIT PMAS-AAUR
Video in Documentary linguistics Louise Ashmore David Nathan.
Chapter 10 Designing Databases. Objectives:  Define key database design terms.  Explain the role of database design in the IS development process. 
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
1 February 2012 ILCAA, TUFS, Tokyo program David Nathan and Peter Austin Hans Rausing Endangered Languages Project SOAS, University of London Language.
The CEN Metalex Naming Convention Fabio Vitali University of Bologna.
Copyright 2002 Prentice-Hall, Inc. Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich Chapter 12 Designing.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.
Lecture 1 Introduction to Database
Learn about relations and their basic properties
Modern Systems Analysis and Design Third Edition
Data Warehouse.
File Systems and Databases
Data Model.
Introduction to Databases
Database Development Cycle
Emulation: Good or Bad? Emulation as a Digital Preservation Strategy – Stewart Granger Reality and Chimeras in the Preservation of Electronic Records –
Introducing the ELAR information system architecture
Database Design Hacettepe University
Database Systems: Design, Implementation, and Management Tenth Edition
Presentation transcript:

Data Management David Nathan & Peter Austin & Robert Munro

This section 1.Data management 2.Properties of data 3.Relational data model 4.XML 5.Example

Workflows - description vs documentation  something inscribed something happened you applied knowledge, made decisions NOT OF INTEREST! representations, lists, summaries, analyses cleaned up, selected, analysed archived, presented, published something happened  recording you applied knowledge, techniques FOCUS OF INTEREST! representations, eg transcription, annotation made decisions, applied linguistic knowledge archived &... ?? recapitulates Description Documentation

Choosing values/priorities  Standards & compliance  Adeptness with tools  Modelling of phenomena, architecture of data  Dissemination/publishing  Preserving  Ethics, responsibility, protocol  Range, comprehensiveness  Intellectual rigour  Which are priorities?  Which are dispensible?

Data should be:  explicit  consistent  robust  meaningful  conventional  adaptable, convertible, machine readable etc  useful!

“Portability”  Bird and Simons 2003: language documentation data needs to have integrity, flexibility, longevity

“Portability”  complete  explicit  documented  preservable  transferable  accessible  adaptable  not technology-specific  (also appropriate, accurate, useful etc!!)

Data management  the way that data is structured is also information, that may be complex  properly structured data allows:  usage including manipulation, conversion, derivation  preservation  machine readability

Data management systems  a data management system is a system you design for storing data and metadata:  information about content and structures  relationship between units of information  it is not necessarily tied to any particular software, or even a computer

Naive managment using filenames  a (too) simple management system:  information about a recording is captured in the filenames: 1st_int_john_5Aug.wav market_conv_mj.wav ….  what does ‘int’ mean?  what information about the recording is missing?

Data modeling  World/universe  Domain  Relevant  entities  properties  relationships  We also need formal ways to represent these

Data modeling  data modelling is the process of designing your data management system:  what information do you need to record?  what are the units of information?  what are their properties (attributes)?  what are the relationships between the units of information?  how is the information etc likely to change in the future?  how can all this be represented?

Data management  two well-known formats for structured data:  relational database  eXtensible Markup Language (XML)  these are methods, not softwares or hardwares  any system for well-structured data could be OK, but generally:  smaller community of users so less tools and support ... so errors more likely

Databases  Note that database has 3 senses:  a body of related information  type of software (eg Oracle, Access, Filemaker)  a model for the domain of information (ie. formulation of entities and relationships)

Relational format  Uses tables  Table rows represent entities in a domain  Table columns represent properties/attributes of entities  Each cell represents one atomic unit of data  The order of rows and columns has no significance

Representing a relational design field name TABLE NAME  simplest example

Representing a relational design field 1 TABLE NAME  less trivial entity field 2

Representing a relational design  less trivial domain name CONTINENT name COUNTRY = one to many

Non-trivial domains  non-trivial domains have many-to-many relationships name AUTHOR name SUBJECT.....

From model to implementation  implementing table relationships name CONTINENT name COUNTRY id continent_id

Designing a database  Determine the domain, entities and relationships  Experiment with scenarios  Any non-trivial model will evolve as it is thought out and tested  Normalisation is the process of refining models

Practical example  Create a database model for some audio metadata

What does all this achieve?  conceptual/intellectual validity  scalable, searchable, modular  machine readable  in fact, portable:  complete  explicit  documented  preservable  transferable  accessible  adaptable  not technology-specific

Stop here!