Introduction to databases from a bioinformatics perspective Misha Taylor.

Slides:



Advertisements
Similar presentations
Technology Guide 3 Data and Database T3-1. IT for Management Prof. Efraim Turban T3-2 File Management Hierarchy of data for a computer-based file Record.
Advertisements

Managing Data Resources
Managing data Resources: An information system provides users with timely, accurate, and relevant information. The information is stored in computer files.
Copyright ©2004 Cezary Z Janikow 1 Domain Model n Visualization of entities and relationships n In UP presented as Class Diagrams – Classes, Relationships,
Oct 31, 2000Database Management -- Fall R. Larson Database Management: Introduction to Terms and Concepts University of California, Berkeley School.
Chapter 6 Methodology Conceptual Databases Design Transparencies © Pearson Education Limited 1995, 2005.
CSCI 3 Introduction to Computer Science. CSCI 3 Course Description: –An overview of the fundamentals of computer science. Topics covered include number.
3-1 Chapter 3 Data and Knowledge Management
Databases and Processing Modes. Fundamental Data Storage Concepts and Definitions What is an entity? An entity is something about which information is.
1 Database Systems (Part I) Introduction to Databases I Overview  Objectives of this lecture.  History and Evolution of Databases.  Basic Terms in Database.
Introduction to Databases
Lecture Fourteen Methodology - Conceptual Database Design
1 Lecture 31 Introduction to Databases I Overview  Objectives of this lecture  History and Evolution of Databases  Basic Terms in Database and definitions.
Database Design Concepts Info 1408 Lecture 2 An Introduction to Data Storage.
“DOK 322 DBMS” Y.T. Database Design Hacettepe University Department of Information Management DOK 322: Database Management Systems.
© Prentice Hall CHAPTER 3 Computer Software.
8/28/97Information Organization and Retrieval Files and Databases University of California, Berkeley School of Information Management and Systems SIMS.
Chapter 4 Relational Databases Copyright © 2012 Pearson Education, Inc. publishing as Prentice Hall 4-1.
Methodology Conceptual Database Design
Chapter 4 Relational Databases Copyright © 2012 Pearson Education 4-1.
INTRODUCTION TO DATABASE USING MS ACCESS 2013 PART 2 NOVEMBER 4, 2014.
CHAPTER 9 DATABASE MANAGEMENT © Prepared By: Razif Razali.
Overview of the Database Development Process
Information storage: Introduction of database 10/7/2004 Xiangming Mu.
ITEC224 Database Programming
CST203-2 Database Management Systems Lecture 2. One Tier Architecture Eg: In this scenario, a workgroup database is stored in a shared location on a single.
Chapter 16 Methodology - Conceptual Database Design.
Methodology - Conceptual Database Design Transparencies
Software School of Hunan University Database Systems Design Part III Section 5 Design Methodology.
Methodology Conceptual Databases Design
9/14/2012ISC329 Isabelle Bichindaritz1 Database System Life Cycle.
CSCI 3140 Module 2 – Conceptual Database Design Theodore Chiasson Dalhousie University.
1 Chapter 15 Methodology Conceptual Databases Design Transparencies Last Updated: April 2011 By M. Arief
CHAPTER 8: MANAGING DATA RESOURCES. File Organization Terms Field: group of characters that represent something Record: group of related fields File:
Computer Science 101 Database Concepts. Database Collection of related data Models real world “universe” Reflects changes Specific purposes and audience.
Methodology - Conceptual Database Design. 2 Design Methodology u Structured approach that uses procedures, techniques, tools, and documentation aids to.
1/26/2004TCSS545A Isabelle Bichindaritz1 Database Management Systems Design Methodology.
DATABASE MGMT SYSTEM (BCS 1423) Chapter 5: Methodology – Conceptual Database Design.
Lecture2: Database Environment Prepared by L. Nouf Almujally & Aisha AlArfaj 1 Ref. Chapter2 College of Computer and Information Sciences - Information.
Database A database is a collection of data organized to meet users’ needs. In this section: Database Structure Database Tools Industrial Databases Concepts.
Announcements. Data Management Chapter 12 Traditional File Approach  Structure Field  Record  File  Fixed All records have common fields, and a field.
CS370 Spring 2007 CS 370 Database Systems Lecture 4 Introduction to Database Design.
Methodology - Conceptual Database Design
FEN Introduction to the database field:  Applications, concepts and terminology Seminar: Introduction to relational databases.
Databases: An Overview Chapter 7, Exploring the Digital Domain.
1 Mapping to Relational Databases Presented by Ramona Su.
Introduction to Database Tonga Institute of Higher Education NOS 215.
Database Environment Chapter 2. Data Independence Sometimes the way data are physically organized depends on the requirements of the application. Result:
Chapter 9 Database Systems © 2007 Pearson Addison-Wesley. All rights reserved.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
DATA Spatial Data – where things are Non Spatial Data or Attribute Data – What things are Data in a computer database are managed and accessed through.
2/20: Ch. 6 Data Management What is data? How is it stored? –Traditional management storage techniques; problems –DBMS.
Chapter 13.3: Databases Invitation to Computer Science, Java Version, Second Edition.
Jemerson Pedernal IT 2.1 FUNDAMENTALS OF DATABASE APPLICATIONS by PEDERNAL, JEMERSON G. [BS-Computer Science] Palawan State University Computer Network.
Introduction to Core Database Concepts Getting started with Databases and Structure Query Language (SQL)
The Database Concept and the Database Management System (DBMS) Databases.
1 Management Information Systems M Agung Ali Fikri, SE. MM.
SQL Basics Review Reviewing what we’ve learned so far…….
Introduction: Databases and Database Systems Lecture # 1 June 19,2012 National University of Computer and Emerging Sciences.
ISC321 Database Systems I Chapter 2: Overview of Database Languages and Architectures Fall 2015 Dr. Abdullah Almutairi.
Methodology Conceptual Databases Design
Methodology Conceptual Database Design
Databases and Database Management Systems Chapter 9
Database.
Database Systems Instructor Name: Lecture-3.
Review of Week 1 Database DBMS File systems vs. database systems
Database Design Hacettepe University
Methodology Conceptual Databases Design
Presentation transcript:

Introduction to databases from a bioinformatics perspective Misha Taylor

Overview Background Flat text files ISAM Databases SQL/Relational Databases Object-Oriented/XML Databases The Future

What is “informatics” Derived from the French word informatique Tends to get associated with specific application areas – Medical informatics – Bioinformatics – Nursing informatics – Business informatics (MIS/IMS) – Social-science informatics

A good definition Informatics is the science that deals with information, its structure, its acquisition and its use

Informatics is not computer science Emphasis is on the acquisition, modeling, and representation of data and knowledge – not on the building of computational artifacts However, understanding computational artifacts very much helps to illustrate the underlying principles It’s impossible to provide examples of the principles independent of any application domain

Informatics is about systems modeling Creating and enhancing models of application areas Identifying relationships among models Creating algorithms that can automate domain tasks

Informatics is about knowledge and its representation Conceptualizing the knowledge required to drive applications Building useful, maintainable systems Developing better methods for management of knowledge within organizations and scientific communities

Problem-solving knowledge automates specific tasks Domain knowledge + Problem-solving method Intelligent behavior

Databases & Knowledge Databases are a tool for storing knowledge – Data – Relationships

A parable: Amazon vs. CDNOW

Database concepts Entity – thing that is being stored and is representative of something in the real world Attribute – descriptor of an entity Relationships

Flat text files Flat text files can act as the basis of these concepts (entity, attribute, relationships)

But… Most applications require that specific information can be quickly and efficiently retrieved Sometimes critical that performance does not degrade as more entities are added Flat text files don’t always fulfill these requirements, especially when there are many entities and/or relationships

Solution – indexes and keys Performance requirement is most often met through the use of indexes or keys More sophisticated database paradigms – ISAM – SQL/Relational – Object-oriented/XML

What is ISAM? Indexed Sequential Access Method Used in: – Cobol – Btrieve – dBase – FoxPro – Faircom c-tree Plus

ISAM Entities are records Attributes are understood to be data stored starting at a specific offset in the record Data & indexes are stored in files Applications are responsible for maintaining relationships and knowing which set of records is in which file

ISAM (contd.) ISAM database/library manages index and data files

SQL/Relational Entities are represented by rows Collections of entities are represented as tables Collections of entities and attributes may be arbitrarily defined at runtime. Applications are not responsible for maintaining relationships, but are responsible for conforming to the model

SQL/Relational (contd.) Incorporates an easy-to-use query language - SQL

Object-oriented/XML Ties data and behavior together - entities are objects, which have both attributes and methods XML is used as a portable persistance mechanism Applications can discover data and relationships at runtime – need not conform to an application-specific model

Comparing ISAM, SQL/Relational, and OO/XML ISAMSQL/RelationalXML User operates on file User operates on a file within a database User operates on objects The file may contain multiple entity types The table has a single defined entity type Objects may encapsulate multiple entity types

Comparing ISAM, SQL/Relational, and OO/XML (contd.) ISAMSQL/RelationalXML All instances of an entity type are contained in one file All instances of an entity type are maintained in one table Instances of an entity type may occur in multiple objects Every instance of a given entity type has the same composition. Every instance of a given entity type may have a different composition.

Comparing ISAM, SQL/Relational, and OO/XML ISAMSQL/RelationalXML The application is responsible for extracting attributes from entity instances The DBMS is responsible for extracting attributes from entity instances The data contains the description of the attributes for any particular entity instance. Relationships are maintained by the application code. Relationships are maintained by the DBMS. Relationships are described within the data itself.

Comparing ISAM, SQL/Relational, and OO/XML ISAMSQL/RelationalXML Indexes are granular to the file level Indexes are granular to the DBMS- understood table level Indexes must be granular to the element level.