Databases מאגרי מידע אחסון שליפה. DNARNA cDNA ESTs Non-coding RNA phenotype DNA sequences (individual genes or complete genomes) Protein sequences Translated.

Slides:



Advertisements
Similar presentations
Understanding Relational Databases Basic Concepts and Applications for Qualitative Content Analysis.
Advertisements

Develop a search statement for searching a database? First, you need to understand what a database is and how it is compiled. Then, you can learn how to.
What is a Database By: Cristian Dubon.
The National Center for Biotechnology Information (NCBI) a primary resource for molecular biology information Database Resources.
1 Welcome to the Protein Database Tutorial This tutorial will describe how to navigate the section of Gramene that provides collective information on proteins.
Databases מאגרי מידע אחסון שליפה. DNARNA cDNA ESTs Non-coding RNA phenotype DNA sequences (individual genes or complete genomes) Protein sequences Translated.
Access Quiz October 24, The database objects bar in Access contains icons for tables, queries, forms and reports 1.True 2.False.
Biological Databases Notes adapted from lecture notes of Dr. Larry Hunter at the University of Colorado.
ISP 121 Week 1 Introduction to Databases. ISP 121, Winter Why a database and not a spreadsheet? You have too many separate files or too much data.
3-1 Chapter 3 Data and Knowledge Management
Introduction to Databases CIS 5.2. Where would you find info about yourself stored in a computer? College Physician’s office Library Grocery Store Dentist’s.
Sequence Analysis. Today How to retrieve a DNA sequence? How to search for other related DNA sequences? How to search for its protein sequence? How to.
Databases מאגרי מידע - חלק ב' אחסון שליפה. What are we looking for in a GOOD database? Large amount of data Numerous entries Well defined fields Non-redundancy.
Basic Concept of Data Coding Codes, Variables, and File Structures.
Databases Ms. Scales. What is a Database? Database  A collection of data organized for fast search and retrieval  Examples: Telephone Directories Hospital.
Database Software Application
BTN323: INTRODUCTION TO BIOLOGICAL DATABASES Day2: Specialized Databases Lecturer: Junaid Gamieldien, PhD
DATABASE MANAGEMENT SYSTEMS BASIC CONCEPTS 1. What is a database? A database is a collection of data which can be used: alone, or alone, or combined /
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
Information Need Question Understanding Selecting Sources Information Retrieval and Extraction Answer Determina tion Answer Presentation This work is supported.
1 Chapter 1 Overview of Database Concepts. 2 Chapter Objectives Identify the purpose of a database management system (DBMS) Distinguish a field from a.
**Database Notes** New Unit Plan Microsoft Access - known as a database management system or DBMS Database – a collection of organized information. Can.
Chapter 9 Database Management
Computer Science 101 Database Concepts. Database Collection of related data Models real world “universe” Reflects changes Specific purposes and audience.
Databases & Consistency. Database Relational databases : dominant information storage/retrieval system.
Genomes and Their Evolution. GenomicsThe study of whole sets of genes and their interactions. Bioinformatics The use of computer modeling and computational.
1.NET Web Forms Business Forms © 2002 by Jerry Post.
IL Step 3: Using Bibliographic Databases Information Literacy 1.
What is Genetic Research?. Genetic Research Deals with Inherited Traits DNA Isolation Use bioinformatics to Research differences in DNA Genetic researchers.
ATADESAB. BATLE CORDER DLEIF Lesson objectives In this lesson you will learn some basic database terms and learn how a database is created.
The Relational Model UC Berkeley Extension Copyright © 2008 Patrick McDermott.
Organizing information in the post-genomic era The rise of bioinformatics.
Biological Databases Biology outside the lab. Why do we need Bioinfomatics? Over the past few decades, major advances in the field of molecular biology,
Databases: An Overview Chapter 7, Exploring the Digital Domain.
DATABASE SYSTEMS. DATABASE u A filing system for holding data u Contains a set of similar files –Each file contains similar records Each record contains.
The EST database is a collection of short single-read transcript sequences from GenBank. These sequences provide a resource to evaluate gene expression,
26 Mar 04 1 Application Software Practical 5/6 MS Access.
Access Review. Access Access is a database application A database is a collection of records and files organized for a particular purpose Access supports.
Class material and homework for February 9 today’s in-class topic: selected examples of contemporary biotechnology –polymerase chain reaction (PCR) –DNA.
Intro to Databases Vocabulary Copyright © Texas Education Agency, All rights reserved.
Database Objective Demonstrate basic database concepts and functions.
DAY 9: DATABASES Rohit September 21,
1 MS Access. 2 Database – collection of related data Relational Database Management System (RDBMS) – software that uses related data stored in different.
Primary vs. Secondary Databases Primary databases are repositories of “raw” data. These are also referred to as archival databases. -This is one of the.
Copyright OpenHelix. No use or reproduction without express written consent1.
The Future of Genetics Research Lesson 7. Human Genome Project 13 year project to sequence human genome and other species (fruit fly, mice yeast, nematodes,
Protein sequence databases Petri Törönen Shamelessly copied from material done by Eija Korpelainen This also includes old material from my thesis
Copyright OpenHelix. No use or reproduction without express written consent1.
Protein databases Petri Törönen Shamelessly copied from material done by Eija Korpelainen and from CSC bio-opas
GENBANK FILE FORMAT LOCUS –LOCUS NAME Is usually the first letter of the genus and species name, followed by the accession number –SEQUENCE LENGTH Number.
DAY 9: DATABASES Rohit February 17,
E-utilities: Short course. The Entrez Query System at NCBI.
Database Presentation BIM, Mrs. Bailey. **Database Notes** Use new sheet of paper! Microsoft Access - known as a database management system or DBMS Database.
Biological Databases By: Komal Arora.
Database Vocabulary Terms.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
What is a Database? A collection of data organized in a manner that allows access, retrieval, and use of that data.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Nancy Baker SILS Bioinformatics Seminar January 21, 2004
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Database.
Advanced Database Concepts: Reports & Views
Spreadsheets, Modelling & Databases
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Basic Local Alignment Search Tool
Database SQL.
How to search NCBI.
Microsoft Access Date.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Interactive Powerpoint
Presentation transcript:

Databases מאגרי מידע אחסון שליפה

DNARNA cDNA ESTs Non-coding RNA phenotype DNA sequences (individual genes or complete genomes) Protein sequences Translated nuc sequences Protein domains Protein structure protein Diseases polymorhism Gene expression Prot-prot interactions Different kinds of DBs dealing with biological information retrieved by various means

A database is a structured collection of information. A database is composed of basic objects called records or entries ( רשומות ). Each record is composed of fields ( שדות ), which hold defined data that is related to that record. Common to all databases Let’s consider the following database of students learning bioinfo in HUJI

Databases A database can be thought of as a large table, where the rows represent records and the columns represent fields. CommentsGenderLast NameFirst Name ID Likes scuba diving femaleAsulinSharon /7 Comes from Cubafemale…NivNurit /4 -female…SharonNurit03321/3 Father of sharon – must go home earlier male…YarkonYossi88924/5 ID (Accession Numbers): Unique identifiers of the database records.

What can we learn about fields? More defined (male female), less defined (comments) A better database will try to store info in well defined fields. Some records contain similar data in some of the fields For some records there is only partial information – some fields contain no data (quality of DB) Each record needs a unique identifier

Data Retrieval The purpose of databases is not merely to collect and organize data, but mainly to allow advanced data retrieval. A query ( שאילתא ) is a method to retrieve information from the database. The organization of each record into predetermined fields, allows us to use queries on fields.

The best search strategy…

Boolean operatorsKeywords Fields Syntax 4. Access additional entries discussing same or similar entities by links to additional databases (DBXref) 2. Choose appropriate database Think, evaluate. The computer is just a machine. You are (hopefully) a thinking organism. 1. Think – phrase your scientific question. Phrase your query Today

The secretary wants to locate the record of the student Sharon Asulin but does not remember the last name – search Sharon CommentsGenderLast Name First Name Field ID Likes scuba diving femaleAsulinSharon /7 Comes from Cubafemale…NivNurit /4 Receives scholarship female…SharonNurit03321/3 Proud father of sharon male…YarkonYossi88924/5 The search was not limited to a certain field Sharon[all fields] Keyword synthax (NCBI) field definition

OOPS !! Retrieved too many records that don’t match the required data - too much noise.

Found (+) Not found (-) True positive False negative Related False positive True negative Unrelated Search results “ s c i e n ti fi c t r u t h ” Evaluating Search Results

CommentsGenderLast NameFirst Name Field ID Likes scuba diving femaleAsulinSharon True positive /7 Comes from Cubafemale…NivNurit /4 Receives scholarship female…Sharon False positive Nurit03321/3 Proud father of sharon False positive male…YarkonYossi88924/5 What can we do to reduce/eliminate false positives without reducing true positives?

Sensitivity Ability of a method to detect positives, irrespective of how many false positives are reported. Selectivity Ability of a method to reject negatives, irrespective of how many false negatives are rejected. SensitivitySelectivity

Find all students whose first name is Sharon Sharon[first name] Keyword synthax (NCBI) field definition Let’s refine our search CommentsGenderLast Name First Name ID Likes scuba diving femaleAsulinSharon / 7 Comes from Cuba female…NivNurit /4 Receives scholarship female…SharonNurit03321/3 Father of sharon – must go home earlier male…YarkonYossi88924/5

CommentsGenderLast Name First Name ID Likes scuba diving femaleAsulinSharom / 7 Comes from Cuba female…NivNurit /4 Receives scholarship female…SharonNurit03321/3 Father of sharon – must go home earlier male…YarkonYossi88924/5 Now we don’t retrieve any answer (false negative?) and we are still not distracted by the noise. The original search phrase sharon[all fields] would have retrieved all the noise but not the required info.

cell OR cycle cell NOT cycle 1 AND 2 1 OR 2 1 NOT cell AND cycle12 “cell cycle” Boolean Operators Cell* - cell, cells, cellular etc)

The secretary wants to locate the record of the female student who comes from Cuba but does not remember her name. Search female[gender] AND *cuba*[comments] Keyword synthax (NCBI) field definition Boolean operator CommentsGenderLast Name First Name Field ID Likes scuba diving – false positive femaleAsulinSharon /7 Comes from Cuba true positive female…NivNurit /4 Receives scholarship female…SharonNurit03321/3 Proud father of sharon male…YarkonYossi88924/5

והעיקר, והעיקר : לא לפחד כלל