Bioinformatics Primer HC Lee 2000 July. What is Bioinformatics? Biomedical/biotechnical information Reproduction and annotation of biosequences – DNA.

Slides:



Advertisements
Similar presentations
Bioinformatics Ayesha M. Khan Spring 2013.
Advertisements

Databases (“knowledge bases”) used in genome analysis
Creating NCBI The late Senator Claude Pepper recognized the importance of computerized information processing methods for the conduct of biomedical research.
Genome databases and webtools for genome analysis Become familiar with microbial genome databases Use some of the tools useful for analyzing genome Visit.
BIOINFORMATICS Ency Lee.
Bioinformatics and the Engineering Library ASEE 2008 Amy Stout.
GENBANK, SWISSPROT AND OTHERS As Problem Sources for CSE 549 Andriy Tovkach Genetics.
AI and Bioinformatics From Database Mining to the Robot Scientist.
Protein databases Morten Nielsen. Background- Nucleotide databases GenBank, National Center for Biotechnology Information.
Archives and Information Retrieval
BIOINFORMATICS CSMN 601 Zhongchun Wang. AGENDA What is Bioinformatics History Genome ABC Technology Industry Overview Ethical, Legal, and Social Implications.
Sequence Analysis MUPGRET June workshops. Today What can you do with the sequence? What can you do with the ESTs? The case of SNP and Indel.
Bioinformatics and Phylogenetic Analysis
Genome Related Biological Databases. Content DNA Sequence databases Protein databases Gene prediction Accession numbers NCBI website Ensembl website.
The Cell, Central Dogma and Human Genome Project.
Biological Databases Chi-Cheng Lin, Ph.D. Associate Professor Department of Computer Science Winona State University – Rochester Center
prepared with some help from friends...
The BIG Goal “The greatest challenge, however, is analytical. … Deeper biological insight is likely to emerge from examining datasets with scores of samples.”
Bioinformatics Student host Chris Johnston Speaker Dr Kate McCain.
The Protein Data Bank (PDB)
Signaling Pathways and Summary June 30, 2005 Signaling lecture Course summary Tomorrow Next Week Friday, 7/8/05 Morning presentation of writing assignments.
ExPASy - Expert Protein Analysis System The bioinformatics resource portal and other resources An Overview.
An Introduction to Bioinformatics Molecular Biology Databases.
From T. MADHAVAN, & K.Chandrasekaran Lecturers in Zoology.. EXIT.
Introductory Overview
Course Module: Introduction to Bioinformatics – CS 2001 July CS Databases.
On line (DNA and amino acid) Sequence Information
Lesson 10 Bioinformatics
Bioinformatics.
Development of Bioinformatics and its application on Biotechnology
Human Genome Project, Stem Cells and Cloning. Human Genome Project A genome is an organism’s complete set of DNA A genome is an organism’s complete set.
Databases in Bioinformatics and Systems Biology Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.
Sequence Databases What are they and why do we need them.
Introduction to Bioinformatics CPSC 265. Interface of biology and computer science Analysis of proteins, genes and genomes using computer algorithms and.
Section 4 Lesson 1– The Human Genome Project. Applications of DNA Technology Advances in gene manipulation have made many things possible. This section.
Biological Databases By : Lim Yun Ping E mail :
Doug Raiford Lesson 3.  More and more sequence data is being generated every day  Useless if not made available to other researchers.
Sequence Retrieving, Manipulation and Management BIOINFORMATICS Lecture 3.
Biological Databases and Tools Sandra Sinisi / Kathryn Steiger November 25, 2002.
Copyright © 2010 Pearson Education Inc. Lecture 01 – Genetics & Genomics: An Introduction Based on Chapter 1 – Genetics: An introduction.
جلسه اول بیو انفورماتیک گردآوری:مسعود رسول آبادی
Sequence Search and Analysis SPE 1653 (703)
BIOLOGICAL DATABASES. BIOLOGICAL DATA Bioinformatics is the science of Storing, Extracting, Organizing, Analyzing, and Interpreting information in biological.
EB3233 Bioinformatics Introduction to Bioinformatics.
Bioinformatics Curriculum Issues, goals, curriculum.
Pathogenomics How this project began: Ann Rose - take advantage of DNA sequence information - genomics Julian Davies - use the information to understand.
Algorithms for Biological Sequence Analysis Kun-Mao Chao ( 趙坤茂 ) Department of Computer Science and Information Engineering National Taiwan University,
Bioinformatics and Computational Biology
Computer Storage of Sequences
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
NCBI: something old, something new. What is NCBI? Create automated systems for knowledge about molecular biology, biochemistry, and genetics. Perform.
Information retrieval and sliding window programs April 5, 2011 Hand in Homework #1. Homework #2 due Tuesday, April 12. Learning objectives- Understand.
NCBI PubMed NCBI Literature Databases: PubMed Session #1, April 28, 2005 Session #2, April 29, 2005 Ho Chi Minh City, VietNam.
Visualizing Biosciences Genomics & Proteomics. “Scientists Complete Rough Draft of Human Genome” - New York Times, June 26, 2000 The problem: –3 billion.
Research Paper on BioInformatics
Archives and Information Retrieval
생물정보학 Bioinformatics.
Mangaldai College, Mangaldai
NCBI What is NCBI? PubMed OMIM Blast Entrez and more.
Genomes and Their Evolution
Introduction to Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Introduction to Databases
Evolution of Genomes Chapter 21.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
SUBMITTED BY: DEEPTI SHARMA BIOLOGICAL DATABASE AND SEQUENCE ANALYSIS.
Presentation transcript:

Bioinformatics Primer HC Lee 2000 July

What is Bioinformatics? Biomedical/biotechnical information Reproduction and annotation of biosequences – DNA and protein Publicly accessible data banks on the web Research & computation based on data Software for such research NCBI education edu review reviewedureview

The Human Genome Project WHAT? International research effort to characterize the genomes of human and selected model organisms through complete mapping and sequencing of their DNA

The Human Genome Project WHY? To develop technologies for genomic analysis, to examine the ethical, legal, and social implications of human genetics research, and to train scientists who will be able to utilize the tools and resources developed through the HGP to pursue biological studies that will improve human health

The Human Genome Project WHEN? Started in 1988 WHERE? –First at DOE and NIH, then –National Human Genome Research Institute, National Institute of Health (USA) Now also Europe and Japan Many national projects Website? HGPHGP

The Data Banks - NCBI/EMBL/DDBJ International DNA Sequence Database Collaboration homehome –NCBI (GenBank) – USA (1982) homepagehomepage –EMBL – Europe (1982) homepagehomepage –DDBJ – Japan (1988) homepagehomepage

Growth of Data Began in Slow first 10 years. First complete genome in 1995 First billion base pair 1997 Current – 8.2 B bp. Double every 6 months. GBK gbkstatsgbkstats EMBL emblstatsemblstats DDBJ ddbjddbj

NCBI - National Center for Biotechnological Information Established in USA in 1988 as a national resource for molecular biology information, NCBI creates public databases, conducts research in computational biology, develops software tools for analyzing genome data, and disseminates biomedical information - all for the better understanding of molecular processes affecting human health and disease.

GenBank – Data bank of NCBI PubMed – publication in life sci. pubmedpubmed Taxonomy – Tree of Life taxonomytaxonomy Structure – 3D struct of proteins structurestructure Entrez – databank homepagehomepage

The Human Genome 1999 December 2 –Chromosome 22 completed (47.7 Mb) 2000 May 8 –Chromosome 21 completed (50.0 Mb) 2000 June 26 –Working Draft of complete human genome 97% coverage, 85% complete pagepage

A Genome Entrez psfilepsfile –Genome queryquery –Bacteria eubaceubac Haemophilus influenzae –Complete genome frameframe

A Gene Haemophilus influenzae –First contig qmapqmap »First gene prot1prot1

Relatives of Gene Homologies peptide sequences with high similarity BLAST search webpagewebpage –Search query1query1 –Result AAC216AAC216

Protein Data Bank PDB homepage webweb Search for protein searchlitesearchlite glyceraldehyde-3-phosphate dehydrogenase –Result 1A7K1A7K View viewview

Human Genome Project - The three main DNA banks: GenBank - EMBL - DDBJ - Bioinformatics and Computational Biology sites Integrated Bioinfomatics site in TW www2.nchc.gov.tw/~c00chh00/bioinfo.html

Important protein data base- top.html Mirror site in TW - expasy.nhri.org.tw/sprot/sprot-top.html An internet course onprotein structure The main protein databank (USA) - List of good links -