Bioinformatics and Computational Biology

Slides:



Advertisements
Similar presentations
On line (DNA and amino acid) Sequence Information Lecture 7.
Advertisements

Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Basic Genomic Characteristic  AIM: to collect as much general information as possible about your gene: Nucleotide sequence Databases ○ NCBI GenBank ○
Basics of Comparative Genomics Dr G. P. S. Raghava.
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
BIOINFORMATICS Ency Lee.
Bioinformatics What is bioinformatics? Why bioinformatics? The major molecular biology facts Brief history of bioinformatics Typical problems of bioinformatics:
How to use the web for bioinformatics Molecular Technologies Ethan Strauss X 1171
GENBANK, SWISSPROT AND OTHERS As Problem Sources for CSE 549 Andriy Tovkach Genetics.
Bioinformatics at IU - Ketan Mane. Bioinformatics at IU What is Bioinformatics? Bioinformatics is the study of the inherent structure of biological information.
Archives and Information Retrieval
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Bioinformatics and Phylogenetic Analysis
Biological Databases Notes adapted from lecture notes of Dr. Larry Hunter at the University of Colorado.
The Cell, Central Dogma and Human Genome Project.
ExPASy - Expert Protein Analysis System The bioinformatics resource portal and other resources An Overview.
From T. MADHAVAN, & K.Chandrasekaran Lecturers in Zoology.. EXIT.
Making Sense of DNA and protein sequence analysis tools (course #2) Dave Baumler Genome Center of Wisconsin,
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
Overview of Bioinformatics A/P Shoba Ranganathan Justin Choo National University of Singapore A Tutorial on Bioinformatics.
Chapter 6 Gene Prediction: Finding Genes in the Human Genome.
On line (DNA and amino acid) Sequence Information
Bioinformatics.
Development of Bioinformatics and its application on Biotechnology
1 Bio + Informatics AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC An Overview پرتال پرتال بيوانفورماتيك ايرانيان.
Databases in Bioinformatics and Systems Biology Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
Introduction to Bioinformatics CPSC 265. Interface of biology and computer science Analysis of proteins, genes and genomes using computer algorithms and.
NCBI Review Concepts Chuong Huynh. NCBI Pairwise Sequence Alignments Purpose: identification of sequences with significant similarity to (a)
Introduction to Bioinformatics Spring 2002 Adapted from Irit Orr Course at WIS.
1 Review of Biological Database Utilization. 2 Biological Databases We will discuss: Usefulness to the bioinformaticist Database types Search methods.
CSCI 6900/4900 Special Topics in Computer Science Automata and Formal Grammars for Bioinformatics Bioinformatics problems sequence comparison pattern/structure.
Organizing information in the post-genomic era The rise of bioinformatics.
Biological Databases Biology outside the lab. Why do we need Bioinfomatics? Over the past few decades, major advances in the field of molecular biology,
ARE THESE ALL BEARS? WHICH ONES ARE MORE CLOSELY RELATED?
Pattern Matching Rhys Price Jones Anne R. Haake. What is pattern matching? Pattern matching is the procedure of scanning a nucleic acid or protein sequence.
From Genomes to Genes Rui Alves.
BIOLOGICAL DATABASES. BIOLOGICAL DATA Bioinformatics is the science of Storing, Extracting, Organizing, Analyzing, and Interpreting information in biological.
Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.
Epidemiology 217 Molecular and Genetic Epidemiology Bioinformatics & Proteomics John Witte.
EB3233 Bioinformatics Introduction to Bioinformatics.
How can we find genes? Search for them Look them up.
Primary vs. Secondary Databases Primary databases are repositories of “raw” data. These are also referred to as archival databases. -This is one of the.
JIGSAW: a better way to combine predictions J.E. Allen, W.H. Majoros, M. Pertea, and S.L. Salzberg. JIGSAW, GeneZilla, and GlimmerHMM: puzzling out the.
The Future of Genetics Research Lesson 7. Human Genome Project 13 year project to sequence human genome and other species (fruit fly, mice yeast, nematodes,
BIOINFORMATICS Ayesha M. Khan Spring 2013 Lec-8.
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
Using public resources to understand associations Dr Luke Jostins Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015.
BASICS OF BIOINFORMATICS Biotechnology Division North-East Institute of Science & Technology (Council of Scientific & Industrial Research) Jorhat ,
BME435 BIOINFORMATICS.
Bioinformatics for Research
bacteria and eukaryotes
Introduction to Genes and Genomes with Ensembl
Basics of Comparative Genomics
Archives and Information Retrieval
Mangaldai College, Mangaldai
GEP Annotation Workflow
Genome Center of Wisconsin, UW-Madison
Genomes and Their Evolution
Genome organization and Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Ensembl Genome Repository.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Basics of Comparative Genomics
Gene Safari (Biological Databases)
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Introduction to Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
SUBMITTED BY: DEEPTI SHARMA BIOLOGICAL DATABASE AND SEQUENCE ANALYSIS.
Presentation transcript:

Bioinformatics and Computational Biology

Bioinformatics collection and storage of biological information derives knowledge from computer analysis of biological data Computational biology development of algorithms and statistical models to analyze biological data

Why bioinformatics is critical? Few people adequately trained in both biology and computer science Genome sequencing, microarrays etc. lead to large amounts of data to be analyzed Leads to important discoveries Saves time and money

Why is the relationship between Computer Science and Biology is essential? Three main reasons- First, massive amounts of data have to be stored, analyzed and made accessible Second, the nature of the data is often such that a computational statistical method is necessary. This applies in particular to the information on the building plans of proteins and spatial organization of their expression in the cell encoded by the DNA. Third, there is a strong analogy between the DNA sequence and a computer program

Key Areas/Scope of Bioinformatics Organizing biological knowledge in database Analysing sequence data Structural Bioinformatics Pharmacological relevance (Population genetics)

1. Organizing biological knowledge in database Genbank/Organized DNA sequences - NCBI, EMBL Protein sequence databank and its structure and functional characteristics. For example, SWISSPROT contains verified protein sequences and more annotations describing the function of a protein Literature database – PUBMED, MEDLINE

2. Analysing sequence data Establish the correct order of sequence contigs Find the translation and transcription initiation sites, find promoter sites, define open reading frames (ORF) Find splice sites, introns, exons Translate the DNA sequence into a protein sequence Compare the DNA sequence to known protein sequences in order to verify exons etc with homologous sequences. Multiple sequence alignments Studying evolutionary aspects, by the construction of phylogenetic trees Determining active site residues, and residues specific for subfamilies Predicting protein–protein interactions Analysing single nucleotide polymorphism to hunt for genetic sources of diseases.

3. Structural Bioinformatics This branch of bioinformatics is concerned with computational approaches to predict and analyse the spatial structure of proteins and nucleic acids. multiple sequence alignment, secondary structure, 3D structure can be predicted with an accuracy above 70 %.

4. Pharmacological relevance Drug targets in infectious organisms can be revealed by whole genome comparisons of infectious and non–infectious organisms. The analysis of single nucleotide polymorphisms reveals genes potentially responsible for genetic diseases. Prediction and analysis of protein 3D structure is used to develop drugs and understand drug resistance. Patient databases with genetic profiles, e.g. for cardiovascular diseases, diabetes, cancer, etc. may play an important role in the future for individual health care, by integrating personal genetic profile (population genetics) into diagnosis.

Genomic Browsers National Center for Biotechnology information (NCBI) (http://ncbi.nlm.nih.gov) Ensembl Genome Browser (http://www.ensembl.org) UCSC Genome Browser (http://genome.ucsc.edu/) WormBase (http://www.wormbase.org/) AceDB (http://www.acedb.org/) FlyBase (http://flybase.bio.indiana.edu/)

Protein databses SWISS-PROT/TrEMBL curated protein sequences http://www.expasy.ch/sprot InterPro: Protein families and domains http://www.ebi.ac.uk/interpro EXProt: proteins with experimentally verified functions http://www.cmbi.nl/exprot Protein Information Resource (PIR) http://pir.georgetown.edu/

NCBI

Continued..

NCBI text search of a protein

Abstract finding by NCBI

Nucleotide search of a typical gene

Continued..

FASTA format

FASTA: FASTA format is a text-based format for representing either nucleic acid sequences or protein sequences, in which base pairs or protein residues are represented using single letter codes.