Bioinformatics What is bioinformatics? Why bioinformatics? The major molecular biology facts Brief history of bioinformatics Typical problems of bioinformatics:

Slides:



Advertisements
Similar presentations
Molecular Genetics PaCES Summer Program in Environmental Science.
Advertisements

Central dogma of genetics Lecture 4. The conversion of DNA to Proteins.
GENETIC-CONCEPTS.
BIOINFORMATICS Ency Lee.
Introduction to Bioinformatics Yana Kortsarts Bob Morris.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
A Molecular Biology primer….. Genetic information is carried on nucleic acids - DNA &RNA.
Introduction to Bioinformatics Spring 2008 Yana Kortsarts, Computer Science Department Bob Morris, Biology Department.
Basic Biology for CS262 OMKAR DESHPANDE (TA) Overview Structures of biomolecules How does DNA function? What is a gene? How are genes regulated?
The Cell, Central Dogma and Human Genome Project.
Bioinformatics Lecture 2. Bioinformatics: is the computational branch of molecular biology Using the computer software to analyze biological data The.
Bioinformatics Original definition (1979 by Paulien Hogeweg): “application of information technology and computer science to the field of molecular biology”
© Wiley Publishing All Rights Reserved. Biological Sequences.
2.7 DNA Replication, transcription and translation
Introduction to Biological Sequences. Background: What is DNA? Deoxyribonucleic acid Blueprint that carries genetic information from one generation to.
RNA Ribonucleic Acid.
Bioinformatics.
CHAPTER 12 DNA & RNA. Griffith & Transformation Discovered transformation using bacteria that causes pneumonia Transformation  Process in which part.
DNA.
Intelligent Systems for Bioinformatics Michael J. Watts
NAi_transcription_vo1-lg.mov.
GENE EXPRESSION © 2007 Paul Billiet ODWSODWS. Two steps are required 1. Transcription The synthesis of mRNA use the gene on the DNA molecule as a template.
Molecular Biology Primer. Starting 19 th century… Cellular biology: Cell as a fundamental building block 1850s+: ``DNA’’ was discovered by Friedrich Miescher.
RNA Ribonucleic Acid. Structure of RNA  Single stranded  Ribose Sugar  5 carbon sugar  Phosphate group  Adenine, Uracil, Cytosine, Guanine.
DNA and Protein Synthesis A Brief Tutorial. Background DNA is the genetic material. DNA is the genetic material. Sometimes called “the blueprint of.
DNA alphabet DNA is the principal constituent of the genome. It may be regarded as a complex set of instructions for creating an organism. Four different.
CSCI 6900/4900 Special Topics in Computer Science Automata and Formal Grammars for Bioinformatics Bioinformatics problems sequence comparison pattern/structure.
Chapter 13. The Central Dogma of Biology: RNA Structure: 1. It is a nucleic acid. 2. It is made of monomers called nucleotides 3. There are two differences.
CHAPTER 12 STUDY GUIDE MATER LAKES ACADEMY MR. R. VAZQUEZ BIOLOGY
Chapter 11 DNA and GENES. DNA: The Molecule of Heredity DNA, the genetic material of organisms, is composed of four kinds nucleotides. A DNA molecule.
Transcription & Translation Chapter 17 (in brief) Biology – Campbell Reece.
BSC Developmental Biology Patterns of Inheritance EvolutionEcology.
BDC331 Conservation Genetics 2015 Mr. Adriaan Engelbrecht Department of Biodiversity and Conservation Biology New Life Sciences Building Core 2, Room
What is central dogma? From DNA to Protein
Ms. Hughes.  Mendel showed that traits are passed from parent to offspring.  Instructions for how genes are inherited.  Genes are made up of segments.
Bioinformatics and Computational Biology
DNA, RNA & Protein Synthesis Chapters 12 & 13. The Structure of DNA.
Processes DNA RNAMisc.Protein What is the base pair rule? Why is it important.
Chapter 17.1 & 17.2 Process from Gene to Protein.
CHAPTER 13 RNA and Protein Synthesis. Differences between DNA and RNA  Sugar = Deoxyribose  Double stranded  Bases  Cytosine  Guanine  Adenine 
LET’S PLAY JEOPARDY!! Molecular Genetics Nucleic Acids ReplicationTranscription Translation Mixed Q $100 Q $200 Q $300 Q $400 Q $500 Q $100 Q.
Microbiology Chapter 9 Genetics - Science of the study of heredity, variations in organisms that are transferable from generations to generation DNA is.
Introduction to Molecular Biology and Genomics BMI/CS 776 Mark Craven January 2002.
Composed of 4 nucleotides, that always pair the same.
BIOINFORMATICS Ayesha M. Khan Spring 2013 Lec-8.
The Central Dogma of Molecular Biology DNA  RNA  Protein  Trait.
Microbial Genetics Structure and Function of Genetic Material The Regulation of Bacterial Gene Expression Mutation: Change in Genetic Material Genetic.
Chapter 13 Test Review.
Introduction to molecular biology Data Mining Techniques.
RNA & Protein Synthesis
DNA and RNA Structure of DNA Chromosomes and Replication Transcription and Translation Mutation and Gene Regulation.
DNA and Protein Synthesis
Molecular Genetics Transcription & Translation
From Gene to Protein pp Discover Biology: C15 From Gene to Protein pp
Data-intensive Computing: Case Study Area 1: Bioinformatics
Unit 4: Genetic Information, Variation and Relationships between Organisms Lesson 2 The Triplet Code A sequence of three DNA bases, called a triplet,
From DNA to Proteins Transcription.
Protein Synthesis.
Protein Synthesis.
Protein Synthesis in Detail
Transcription and Translation Chapter 12
Ch 12 DNA and RNA.
What is RNA? Do Now: What is RNA made of?
Chapter 12 DNA and RNA.
Protein synthesis: Overview
CHAPTER 12 Review.
Bioinformatics Vicki & Joe.
Chapter 12 & 13 DNA and RNA.
DNA, RNA & PROTEINS The molecules of life.
LECTURE 5: DNA, RNA & PROTEINS
Presentation transcript:

Bioinformatics What is bioinformatics? Why bioinformatics? The major molecular biology facts Brief history of bioinformatics Typical problems of bioinformatics: collection and retrieval of data alignment and similarity search prediction and classification Expectations and the level of requirements Lecture 1

What is Bioinformatics? Mathematics and Statistics Biology Computer Science

A working definition is that of House of Representatives Standing Committee on Primary Industries and Regional Services Inquiry :- "All aspects of gathering, storing, handling, analyzing, interpreting and spreading vast amounts of biological information in databases. The information involved includes gene sequences, biological activity/function, pharmacological activity, biological structure, molecular structure, protein- protein interactions, and gene expression. Bioinformatics uses powerful computers and statistical techniques to accomplish research objectives, for example, to discover a new pharmaceutical or herbicide." What is bioinformatics?

Molecular biology and genetics Phylogenetic and evolutionary sciences Different aspects of biotechnology including pharmaceutical and microbiological industries Medicine Agriculture Eco-management Areas of current and future development of bioinformatics

Exponential growth of investments Constant deficit of trained professionals Diversification of bioinformatics applications Need in different types of bioinformaticians Why bioinformatics?

Central Dogma of Molecular Biology GENOTYPE (i.e. Aa) PHENOTYPE (pink) GENE (DNA) MESSENGER (RNA) PROTEIN TRAIT ATGCAAGTCCACTGTATTCCA UACGUUCAGGUGACAUAAGGG transcription reverse tr translation replication

DNA Symbol Meaning Explanation G G Guanine A A Adenine T T Thymine C C Cytosine R A or G puRine Y C or T pYrimidine N A, C, G or T Any base Double helix 5’ 3’ 5’ A C G T C A T G T G C A G T A C RNA 5’3’ A C G U C A U G template U U Uracil

Genetic Code 1.Amino acids are coded by codons – triplets of nucleotides, e.g. |ACG|TAT|…. 2.There are 4 3 = 64 codons for ~20 amino acids, the code is degenerate 3.Codons do not overlap 4.Deletions or insertions of one or few nucleotides (not equal to 3 x N) usually destroy a message by shifting a reading frame 5.Three specific codons (stop codons) do not code any amino acid and are always located at the very end of the protein coding part of a gene

The genetic code

The 20 amino acids common in living organisms

PROTEINS Green Fluorecent Protein (GFP) 1 mcgkkfelki dnvrfvghpt llqpphtiqa sktdpspkre lptmilfsvv falranadas 61 viscmhnlsr riaialqhee rrcqyltrea klmlamqdev ttiidsdgsp qspfrqilpk 121 cklardlkea ydslcttgvv rlhinnwlev sfclphkihr vggkhiplea lerslkairp

Genomic Hierarchy in Eukaryotes Genome nuclear (1) Chromosomes (23x2) DNA molecules (23x2) Genes (~30,000); only a small fraction of genome Nucleotides (~3x10 9 )

Eukaryotic genes are complex Promoter Exon 1 Exon 2 Exon 3 Exon 4 Start codon Intron 1 Intron 2 Intron 3 Stop codon Protein coding regions

The first biological database - Protein Identification Resource was established in 1972 by Margaret Dayhoff Dayhoff and co-workers organized the proteins into families and superfamilies based on degree of sequence similarity Idea of sequence alignment was introduced as well as special tables that reflected the frequency of changes observed in the sequences of a group of closely related proteins Currently there are several huge Protein Banks : SwissProt, PIR International, etc. The first DNA database was established in Currently there are several powerful databases: GenBank, EMBL, DDBJ, etc. Brief history of bioinformatics: Databases

Brief history of bioinformatics: evolutionary reconsructions

Brief history of bioinformatics: other important steps Development of sequence retrieval methods ( s) Development of principles of sequence alignment (1980s) Prediction of RNA secondary structure (1980s) Prediction of protein secondary structure and 3D ( s) The FASTA and BLAST methods for DB search ( s) Prediction of genes (1990s) Studies of complete genome sequences (late 1990s –2000s)

Collection and retrieval of data. Alignment methods. Sequencing (DNA, proteins) Submission of sequences to the databases Computer storage of sequences Development of sequence formats Conversion of one sequence format to another Development of retrieval and alignment methods

Prediction, reconstruction and classification Prediction of secondary and 3D structure of RNA and proteins Gene prediction in prokaryotes and eukaryotes Prediction of promoters and other functional sites Reconstruction of phylogeny Genome analysis Classification of proteins and genes

Prediction of RNA secondary structure: an example A. Single stranded RNA 5’ 3’ 5’ 3’ B. Stem and loop or hairpin loop

Expectations of students’ performance Basic understanding of general principles of molecular biology Some mathematical and computer science background Focus on using computational methods and understanding general ideas of analysis used in bioinformatics Formal description of algorithms and complex methodology will not be the core elements of this unit The core requirement is understanding of foundations of bioinformatics and “hands on” approach