Prepared By: Syed Khaleelulla Hussaini. Outline Proteins DNA RNA Genetics and evolution The Sequence Matching Problem RNA Sequence Matching Complexity.

Slides:



Advertisements
Similar presentations
Proteins: Structure reflects function….. Fig. 5-UN1 Amino group Carboxyl group carbon.
Advertisements

Review.
Atoms Made of protons, neutrons, and electrons Protons and neutrons are clustered as the nucleus Electrons orbit the nucleus in energy levels or shells.
• Exam II Tuesday 5/10 – Bring a scantron with you!
5’ C 3’ OH (free) 1’ C 5’ PO4 (free) DNA is a linear polymer of nucleotide subunits joined together by phosphodiester bonds - covalent bonds between.
The Organic Molecules of Living Organisms
Lectures on Computational Biology HC Lee Computational Biology Lab Center for Complex Systems & Biophysics National Central University EFSS II National.
© 2010 Pearson Education, Inc. Lectures by Chris C. Romero, updated by Edward J. Zalisko PowerPoint ® Lectures for Campbell Essential Biology, Fourth Edition.
Molecular Techniques in Molecular Systematics. DNA-DNA hybridisation -Measures the degree of genetic similarity between pools of DNA sequences. -Normally.
Exciting Developments in Molecular Biology As seen by an amateur.
You Must Know How the sequence and subcomponents of proteins determine their properties. The cellular functions of proteins. (Brief – we will come back.
Chapter 27 Amino Acids, Peptides, and Proteins. Nucleic Acids.
Proteins account for more than 50% of the dry mass of most cells
Unit 7 RNA, Protein Synthesis & Gene Expression Chapter 10-2, 10-3
How does DNA work? What is a gene?
C OMPUTATIONAL BIOLOGY. O UTLINE Proteins DNA RNA Genetics and evolution The Sequence Matching Problem RNA Sequence Matching Complexity of the Algorithms.
Protein Synthesis. DNA RNA Proteins (Transcription) (Translation) DNA (genetic information stored in genes) RNA (working copies of genes) Proteins (functional.
Proteins account for more than 50% of the dry mass of most cells
Human Genetic Variation Basic terminology. What is a gene? A gene is a functional and physical unit of heredity passed from parent to offspring. Genes.
CHAPTER 12 PROTEIN SYNTHESIS AND MUTATIONS -RNA -PROTEIN SYNTHESIS -MUTATIONS.
©CMBI 2006 Amino Acids “ When you understand the amino acids, you understand everything ”
Copyright © 2004 by Limsoon Wong A Biology Review.
Concept 5.4: Proteins have many structures, resulting in a wide range of functions Proteins account for more than 50% of the dry mass of most cells Protein.
How Proteins Are Made Mrs. Wolfe. DNA: instructions for making proteins Proteins are built by the cell according to your DNA What kinds of proteins are.
. Sequence Alignment. Sequences Much of bioinformatics involves sequences u DNA sequences u RNA sequences u Protein sequences We can think of these sequences.
Pairwise alignment of DNA/protein sequences I519 Introduction to Bioinformatics, Fall 2012.
© 2010 Pearson Education, Inc. Lectures by Chris C. Romero, updated by Edward J. Zalisko PowerPoint ® Lectures for Campbell Essential Biology, Fourth Edition.
LESSON 4: Using Bioinformatics to Analyze Protein Sequences PowerPoint slides to accompany Using Bioinformatics : Genetic Research.
CSCI 6900/4900 Special Topics in Computer Science Automata and Formal Grammars for Bioinformatics Bioinformatics problems sequence comparison pattern/structure.
AMINO ACIDS.
Genetics in ~1920: 1. Cells have chromosomes Sketch of Drosophila chromosomes (Bridges, C. 1913)
Chapter 11 DNA and GENES. DNA: The Molecule of Heredity DNA, the genetic material of organisms, is composed of four kinds nucleotides. A DNA molecule.
WSSP Chapter 8 BLASTX Translated DNA vs Protein searches atttaccgtg ttggattgaa attatcttgc atgagccagc tgatgagtat gatacagttt tccgtattaa taacgaacgg ccggaaatag.
Amino Acids are the building units of proteins
Learning Targets “I Can...” -State how many nucleotides make up a codon. -Use a codon chart to find the corresponding amino acid.
Fig Second mRNA base First mRNA base (5 end of codon) Third mRNA base (3 end of codon)
Welcome Back! February 27, 2012 Sit in any seat for today. You will have assigned seats tomorrow Were you absent before the break? Plan on coming to tutorial.
NOTES: 2.3 part 2 Nucleic Acids & Proteins. So far, we’ve covered… the following MACROMOLECULES: ● CARBOHYDRATES… ● LIPIDS… Let’s review…
Macromolecules of Life Proteins and Nucleic Acids
CELL REPRODUCTION: MITOSIS INTERPHASE: DNA replicates PROPHASE: Chromatin condenses into chromosomes, centrioles start migrating METAPHASE: chromosomes.
End Show Slide 1 of 39 Copyright Pearson Prentice Hall 12-3 RNA and Protein Synthesis 12–3 RNA and Protein Synthesis.
RNA 2 Translation.
Transcription and Translation
Amino Acids ©CMBI 2001 “ When you understand the amino acids, you understand everything ”
. Sequence Alignment. Sequences Much of bioinformatics involves sequences u DNA sequences u RNA sequences u Protein sequences We can think of these sequences.
Proteins.
Chapter 3 Proteins.
M3/31EXAM IIChapters 8-12, parts of 2, 3 W4/2Transcription and TranslationChapters 4, 15 M4/7"Molecular" GeneticsChapter 16 W4/9"Classical" GeneticsChapter.
Amino Acids  Amino Acids are the building units of proteins. Proteins are polymers of amino acids linked together by what is called “ Peptide bond” (see.
Stephen Taylor i-Biology.net Photo credit: Firefly with glow, by Terry Priest on Flickr (Creative Commons)
Parts is parts…. AMINO ACID building block of proteins contain an amino or NH 2 group and a carboxyl (acid) or COOH group PEPTIDE BOND covalent bond link.
Amino acids Common structure of 19 AAs H3N+H3N+ COO - R H C Proline.
GOVERNMENT ENGINEERING COLLEGE, BHARUCH Subject : Organic Chemistry and Unit Process.
Genomics Lecture 3 By Ms. Shumaila Azam. Proteins Proteins: large molecules composed of one or more chains of amino acids, polypeptides. Proteins are.
Could the first steps of chemical evolution have occurred on ancient Earth?  To find out, Stanley Miller combined methane (CH 4 ), ammonia (NH.
Amino acids.
BIOLOGY 12 Protein Synthesis.
Chapter 2 part 2: Biochemistry
Sequence Alignment ..
Fig. 5-UN1  carbon Amino group Carboxyl group.
The Structure and Function of Macromolecules
Proteins Genetic information in DNA codes specifically for the production of proteins Cells have thousands of different proteins, each with a specific.
The 20 amino acids.
Translation.
The 20 amino acids.
Do now activity #5 How many strands are there in DNA?
The Chemical Building Blocks of Life
Example of regression by RBF-ANN
“When you understand the amino acids,
Presentation transcript:

Prepared By: Syed Khaleelulla Hussaini

Outline Proteins DNA RNA Genetics and evolution The Sequence Matching Problem RNA Sequence Matching Complexity of the Algorithms

DEFINITION Computational Biology encompasses all computational methods and theories applicable to molecular biology and areas of computer based techniques for solving biological problems.

Protiens They are building blocks of living organism It is a large molecule that is composed of sequences of amino acids There are 20 amino acids which are divided into classes hydrophobic(h-phob) hydrophillic(h-phil) polar(pos,neg)

Aspartic Acid Asp D Glutamic Acid Glu E Phenylanine Phe F Glycine Gly G Alanine Ala A Cystine Cys C Histidine His H Isoleucine Ile I Lysine Lys K LeucineLeu L Methionine Met M Asparagine Asn N Proline ProP GlutamineGln Q Arginine Arg R Serine Ser S Threonine Thr T Valine Val V Tryptophan Trp W Tyrosine Tyr Y Amino acid codes Name, 3-letter & single-letter codes

DNA(Deoxyribonucleic acid) Blueprint of living organisms DNA is composed of two strands hold by a weak hydrogen bond Each strand is a sequence of nucleotides DNA has four bases which are classified as two chemical types BASE SYMBOL TYPE Adenine A Purine Thymine T Purine Sytosine C Pyrimidine Guanine G Pyrimidine

DNA Double Helix

RNA RNA is chemically very similar to DNA There are two important differences  Four bases present in RNA are: Adenine(A) Guanine(G) Cystosine(C) Uracil(U)  RNA nucleotides contain a different sugar molecule(ribose)

Genetics and Evolution Mutation The changing of the structure of a gene, resulting in a variant form that may be transmitted to subsequent generations, caused by the alteration of single base units in DNA. Natural selection The process whereby organisms better adapted to their environment tend to survive and produce more offspring. Genetic Drift Variation in the relative frequency of different genotypes in a small population.

Sequence matching problem Proteins are longer and DNA strands are even longer We match them by breaking them in to shorter subsequences Breaking and matching is done by notion of alignment.

Sequence matching example Consider two amino acid sequences: ACCTGAGAG ACGTGGCAG sequence alignment A C C T G A G – A C A C G T G – G C A C

Finite state machines in blast It is used to find out which of the sequences in a database are related to the new given sequence using BLAST The BLAST system is a three step process: 1. Examine the query string and select set of substrings of length w(between 4 and 20) which are good for producing matches 2. Build a DFSM that uses set of substrings and find the sequences with the highest local matches in the database 3. Examine the matches found in step2 and try to build a longer matching sequences

Regular expressions specify protein motif Aligning collection of related proteins we can define a motif Example: E S G H D T Y Y N K N R M D T T T T T S W Q S R G S D T T T P D M T A G P T T W R N T Once an motif is defined we can search for the occurrences of it in other protein sequence by using regular expressions

HMM for sequence matching HMM’s are used when sequences become fairly diverse We can capture the variations among the members of the family and the probabilities associated with them So by using HMM’s we can find the best alignment between two sequences and from which family does a given new sequence belongs to

HMM profile is given by M = (K,O,π,A,B)  K is a set of n states, one for each position in the sequence  O is the output alphabet  Π contains the initial state probabilities  A contains the transition probabilities  B contains the output probabilities

Example of HMM describing protein sequence family

RNA sequence matching and secondary structure prediction using the tools of context-free languages In RNA a change to a single nucleotide in a stem region could completely alter the molecules shape and its function So an change in the stem must be matched by a corresponding change in the paired nucleotide Context free languages are used describe these nested dependencies and secondary structure

Example:

Complexity of algorithms used in computational biology Approaches to many of the problems described here are computational like breaking up of large protein and DNA molecules into substrings NP-hard Conversion to decision problem SHOERTEST-SUPERSTRING( ): S is a set of strings and there exists some superstring T such that every element of S is a substring of T and T has length less than or equal to K) – NP-complete

Reference

Thank you… Thank you…