Case Study. DNA Deoxyribonucleic acid (DNA) is a nucleic acid that contains the genetic instructions used in the development and functioning of all known.

Slides:



Advertisements
Similar presentations
Aim: What is a chromosome?
Advertisements

Nucleic Acids. Copyright © 2005 Pearson Education, Inc. publishing as Benjamin Cummings Concept 5.5: Nucleic acids store and transmit hereditary information.
3.3 DNA Structure –
Mrs. Stewart Biology I Honors. STANDARDS: CLE Investigate how genetic information is encoded in nucleic acids. CLE Describe the relationships.
DEOXYRIBONUCLEIC ACID DNA. O.L Lesson Objectives At the end of this lesson you should be able to 1. Outline the simple structure of DNA – 2 strands and.
Nucleic Acids. Elements Nucleic Acids Contain C, H, O,N, P.
Prof. Drs. Sutarno, MSc., PhD..  Chromosomes are made up of Proteins and DNA  DNA carries the genetic information  This information is similar to digital.
Biomolecules Nucleic acids.  Are the genetic materials of all organisms and determine inherited characteristics.  The are two kinds of nucleic acids,
Unit 7 Lesson 1 DNA Structure and Function
Essential Idea The structure of DNA allows efficient storage of genetic information.
DNA
Unit 2 – PART A Inside the Nucleus DNA Sturcture.
Strand Design for Biomolecular Computation
Chapter 11: DNA and Genes (Part 1). 1. Although the environment influences how an organism develops, the genetic information that is held in the molecules.
DNA. Nucleic Acids Informational polymers Made of C,H,O,N and P No general formula Examples: DNA and RNA.
DNA The molecule of heredity. The molecules of DNA is the information for life (determine an organism’s traits) DNA achieves its control by determining.
What is this DNA you speak of?  DNA stands for deoxyribonucleic acid - Found in nucleus of eukaryotic cells - Found in cytoplasm of protists.
National 5 Biology Course Notes Part 4 : DNA and production of
Molecular Biology 2.6 Structure of DNA and RNA. Nucleic Acids The nucleic acids DNA and RNA are polymers of nucleotides.
DNA (deoxyribonucleic acid) consists of three components.
DNA Structure.
Regents Biology Nucleic Acids Information storage.
8.2 Structure of DNA Deoxyribonucleic acid (DNA) is a molecule that encodes the genetic instructions used in the development and functioning of all known.
DNA Structure and Protein Synthesis (also known as Gene Expression)
DNA.
DNADNA. Structure and replication of DNA - syllabus content Structure of DNA — nucleotides contain deoxyribose sugar, phosphate and base. DNA has a sugar–phosphate.
Overview DNA stands for d Deoxyribose Nucleic Acid Thinking Points What is the structure of DNA? Why is DNA so important?
The nucleic acids include the amazing DNA molecule. It is the source of constancy and variation among species, and is the foundation for the unity and.
Compression of Protein Sequences EE-591 Information Theory FEI NAN, SUMIT SHARMA May 3, 2003.
DNA- Deoxyribonucleic acid Each nucleotide of DNA is composed of a phosphate group, a sugar called deoxyribose and a molecule that is called a nitrogenous.
DNA and Genes. Prokaryotes VS Eukaryotes Prokaryotes: no defined nucleus and a simplified internal structure Eukaryotes: membrane limited nucleus and.
Have Your DNA and Eat It Too I will be able to describe the structure of the DNA molecule I will be able to explain the rules of base pairing I will understand.
DNA Replication. What is DNA?  Stands for Deoxyribonucleic Acid  Has three main parts: 1. Sugar molecules (deoxyribose) 2. Phosphate 3. Nitrogen bases.
DNA –Was known as a chemical in cells by the end of the nineteenth century –Has the capacity to store genetic information –Can be copied and passed from.
Section12.2.  3. DNA (deoxyribonucleic acid) is a nucleic acid made up of nucleotides joined into long strands by covalent bonds.
The Structure of DNA Read the title aloud to students.
DNA & GENES DNA: the molecule of heredity DNA ultimately determines an organism’s traits. Within the structure of DNA is the complete instructions for.
Unit 2 Lesson 6 DNA Structure and Function
DNA Deoxyribonucleic acid. DNA structure DNA is a nucleic acid –composed of many nucleotides –A nucleotide is composed of a sugar (deoxyribose), a phosphate.
Nucleotides and Nucleic acids.
DNA
(2) Structure and Replication of DNA
Nucleic Acids DNA & RNA.
Structure & Replication
DNA Structure and Protein Synthesis (also known as Gene Expression)
Nucleic acid Dr. Sahar Al Shabane.
DNA Structrue & Function
Unit 2 Lesson 6 DNA Structure and Function
Unit 2 Lesson 6 DNA Structure and Function
Higher Human Biology Sub topic 2a
DNA and The Genome Structure and Organisation of DNA
Overview of Genetics Genes make us who we are!.
DNA Structure Section
DNA Structure 2.6 & 7.1.
BTY100-Lec#4.1 Genetic Basis of Life Genetic Makeup © LPU: BTY100.
Deoxyribonucleic Acid
Topic 3: The chemistry of life
THINK ABOUT IT The DNA molecule must somehow specify how to assemble proteins, which are needed to regulate the various functions of each cell. What kind.
DNA Structure and Function
Cracking the Code What is DNA?
DNA & The Genetic Code The sequence (order) of bases in a strand of DNA acts as a template for DNA replication and makes the code for building proteins.
DNA and the Genome Key Area 1a The Structure of DNA.
DNA Section 6.1.
DNA Structure Section
Deoxyribonucleic Acid
DNA Learning Goal: To learn about the structure of DNA.
Nucleic Acids DNA & RNA.
Nucleic Acids DNA & RNA.
BC Science Connections 10
Presentation transcript:

Case Study

DNA Deoxyribonucleic acid (DNA) is a nucleic acid that contains the genetic instructions used in the development and functioning of all known living organism and some viruses. The main role of DNA molecules is the long- term storage of information. The DNA segments that carry this genetic information are called genes.

DNA DNA consists of two long polymers of simple units called nucleotides, with backbones made of sugars and phosphate groups joined by ester bonds. These two strands run in opposite directions to each other and are therefore Anti-parallel. Attached to each sugar is one of four types of molecules called heterocyclic bases. It is the sequence of these four bases along the backbone that encodes information.

DNA The heterocyclic bases are adenine (A), guanine (G), cytosine (C) and thymine (T), linked together with sugar units and phosphates. One of the biggest puzzles was that although the proportion of these bases varied from one DNA to another, it was always found that the number of A = T, and G = C.[4]

The Longest Common Subsequence (LCS) in DNA Finding Longest Common Subsequence (LCS) from molecular Sequences is an interesting variation of the classical string matching problem: the task is that of finding a subsequence from two molecular sequences such that it is of longest length. There can exist more than one LCS for the same given sequences. For example consider two short DNA sequences as given below S1=T C C A T A G T C S2=A G C T A A T A G One of the LCS for the above two sequences can be ‘CAAT’ of length four. ‘ATAT’,’ATAG’ etc can be other such subsequences of length four.

The Longest Common Subsequence (LCS) in DNA While determining the LCS we look for symbols in all the sequences in the same relative order but not necessarily contiguous. In molecular biology it is important to estimate the similarity of two DNA or protein sequences. Especially for pre-selection purposes, those biological sequences can be treated as strings from an appropriate input alphabet. The degree of similarity can be measured by counting the maximal number of identical symbols existing in both input strings in the same order.

The Longest Common Subsequence (LCS) in DNA Collecting these identical symbols and concatenating them produces (one of) the longest common subsequence(s) of the strings the length of which numerically describes the similarity between the strings. In molecular biology, LCS is an appropriate measure of the similarity of biological sequences. When we want to know how homologous those DNA or protein sequences are, we can calculate the maximum number of identical symbols among them. That is exactly an LCS of them

How The algorithm works In our case, we will consider only the sequences with the same length, so the relation between them can be represented using a square matrix which can be used to determine whether the relation has certain properties. we are given two character strings X=x 0 x 1 x 2 ……x n-1 and Y=y 0 y 1 y 2 ……y m-1 of length n and m respectively(m=n in our case), and are asked to find a longest string S that is a sequence of both X and Y.

How The algorithm works There are more than one way to solve this problem but the most efficient one is to use the dynamic programming technique where the solution to the global problem must be a composition of optimal sub problem solutions. The algorithm based on dynamic programming technique initialize an (n+1)(m+1) array (matrix), L, and iteratively build up values L [i,j] which used to denote the length of a longest string that is a subsequence of both X[0….i]= x 0 x 1 x 2 ……x i and Y[0…j]= y 0 y 1 y 2 …y j.

How The algorithm works There are more than one way to solve this problem but the most efficient one is to use the dynamic programming technique where the solution to the global problem must be a composition of optimal sub problem solutions. The algorithm based on dynamic programming technique initialize an (n+1)(m+1) array (matrix), L, and iteratively build up values L [i,j] which used to denote the length of a longest string that is a subsequence of both X[0….i]= x 0 x 1 x 2 ……x i and Y[0…j]= y 0 y 1 y 2 …y j.

How The algorithm works This allows us to rewrite L [i,j] in terms of optimal sub problem solutions and it depends on which of the following two cases we are in.

How The algorithm works In order to make both of these equations make sense in the boundary cases when i=0 or j=0, the algorithm assign L[i,-1]=0 for i=-1,0,1,…..,n-1 and L[-1,j]=0 for j=-1,0,1,…..,m-1. The algorithm iterative until we have L[n-1,m-1], the length of a longest common subsequence of X and Y.

Implementation Given two strands of DNA, X=AGCGA and Y=CAGAT, which could for example come from two individuals, we assumed that the two DNA are have the same length so we can determine whether the relation has certain properties, then after applying the algorithm we get the following matrix.

Implementation CAGAT A G C G A012233

Implementation CAGAT A G C G A The LCS of the sub string AG and the sub string CAG is AG and it is of length 2

Implementation CAGAT A G C G A The LCS of the sub string AGC and the sub string CA is A and it is of length 1

Implementation CAGAT A G C G A The LCS of the string AGCGA and the string CAGAT is AGA and it is of length 3

Implementing the binary relations on the LCS matrix We will consider the matrix L to be the relation between two distinct strands of DNA which come from two individuals and we will set L[i,j] as following:

Implementing the binary relations on the LCS matrix CAGAT 0A G C G A11111

As we can see the relation in our example is not reflexive i.e L[i,i] is not a member of the relation in all the time. Also the relation is not symmetric because L[1,0]=0 while L[0,1]=1.

Implementing the binary relations on the LCS matrix The relation is not transitive as shown below. × =

Implementing the binary relations on the LCS matrix consider that we have another two strands of DNA, X=CGAAT and Y=CGATT, then after applying LCS algorithm we get the following matrix CGATT C G A A T012344

Implementing the binary relations on the LCS matrix After converting the matrix to 0, 1 matrix and deleting the extra row and column we get: As we can see the binary relation between these two strands of DNA considers being an Equivalence Relation since it is reflexive, symmetric and transitive.

Conclusion We presented a case study about applying the relation concepts in to a given two strands of DNA through specific text similarity problem called Longest Common Subsequence problem.