DNA SPLICING RULES STAYING TRUE TO THE BIOLOGY Elizabeth Goode April 2015.

Slides:



Advertisements
Similar presentations
DNA and splicing (circular) Dipartimento di Informatica Sistemistica e Comunicazioni, Univ. di Milano - Bicocca ITALY Dipartimento di Informatica e Applicazioni,
Advertisements

Molecular Computing Formal Languages Theory of Codes Combinatorics on Words.
CS 345: Chapter 9 Algorithmic Universality and Its Robustness
Theory Of Automata By Dr. MM Alam
Chapter Chapter Summary Languages and Grammars Finite-State Machines with Output Finite-State Machines with No Output Language Recognition Turing.
Regular Grammars Formal definition of a regular expression.
LING 438/538 Computational Linguistics Sandiway Fong Lecture 8: 9/29.
CS5371 Theory of Computation
Theory of Computation What types of things are computable? How can we demonstrate what things are computable?
Regular Languages Sequential Machine Theory Prof. K. J. Hintz Department of Electrical and Computer Engineering Lecture 3 Comments, additions and modifications.
FORMAL LANGUAGES, AUTOMATA AND COMPUTABILITY
Languages, grammars, and regular expressions
79 Regular Expression Regular expressions over an alphabet  are defined recursively as follows. (1) Ø, which denotes the empty set, is a regular expression.
Topics Automata Theory Grammars and Languages Complexities
Fall 2003Costas Busch - RPI1 Turing Machines (TMs) Linear Bounded Automata (LBAs)
Formal Learning Theory Michele Friend (Philosophy) and Valentina Harizanov (Mathematics)
By: Er. Sukhwinder kaur.  Computation Computation  Algorithm Algorithm  Objectives Objectives  What do we study in Theory of Computation ? What do.
CSCI 4325 / 6339 Theory of Computation Zhixiang Chen Department of Computer Science University of Texas-Pan American.
Introduction Chapter 0. Three Central Areas 1.Automata 2.Computability 3.Complexity.
::ICS 804:: Theory of Computation - Ibrahim Otieno SCI/ICT Building Rm. G15.
CS/IT 138 THEORY OF COMPUTATION Chapter 1 Introduction to the Theory of Computation.
DECIDABILITY OF PRESBURGER ARITHMETIC USING FINITE AUTOMATA Presented by : Shubha Jain Reference : Paper by Alexandre Boudet and Hubert Comon.
TM Design Universal TM MA/CSSE 474 Theory of Computation.
Introduction to CS Theory Lecture 3 – Regular Languages Piotr Faliszewski
1 Section 14.2 A Hierarchy of Languages Context-Sensitive Languages A context-sensitive grammar has productions of the form xAz  xyz, where A is a nonterminal.
1 INFO 2950 Prof. Carla Gomes Module Modeling Computation: Language Recognition Rosen, Chapter 12.4.
Grammars CPSC 5135.
Languages & Grammars. Grammars  A set of rules which govern the structure of a language Fritz Fritz The dog The dog ate ate left left.
1 Introduction to Regular Expressions EELS Meeting, Dec Tom Horton Dept. of Computer Science Univ. of Virginia
1 Computability Five lectures. Slides available from my web page There is some formality, but it is gentle,
Introduction to Language Theory
Lecture # 5 Pumping Lemma & Grammar
Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.
Copyright © Curt Hill Languages and Grammars This is not English Class. But there is a resemblance.
Regular Expressions and Languages A regular expression is a notation to represent languages, i.e. a set of strings, where the set is either finite or contains.
Saeid Pashzadeh Jan 2009 Theory of Computation 1.
DNA Splicing Systems By William DeLorbe and Dr. Elizabeth Goode.
Grammars A grammar is a 4-tuple G = (V, T, P, S) where 1)V is a set of nonterminal symbols (also called variables or syntactic categories) 2)T is a finite.
Formal Languages and Grammars
Discrete Structures ICS252 Chapter 5 Lecture 2. Languages and Grammars prepared By sabiha begum.
CSCI 2670 Introduction to Theory of Computing September 14, 2005.
Molecular Computation and Splicing Systems J.H.M. Dassen, Summarized by Dongmin Kim
1 Linear Bounded Automata LBAs. 2 Linear Bounded Automata (LBAs) are the same as Turing Machines with one difference: The input string tape space is the.
1 Course Overview Why this course “formal languages and automata theory?” What do computers really do? What are the practical benefits/application of formal.
Introduction Why do we study Theory of Computation ?
Lecture 03: Theory of Automata:2014 Asif Nawaz Theory of Automata.
Week 14 - Friday.  What did we talk about last time?  Simplifying FSAs  Quotient automata.
CS6800 Advance Theory of Computation Spring 2016 Nasser Alsaedi
CSCI 2670 Introduction to Theory of Computing September 16, 2004.
Chapter 2. Formal Languages Dept. of Computer Engineering, Hansung University, Sung-Dong Kim.
Theory of Computation. Introduction to The Course Lectures: Room ( Sun. & Tue.: 8 am – 9:30 am) Instructor: Dr. Ayman Srour (Ph.D. in Computer Science).
Modeling Arithmetic, Computation, and Languages Mathematical Structures for Computer Science Chapter 8 Copyright © 2006 W.H. Freeman & Co.MSCS SlidesAlgebraic.
Implementation of Haskell Modules for Automata and Sticker Systems
Formal Methods in software development
Why do we study Theory of Computation ?
Theory of Languages and Automata
Linear Bounded Automata LBAs
Complexity and Computability Theory I
Natural Language Processing - Formal Language -
Context Sensitive Grammar & Turing Machines
Lecture 9 Theory of AUTOMATA
Course 2 Introduction to Formal Languages and Automata Theory (part 2)
REGULAR LANGUAGES AND REGULAR GRAMMARS
Non-deterministic Finite Automata (NFA)
A HIERARCHY OF FORMAL LANGUAGES AND AUTOMATA
Regular Grammar.
Formal Methods in software development
Sub: Theoretical Foundations of Computer Sciences
Presentation transcript:

DNA SPLICING RULES STAYING TRUE TO THE BIOLOGY Elizabeth Goode April 2015

HISTORY

TOM HEAD introduced a mathematical model of the biological activity we call “DNA splicing.” [Tom Head, 1987]

HISTORY TOM HEAD introduced a mathematical model of the biological activity we call “DNA splicing.” [Tom Head, 1987] The model provided a way to study the generative power of double- stranded DNA (dsDNA) in the presence of enzymes that cut and paste in a site-specific manner – i.e., the computational power of the splicing operation.

HISTORY Over 5,000 articles and books have been published about DNA splicing systems since Head introduced the concept in 1987.

HISTORY Over 5,000 articles and books have been published about DNA splicing systems since Head introduced the concept in Algebra and graph theory have been used to attack the questions that arise in the area of DNA splicing theory.

HISTORY Over 5,000 articles and books have been published about DNA splicing systems since Head introduced the concept in Algebra and graph theory have been used to attack the questions that arise in the area of DNA splicing theory. Leonard Adleman examined the possibility of using DNA and its various biochemical operations to crack the RSA encryption code. DNA-computing was born. [Adleman 1994, 20-var SAT]

What is DNA? Double-stranded DNA (dsDNA) is a double helix (twisted ladder) with a sugar-phosphate backbone. The “rungs” are made of nucleotides held together in a very specific orientation by hydrogen bonding.

What is DNA?

What is DNA splicing?

Splicing is the cutting and pasting of dsDNA. Restriction enzymes are proteins that can perform site-specific cutting of dsDNA. By site-specific cutting we mean the enzymes can recognize a specific sequence of nucleotides and perform a specific type of cut at that site. Example: Bgl II

What is DNA splicing? Example: Bgl II

What is DNA splicing?

Splicing as an Operation on dsDNA

Modeling the splicing of dsDNA

The dsDNA that is initially in the test tube is represented as a set I.

Modeling the splicing of dsDNA

Modeling the Cutting using Rule Sets

What is a Formal Language?

We can think of a formal language as a set of strings written over a finite alphabet. In general, two alphabet letters suffice.

What is a Formal Language? We can think of a formal language as a set of strings written over a finite alphabet. In general, two alphabet letters suffice. Example: A = {a, b}

What is a Formal Language? We can think of a formal language as a set of strings written over a finite alphabet. In general, two alphabet letters suffice. Example: A = {a, b} L 1 = aa + ab + ba + bb

What is a Formal Language? We can think of a formal language as a set of strings written over a finite alphabet. In general, two alphabet letters suffice. Example: A = {a, b} L 1 = aa + ab + ba + bb L 2 = ab * a

What is a Formal Language? We can think of a formal language as a set of strings written over a finite alphabet. In general, two alphabet letters suffice. Example: A = {a, b} L 1 = aa + ab + ba + bb L 2 = ab * a L 3 = a(bb) * a

What is a Formal Language? We can think of a formal language as a set of strings written over a finite alphabet. In general, two alphabet letters suffice. Example: A = {a, b} L 1 = aa + ab + ba + bb L 2 = ab * a L 3 = a(bb) * a L 4 = a(bb) * a + ab + ba + bb

Formal Language Characterization Schemas

The Chomsky Hierarchy of Languages classifies languages into families. [Noam Chomsky 1956,1959]

Formal Language Characterization Schemas The Chomsky Hierarchy of Languages classifies languages into families. [Noam Chomsky 1956,1959] REGULAR

Formal Language Characterization Schemas The Chomsky Hierarchy of Languages classifies languages into families. [Noam Chomsky 1956,1959] REGULAR CONTEXT-FREE

Formal Language Characterization Schemas The Chomsky Hierarchy of Languages classifies languages into families. [Noam Chomsky 1956,1959] REGULAR CONTEXT-FREE CONTEXT-SENSITIVE

Formal Language Characterization Schemas The Chomsky Hierarchy of Languages classifies languages into families. [Noam Chomsky 1956,1959] REGULAR CONTEXT-FREE CONTEXT-SENSITIVE RECURSIVE

Formal Language Characterization Schemas The Chomsky Hierarchy of Languages classifies languages into families. [Noam Chomsky 1956,1959] REGULAR CONTEXT-FREE CONTEXT-SENSITIVE RECURSIVE RECURSIVELY ENUMERABLE

Computational Models for Formal Languages

The “Machine” Model

Computational Models for Formal Languages The “Machine” Model Finite Automata and Regular Languages

Computational Models for Formal Languages The “Machine” Model Finite Automata and Regular Languages L 2 = ab * a

Computational Models for Formal Languages The “Machine” Model Finite Automata and Regular Languages L 2 = ab * a

Computational Models for Formal Languages The “Machine” Model Finite Automata and Regular Languages L 2 = ab * a

Computational Models for Formal Languages The “Machine” Model Finite Automata and Regular Languages L 2 = ab * a a a b

Computational Models for Formal Languages The “Machine” Model Finite Automata and Regular Languages [McCulloch & Pitts 1943, Kleene 1956, Huffman 1954, Moore 1956, Mealy 1955]

Computational Models for Formal Languages

The “Grammar” Model Computational Models for Formal Languages

The “Grammar” Model Right (Left) Linear grammars and Regular Languages Computational Models for Formal Languages

The “Grammar” Model Right (Left) Linear grammars and Regular Languages L 2 = ab * a Computational Models for Formal Languages

The “Grammar” Model Right (Left) Linear grammars and Regular Languages L 2 = ab * a S -> aAA -> a | bBB -> a | bB Computational Models for Formal Languages

The “Splicing” Model Gheorghe Paun proved that splicing is Turing complete – splicing systems not having finite initial sets can generate all languages that can be generated by the universal Turing machine. [1995] Computational Models for Formal Languages

The “Splicing” Model Gheorghe Paun proved that splicing is Turing complete – splicing systems not having finite initial sets can generate all languages that can be generated by the universal Turing machine. [1995] Pixton proved that all finitely generated splicing languages are regular. [1994] Computational Models for Formal Languages

The “Splicing” Model Gheorghe Paun proved that splicing is Turing complete – splicing systems not having finite initial sets can generate all languages that can be generated by the universal Turing machine. [1995] Pixton proved that all finitely generated splicing languages are regular. [1994] There are assorted results “in-between” the classes of languages belonging to the regular set and the universal set of languages, primarily due to Gheorghe Paun. Computational Models for Formal Languages

The “Splicing” Model Computational Models for Formal Languages

Rules for Splicing

Splicing languages that respect the biology Rules have form (c,x,d) (p,x,q) and handedness [Head 1987] Rules for Splicing

Splicing languages that respect the biology Rules have form (c,x,d) (p,x,q) and handedness [Head 1987] Rules for Splicing REGULAR

Rules for Splicing REGULAR

Rules for Splicing REGULAR

Rules for Splicing

REGULAR Rules for Splicing

REGULAR Rules for Splicing R ule (a, 1; b, 1) cuts & pastes these strings:

REGULAR Rules for Splicing R ule (a, 1; b, 1) cuts & pastes these strings:

REGULAR Rules for Splicing R ule (a, 1; b, 1) cuts & pastes these strings:

REGULAR Rules for Splicing R ule (a, 1; b, 1) cuts & pastes these strings:

REGULAR Rules for Splicing

REGULAR Rules for Splicing

REGULAR Rules for Splicing

REGULAR Rules for Splicing

REGULAR Rules for Splicing

The form of rules can reveal or obscure the biological activity.

Rules for Splicing The form of rules can reveal or obscure the biological activity. Ex: Head’s original notation was intended to show the “sticky ends” or “crossing” of the restriction enzyme cutting sites. (u, x, v) (p, x, q)

Rules for Splicing The form of rules can reveal or obscure the biological activity. Ex: Head’s original notation was intended to show the “sticky ends” or “crossing” of the restriction enzyme cutting sites. (u, x, v) (p, x, q) Ex: A Pixton-type rule of the form (u, v ; u’, v’) does not necessarily respect the biological notion of the crossing of the restriction site.

Rules for Splicing The form of rules can reveal or obscure the biological activity. Ex: Head’s original notation was intended to show the “sticky ends” or “crossing” of the restriction enzyme cutting sites. Ex: A Pixton-type rule of the form (u, v ; u’, v’) does not necessarily respect the biological notion of the crossing of the restriction site.

Rules for Splicing The form of rules can reveal or obscure the biological activity. Ex: Head’s original notation was intended to show the “sticky ends” or “crossing” of the restriction enzyme cutting sites. Ex: A Pixton-type rule of the form (u, v ; u’, v’) does not necessarily respect the biological notion of the crossing of the restriction site. But it can: (px, q ; sx, t) or (p, xq ; s, xt)

Rules for Splicing The form of rules can reveal or obscure the biological activity.

Rules for Splicing The form of rules can reveal or obscure the biological activity. Yuhani Yusof and E Goode decided to rewrite the rules in a form that makes the biological activity apparent. Simple rules: (a, 1; a, 1) -> (a, 1, 1: a, 1, 1) or (1, 1, a: 1, 1, a) or (1, a, 1: 1, a, 1)

Rules for Splicing The form of rules can reveal or obscure the biological activity. Yuhani Yusof and E Goode decided to rewrite the rules in a form that makes the biological activity apparent. Simple rules: (a, 1; a, 1) -> (a, 1, 1: a, 1, 1) or (1, 1, a: 1, 1, a) or (1, a, 1: 1, a, 1) Semi-simple rules: (a, 1: b, 1) -> (a, 1, 1: b, 1, 1)

Rules for Splicing The form of rules can reveal or obscure the biological activity. Yuhani Yusof and E Goode decided to rewrite the rules in a form that makes the biological activity apparent. Simple rules: (a, 1; a, 1) -> (a, 1, 1: a, 1, 1) or (1, 1, a: 1, 1, a) or (1, a, 1: 1, a, 1) Semi-simple rules: (a, 1; b, 1) -> (a, 1, 1: b, 1, 1) Semi-null rules: (u, 1; v, 1) -> (u, 1, 1: v, 1, 1) or (u 1, w, 1: v 1, w, 1)

Rules for Splicing The form of rules can reveal or obscure the biological activity. Yuhani Yusof and E Goode decided to rewrite the rules in a form that makes the biological activity apparent. Simple rules: (a, 1; a, 1) -> (a, 1, 1: a, 1, 1) or (1, 1, a: 1, 1, a) or (1, a, 1: 1, a, 1) Semi-simple rules: (a, 1; b, 1) -> (a, 1, 1: b, 1, 1) Semi-null rules: (u, 1; v, 1) -> (u, 1, 1: v, 1, 1) or (u 1, w, 1: v 1, w, 1) Theorems include: The languages generated by simple semi-simple and semi-null systems are exactly those generated by the corresponding rule sets written in the Yusof-Goode rule notation.

Rules for Splicing As the biology demands that symmetry and reflexivity be respected, the Yusof- Goode model adopted the constraint that the splicing system would always be built to include any rules required to ensure its reflexive and symmetric closure.

Rules for Splicing As the biology demands that symmetry and reflexivity be respected, the Yusof- Goode model adopted the constraint that the splicing system would always be built to include any rules required to ensure its reflexive and symmetric closure. These notions become important in the investigation of the limit language associated with a splicing schema.

Wet Splicing of dsDNA

Planning Experiment

Planning Experiment

From Yuhani on 03/12/11 Dear Dr Liz, These are actually my plan tomorrow: Run on the double 8-well 1. 3 alpha-beta+3 gamma-delta+1 combination alpha-beta & gamma-delta of the PCR Rx#4 +LMW ladder 2. 1 alpha-beta, 1 gamma-delta, 1 combination of alpha-beta & gamma-delta of the Purification PCR Rx#3 (that I did last Thursday)-I'll take 5 ul each since it is suggested from Qiagen purification kit to run the purification product with 5ul purification lambda with 1 ul 6X loading dye, 1 combination of alpha-beta that you purified, 1 combination alpha-beta & gamma-delta (2nd) purified, 1 alpha-beta & 1 gamma delta purified from PCR Rx#4 and LMW ladder. If all look goods, I will do the RE digestion and ligation after that and run for final results. Just two question that I want to ensure. 1.If I did all the digestion and ligation sample-mean until the last sample of Lane 8 that stored in -70 degC, how long that I need to wait to run all the sample (Lane 1-Lane 8) on 3% gel. 2. The RE digestion and ligation sample should be done in 37 degC. So, I 'll put in the heater box at 37 degC and take out the sample from that with the time t =5, that I plan. Is it right? Planning Experiment

Cutting out Stands 1 and 2 from E.Coli genome

Purification of Cut outs from E.Coli

PCR results from purified DNA for Initial Strand 1

Dynamics of the DNA splicing system: Lanes 1 and 9 contain DNA Ladder Limit language is apparent in Lanes 4, 5, 6, 7 Transient strings appear in Lanes 3 –

Computational Models for Formal Languages

The end. Thank you for your time.