DNA and splicing (circular) Dipartimento di Informatica Sistemistica e Comunicazioni, Univ. di Milano - Bicocca ITALY Dipartimento di Informatica e Applicazioni,

Slides:



Advertisements
Similar presentations
Molecular Computing Formal Languages Theory of Codes Combinatorics on Words.
Advertisements

Formal Languages Theory of Codes Combinatorics on words Molecular Computing.
Specifying Languages Our aim is to be able to specify languages for use in the computer. The sketch of the FSA is easy for us to understand, but difficult.
Towards a characterization of regular languages generated by finite splicing systems: where are we? Ravello, Settembre 2003 Paola Bonizzoni, Giancarlo.
Grammar types There are 4 types of grammars according to the types of rules: – General grammars – Context Sensitive grammars – Context Free grammars –
FORMAL LANGUAGES, AUTOMATA, AND COMPUTABILITY
Basic Properties of Relations
8.4 Closures of Relations. Intro Consider the following example (telephone line, bus route,…) abc d Is R, defined above on the set A={a, b, c, d}, transitive?
Quit Permutations Combinations Pascal’s triangle Binomial Theorem.
CS5371 Theory of Computation
1 Module 19 LNFA subset of LFSA –Theorem 4.1 on page 131 of Martin textbook –Compare with set closure proofs Main idea –A state in FSA represents a set.
CS Master – Introduction to the Theory of Computation Jan Maluszynski - HT Lecture 4 Context-free grammars Jan Maluszynski, IDA, 2007
Languages, grammars, and regular expressions
CS 310 – Fall 2006 Pacific University CS310 Strings, String Operators, and Languages Sections: August 30, 2006.
79 Regular Expression Regular expressions over an alphabet  are defined recursively as follows. (1) Ø, which denotes the empty set, is a regular expression.
CS5371 Theory of Computation Lecture 9: Automata Theory VII (Pumping Lemma, Non-CFL)
VLSI Digital System Design Datapath Operations and Ripple-Carry Add.
1 Regular Expressions/Languages Regular languages –Inductive definitions –Regular expressions syntax semantics Not covered in lecture.
1 Section 7.1 Relations and their properties. 2 Binary relation A binary relation is a set of ordered pairs that expresses a relationship between elements.
DNA SPLICING RULES STAYING TRUE TO THE BIOLOGY Elizabeth Goode April 2015.
Formal Language Finite set of alphabets Σ: e.g., {0, 1}, {a, b, c}, { ‘{‘, ‘}’ } Language L is a subset of strings on Σ, e.g., {00, 110, 01} a finite language,
Introduction Chapter 0. Three Central Areas 1.Automata 2.Computability 3.Complexity.
1Introduction 2Theoretical background Biochemistry/molecular biology 3Theoretical background computer science 4History of the field 5Splicing systems.
“Developments on linear and circular splicing” Paola Bonizzoni, Clelia De Felice, Giancarlo Mauri, Rosalba Zizza Dipartimento di Informatica Sistemistica.
Some simple basics. 1Introduction 2Theoretical background Biochemistry/molecular biology 3Theoretical background computer science 4History of the field.
THEORY OF COMPUTATION 08 KLEENE’S THEOREM.
Introduction to CS Theory Lecture 3 – Regular Languages Piotr Faliszewski
1 Language Definitions Lecture # 2. Defining Languages The languages can be defined in different ways, such as Descriptive definition, Recursive definition,
Moore automata and epichristoffel words
Computing languages by (bounded) local sets Dora Giammarresi Università di Roma “Tor Vergata” Italy.
Membership problem CYK Algorithm Project presentation CS 5800 Spring 2013 Professor : Dr. Elise de Doncker Presented by : Savitha parur venkitachalam.
Random Context and Programmed Grammars of Finite Index Have The Same Generative Power Doc. RNDr. Alexander Meduna, CSc. Ing. Zbyněk Křivka DIFS, FIT, Brno.
Properties of Addition and Multiplication. Commutative Property In the sum you can add the numbers in any order. a+b = b+a In the product you can multiply.
1 Module 14 Regular languages –Inductive definitions –Regular expressions syntax semantics.
Phrase-structure grammar A phrase-structure grammar is a quadruple G = (V, T, P, S) where V is a finite set of symbols called nonterminals, T is a set.
Regular Grammars Chapter 7. Regular Grammars A regular grammar G is a quadruple (V, , R, S), where: ● V is the rule alphabet, which contains nonterminals.
Properties and Mental Computation p. 80. Math talk What are some math properties that we use? Why do you think we have them? Do you ever use them?
MA/CSSE 474 Theory of Computation Regular Expressions Intro.
Language: Set of Strings
Strings and Languages CS 130: Theory of Computation HMU textbook, Chapter 1 (Sec 1.5)
A Quick Math Overview (not the last one!) Section 1.4, you should it read carefully! Reading the corresponding material from the book is strongly encouraged.
“New results on finite H-systems” Budapest, 29/30 November 2002 Jointly with Paola Bonizzoni, Clelia De Felice, Giancarlo Mauri Dipartimento di Informatica.
Discrete Mathematical Structures 4 th Edition Kolman, Busby, Ross © 2000 by Prentice-Hall, Inc. ISBN
ON THE EXPRESSIVE POWER OF SHUFFLE PRODUCT Antonio Restivo Università di Palermo.
Grammars A grammar is a 4-tuple G = (V, T, P, S) where 1)V is a set of nonterminal symbols (also called variables or syntactic categories) 2)T is a finite.
Introduction Finite Automata accept all regular languages and only regular languages Even very simple languages are non regular (  = {a,b}): - {a n b.
Properties of Algebra (aka all the rules that holds the math together!)
UNIT - I Formal Language and Regular Expressions: Languages Definition regular expressions Regular sets identity rules. Finite Automata: DFA NFA NFA with.
Lecture # 31 Theory Of Automata By Dr. MM Alam 1.
Binary Relations Definition: A binary relation R from a set A to a set B is a subset R ⊆ A × B. Example: Let A = { 0, 1,2 } and B = {a,b} {( 0, a), (
§R1∪R2§R1∪R2 §R 1 ∩R 2 R1-R2R1-R2 2.4 Operations on Relations.
CSCI 4325 / 6339 Theory of Computation Zhixiang Chen.
Refining the Nonterminal Complexity of Graph-controlled Grammars Faculty of Informatics TU Wien, Wien, Austria Rudolf FREUND Wilhelm-Schickard-Institut.
Section 9.1. Section Summary Relations and Functions Properties of Relations Reflexive Relations Symmetric and Antisymmetric Relations Transitive Relations.
1/29/02CSE460 - MSU1 Nondeterminism-NFA Section 4.1 of Martin Textbook CSE460 – Computability & Formal Language Theory Comp. Science & Engineering Michigan.
Implementation of Haskell Modules for Automata and Sticker Systems
Formal Language & Automata Theory
Context Sensitive Grammar & Turing Machines
Formal Language & Automata Theory
Chapter 2.6 (Part 1): Prove Statements about Segments and Angles
Lecture 9 Theory of AUTOMATA
Regular Grammar.
Regular Grammars.
Chapter 7 Regular Grammars
Dipartimento di Informatica e Applicazioni, Univ. of Salerno, ITALY
Математици-юбиляри.
4-5 Proving Congruence Included side: the side between the 2 angles used. AB is the included side between angles A and B. BC is the included side between.
Chapter 1 Introduction to the Theory of Computation
Clements MAΘ October 30th, 2014
LECTURE # 07.
Presentation transcript:

DNA and splicing (circular) Dipartimento di Informatica Sistemistica e Comunicazioni, Univ. di Milano - Bicocca ITALY Dipartimento di Informatica e Applicazioni, Univ. di Salerno, ITALY Paola Bonizzoni, Clelia De Felice, Giancarlo Mauri, Rosalba Zizza Circular splicing, definitions State of the art Our contributions Works in progress

<<An important aspect of this years meeting can be summed up us: SHOW ME THE EXPERIMENTAL RESULT! >> (T. Amenyo, Informal Report on 3rd Annual DIMACS Workshop on DNA Computing, 1997) We apologize... theoretical results

Before Adleman experiment (1994)... Tom Head 1987 (Bull. of Math. Biology) Formal Language Theory and DNA : an analysis of the generative capacity of specific recombinant behaviors SPLICING Unconventional models of computation

SPLICING LINEAR CIRCULAR

CIRCULAR SPLICING restriction enzyme 1 restriction enzyme 2 ligase enzymes

Circular languages: definitions and examples Conjugacy relation on A* w, w A*, w ~ w w=xy, w = yx Example abaa, baaa, aaab,aaba are conjugate A ~ = A* ~ = set of all circular words ~ w = [w] ~, w A* Circular language C A ~ set of equivalence classes A* A* ~ L Cir(L) = { ~ w | w L} (circularization of L) C L C {w A*| ~ w C}= Lin(C) (Full linearization of C) (A linearization of C, i.e. Cir(L)=C )

FA ~ ={ C A ~ | L A*, Cir(L) = C, L FA, FA Chomsky hierarchy} Definition: Theorem [Head, Paun, Pixton] C Reg Lin (C) Reg C Reg ~ Lin (C) Reg

Pauns definition Circular splicing systems (A= finite alphabet, I A ~ initial language) SC PA = (A, I, R) R A* | A* $ A* | A* rules ~hu1u2,~hu1u2, ~ku3u4~ku3u4 A ~ r = u 1 | u 2 $ u 3 | u 4 R u 2 hu 1 u4ku3u4ku3 ~ u 2 hu 1 u 4 ku 3 Definition I and closed under the application of the rules in R A circular splicing language C(SC PA ) (i.e. a circular language generated by a splicing system SC PA ) is the smallest circular language containing

Other definitions of splicing systems Heads definitionSC H = (A, I, T) T A* A* A* triples A ~ (p, x, q ), ( u,x,v) T vkux ~ hpx vkux q ~ hpxq, ~ kuxv q hpx (A= finite alphabet, I A ~ initial language) SC PI = (A, I, R) A ~ (, ; ), (, ; ) R ~ h h ~ h, ~ h h Pixtons definition R A* A* A* rules h

Problem: Theorem [ Paun96] Characterize FA ~ C(Fin, Fin) C(Reg, Fin) class of circular languages C= C(SC PA ) generated by SC PA with I and R both finite sets. F {Reg ~, CF ~, RE ~ } R +add. hyp. (symmetry, reflexivity, self-splicing) Theorem [Pixton95-96] R Fin+add. hyp. (symmetry, reflexivity) C(F, Fin) F F {Reg ~, CF ~, RE ~ } C(F, Reg) FC(Reg ~, Fin) Reg ~,

Circular finite splicing languages and Chomsky hierarchy CS ~ CF ~ Reg ~ ~ ((aa)*b) ~ (aa)* ~ (a n b n ) I= ~ aa ~ 1, R={aa | 1 $ 1 | aa} I= ~ ab ~ 1, R={a | b $ b | a}

Our contributions Reg ~ Fingerprint closed star languages X*, X regular group code Cir (X*) X finite cyclic languages weak cyclic, other examples ~ (a*ba*)* Reg ~ C(Fin, Fin)

Our contributions (continued) Comparing the three definitions of splicing systems C(SC H ) C(SC PA ) C(SC PI ) ~ (a*ba*)*, ~ ((aa)*b) =... ?

Star languages L A* is star language if L is regular, closed under conjugacy relation and L=X*, with X regular Proposition: SC PA =(A,I,R), I Cir(X*) C(SC PA ) Cir (X*) Consistence easily follows!!! Examples (b*(ab*a)*)* = X* (a*ba*)* = X* X=b ab*a X= a*ba* Definition

Fingerprint closed languages Definition For any cycle c, L contains the Fingerprints of c Fingerprint of a cycle c n c L power of the cycle, where the internal cycles are crossed a finite number of times c=(x(y(zz) j y) i x) n c i n y, j n x c q0q0 x x y y z z q0q0

Fingerprint closed star languages C(Fin,Fin) Theorem I=Cir({successful path containing fingerprint of cycles}) R={1 | 1 $ 1 | ƒ | ƒ fingerprint of cycle c, for any cycle c} Star languages not fingerprint closed (a*ba*)* but not generated!!! Star languages fingerprint closed X*, X regular group code X finite, Cir(X*) Sketch Take SC PA = (A, I, R) with (for example X=b ab*a) (for example X=A d )

Not Star Languages in C(Fin, Fin) new! Definition Cyclic(z) ={( ~ (z* p)) | p Pref (Lin( ~ z))} Example Cyclic(abc)= ~ (abc)*a ~ (abc)*ab ~ (abc)*b ~ (abc)*bc ~ (abc)*c ~ (abc)*ca z = abc A* Lin ( ~ z) =Lin ( ~ abc) ={abc, bca,cab} Pref(Lin ( ~ z)) =Pref(Lin ( ~ abc)) =Pref({abc, bca,cba}) = {a, ab, b, bc, c, ca} Cyclic Languages

Theorem Cyclic(z) C(Fin,Fin) The proof is quite technical... Example (continued) Cyclic (abc) is generated by SC PA = (A,I,R) where I,R are defined as follows I={ ~ ((abc) i p | 0 i 3, p Pref(Lin( ~ (abc))) } R={z ab | z $ z | ca z, z ab | z $ z b | c z, z ca | z $ z $ bc z, z a | z $ z | b z, z b | z $ z $ c z, z c | z $ z | a z } For any z, |z|>2, z unbordered word, then i.e. z uA* A*u

Other circular regular splicing languages ~ (abc)*a ~ (abc)*ab ~ (abc)*b ~ (abc)*bc ~ (abc)*c ~ (abc)*ca Cyclic(abc) ~ (abc)*ac weak cyclic languages Cyclic (abca).... bordered word...

Works in progress Characterize Reg ~ C(Fin, Fin) Characterize FA ~ C(Fin, Fin) C(SC PI ) = Star languages Additional hypothesis r= u 1 | u 2 $ u 3 | u 4 in R Reflexive: r = u 1 | u 2 $ u 1 | u 2 Symmetric: r = u 3 | u 4 $ u 1 | u 2 Self-splicing: From ~ xu 1 u 2 yu 3 u 4, with r,r as above, generates ~ u 4 xu 1, ~ u 2 yu 3.

DNA6 auditorium Thanks!