Exact reconstruction of finite memory automata with the GSPS And a surprising application to the reconstruction of cellular automata James Nutaro

Slides:



Advertisements
Similar presentations
1 Approximate string matching using factor automata J. Holub and B. Melichar Theoretical Computer Science vol.249 p Speaker: L. C. Chen Advisor:
Advertisements

4b Lexical analysis Finite Automata
Theory Of Automata By Dr. MM Alam
L ECTURE 3 T HEORY OF AUTOMATA. E QUIVALENT R EGULAR E XPRESSIONS Definition Two regular expressions are said to be equivalent if they generate the same.
Finite Automata CPSC 388 Ellen Walker Hiram College.
1 1 CDT314 FABER Formal Languages, Automata and Models of Computation Lecture 3 School of Innovation, Design and Engineering Mälardalen University 2012.
YES-NO machines Finite State Automata as language recognizers.
Theory of Computation What types of things are computable? How can we demonstrate what things are computable?
1 Approximate string matching using factor automata Jan Holub and Borivoj Melichar Theoretical Computer Science vol.249 p Speaker: L. C. Chen Advisor:
1 CSCI-2400 Models of Computation. 2 Computation CPU memory.
Finite Automata Finite-state machine with no output. FA consists of States, Transitions between states FA is a 5-tuple Example! A string x is recognized.
1 The scanning process Goal: automate the process Idea: –Start with an RE –Build a DFA How? –We can build a non-deterministic finite automaton (Thompson's.
COMMONWEALTH OF AUSTRALIA Copyright Regulations 1969 WARNING This material has been reproduced and communicated to you by or on behalf of Monash University.
1 Lecture 16 FSA’s –Defining FSA’s –Computing with FSA’s Defining L(M) –Defining language class LFSA –Comparing LFSA to set of solvable languages (REC)
COMMONWEALTH OF AUSTRALIA Copyright Regulations 1969 WARNING This material has been reproduced and communicated to you by or on behalf of Monash University.
1 Lecture 16 FSA’s –Defining FSA’s –Computing with FSA’s Defining L(M) –Defining language class LFSA –Comparing LFSA to set of solvable languages (REC)
Aho-Corasick String Matching An Efficient String Matching.
A sample processing of an input molecule. S0 S1 a a b b A1: even number of b’s Automaton A1 accepting inputs with an even number of b ’s.
Derrick Coetzee, Microsoft Research CC0 waiverCC0 waiver: To the extent possible under law, I waive all copyright and related or neighboring rights to.
Costas Busch - RPI1 CSCI-2400 Models of Computation.
CS 3240 – Chuck Allison.  A model of computation  A very simple, manual computer (we draw pictures!)  Our machines: automata  1) Finite automata (“finite-state.
Topic : algorithms on FSA -- M.Mohri,on some applications of Finite- state automata theory to natural language processing. Natural Language Eng 1 (1996)
Parallelization: Conway’s Game of Life. Cellular automata: Important for science Biology – Mapping brain tumor growth Ecology – Interactions of species.
Lecture # 1 (Automata Theory)
Formal Methods in SE Theory of Automata Qasiar Javaid Assistant Professor Lecture # 06.
1 For each language, give a regular expression that generates the language and a DFA that accepts it. L 1 = { w  {0, 1}* : has both 00 and 11 as substrings}
MA354 Mathematical Modeling T H 2:45 pm– 4:00 pm Dr. Audi Byrne.
1 Computability Five lectures. Slides available from my web page There is some formality, but it is gentle,
4b 4b Lexical analysis Finite Automata. Finite Automata (FA) FA also called Finite State Machine (FSM) –Abstract model of a computing entity. –Decides.
Module 2 How to design Computer Language Huma Ayub Software Construction Lecture 8.
L ECTURE 3 Chapter 4 Regular Expressions. I MPORTANT T ERMS Regular Expressions Regular Languages Finite Representations.
1 CD5560 FABER Formal Languages, Automata and Models of Computation Lecture 3 Mälardalen University 2010.
Inferring Finite Automata from queries and counter-examples Eggert Jón Magnússon.
Models of Computation. Computation: Computation is a general term for any type of information processing information processing CPU memory.
Fundamentals of Informatics
MA354 An Introduction to Math Models (more or less corresponding to 1.0 in your book)
An Enhanced Cellular Automata and Image Pyramid Decomposition Based Algorithm for Image Segmentation : A New Concept Anand Prakash Shukla Suneeta Agarwal.
Three Basic Concepts Languages Grammars Automata.
TFA: A Tunable Finite Automaton for Regular Expression Matching Author: Yang Xu, Junchen Jiang, Rihua Wei, Yang Song and H. Jonathan Chao Publisher: ACM/IEEE.
CS 154 Formal Languages and Computability February 4 Class Meeting Department of Computer Science San Jose State University Spring 2016 Instructor: Ron.
Lecture # 4.
Mathematical Foundations of Computer Science Chapter 3: Regular Languages and Regular Grammars.
Overview of Previous Lesson(s) Over View  A token is a pair consisting of a token name and an optional attribute value.  A pattern is a description.
BİL711 Natural Language Processing1 Regular Expressions & FSAs Any regular expression can be realized as a finite state automaton (FSA) There are two kinds.
1 Section 11.2 Finite Automata Can a machine(i.e., algorithm) recognize a regular language? Yes! Deterministic Finite Automata A deterministic finite automaton.
Lecture 03: Theory of Automata:2014 Asif Nawaz Theory of Automata.
Akram Salah ISSR Basic Concepts Languages Grammar Automata (Automaton)
1 1. Eliminate all  -transitions from the following FA without changing the number of states and the language accepted by the automaton. You should also.
L ECTURE 3 T HEORY OF AUTOMATA. E QUIVALENT R EGULAR E XPRESSIONS Definition Two regular expressions are said to be equivalent if they generate the same.
Compilers Lexical Analysis 1. while (y < z) { int x = a + b; y += x; } 2.
Recap Lecture 3 RE, Recursive definition of RE, defining languages by RE, { x}*, { x}+, {a+b}*, Language of strings having exactly one aa, Language of.
Theory of Computation Lecture #
Implementation of Haskell Modules for Automata and Sticker Systems
Lexical analysis Finite Automata
Theory of Automata.
CSC312 Automata Theory Chapter # 5 by Cohen Finite Automata
Chapter 7 PUSHDOWN AUTOMATA.
[Week#03,04] (b) - Finite Automata
REGULAR LANGUAGES AND REGULAR GRAMMARS
THEORY OF COMPUTATION Lecture One: Automata Theory Automata Theory.
CSCI-2400 Models of Computation Costas Busch - RPI.
Physics-based simulation for visual computing applications
Principles of Computing – UFCFA3-30-1
4b Lexical analysis Finite Automata
MA/CSSE 474 Theory of Computation Minimizing DFSMs.
4b Lexical analysis Finite Automata
CSC312 Automata Theory Chapter # 5 by Cohen Finite Automata
Duo, trio and tetrad methods: Is there a difference?
Recap Lecture 3 RE, Recursive definition of RE, defining languages by RE, { x}*, { x}+, {a+b}*, Language of strings having exactly one aa, Language of.
Presentation transcript:

Exact reconstruction of finite memory automata with the GSPS And a surprising application to the reconstruction of cellular automata James Nutaro

Reconstruction with the GSPS Begin with one or more time series Hypothesize a relationship between the variables in these time series – Visualized as a mask with squares for output and circles for input Construct an input-output model from the mask tv1v2v1v2 7ABAB 6ABAB 5BABB 4ABBA 3BBBA 2BAAB 1BBAA v1(t)v1(t-1)v2(t)% AAB100 (3/3) ABB BAA100 (2/2) BBA50 (2/4) B

The reconstruction procedure, step #1 tv1v2v1v2 7ABAB 6ABAB 5BABB 4ABBA 3BBBA 2BAAB 1BBAA v1(t)v1(t-1)v2(t)% AAB100 (1/1) Input observation Output observation v2(t)=f(v1(t),v1(t-1)) B=f(A,A)

The reconstruction procedure, step #2 tv1v2v1v2 7ABAB 6ABAB 5BABB 4ABBA 3BBBA 2BAAB 1BBAA v1(t)v1(t-1)v2(t)% AAB100 (1/1) ABB Input observation Output observation v2(t)=f(v1(t),v1(t-1)) B=f(A,B)

The reconstruction procedure, step #3 tv1v2v1v2 7ABAB 6ABAB 5BABB 4ABBA 3BBBA 2BAAB 1BBAA v1(t)v1(t-1)v2(t)% AAB100 (1/1) ABB BAA Input observation Output observation v2(t)=f(v1(t),v1(t-1)) A=f(B,A)

f may not be deterministic tv1v2v1v2 7ABAB 6ABAB 5BABB 4ABBA 3BBBA 2BAAB 1BBAA v1(t)v1(t-1)v2(t)% AAB100 (1/1) ABB BAA BBA50 (1/2) B Input observation Output observation v2(t)=f(v1(t),v1(t-1)) B=f(B,B) A=f(B,B)

Simulation with the GSPS Begin with first observation and observations of all data not generated by the model Generate subsequent observations with the model tv1v2 7A 6B 5A 4A 3BB 2BA 1BB v1(t)v1(t-1)v2(t)% AAB100 (3/3) ABB BAA100 (2/2) BBA50 (2/4) B

A simulation with the GSPS, step #1 First observation is v1(t)=v1(t-1)=B Outcome is A with 50% change and B with 50 % – A selected at random tv1v2 7A 6B 5A 4A 3B 2BA 1BB v1(t)v1(t-1)v2(t)% AAB100 (3/3) ABB BAA100 (2/2) BBA50 (2/4) B

A simulation with the GSPS, step #2 Second observation is v1(t)=v1(t-1)=B Outcome is A with 50% change and B with 50 % – B selected at random tv1v2 7A 6B 5A 4A 3BB 2BA 1BB v1(t)v1(t-1)v2(t)% AAB100 (3/3) ABB BAA100 (2/2) BBA50 (2/4) B

Finite memory automata

Examples of finite memory automata a b 0/0 1/1 0/0 a b 1/0 1/1 0/1 0/0

Not a finite memory automaton a b 1/1 0/1 0/0 Consider the input string What is the outcome? We can’t know.

GSPS and finite memory automata txy 7 6 Given a complete set of observations of a finite memory automaton, there is a mask that can exactly reconstruct its input/output behavior. This mask is the one corresponding to the function The number of unique entries in a complete set of observations is at most txy 7

GSPS and stochastic, finite memory automata a 1/1 1/ Example of a stochastic automaton with single input, single state, and two outputs.

Cellular automata

Wolfram’s rule # txyxy 5WBBW 4BBWW 3BWWB 2WWBB 1WBBW

Reconstruction of Wolfram’s rule #24 txyxy 5WBBW 4BBWW 3BWWB 2WWBB 1WBBW x(t-1)y(t-1)y(t) WWW WBW BWB BBB Reconstruction Simulation

Activity in cellular automata

Activity and computational costs

Is exact reconstruction of highly active systems feasible? Problem posed by highly active systems The necessary data grows exponentially with the variety of input and output – Exponential growth factor increases with the memory Can quickly reach peta- and exa- scale data Taming activity: directions for research Simplification – Preserve essential behaviors while reducing the level of activity High performance computing – GSPS algorithms implemented for large-scale computing and storage systems

In conclusion…a curious example of simplification and HPC Simulated tumor growth at day 90 beginning from 5 occupied pixels on day 1. Expected error in size of the tumor’s bounding box at 90 days is 3 pixels. Simplification: GSPS model has c. 190,000 possible observations at each cell; biological model has millions. Computing: Divide and conquer type parallel algorithm for constructing the GSPS table; required c. 2 days of computing on four cores to process c. 250,000,000 time series. Software for this (a) Biologically based simulation (b) GSPS simulation based on data produced by (a)