Property testing of Tree Regular Languages Frédéric Magniez, LRI, CNRS Michel de Rougemont, LRI, University Paris II.

Slides:



Advertisements
Similar presentations
1 Property testing and learning on strings and trees Michel de Rougemont University Paris II & LRI Joint work with E. Fischer, Technion, F. Magniez, LRI.
Advertisements

Hardness of testing 3- colorability in bounded degree graphs Andrej Bogdanov Kenji Obata Luca Trevisan.
The Primal-Dual Method: Steiner Forest TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A A AA A A A AA A A.
December 2, 2009 IPAM: Invariance in Property Testing 1 Invariance in Property Testing Madhu Sudan Microsoft/MIT TexPoint fonts used in EMF. Read the TexPoint.
CS 336 March 19, 2012 Tandy Warnow.
Great Theoretical Ideas in Computer Science for Some.
Connectivity - Menger’s Theorem Graphs & Algorithms Lecture 3.
Bart Jansen 1.  Problem definition  Instance: Connected graph G, positive integer k  Question: Is there a spanning tree for G with at least k leaves?
Deterministic vs. Non-Deterministic Graph Property Testing Asaf Shapira Tel-Aviv University Joint work with Lior Gishboliner.
Gillat Kol joint work with Ran Raz Locally Testable Codes Analogues to the Unique Games Conjecture Do Not Exist.
1 Nondeterministic Space is Closed Under Complement Presented by Jing Zhang and Yingbo Wang Theory of Computation II Professor: Geoffrey Smith.
Lecture 24 Coping with NPC and Unsolvable problems. When a problem is unsolvable, that's generally very bad news: it means there is no general algorithm.
Lecture 16: DFS, DAG, and Strongly Connected Components Shang-Hua Teng.
Global Flow Optimization (GFO) in Automatic Logic Design “ TCAD91 ” by C. Leonard Berman & Louise H. Trevillyan CAD Group Meeting Prepared by Ray Cheung.
1 Markov Decision Processes: Approximate Equivalence Michel de Rougemont Université Paris II & LRI
CS774. Markov Random Field : Theory and Application Lecture 17 Kyomin Jung KAIST Nov
Approximation Algorithms for Unique Games Luca Trevisan Slides by Avi Eyal.
Christian Sohler | Every Property of Hyperfinite Graphs is Testable Ilan Newman and Christian Sohler.
Artur Czumaj Dept of Computer Science & DIMAP University of Warwick Testing Expansion in Bounded Degree Graphs Joint work with Christian Sohler.
Gillat Kol joint work with Ran Raz Locally Testable Codes Analogues to the Unique Games Conjecture Do Not Exist.
Learning and Fourier Analysis Grigory Yaroslavtsev CIS 625: Computational Learning Theory.
Testing the Diameter of Graphs Michal Parnas Dana Ron.
Sparse Random Linear Codes are Locally Decodable and Testable Tali Kaufman (MIT) Joint work with Madhu Sudan (MIT)
Testing of Clustering Noga Alon, Seannie Dar Michal Parnas, Dana Ron.
Graphs and Trees This handout: Trees Minimum Spanning Tree Problem.
Validating Streaming XML Documents Luc Segoufin & Victor Vianu Presented by Harel Paz.
Michael Bender - SUNY Stony Brook Dana Ron - Tel Aviv University Testing Acyclicity of Directed Graphs in Sublinear Time.
Testing Metric Properties Michal Parnas and Dana Ron.
On Proximity Oblivious Testing Oded Goldreich - Weizmann Institute of Science Dana Ron – Tel Aviv University.
Is the following graph Hamiltonian- connected from vertex v? a). Yes b). No c). I have absolutely no idea v.
1 Separator Theorems for Planar Graphs Presented by Shira Zucker.
Steiner trees Algorithms and Networks. Steiner Trees2 Today Steiner trees: what and why? NP-completeness Approximation algorithms Preprocessing.
A Tutorial on Property Testing Dana Ron Tel Aviv University.
Model Checking Lecture 5. Outline 1 Specifications: logic vs. automata, linear vs. branching, safety vs. liveness 2 Graph algorithms for model checking.
Some 3CNF Properties are Hard to Test Eli Ben-Sasson Harvard & MIT Prahladh Harsha MIT Sofya Raskhodnikova MIT.
Correlation testing for affine invariant properties on Shachar Lovett Institute for Advanced Study Joint with Hamed Hatami (McGill)
Approximating the MST Weight in Sublinear Time Bernard Chazelle (Princeton) Ronitt Rubinfeld (NEC) Luca Trevisan (U.C. Berkeley)
Japanjune The correction of XML data Université Paris II & LRI Michel de Rougemont 1.Approximation and Edit Distance.
1 Approximate Satisfiability and Equivalence Michel de Rougemont University Paris II & LRI Joint work with E. Fischer, Technion, F. Magniez, LRI, LICS.
Transitive-Closure Spanner of Directed Graphs Kyomin Jung KAIST 2009 Combinatorics Workshop Joint work with Arnab Bhattacharyya MIT Elena Grigorescu MIT.
1 Sublinear Algorithms Lecture 1 Sofya Raskhodnikova Penn State University TexPoint fonts used in EMF. Read the TexPoint manual before you delete this.
A Simpler 1.5-Approximation Algorithm for sorting by transposition Tzvika Hartman.
Approximate schemas Michel de Rougemont, LRI, University Paris II.
 Rooted tree and binary tree  Theorem 5.19: A full binary tree with t leaves contains i=t-1 internal vertices.
1 Approximate Schemas and Data Exchange Michel de Rougemont University Paris II & LRI Joint work with Adrien Vielleribière, University Paris-South.
Approximate schemas Michel de Rougemont, LRI, University Paris II Joint work with E. Fischer, Technion, F. Magniez, LRI.
1 Approximate Data Exchange Michel de Rougemont Adrien Vieilleribière University Paris II & LRI University Paris-Sud & LRI ICDT 2007.
Artur Czumaj DIMAP DIMAP (Centre for Discrete Maths and it Applications) Computer Science & Department of Computer Science University of Warwick Testing.
狄彥吾 (Yen-Wu Ti) 華夏技術學院資訊工程系 Property Testing on Combinatorial Objects.
CSE 421 Algorithms Richard Anderson Winter 2009 Lecture 5.
Testing Low-Degree Polynomials over GF(2) Noga AlonSimon LitsynMichael Krivelevich Tali KaufmanDana Ron Danny Vainstein.
5.6 Prefix codes and optimal tree Definition 31: Codes with this property which the bit string for a letter never occurs as the first part of the bit string.
Algorithms for Big Data: Streaming and Sublinear Time Algorithms
On Sample Based Testers
Property Testing (a.k.a. Sublinear Algorithms )
Algorithms for Finding Distance-Edge-Colorings of Graphs
New Characterizations in Turnstile Streams with Applications
Approximating the MST Weight in Sublinear Time
What is the next line of the proof?
Lecture 18: Uniformity Testing Monotonicity Testing
(edited by Nadia Al-Ghreimil)
Structural graph parameters Part 2: A hierarchy of parameters
On Inferring K Optimum Transformations of XML Document from Update Script to DTD Nobutaka Suzuki Graduate School of Library, Information and Media Studies.
Approximate Validity of XML Streaming Data
Algebraic Property Testing
On the effect of randomness on planted 3-coloring models
(edited by Nadia Al-Ghreimil)
Lecture 6: Counting triangles Dynamic graphs & sampling
Richard Anderson Lecture 5 Graph Theory
Treewidth meets Planarity
Presentation transcript:

Property testing of Tree Regular Languages Frédéric Magniez, LRI, CNRS Michel de Rougemont, LRI, University Paris II

1.Tester for regular words with the Edit Distance with Moves 2. Tester for ranked regular trees with the Tree-Edit Distance with Moves, Property testing of Tree Regular Languages

Let F be a property on a class K of structures U An ε -tester for F is a probabilistic algorithm A such that: If U |= F, A accepts If U is ε far from F, A rejects with high probability Time(A) independent of n. (Goldreich, Golwasser, Ron 1996, Rubinfeld, Sudan 1994) Tester usually implies a linear time corrector. Testers on a class K

History of Testers Self-testers and correctors for Linear Algebra,Blum & Kanan 1989 Robust characterizations of polynomials, R. Rubinfeld, M. Sudan, 1994 Testers for graph properties : k-colorability, Goldreich and al graph properties have testers, Alon and al Regular languages have testers, Alon and al. 2000s Testers for Regular tree languages, Mdr and Magniez, ICALP 2004

1.Classical Edit Distance: Insertions, Deletions, Modifications 2.Edit Distance with moves Edit Distance with Moves generalizes to Trees Edit distance on Words

Testers on words Simpler proof which generalizes to regular trees. L is a regular language and A an automaton for L. Admissible Z= A word W is Z-feasible if there are two states init accept

The Tester For every admissible path Z: else REJECT. Theorem: Tester(W,A, ε ) is an ε -tester for L(A). Tester. Input : W,A, ε

Proof schema of the Tester Theorem: Regular words are testable. Robustness lemma: If W is ε-far from L, then for every admissible path Z, there exists such that the number of Z-infeasible subwords Splitting lemma: if W is far from L there are many disjoint infeasible subwords. Amplifying lemma: If there are many infeasible words, there are many short ones.

Merging Merging lemma: Let Z be an admissible path, and let F be a Z- feasible cut of size h’. Then C CC C C C Take each word and split it along its connected components, removing single letters. Rearrange all the words of the same component in its Z-order. Add gluing words to obtain W’ in L:

Splitting Splitting lemma: If Z is an admissible path, W a word s.t. dist(W,L) > h, then W has Proof by contraposition:

Tree-Edit-Distance a e b cd a e b c a e b c d f e Deletion Edge Insertion Node and Label Tree Edit distance with moves: a e b cd a e b cd 1 move Distance Problem is NP-complete, non-approximable.

Binary trees : Distance with moves allows permutations Tree-Edit-Distance on binary trees Distance(T1,T2) =4 m-Distance (T1,T2) =2

(q0, q0)  q1 (q0,q1)  q1 Tree automata q0 q1 q0 q1 q2 (q1,q1)  q2 (q1,q0)  q2 (q2,-)  q2 (-,q2)  q2

Fact. If then the number of infeasible subtrees of constant size is O(n). Infeasible subtrees

Tester for regular Trees Theorem: Tester(T,A, ε ) is an ε -tester for L(A). Tester. Input : T,A,

Proof schema of the Tester Theorem: Regular trees are testable. Robustness lemma: If T is ε-far from L, then for every admissible path Z, there exists such that the number of Z-infeasible i-subtrees Splitting lemma: if T is far from L there are many disjoint infeasible subtrees. Amplifying lemma: If there are many infeasible subtrees, there are many small ones.

Splitting and Merging C CC C C C Splitting and Merging on words: Splitting and Merging on trees:

Splitting and Merging trees C D D C C E Connected Components Corrected tree

Conclusion Verification is hard. Approximate verification can be feasible. 1.Testers and Correcters for regular words 2.Tester for regular trees 3.Corrector for regular trees 4.Unranked trees: XML files 5.Applications: Constant algorithm for Edit Distance with moves (Fischer, Magniez, Mdr)