Dr. David Dailey Dr. Beverly Gocal Dr. Deborah Whitfield

Slides:



Advertisements
Similar presentations
Recap: Mining association rules from large datasets
Advertisements

Chapter 6 Languages: finite state machines
Properties of Regular Languages
Strings and Languages Operations
Chapter 3 Boolean Algebra and Logic Gate (Part 2).
Discussion #25 1/13 Discussion #25 Set Topics & Applications.
Regular Expression to NFA-  (a+ba) * a. First Parsing Step concatenate (a+ba) * a.
79 Regular Expression Regular expressions over an alphabet  are defined recursively as follows. (1) Ø, which denotes the empty set, is a regular expression.
Preliminaries/ Chapter 1: Introduction. Definitions: from Abstract to Linear Algebra.
1 Module 32 Chomsky Normal Form (CNF) –4 step process.
Lecture 07 Prof. Dr. M. Junaid Mughal
1 Section 7.1 Relations and their properties. 2 Binary relation A binary relation is a set of ordered pairs that expresses a relationship between elements.
Chapter 12: Context-Free Languages and Pushdown Automata
Boolean Algebra Dr. Bernard Chen Ph.D. University of Central Arkansas Spring 2009.
CMSC 330 Exercise: Write a Ruby function that takes an array of names in “Last, First Middle” format and returns the same list in “First Middle Last” format.
Lecture 08 Prof. Dr. M. Junaid Mughal
Chapter 2 Languages.
Theory Of Automata By Dr. MM Alam
CMSC 330: Organization of Programming Languages Theory of Regular Expressions.
MAT 360 Lecture 5 Hilbert’s axioms - Betweenness.
1 Chapter 1 Introduction to the Theory of Computation.
Grammars CPSC 5135.
Introduction to Language Theory
Module 2 How to design Computer Language Huma Ayub Software Construction Lecture 8.
L ECTURE 3 Chapter 4 Regular Expressions. I MPORTANT T ERMS Regular Expressions Regular Languages Finite Representations.
Moore automata and epichristoffel words
1S Algebra Revision! $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400.
Counting CSC-2259 Discrete Structures Konstantin Busch - LSU1.
Phrase-structure grammar A phrase-structure grammar is a quadruple G = (V, T, P, S) where V is a finite set of symbols called nonterminals, T is a set.
The Real Number System Section P.1. Set, Unions, and Intersections Part 1.
Section 12.4 Context-Free Language Topics
Language: Set of Strings
Chapter 4 Pumping Lemma Properties of Regular Languages Decidable questions on Regular Languages.
Mathematical Preliminaries
CONSENSUS THEOREM Choopan Rattanapoka.
UNIT - 2.  A binary operation on a set combines two elements of the set to produce another element of the set. a*b  G,  a, b  G e.g. +, -, ,  are.
BOOLEAN ALGEBRA Only involves in calculations of TRUE and FALSE; either be inputs or output. When a logic statement is TRUE it is assigned a Boolean logic.
1 CS 140 Discrete Mathematics Combinatorics And Review Notes.
Properties of Algebra (aka all the rules that holds the math together!)
Lecture # Book Introduction to Theory of Computation by Anil Maheshwari Michiel Smid, 2014 “Introduction to computer theory” by Daniel I.A. Cohen.
Strings and Languages Denning, Section 2.7. Alphabet An alphabet V is a finite nonempty set of symbols. Each symbol is a non- divisible or atomic object.
Lecture 2 Theory of AUTOMATA
1 Chapter Constructing Efficient Finite Automata.
Axioms for Rational Numbers 9/14-9/15. Natural Numbers – Numbers used for counting: 1, 2, 3, and so on. Whole Numbers – Natural numbers and zero: 0, 1,
Transparency No. 1 Formal Language and Automata Theory Homework 5.
1 Chapter 3 Regular Languages.  2 3.1: Regular Expressions (1)   Regular Expression (RE):   E is a regular expression over  if E is one of:
1 Section 11.3 Constructing Efficient Finite Automata First we’ll see how to transform an NFA into a DFA. Then we’ll see how to transform a DFA into a.
1 Strings and Languages Lecture 2-3 Ref. Handout p12-17.
Set, Alphabets, Strings, and Languages. The regular languages. Clouser properties of regular sets. Finite State Automata. Types of Finite State Automata.
CHAPTER TWO LANGUAGES By Dr Zalmiyah Zakaria.
Equivalence Relations and Classes
Objectives The student will be able to:
Theory of Computation Lecture #
Lecture # 2.
CSC-2259 Discrete Structures
The greatest mathematical tool of all!!
LANGUAGES Prepared by: Paridah Samsuri Dept. of Software Engineering
SLIDES FOR CHAPTER 2 BOOLEAN ALGEBRA
Aim: Full House Grid: 9 Grid Play: Calculate answer & cross it off
تصنيف التفاعلات الكيميائية
Boolean Algebra.
Boolean Algebra.
Definition: Let G = (V, T, P, S) be a CFL
Recap lecture 29 Example of prefixes of a language, Theorem: pref(Q in R) is regular, proof, example, Decidablity, deciding whether two languages are equivalent.
CS 250, Discrete Structures, Fall 2015 Nitesh Saxena
QUANTITATIVE METHODS 1 SAMIR K. SRIVASTAVA.
Chapter 1 Introduction to the Theory of Computation
Preliminaries/ Chapter 1: Introduction
Line and Angle Relationships
Practice makes perfect!
Presentation transcript:

Dr. David Dailey Dr. Beverly Gocal Dr. Deborah Whitfield

 Introduction  Graph distance  String Distance ◦ Definitions ◦ Examples ◦ Implementation ◦ Theoretical Results ◦ String Space Examples

 Distance ◦ may be defined for any structure  Overlap of the substructures of two structures ◦ Strings ◦ Graphs ◦ Algebraic structures ◦ Semi-groups ◦ Trees  Web site and web page similarity

 Past 15 years ◦ Over 20 papers on graph similarity ◦ Several more on string similarity  Semi-Group  Let T=(S, A) together with the concatenation operation, where A consists of the set of axioms ◦  x, y  S, xy  S ◦  x, y, z  S, x(yz) = (xy)z

 Graph: Let T=(S, A) together with a relation ~ where A consists of the set of axioms ◦  x, y  S, x ~ y  y ~ x ◦  x ,  (x ~ x)  String Let T=(S,A) together with an associative operation (expressed by concatenation). ◦ Then let S n be defined recursively by  S 1 = S and  S n = S x S n-1 and  S* be defined as the infinite union of ordered tuples: S 1  S 2  …  S n

 Levenshtein distance calculates minimum number of transformations  Largest shared substructure  Smallest super structure  All of these approaches are relative

 Enumerate all substructures within T and U  Union those two sets (T*  U*) =Z  |Z|-dimensional vector space  z(T) be the number of occurrences of structure z as a substructure of T  Calculate Minkowski distance d(T,U)

 Alphabet S = {a,b,c},  = abaac and  = cbaac   *= {a,b,c,ab, ba,aa,ac,aba,baa,aac, abaa, baac, abaac}   * = {a,b,c,cb,ba,aa,ac,cba, baa, aac,cbaa, baac,cbaac}  Z= { a, b, c, ab, cb, ba, aa, ac, cba, aba, baa, aac, cbaa, abaa, baac, cbaac, abaac } (underlined elements are unique to  and boldfaced are unique to  *)  Equal frequency: I = {b, c, ba, aa, ac, baa, aac, baac}  Different frequency: D={a},  Unique: O= {ab, cb, cba,aba, cbaa, abaa, cbaac, abaac}  |I| = 8, |D| = 1, and |O| = 8

 |I| +|D| +|O| = |Z| = 18.  Contribution of O is |O|  Contribution of I is 0 - substrings appear equally often  Contribution of D, in this case will be 1.  d( ,  ) = contribution(I)+ contribution(D)+ contribution(O) = 9

 A= aabc B= abcd  S= {a, a, aa, aab, aabc, ab, abc, b, bc, c}  T= {a, ab, abc, abcd, b, bc, bcd, c, cd, d}  Counts for S and T ◦ a:2 aa:1 aab:1 aabc:1 ab:1 abc:1 b:1 bc:1 c:1 ◦ a:1 ab:1 abc:1 abcd:1 b:1 bc:1 bcd:1 c:1 cd:1 d:1  Differences: a:1 aad:1 aab:1 aabc:1 ab:0 abc:0 abcd:1 b:0 bc:0 bcd:1 c:0 cd1:0 d:1  Distance (aabc, abcd) = 8

 Too tedious by hand  ipt/StringDistances.html ipt/StringDistances.html  Distance (aabc, abcd) = 8

 Conjecture: if |  |=|  |=n and  and  share no substrings in common (i.e., |I  D|=0), then d(  ) = n(n+1)  Conjecture: if |  |=|  |=n and a and b share no substrings in common (i.e., |I  D|=0), then d(  ) = n(n+1)  Lemma: if  =a n then d(  )= n 2 + n(n+1)/2  Conjecture: if |  |=|  |=n, then d(  )=d(  )=d(  )=d(  )= n 2 + n(n+1)/2

 Pretty pics

 Exhaustive substructure vector space  Calculate distance  Interesting observations used to study structure similarity based on size