Automated Reasoning for Classifying Finite Algebras Simon Colton Computational Bioinformatics Laboratory Imperial College, London.

Slides:



Advertisements
Similar presentations
Automated Theorem Proving Lecture 1. Program verification is undecidable! Given program P and specification S, does P satisfy S?
Advertisements

Presented by: Ms. Maria Estrellita D. Hechanova, ECE
ARCHITECTURES FOR ARTIFICIAL INTELLIGENCE SYSTEMS
Introduction The concept of transform appears often in the literature of image processing and data compression. Indeed a suitable discrete representation.
Computing Fundamentals 2 Introduction to CafeOBJ Lecturer: Patrick Browne Lecture Room: K408 Lab Room: A308 Based on work by: Nakamura Masaki, João Pascoal.
Types of Logic Circuits
Mathematics of Cryptography Part II: Algebraic Structures
Introducing Formal Methods, Module 1, Version 1.1, Oct., Formal Specification and Analytical Verification L 5.
Copyright © Cengage Learning. All rights reserved.
Automated Exploration of Bioinformatics Spaces Simon Colton Computational Bioinformatics Laboratory.
1.  Detailed Study of groups is a fundamental concept in the study of abstract algebra. To define the notion of groups,we require the concept of binary.
1. Introduction Consistency of learning processes To explain when a learning machine that minimizes empirical risk can achieve a small value of actual.
Tirgul 10 Rehearsal about Universal Hashing Solving two problems from theoretical exercises: –T2 q. 1 –T3 q. 2.
CSE115/ENGR160 Discrete Mathematics 02/07/12
Hidden pairings and trapdoor DDH groups Alexander W. Dent Joint work with Steven D. Galbraith.
Describing Syntax and Semantics
Copyright © Cengage Learning. All rights reserved.
THE TRANSITION FROM ARITHMETIC TO ALGEBRA: WHAT WE KNOW AND WHAT WE DO NOT KNOW (Some ways of asking questions about this transition)‏
1 Topic Mathematical Proofs. 2 Topic Mathematical Proofs California Standards: 24.2 Students identify the hypothesis and conclusion in logical.
Automated Puzzle Generation Simon Colton Universities of Edinburgh and York.
Introduction to Computer Science. A Quick Puzzle Well-Formed Formula  any formula that is structurally correct  may be meaningless Axiom  A statement.
Automated Theory Formation for Tutoring Tasks in Pure Mathematics Simon Colton, Roy McCasland, Alan Bundy, Toby Walsh.
ILP for Mathematical Discovery Simon Colton & Stephen Muggleton Computational Bioinformatics Laboratory Imperial College.
The HOMER System for Discovery in Number Theory Simon Colton Imperial College, London.
Lakatos-style Methods in Automated Reasoning Alison Pease University of Edinburgh Simon Colton Imperial College, London.
Copyright R. Weber Machine Learning, Data Mining ISYS370 Dr. R. Weber.
Automated Theory Formation: First Steps in Bioinformatics Simon Colton Computational Bioinformatics Laboratory.
The TM System for Repairing Non-Theorems Alison Pease – University of Edinburgh Simon Colton – Imperial College, London.
Nattee Niparnan. Easy & Hard Problem What is “difficulty” of problem? Difficult for computer scientist to derive algorithm for the problem? Difficult.
Automated Theory Formation in Bioinformatics Simon Colton Computational Bioinformatics Lab Imperial College, London.
Empirical Explorations with The Logical Theory Machine: A Case Study in Heuristics by Allen Newell, J. C. Shaw, & H. A. Simon by Allen Newell, J. C. Shaw,
LDK R Logics for Data and Knowledge Representation Modeling First version by Alessandro Agostini and Fausto Giunchiglia Second version by Fausto Giunchiglia.
1 Inference Rules and Proofs (Z); Program Specification and Verification Inference Rules and Proofs (Z); Program Specification and Verification.
Descriptive ILP for Mathematical Discovery Simon Colton Computational Bioinformatics Lab Department of Computing Imperial College, London.
A Theory of Theory Formation Simon Colton Universities of Edinburgh and York.
Temperature Readings The equation to convert the temperature from degrees Fahrenheit to degrees Celsius is: c(x) = (x - 32) The equation to convert the.
Properties of Real Numbers. Sets In mathematics, a set is a collection of things Sets can be studies as a topic all on its own (known as set theory),
Edinburgh and Calculemus Simon Colton Universities of Edinburgh and York.
Lecture Introduction to Proofs 1.7 Proof Methods and Strategy.
The HR Program for Theorem Generation Simon Colton Mathematical Reasoning Group University of Edinburgh.
Making Conjectures About Maple Functions Simon Colton Universities of Edinburgh & York.
A Theory of Theory Formation Simon Colton Universities of Edinburgh and York.
MA10209 – Week 6 Tutorial B3/B4, Andrew Kennedy. people.bath.ac.uk/aik22/ma10209 Top Tips (response to sheet 5)  Proof by example is not a proof at all.
A Logic of Partially Satisfied Constraints Nic Wilson Cork Constraint Computation Centre Computer Science, UCC.
Boolean Algebra and Computer Logic Mathematical Structures for Computer Science Chapter 7.1 Copyright © 2006 W.H. Freeman & Co.MSCS SlidesBoolean Algebra.
Automated Discovery in Pure Mathematics Simon Colton Universities of Edinburgh and York.
The Homer System Simon Colton – Imperial College, London Sophie Huczynska – University of Edinburgh.
DISCRETE COMPUTATIONAL STRUCTURES CSE 2353 Fall 2010 Most slides modified from Discrete Mathematical Structures: Theory and Applications by D.S. Malik.
CSE Winter 2008 Introduction to Program Verification January 15 tautology checking.
1.2 Logical Reasoning page 9. Inductive Reasoning: Reasoning that is based on patterns you observe. Conjecture: A conclusion that is reached using inductive.
Math 344 Winter 07 Group Theory Part 2: Subgroups and Isomorphism
1Computer Sciences Department. Book: INTRODUCTION TO THE THEORY OF COMPUTATION, SECOND EDITION, by: MICHAEL SIPSER Reference 3Computer Sciences Department.
{ What is a Number? Philosophy of Mathematics.  In philosophy and maths we like our definitions to give necessary and sufficient conditions.  This means.
Math 344 Winter 07 Group Theory Part 1: Basic definitions and Theorems.
Automatic Generation of First Order Theorems Simon Colton Universities of Edinburgh and York Funded by EPSRC grant GR/M98012 and the Calculemus Network.
Computational Biology Group. Class prediction of tumor samples Supervised Clustering Detection of Subgroups in a Class.
Automated Theorem Discovery Simon Colton Universities of Edinburgh and York.
Calculation Invention and Deduction Dr. Simon Colton Imperial College London (Formerly at Edinburgh) YVR in Karlsruhe & Saarbrucken.
Machine Creativity Edinburgh Simon Colton Universities of Edinburgh and York.
Boolean Algebra and Computer Logic Mathematical Structures for Computer Science Chapter 7 Copyright © 2006 W.H. Freeman & Co.MSCS SlidesBoolean Algebra.
Discovery Systems Author: Kenneth W. Hasse Jr. Presenter: Peter Yoon.
ESFOR Panel Application Developers’ Wish Lists for Automated Theorem Provers.
Math 3121 Abstract Algebra I Lecture 6 Midterm back over+Section 7.
CSE15 Discrete Mathematics 02/08/17
Knowledge Representation
Number Line Where are you on the learning journey?
Section 10.2 Finite Mathematical Systems
Finite Mathematical Systems
Presentation transcript:

Automated Reasoning for Classifying Finite Algebras Simon Colton Computational Bioinformatics Laboratory Imperial College, London

Joint work with Roy McCasland (Edinburgh) –Mathematical insights Andreas Meier (Saarbrucken) –Theorem proving expertise Volker Sorge (Birmingham) –ATP and CAS expertise Truly collaborative –i.e., I may not be able to answer some questions

Classification of Finite Algebras Major driving force in mathematics –E.g., Kronecker’s 1870 classification of Abelian groups –Also, 1980 classification of finite simple groups For loops and quasigroups, etc. –Large numbers of isomorphism/isotopy classes –E.g., 109 loops of size, 1441 quasigroups of size 5 Computational approaches have been used –In a quantitative, rather than a qualitative way –E.g., existence of QGX quasigroups of certain sizes

The Task We Set Ourselves Write a system which can… Be given only a particular size and an algebraic specification (in terms of a set of axioms) And produce a fully verified classification theorem –Which can be used to classify algebras of that size Up to isomorphism As a simple example –Given the axioms of group theory and the size 6 –Our system proves that groups of size six are either Abelian or non-Abelian up to isomorphism

The Tools We Used Automated Reasoning: –Spass theorem prover –MACE-4 model generator –Omega proof planning system Machine Learning: –HR automated theory formation system –C4.5 decision tree learner Computer Algebra –Gap system

Why Machine Learning? Why are these two algebras non-isomorphic? Did you use deduction (only) to show this? My problem with the term “automated reasoning” Doesn’t include inductive reasoning abcd aabcd bbacd ccbad ddbca abcd aabcd bbdca ccbad dabcd

The HR System Starts with minimal information –E.g., dividing two numbers, ring theory axioms Produces a rich theory containing: –Examples, concepts, conjectures, proofs 15 Generic production rules form concepts 20+ Measures of interestingness –Drive a best-first search Conjecture making performed empirically Theorem proving/disproving by third party software –Usually Otter and MACE

Approach One Use MACE (+isofilter) to produce: –A single example of each isomorphism class Use HR to form a theory: –With a concept describing each class uniquely Use Spass to: –Verify MACE’s results That each example satisfies axioms Every algebra is isomorphic to one of the classes –Verify HR’s results That each example has the concept’s property –Prove that each concept is a classifier Discriminant and isomorphism-class theorems are true

Approaches Two and Three Same as approach 1 But HR allowed to stop before it has found a classifying concept for each class –In many cases, this is necessary Approach 2: use Prolog to combine concepts Approach 3: use C4.5 to learn a decision tree –Problem: sometimes sub-optimal trees produced

Example Discriminating Concept First one: –Idempotent element appearing twice on the diagonal

Difficulties and Lessons Learned Difficulty 1: –MACE intermediate files > 4GB –Solution: don’t require generation of all isomorphism classes Difficulty 1: –HR has trouble with more than 6 or 7 examples –Solution: only use HR to discriminate a few examples (pairs) Difficulty 2: –Spass has trouble with sizes greater than 6 or 7 –(Partial) solution: use CAS to describe problem in terms of generators and relations (decrease potential mappings)

Approach Four (Bootstrapping) Want fully automated decision tree process –See IJCAR’04 paper for full algorithm description Step 1: MACE produces a non-isomorphic pair Step 2: HR discriminates the pair Step 3: Spass proves that some discriminants are actually classifiers Step 4: For non-classifiers, use MACE to produce a non-iso pair which both have the property –If successful, go to step 2 –If not, use Spass to prove it’s a dead-end

Example Decision Tree

Nice Result in Group Theory (Produced by Approach 1) Class 1: -(exists b (-(inv(b)=b))) Class 2: exists b c (-(inv(b)=b) & c*c=b) Class 3: -(exists b (inv(b)=b & -(exists c d (commutator(d,c)=b))) Class 4: exists b c d (b*c=d & -(c*b=d) & inv(d)=d) Class 5: none of the above

In English… Groups of order 8 can be classified according to the self-inverse (inv(x)=x) elements they contain: they will either have: (i) all self inverse elements (ii) an element which squares to give a non-self inverse element (iii) no self-inverse elements which aren't also commutators (iv) a self inverse element which can be expressed as the product of two non-commutative elements (v) none of these properties

Classification Theorems Produced Using Approach 4 Generated classifying theorems for –Groups of size 4 (#2), 6 (#2), 8 (#5) –Loops of size 4 (#2), 5 (#6), 6 (#109) –Quasigroups Of size 3 (#5), 4 (#35), 5 (#1441) –Monoids of size 3 (#7) –QG4-quasigroups of size 5 (#4) –QG5-quasigroups of size 7 (#3)

Conclusions Computers can help in classification tasks –In a qualitative, as well as quantitative way –Can produce fully verified classification theorems Cannot be achieved by deduction alone –Our approach requires deduction (ATP), induction (ML), and symbolic manipulation (CAS) –Long live the Calculemus project!! Application to model generation (please ask) –Results are not conclusive yet…

Future Work #1 Improve the current system –By trying out different tools/methods SEM, FINDER for model generation SAT solvers for the ATP tasks Progol (ILP) for machine learning tasks –First test: 68% success (HR was 96%) Look at different domains –Possibly domains associated with Zariski spaces Also look at isotopy as well as isomorphism

Future Work #2 Produce general classification theorems Analysis of trees produced so far –Important concepts, etc. Generalise results over sizes –One possibility: Use smaller size decision trees as seeds for the larger trees Determine families and parameterisations of the family members –Use the counting abilities of HR May be difficult for first order provers

Future Work #3 Look at sub-algebra structures/mappings E.g., centre of a group forms a subgroup –Look for more specific results than this Look for algebras embedded within others –HR has abilities to do this –May be a tough problem for theorem proving Build up an “Atlas” for loops & quasigroups Start building more constructive classification results –E.g., using cross products, etc.

Future Work #4 Find mathematical applications of this Any help……..?