Class Presentations Guidelines:  You should go beyond the assigned material  Prepare for a full class presentation (50 minutes)  e-mail to me a draft.

Slides:



Advertisements
Similar presentations
Chapter 10: Designing Databases
Advertisements

Learning from Observations Chapter 18 Section 1 – 3.
Naïve Bayes. Bayesian Reasoning Bayesian reasoning provides a probabilistic approach to inference. It is based on the assumption that the quantities of.
Knowledge Representation
1 NP-completeness Lecture 2: Jan P The class of problems that can be solved in polynomial time. e.g. gcd, shortest path, prime, etc. There are many.
Data Mining Practical Machine Learning Tools and Techniques Slides for Chapter 3 of Data Mining by I. H. Witten, E. Frank and M. A. Hall.
Constraint Logic Programming Ryan Kinworthy. Overview Introduction Logic Programming LP as a constraint programming language Constraint Logic Programming.
Case-based Reasoning System (CBR)
LEARNING DECISION TREES
Learning decision trees derived from Hwee Tou Ng, slides for Russell & Norvig, AI a Modern Approachslides Tom Carter, “An introduction to information theory.
© Copyright Eliyahu Brutman Programming Techniques Course.
Clique Cover Cook’s Theorem 3SAT and Independent Set
Copyright © Cengage Learning. All rights reserved.
© 2005 by Prentice Hall Chapter 9 Structuring System Requirements: Logic Modeling Modern Systems Analysis and Design Fourth Edition.
Mgt 20600: IT Management & Applications Databases Tuesday April 4, 2006.
Themes of Presentations Rule-based systems/expert systems (Catie; October 13) Software Engineering () Fuzzy Logic (Mark; Dec. 1) Configuration Systems.
Case Representation Cont’d Sources: –Chapter 3 – –
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
Signals and Systems March 25, Summary thus far: software engineering Focused on abstraction and modularity in software engineering. Topics: procedures,
Computability and Modeling Computation What are some really impressive things that computers can do? –Land the space shuttle (and other aircraft) from.
Object-Oriented Software Engineering Practical Software Development using UML and Java Chapter 8: Modelling Interactions and Behaviour.
Copyright R. Weber Machine Learning, Data Mining ISYS370 Dr. R. Weber.
CSCI-383 Object-Oriented Programming & Design Lecture 4.
Slides for “Data Mining” by I. H. Witten and E. Frank.
Capacity analysis of complex materials handling systems.
Nattee Niparnan. Easy & Hard Problem What is “difficulty” of problem? Difficult for computer scientist to derive algorithm for the problem? Difficult.
Help Desk System How to Deploy them? Author: Stephen Grabowski.
Copyright 2002 Prentice-Hall, Inc. Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich Chapter 20 Object-Oriented.
CS 682, AI:Case-Based Reasoning, Prof. Cindy Marling1 Chapter 3: Reasoning Using Cases In this chapter, we look at how cases are used to reason We’ve already.
11 Table 2.1 defines a correspondence between a set of percent scores and a set of letter grades. For each score from 0 to 100, there corresponds only.
LEARNING DECISION TREES Yılmaz KILIÇASLAN. Definition - I Decision tree induction is one of the simplest, and yet most successful forms of learning algorithm.
Intelligent Decision Support Systems: A Summary. Case-Based Reasoning Example: Slide Creation Repository of Presentations: -5/9/00: ONR review -8/20/00:
Source for Information Gain Formula Artificial Intelligence: A Modern Approach by Stuart Russell and Peter Norvig Chapter 18: Learning from Observations.
Similarity in CBR (Cont’d) Sources: –Chapter 4 – –
Learning from Observations Chapter 18 Through
CHAPTER 18 SECTION 1 – 3 Learning from Observations.
Next-generation databases Active databases: when a particular event occurs and given conditions are satisfied then some actions are executed. An active.
Introduction CS 3358 Data Structures. What is Computer Science? Computer Science is the study of algorithms, including their  Formal and mathematical.
Andrew S. Budarevsky Adaptive Application Data Management Overview.
Week 10Complexity of Algorithms1 Hard Computational Problems Some computational problems are hard Despite a numerous attempts we do not know any efficient.
Sommerville 2004,Mejia-Alvarez 2009Software Engineering, 7th edition. Chapter 8 Slide 1 System models.
Unit-1 Introduction Prepared by: Prof. Harish I Rathod
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 14 Slide 1 Object-oriented Design.
Visual Information Systems Recognition and Classification.
Data Structures and Algorithms Dr. Tehseen Zia Assistant Professor Dept. Computer Science and IT University of Sargodha Lecture 1.
IDSS: Overview of Themes AI  Introduction  Overview IDT  Attribute-Value Rep.  Decision Trees  Induction CBR  Introduction  Representation  Similarity.
Graphs A ‘Graph’ is a diagram that shows how things are connected together. It makes no attempt to draw actual paths or routes and scale is generally inconsequential.
Similarity in CBR (Cont’d) Sources: –Chapter 4 – –
Chapter 7 Complex Similarity Topix. About this chapter Extends previous discussed methods The reader can choose to read about only specific methods, depending.
1 Compiler Construction (CS-636) Muhammad Bilal Bashir UIIT, Rawalpindi.
© 2005 by Prentice Hall Chapter 9 Structuring System Requirements: Logic Modeling Modern Systems Analysis and Design Fourth Edition Jeffrey A. Hoffer Joey.
Copyright (c) 2014 Pearson Education, Inc. Introduction to DBMS.
Chapter 11 Introduction to Computational Complexity Copyright © 2011 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1.
Lecture 5 Frames. Associative networks, rules or logic do not provide the ability to group facts into associated clusters or to associate relevant procedural.
Chapter 18 Section 1 – 3 Learning from Observations.
Learning From Observations Inductive Learning Decision Trees Ensembles.
16 April 2011 Alan, Edison, etc, Saturday.. Knowledge, Planning and Robotics 1.Knowledge 2.Types of knowledge 3.Representation of knowledge 4.Planning.
Human Computer Interaction Lecture 21 User Support
Knowledge Representation Techniques
Learning from Observations
Learning from Observations
Introduce to machine learning
Presented By S.Yamuna AP/CSE
Introduction to Data Structure
Data Queries Raster & Vector Data Models
Chapter 9 Structuring System Requirements: Logic Modeling
Learning from Observations
Chapter 9 Structuring System Requirements: Logic Modeling
Trees-2, Graphs Data Structures with C Chpater-6 Course code: 10CS35
Learning from Observations
Presentation transcript:

Class Presentations Guidelines:  You should go beyond the assigned material  Prepare for a full class presentation (50 minutes)  to me a draft of the powerpoint presentation at least 1 week before the due date  You are responsible for understanding the material you are presenting. If there are things you don't understand, look for the citations, and read the references needed for you to understand.  Be prepared for interruptions during your presentation. I want to understand your presentation very well; when I don't understand something I will ask.

Example Domains So far we have discussed domains for analysis tasks:  Help-desk systems  Medicine  Yes/No classification for waiting in a restaurant  Prediction of debt recovery (credit card) Domains for synthesis tasks:  Printer configuration domain  Machining domain  Military domain: non combatant evacuation operations

Mechanical Manufacturing Domain Process planning problem: to determine the methods and sequence of machining operations to produce a finish component to design specifications Machines Tools Sequence of steps Timing Routing/cost combinations Provision for alternate and concurrent operations

Mechanical Manufacturing Domain (II) I. diameter < 0.5” A. true position > tolerance > 0.01 Drill the hole 2. tolerance < 0.0 drill and ream B. true position < tolerance > 0.01 drill and finish bore 2. Tolerance < 0.01 drill, semi-bore, and finish bore…… III. …... drillingboringreaming

Mechanical Manufacturing Domain (III)

Noncombatant Evacuation Operations Domain Goal: Assist DoS to evacuate noncombatants, whose lives are in danger Characteristics: –Uncertainty; complex (200+ tasks); distributed –US Ambassador is senior authority!! Characteristics: –Uncertainty; complex (200+ tasks); distributed –US Ambassador is senior authority!! Planning: Responsibility of geographic combatants –Resources: Doctrine, DoS, EAP, etc. Planning: Responsibility of geographic combatants –Resources: Doctrine, DoS, EAP, etc.

Noncombatant Evacuation Operations Domain (II) Some Interesting Happenings: - During Operation Desert Shield - Mid air refueling: Fuel spill, inexperienced pilots - Buzzing the embassy the hostiles scattered - Transported evacuees to embassy (e.g., non-citizen ambassadors) - Night Vision Goggles & Problems with Lights - Pistol removed upon entry to a helicopter - Bribe to a Somali Major - Clown on board - Birth on-board; 281  282 evacuees Some Interesting Happenings: - During Operation Desert Shield - Mid air refueling: Fuel spill, inexperienced pilots - Buzzing the embassy the hostiles scattered - Transported evacuees to embassy (e.g., non-citizen ambassadors) - Night Vision Goggles & Problems with Lights - Pistol removed upon entry to a helicopter - Bribe to a Somali Major - Clown on board - Birth on-board; 281  282 evacuees

Case Representation Sources: –Chapter 3 (Book) – –

Contents of a Case Generally a case contains specific knowledge about a previous problem solving experience Typically a case contains the following information:  Problem/Situation  Solution  Adequacy (utility) Scope of the information:  Complete solution/partial solution  Detail or abstracted solution Representation formalism:  Attribute-value pairs  Structured representation: Objects, trees  High-order: predicate logic, plans (example: help-desk systems) (example: planning)

Information About the Problem/Situation A problem/situation may described:  A solved problem  An analyzed situation (example: diagnosis) (example: military domain) The description of a situation must contain all information necessary to determine if a case can be reused in other situations (the Eastern Exit Operation example) Situation/problem description may contain:  Goal of the case  Constraints and conditions  In general, any relevant information

Example of Case Conditions Title: Perform Long Range Troop Insertion (Eastern Exit) Goal: Determine Insertion Procedure Conditions: 1. Aerial refueling aircraft available 2. Helicopters equipped for aerial refueling 3. Pilots trained for aerial refueling … Conditions: 1. Aerial refueling aircraft available 2. Helicopters equipped for aerial refueling 3. Pilots trained for aerial refueling … Actions: 1. Embark troops in helicopters 2. Determine intermediate refueling point 3. Fly helicopters to refueling point 4. Continue helicopters to final destination Actions: 1. Embark troops in helicopters 2. Determine intermediate refueling point 3. Fly helicopters to refueling point 4. Continue helicopters to final destination

Information About the Solution The kind of information about the solution depends on the adaptation procedure:  Simple Yes or No  Sequence of actions  Complex structure including constrains and justifications (example: help-desk) (example: manufacturing)

Example of Complex Solution Mount Piece on the Lathe machine at position X, Y Rotate machine at Z speed Select drilling tool with M cm head diameter Select trajectory for the tool Justification: tool and speed determine possible trajectories

Information About Adequacy Feedback from the environment:  Was the solution applicable?  What was the cost of adapting this solution? (example: help-desk) (example: manufacturing) Example:  Is the diagnostic correct?  How long does it took to adapt the manufacturing plan?  What is the cost of the machining operations of the new plan?

Complete/Partial Solution This question arises primarily for synthesis tasks  If there are too many interdependencies between pieces of the solutions, cases will contain the whole solution. (example: manufacturing plans)  If pieces of the solutions are more or less independent, cases will contain those pieces. (example: military domain)

Detail/Abstract Solution This question arises primarily for synthesis tasks  If there are ways to abstract the concrete solutions and vice- versa, we could consider storing these abstracted solutions. Why? To increase the re-usability of the cases If such a way doesn’t exists, we are “stuck” with the concrete solutions

Attribute-Value Case Representation Case: a collection of attribute-value pairs Example: Each row in the wait-restaurant table is a case Examples in the IDT context correspond to cases Each attribute is from a certain type. For example:  Integer: all integers or an interval  Real: all numbers or an interval  Symbol: finite set of alternatives (e.g., Thai, Italian,…)  Hypertext: HTML (e.g., HICAP) Attributes can be the same for all cases or vary from case to case

Formalization Attributes: A 1, A 2,.., A n Types: T 1, T 2, …, T n Values a 1 in T 1, a 2 in T 2, …, a n in T n A case is defined as follows:  If all cases have the same number of attributes, a case is a vector: (a 1, …, a n ) in T 1  {unknown}  …  T n  {unknown}  If cases have a varying number of attributes, a case is a set: {A p = a p, …, A k = a k } (attributes that are not in the set are considered unknown) Unknown values is the main difference between a case and an example in the sense of IDT

Selection of Attributes Situation description:  Independence: Attributes should represent independent features whenever possible  Completeness: the attributes should be sufficient to determine if the case can be reused in a new situation  Minimalist: The only attributes that should be included in a case are those used in to compute similarity (ex: type of restaurant versus week day) (not always possible: patrons and day of the week are related)

Selection of the Types Selection of the types is defined by the elements needed to compute similarity Symbolic:  Ideal for a small number of alternatives (e.g., type of restaurant) Integer/Real  Ideal for measures and other numeric values  Computation of similarity is straightforward Text:  Ideal for unstructured information  Computation of similarity can be very difficult

Example Case 1 Front-light = doesn’t work Car-type = Golf II, 1.6 Year = 1993 Batteries = 13.6V … Symptoms: Solution: Diagnosis: Front-lights-safeguard = broken Help measures: “Replace front lights safeguard” Symbol: work,doesn’t work Symbol: Golf, Mercedes,… Symbol: 1960, …, 2002 Real: 1V … 30V Text Symbol: ok, broken

Homework Assignment 1.(ALL) Select a machine that you feel particularly familiar with it (e.g., your PC, the graphic card of your pc). List at least 10 attributes and their types that you feel are relevant to make a diagnosis of a failure for that machine 2.(ALL) Suppose that (1) you have a classification-task domain with many attributes (say attributes) and (2) there is a lot of data collected (basically a table with many rows, whose columns are the attributes). Suppose that you know there are many attributes that are not needed for the classification problem. Indicate how to determine a subset of relevant features

Tree Representation Structured representations are needed when there are multiple relations between elements of the problem

Objects and Classes An object class describes the structure of an object through a (finite) collection of attributes and their types An instance (or an object) of an object class assigns values of the corresponding type for each attribute in the class

Example (Objects and Classes) Front-light = doesn’t work Car-type = Golf II, 1.6 Year = 1993 Batteries = 13.6V … Front-light = doesn’t work Car-type = Golf II, 1.6 Year = 1993 Batteries = 13.6V … Instance: Entry # 314 Front-light: symbol Car-type : symbol Year: Symbol Batteries: Real … Front-light: symbol Car-type : symbol Year: Symbol Batteries: Real … Class: Symptoms

Relations Between Objects Relations between objects are important Typical kinds of relations:  Taxonomical relations: “is-a-kind-of” indicates abstraction/refinement relations between objects  Compositional relations: “is-a-part-of” indicates that objects are parts of other objects (example: car is a kind of transportation means) (example: motor is a part of a car)

Compositional Relations Car Fuel systemMotorElectrical system Carburator Exhaust … Compositional relations are described through relational attributes Relational Attributes are attributes whose values are objects

Example (Compositional Relation) Model: symbol Make : symbol Year: Symbol Motor: MotorC … Model: symbol Make : symbol Year: Symbol Motor: MotorC … Class: CarC SerialN: int Liter : real Carburator: CarbC … SerialN: int Liter : real Carburator: CarbC … Class: MotorC Model: Tercel Make : Toyota Year: 1991 Motor: … Model: Tercel Make : Toyota Year: 1991 Motor: … Instance: Toyota Tercel SerialN: Liter : 1.5 Carburator: … … SerialN: Liter : 1.5 Carburator: … … Instance: …

Taxonomical Relations Transportation Means Air trans.Land trans.Sea trans car Sport utility … Taxonomical relations are explicitly represented The subclass inherits all the attributes of the superclass

Example (Taxonomical Relation) Model: symbol Make : symbol Year: Symbol Price: int … Model: symbol Make : symbol Year: Symbol Price: int … Class: CarC Max speed: int horseP: int … Max speed: int horseP: int … Class: Land Transport Max speed: 100 mph horseP: … … Model: Tercel Make : Toyota Year: 1991 Price: $2000 … Max speed: 100 mph horseP: … … Model: Tercel Make : Toyota Year: 1991 Price: $2000 … instance: ToyotaTercel

Analysis of Object-Oriented Case Representations Advantages:  Structured and natural in many domains  Relations between objects are explicitly represented  More compact storage compared to with attribute-values  Structured relations can be used to define similarity  Disadvantages:  Similarity computation and retrieval can be time costly  Time order cannot be represented  Example domain: design and configuration Example domain: planning

Predicate Logic Representation Problem/Solution from a case can be represented through predicates: Front-light = doesn’t work Car-type = Golf II, 1.6 Year = 1993 Batteries = 13.6V … Symptoms: Solution: Diagnosis: Front-lights-safeguard = broken Help measures: “Replace front lights safeguard” Case( symptoms(frontLight(dw), carType(GolfII_1.6), year(1993), batteries(13.6),…), diagnosis(broken(fls), measures(rfls))) Case: term predicate

Predicate Logic Representation (cont’d) Attribute-value pairs representation of cases can be represented as predicates (each attribute is represented as a term and a predicate “encapsulates” all terms)  Tree can also be represented as predicates (each node is a predicate and the links are terms)  Object representations can also be represented as predicates (terms represent the hierarchical relations)

Predicate Logic Representation (cont’d) Advantages:  As flexible as it gets (I am exaggerating)  Complex structural relations can be represented  Can take advantage of inference mechanism (i.e., prolog) Disadvantages:  Computing similarity can be very complicated  Inference procedures are frequently very time costly  SAT is NP-complete.

Formulas (SAT): Definition Definition. A Boolean formula is defined recursively as follows: A Boolean variable is a Boolean formula If  1 and  2, are Boolean formulas then:  (  1   2 )  (  1   2 )  (  1   2 ) are also Boolean formulas If  is a Boolean formula then ¬(  ) is a Boolean formula Assume that there are no redundancies in parenthesis Definition. (SAT) Given a Boolean formula , is there an assignment of the variables in  that makes the formula true? Example: ((x  y)  ¬x)  y

Graph Representation Graph representations are useful in many domains: Data flow Planning Query answer Mount Piece on the Lathe machine at position X, Y Rotate machine at Z speed Select drilling tool with M cm head diameter Select trajectory for the tool Can’t be represented as a tree

Analysis of Graph Representations Advantages:  Structured and natural in many domains  Relations between objects are explicitly represented  Structured relations can be used to define similarity Disadvantages:  Similarity computation and retrieval can be time costly  Graph-Subgraph Isomorphism is NP-complete!

Graphs: Definition G = (V, E) Vertices (nodes) Edges (arcs) Edges are a subset of V  V {(v,v’) : v and v’ are in V} We also write v  v’ instead of (v.v’)

Subgraphs Given a graph G = (V, E) and a graph G’ = (V’, E’), G is a subgraph of G’ if: V  V’ E  E’ Every element in the left set is an element in the right set

Graph-Subgraph Isomorphism Two graphs G1 = (V1,E1) and G2 = (V2,E2) are isomorphic if a bijective function f: V1  V2 exists such that: –If (u,v) is in E1 then (f(u),f(v)) is in E2 –If (u’,v’) is in E2 then (f(u’),f(v’)) is in E1 Graph-Subgraph Isomorphism problem is NP-complete: Given two graphs G1 and G2 is G1 isomorphic to a subgraph of G2? Example of 2 isomorphic graphs: