Tutorial 3 Empirical evaluation in AI

Slides:



Advertisements
Similar presentations
Critical Reading Strategies: Overview of Research Process
Advertisements

Science Fair Project 2015.
The Scientific Method.
Observation Tools Overview and User Guide. Does the need to determine the impact a student's ADHD is having in the classroom or quantitatively describe.
Technical Writing II Acknowledgement: –This lecture notes are based on many on-line documents. –I would like to thank these authors who make the documents.
Intelligent Agents What is the basic framework we use to construct intelligent programs?
UMass Lowell Computer Science Advanced Algorithms Computational Geometry Prof. Karen Daniels Spring, 2004 Project.
July 16, Introduction to CS II Data Structures Hongwei Xi Comp. Sci. Dept. Boston University.
Advanced Research Methodology
Explain the steps in the scientific method.
Assignments  1. Grade graphs and conclusions.  2. Introduction to Reaction Time.  3. Begin Pre-Lab of Reaction Time.
 For the IB Diploma Programme psychology course, the experimental method is defined as requiring: 1. The manipulation of one independent variable while.
 To complete and submit a science based investigation according to the following guidelines and procedures.
Tables and Figures. The “Big Picture” For other scientists to understand the significance of your data/experiments, they must be able to: understand precisely.
Science Project Type your project title here Your name Mueller Park Junior High You are welcome to choose any design style for your project!
Notes from Instructor Regarding Term Papers. Howard Godfrey UNC Charlotte Fall, 2007.
ENG - W232 The Formal Business Report. Audience supervisors at Air America.
Major objective of this course is: Design and analysis of modern algorithms Different variants Accuracy Efficiency Comparing efficiencies Motivation thinking.
“What Makes a Good Science/Technology Project” Derresa Davis-Tobin
Copyright 2010, The World Bank Group. All Rights Reserved. COMMUNICATION AND DISSEMINATION, PART 2 DEVELOPING DISSEMINATION PRODUCTS 1.
Game tree search Chapter 6 (6.1 to 6.3 and 6.6) cover games. 6.6 covers state of the art game players in particular. 6.5 covers games that involve uncertainty.
Science Fair Project Type your project title here Your name Your teacher’s name Your school.
Statistical Analysis: Chi Square
Type your project title here Your name Mueller Park Junior High
Chapter 1 Introduction and Data Collection
Writing Scientific Research Paper
The Scientific Method & Experimental Design
AI Classnotes #5, John Shieh, 2012
Parts of a Lab Write-up.
Lab Roles and Lab Report
Heuristic Search A heuristic is a rule for choosing a branch in a state space search that will most likely lead to a problem solution Heuristics are used.
For Monday Chapter 6 Homework: Chapter 3, exercise 7.
Department of Computer Science
Report Writing.
Just What Is Science Anyway???
Finding Heuristics Using Abstraction
Understanding Randomness
CS Fall 2016 (Shavlik©), Lecture 8, Week 5
Hypothesis Theory examples.
Technical Report Writing
Lab Write-ups.
Objective of This Course
CS 188: Artificial Intelligence Fall 2008
Academic Communication Lesson 3
Problem Solving and Searching
Introduction to CS II Data Structures
Lab Reports What goes into a lab report? Why do we need a report?
The Scientific Method.
Lectures on Graph Algorithms: searching, testing and sorting
Tutorial 1 Uninformed Search
Assignment Operators Topics Increment and Decrement Operators
Problem Solving and Searching
Understanding Standards:
Assignment Operators Topics Increment and Decrement Operators
BEST FIRST SEARCH -OR Graph -A* Search -Agenda Search CSE 402
Artificial Intelligence
Tutorial: Writing a Lab Report CHEM 1154
Project 1 Guidelines.
The Scientific Method.
Synthesis.
Business Statistics: A First Course (3rd Edition)
HCI – Lesson 7 USABILITY Homework Prof. Garzotto.
EVALUATION OF COUNTERMEASURES PDCA
CMSC 471 Fall 2011 Class #4 Tue 9/13/11 Uninformed Search
EVALUATION OF COUNTERMEASURES PDCA
Creating Additional Input Items
Assignment Operators Topics Increment and Decrement Operators
The Scientific Method & Experimental Design
Advanced Operating System Maekawa vs. Ricart-Agrawala By Rizal M Nor
Scalable light field coding using weighted binary images
Presentation transcript:

Tutorial 3 Empirical evaluation in AI CS 236501 Introduction to AI Tutorial 3 Empirical evaluation in AI

Intro. to AI – Tutorial 3 – By Nela Gurevich Agenda Importance of empirical evaluation in AI Empirical evaluation guidelines Designing and running an experiment Experimental results presentation Discussing the experimental results 14-Jan-19 Intro. to AI – Tutorial 3 – By Nela Gurevich

Empirical evaluation in AI Empirical = Exploratory + Experimental We wish to explain the behavior of an algorithm We conduct experiments to prove our explanation Sometimes, the opposite occurs – we make conclusion about an algorithm behavior from the results of an experiment Empirical evaluation is an important part in AI research Many algorithms do not have a sound theoretical proof for their properties 14-Jan-19 Intro. to AI – Tutorial 3 – By Nela Gurevich

Empirical evaluation in AI We use experiments to prove our theories It is important to design experiments properly It is important to clearly present our experimental results Clear presentation and explanation of experimental results are a major part of any AI research and of this course Home Assignments! We will guide you in the home assignments 14-Jan-19 Intro. to AI – Tutorial 3 – By Nela Gurevich

Designing an experiment: first step What would you like to do? Examine the effect of a certain parameter on an algorithm performance Example: Beam search with beam width = k We would like to examine the effect of k on algorithm performance Compare the performance of two (or more) algorithms Example: DFS vs. BFS on Tic-Tac-Toe puzzle 14-Jan-19 Intro. to AI – Tutorial 3 – By Nela Gurevich

Parameter effect on algorithm performance Fix all algorithm parameters, except the one that is inspected Example: Effect of beam width (k) on Beam Search Fix the problem difficulty Run the experiment for different values of the inspected parameter Example: Run Beam Search with k = {1, 5, 10, 50, 100, 500, 1000} – small, medium and large values Important: choosing the parameter values range correctly K = {1, 5, 7} or {k = 1001, 1005, 1007} is a bad decision The range of the values may vary for different problems 14-Jan-19 Intro. to AI – Tutorial 3 – By Nela Gurevich

Comparing two (or more) algorithms Run two algorithms with fixed parameters For proper comparison, avoid all possible differences Example: DFS vs. BFS on Tic-Tac-Toe Run the algorithms on problems of the same difficulty, or even on the same problem Example: Comparing two heuristics Use the same search algorithm with the same parameters Use problems of the same difficulty (or same problems) 14-Jan-19 Intro. to AI – Tutorial 3 – By Nela Gurevich

Designing an experiment: Random Elements Sometimes random elements affect the performance of an algorithm, for example: Random initial problems In Hill-Climbing search, when two successors have the same heuristic value, one is chosen at random In such cases, the experiment should be repeated more than once, and results should be averaged Important: initialize the random seed in your program, to avoid repeating the same “random” experiment over and over srand( (unsigned)time( NULL ) ); // in C 14-Jan-19 Intro. to AI – Tutorial 3 – By Nela Gurevich

Intro. to AI – Tutorial 3 – By Nela Gurevich Running an experiment Trace the experiment For debugging purposes Record all data that seems important Saves time later (no need to re-run experiments to get the needed data) Example: DFS Record number of expanded nodes, execution time, whether solution was found, solution length. Use batch (script) files Easy to reproduce results Save time near computer 14-Jan-19 Intro. to AI – Tutorial 3 – By Nela Gurevich

Presenting experimental results Use tables or graphs – visual methods, easy to understand First of all, decide what is the purpose of the table/graph you want to present 14-Jan-19 Intro. to AI – Tutorial 3 – By Nela Gurevich

Presenting experimental results Example: I run Beam Search on n random problems. Beam Search is not complete: sometimes it finds a solution to a problem, and sometimes it does not. I would like to present a graph that shows the effect of the beam width (k) parameter in Beam Search on the percentage of problems for which solution is found. 14-Jan-19 Intro. to AI – Tutorial 3 – By Nela Gurevich

Presenting experimental results – common guidelines Graphs/tables should be very easy to understand No additional mathematical calculations should be required to understand the graph/table Run Alg 1 2 3 4 5 Alg1 100 140 200 90 20 Alg2 30 40 210 95 60 X Table1: Solution length found by Alg1 and Alg2 14-Jan-19 Intro. to AI – Tutorial 3 – By Nela Gurevich

Presenting experimental results – common guidelines Avoid combining large amount of data in one graph/table Run Results 1 2 3 4 5 Alg1 expanded nodes 100 140 200 90 20 Alg1 solution length 7 9 10 Alg1 – time 22.2 34.2 55.4 33.2 11.1 Alg2 expanded nodes 150 110 67 99 105 Alg2 solution length 6 Alg2 - time 12.4 41.2 31.3 22 23.5 X 14-Jan-19 Intro. to AI – Tutorial 3 – By Nela Gurevich

Presenting experimental results: graphs Name the axis Scale the axis properly Use graphs for continuous values Use graphs for discrete values 14-Jan-19 Intro. to AI – Tutorial 3 – By Nela Gurevich

Presenting experimental results: tables Name the data columns properly Specify the measurement units used in the table Use short tables When long, detailed tables need to be attached, attach them as appendices Summarize with averages, when needed 87 110 Avg 60 95 210 40 30 Alg2 20 90 200 140 100 Alg1 5 4 3 2 1 Problem Alg Table1: Solution length found by Alg1 and Alg2 14-Jan-19 Intro. to AI – Tutorial 3 – By Nela Gurevich

Discussing the experimental results It is essential to discuss the obtained experimental results in words ! Verbal discussion with visual presentation of the results is the main part of the empirical evaluation of algorithms 14-Jan-19 Intro. to AI – Tutorial 3 – By Nela Gurevich

Discussing the experimental results: guidelines Support every conclusion you make with data that proves this conclusion Insert graphs/tables in appropriate places in the text for easy reference Use short and clear sentences Avoid using many adjectives Avid combining many conclusions in one sentence 14-Jan-19 Intro. to AI – Tutorial 3 – By Nela Gurevich

Discussing the experimental results: guidelines Not always the conclusion is that Algorithm1 is clearly better than Algorithm2 on a given problem Discuss advantages/disadvantages of each algorithm Compare the different performance elements of the algorithms Example: Alg1 expands twice as many nodes as Alg2, on average, Alg1 and Alg2 take the same time to run on average Not always the conclusion from an experiment is that Algorithm1 is better than Algorithm2 in general Discuss how the algorithms are affected by the problem on which they are tested 14-Jan-19 Intro. to AI – Tutorial 3 – By Nela Gurevich

Intro. to AI – Tutorial 3 – By Nela Gurevich Summary Designing and running an experiment should be done carefully Present experimental results with graphs and tables. It is important that the visual presentation of the results is clear to the reader Findings should be summarized with a verbal discussion of the experimental results, backed up with visual presentation of results 14-Jan-19 Intro. to AI – Tutorial 3 – By Nela Gurevich

Intro. to AI – Tutorial 3 – By Nela Gurevich Final Tip Print homework on both sides of the paper – save trees 14-Jan-19 Intro. to AI – Tutorial 3 – By Nela Gurevich