Gesture recognition techniques. Definitions Gesture – some type of body movement –a hand movement –Head movement, lips, eyes Depending on the capture.

Slides:



Advertisements
Similar presentations
Huffman code and ID3 Prof. Sin-Min Lee Department of Computer Science.
Advertisements

Sensor-Based Abnormal Human-Activity Detection Authors: Jie Yin, Qiang Yang, and Jeffrey Junfeng Pan Presenter: Raghu Rangan.
Using Data Flow Diagrams
GP Applications Two main areas of research Testing genetic programming in areas other techniques have been applied to. Applying genetic programming to.
Final Exam: May 10 Thursday. If event E occurs, then the probability that event H will occur is p ( H | E ) IF E ( evidence ) is true THEN H ( hypothesis.
Computer vision: models, learning and inference
Learn how to make your drawings come alive…  Lecture 3: SKETCH RECOGNITION Analysis, implementation, and comparison of sketch recognition algorithms,
Learn how to make your drawings come alive…  NEW COURSE: SKETCH RECOGNITION Analysis, implementation, and comparison of sketch recognition algorithms,
Region labelling Giving a region a name. Image Processing and Computer Vision: 62 Introduction Region detection isolated regions Region description properties.
Foundation Knowledge Identify The Problem Explore Interpretations & Connections Prioritize Alternatives Envision Strategic Innovation Confused Fact Finder.
Foundation Knowledge Identify The Problem Explore Interpretations & Connections Prioritize Alternatives Envision Strategic Innovation Confused Fact Finder.
SubSea: An Efficient Heuristic Algorithm for Subgraph Isomorphism Vladimir Lipets Ben-Gurion University of the Negev Joint work with Prof. Ehud Gudes.
Learn how to make your drawings come alive…  NEW COURSE: SKETCH RECOGNITION Analysis, implementation, and comparison of sketch recognition algorithms,
Learn how to make your drawings come alive…  NEW COURSE: SKETCH RECOGNITION Analysis, implementation, and comparison of sketch recognition algorithms,
Learn how to make your drawings come alive…  Lecture 2: SKETCH RECOGNITION Analysis, implementation, and comparison of sketch recognition algorithms,
Lecturing with Digital Ink Richard Anderson University of Washington.
Computer vision: models, learning and inference Chapter 10 Graphical Models.
Foundation Knowledge Identify The Problem Explore Interpretations & Connections Prioritize Alternatives Envision Strategic Innovation Confused Fact Finder.
Learning Table Extraction from Examples Ashwin Tengli, Yiming Yang and Nian Li Ma School of Computer Science Carnegie Mellon University Coling 04.
L1 Sketch Tools Advanced HCI Beryl Plimmer. Agenda What’s the difference between a keyboard and a pencil?
Data Mining Techniques
Systems Analysis – Analyzing Requirements.  Analyzing requirement stage identifies user information needs and new systems requirements  IS dev team.
Ink and Gesture recognition techniques. Definitions Gesture – some type of body movement –a hand movement –Head movement, lips, eyes Depending on the.
Short Introduction to Machine Learning Instructor: Rada Mihalcea.
Slides are based on Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems.
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
Knowledge Discovery and Delivery Lab (ISTI-CNR & Univ. Pisa)‏ www-kdd.isti.cnr.it Anna Monreale Fabio Pinelli Roberto Trasarti Fosca Giannotti A. Monreale,
1 © Unitec New Zealand DE4401&APTE 5601 Topic 4 N ETWORK A NALYSIS.
1 Sketch tools and Related Research Rachel Patel.
Combining geometry and domain knowledge to interpret hand-drawn diagrams As Presented By: Andrew Campbell Christopher Dahlberg.
Research Projects 6v81 Multimedia Database Yohan Jin, T.A.
22CS 338: Graphical User Interfaces. Dario Salvucci, Drexel University. Lecture 10: Advanced Input.
Data Management and Database Technologies 1 DATA MINING Extracting Knowledge From Data Petr Olmer CERN
Stefan Mutter, Mark Hall, Eibe Frank University of Freiburg, Germany University of Waikato, New Zealand The 17th Australian Joint Conference on Artificial.
1 Pattern Recognition Pattern recognition is: 1. A research area in which patterns in data are found, recognized, discovered, …whatever. 2. A catchall.
Chapter 4: Pattern Recognition. Classification is a process that assigns a label to an object according to some representation of the object’s properties.
Map of the Great Divide Basin, Wyoming, created using a neural network and used to find likely fossil beds See:
Machine Learning Chapter 5. Artificial IntelligenceChapter 52 Learning 1. Rote learning rote( โรท ) n. วิถีทาง, ทางเดิน, วิธีการตามปกติ, (by rote จากความทรงจำ.
Chapter 12 Object Recognition Chapter 12 Object Recognition 12.1 Patterns and pattern classes Definition of a pattern class:a family of patterns that share.
Foundation Knowledge Identify The Problem Explore Interpretations & Connections Prioritize Alternatives Envision Strategic Innovation Confused Fact Finder.
Network Community Behavior to Infer Human Activities.
Graphs A ‘Graph’ is a diagram that shows how things are connected together. It makes no attempt to draw actual paths or routes and scale is generally inconsequential.
Gesture Input and Gesture Recognition Algorithms.
Feature Point Detection and Curve Approximation for Early Processing of Free-Hand Sketches Tevfik Metin Sezgin and Randall Davis MIT AI Laboratory.
R-Trees: A Dynamic Index Structure For Spatial Searching Antonin Guttman.
CfE Advanced Higher Physics
Concepts and Realization of a Diagram Editor Generator Based on Hypergraph Transformation Author: Mark Minas Presenter: Song Gu.
Data Mining By Farzana Forhad CS 157B. Agenda Decision Tree and ID3 Rough Set Theory Clustering.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Knowledge Discovery and Data Mining 19 th Meeting Course Name: Business Intelligence Year: 2009.
1 SWE Introduction to Software Engineering Lecture 14 – System Modeling.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Pen Based User Interface II CSE 481b January 25, 2005.
May 2003 SUT Color image segmentation – an innovative approach Amin Fazel May 2003 Sharif University of Technology Course Presentation base on a paper.
Pattern Recognition. What is Pattern Recognition? Pattern recognition is a sub-topic of machine learning. PR is the science that concerns the description.
Real-Time Hierarchical Scene Segmentation and Classification Andre Uckermann, Christof Elbrechter, Robert Haschke and Helge Ritter John Grossmann.
Sketch Tools L2 Advanced HCI. Agenda What is the problem with computer-based design tools? Why a toolkit? Framework Implementation Does it work? What.
 Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems n Introduction.
Gesture Input and Gesture Recognition Algorithms
Chapter 13 Editing and Topology.
CS 9633 Machine Learning Support Vector Machines
Automated scoring of student trees
Automated scoring of student trees
Map of the Great Divide Basin, Wyoming, created using a neural network and used to find likely fossil beds See:
K Nearest Neighbor Classification
Craig Schroeder October 26, 2004
Copyright © Cengage Learning. All rights reserved.
Theory of Computation Turing Machines.
Lecture 31: Graph-Based Image Segmentation
Spreadsheets, Modelling & Databases
Presentation transcript:

Gesture recognition techniques

Definitions Gesture – some type of body movement –a hand movement –Head movement, lips, eyes Depending on the capture this could be –Digital ink –Accelerometer data –Actual body movement detected by vision analysis (ie what the vision group do With digital ink –Stroke – time series of x,y points may include pressure and pen tilt data –Sometime people use the term ‘gesture’ to mean an editing stroke – delete, cut, copy etc

More definitions Bounding box, the smallest enclosing rectangle Distance is measured in himetric units =.01mm

Dissecting a diagram Components –Nodes Contain label –Arch/edge Line and arrow Semantic meaning –Actions –Connections –Directed flow

What are the components here? What is the semantic meaning?

Where to start? Step 1 is dividing writing and drawing because there is a fundamental semantic are different –At least for languages that use an alphabet – languages such as Chinese there is some relationship The Microsoft OS (tablet and vista) has a pretty good writing recognizer It has a ‘divider’ that will separate writing from drawing –Trouble is it classifies nearly everything as writing (about 80%) How do you tell writing from drawing?

How do you find a better way? Look for effective ink features –We found 47! Pressure Time Intersections Curvature OS values –We have another ~20 on the list we have thought of since

Time Features FeatureDescriptionOrigin Total durationTotal duration of the stroke from pen up to pen down.(Rubine 1991) Maximum speedMaximum speed when drawing the stroke.Adapted from (Rubine 1991) Minimum speedMinimum speed when drawing the stroke. Average SpeedMean average speed when drawing the stroke. Time from last strokeThe time between the current stroke and the previous stroke in the sketch. Not applicable to the first stroke of a diagram.New Time till next strokeThe time between the current stroke and the next stroke in the sketch. Not applicable to the last stroke of a diagram. Speed from last strokeSpeed (distance/time) between the current stroke and the previous stroke in the sketch. Not applicable to the first stroke of a diagram. Speed to next strokeSpeed (distance/time) between the current stroke and the next stroke in the sketch. Not applicable to the last stroke of a diagram. # Speed minimaThe number of extreme minima in the speed values for the stroke, this excludes the minima that occur at the beginning and end of the stroke for pen up/down events. Adapted from (Sezgin, Stahovich et al. 2001) From: Patel, Rachael. 2007, Exploring better techniques for diagram recognition, MSc Thesis, UoA

Data mining techniques R – statistical programming, used to partition the data Weka – a range of different techniques all in the one package.

Classification Tree Created using statistical partitioning on data set from about 30 people

This is an improvement Increased % correct and decreased % incorrect About 80% correct How can we do better?

Features and Algorithms There are two parts to this divider The features – those most discriminating from the 47 examined The algorithm – a tree is a blunt instrument – more fuzziness is needed and also some idea of certainty –Probably the future down the tree the less confidence there is in the result

Context The spatial relationship between strokes Semantic Rules – could this possibly be a character/drawing - containment

Shape recognition There is quite a small set of basic shapes Most common approach Rubine’s Algorithm[1] –13 features – defined by heuristics –Hidden Markoff model (statistical pattern matching) This looks at single stroke –It isn’t too difficult to join strokes [1] Rubine, D. Specifying gestures by example. in Proceedings of Siggraph ' : ACM

Putting the bits together How would we classify these? Types Spatial relationships –Containment –Adjacency Connectivity

Inkkit component recognizer Analyses relationships of shapes in library Does similar analysis on diagram Builds a huge tree of all possible relationships, each with a probability Parses the tree looking for most probably component Assigns that, removes it from the tree and searches again.

Other techniques Fuzzy logic User written rules Semantic networks Bayesian networks All of these rely on calculated ink features –How people arrived at the particular features they have chosen is rarely stated

Data Repositories for Recognition Problem description Data Collection

Dataset generation Data generation The Data