Byron Hood | version 0.4 Computer Systems Lab Project2007-2008 Sign Language Recognition using Webcams.

Slides:



Advertisements
Similar presentations
Inside an XSLT Processor Michael Kay, ICL 19 May 2000.
Advertisements

OpenCV Introduction Hang Xiao Oct 26, History  1999 Jan : lanched by Intel, real time machine vision library for UI, optimized code for intel 
QR Code Recognition Based On Image Processing
ECE 454 Computer Systems Programming Compiler and Optimization (I) Ding Yuan ECE Dept., University of Toronto
Wavelets Fast Multiresolution Image Querying Jacobs et.al. SIGGRAPH95.
Overview Motivations Basic static and dynamic optimization methods ADAPT Dynamo.
Low Complexity Keypoint Recognition and Pose Estimation Vincent Lepetit.
Database-Based Hand Pose Estimation CSE 6367 – Computer Vision Vassilis Athitsos University of Texas at Arlington.
To solve the problem of limited documentation and example code available on the subject of biometrics. Research that is done in this field can not directly.
Virtual Dart: An Augmented Reality Game on Mobile Device Supervisor: Professor Michael R. Lyu Prepared by: Lai Chung Sum Siu Ho Tung.
Lecture 5 Template matching
A new face detection method based on shape information Pattern Recognition Letters, 21 (2000) Speaker: M.Q. Jing.
Real-time Embedded Face Recognition for Smart Home Fei Zuo, Student Member, IEEE, Peter H. N. de With, Senior Member, IEEE.
Video Object Tracking and Replacement for Post TV Production LYU0303 Final Year Project Spring 2004.
A Study of Approaches for Object Recognition
System Design and Analysis
Information Retrieval IR 4. Plan This time: Index construction.
CSE 160 – Lecture 10 Programs 1 and 2. Program 1 Write a “launcher” program to specify exactly where programs are to be spawned, gather output, clean.
Cindy Song Sharena Paripatyadar. Use vision for HCI Determine steps necessary to incorporate vision in HCI applications Examine concerns & implications.
Hand Movement Recognition By: Tokman Niv Levenbroun Guy Instructor: Todtfeld Ari.
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
Oral Defense by Sunny Tang 15 Aug 2003
Presenter: Stefan Zickler
Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.
Knowledge Systems Lab JN 8/24/2015 A Method for Temporal Hand Gesture Recognition Joshua R. New Knowledge Systems Laboratory Jacksonville State University.
Distinctive Image Features from Scale-Invariant Keypoints By David G. Lowe, University of British Columbia Presented by: Tim Havinga, Joël van Neerbos.
Development of Image Processing Based Feedback Systems for Interactive Gaming Using Non-Traditional Controllers Adam Hedji Mantas Pulinas Philip San III.
By Dominik Seifert B Overview Data Alignment Hashtable MurMurHash Function “Stupid Parallel Hashing” Lookup The Complete Algorithm The Little.
There are different types of translator. An Interpreter Interpreters translate one instruction at a time from a high level language into machine code every.
1 Lecture 19: Motion Capture. 2 Techniques Morphing Motion Capture.
Reconstructing 3D mesh from video image sequences supervisor : Mgr. Martin Samuelčik by Martin Bujňák specifications Master thesis
Extracting Barcodes from a Camera-Shaken Image on Camera Phones Graduate Institute of Communication Engineering National Taiwan University Chung-Hua Chu,
Python – Part 1 Python Programming Language 1. What is Python? High-level language Interpreted – easy to test and use interactively Object-oriented Open-source.
Presented by On the Path to Petascale: Top Challenges to Scientific Discovery Scott A. Klasky NCCS Scientific Computing End-to-End Task Lead.
Optimization_fff as a Data Product J.McTiernan HMI/AIA Meeting 16-Feb-2006.
Data Capture Understand the concept of data encoding. Describe methods of data capture and identify appropriate contexts for their.
Wei Dang Kevin Ellsworth Cory Shirts.  Goal: have a user interface to allow user text input using sign language digits and letters ◦ User interface ◦
CFD Refinement By: Brian Cowley. Overview 1.Background on CFD 2.How it works 3.CFD research group on campus for which problem exists o Our current techniques.
SDSS photo-z with model templates. Photo-z Estimate redshift (+ physical parameters) –Colors are special „projection” of spectra, like PCA.
High Speed Digital Systems Lab. Agenda  High Level Architecture.  Part A.  DSP Overview. Matrix Inverse. SCD  Verification Methods. Verification Methods.
CS 460/660 Compiler Construction. Class 01 2 Why Study Compilers? Compilers are important – –Responsible for many aspects of system performance Compilers.
1 Research Question  Can a vision-based mobile robot  with limited computation and memory,  and rapidly varying camera positions,  operate autonomously.
Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.
TEMPLATE DESIGN © E-Eye : A Multi Media Based Unauthorized Object Identification and Tracking System Tolgahan Cakaloglu.
1 ENGI 2420 Structured Programming (Lab Tutorial 8) Memorial University of Newfoundland.
CSCI1600: Embedded and Real Time Software Lecture 33: Worst Case Execution Time Steven Reiss, Fall 2015.
Team Members Ming-Chun Chang Lungisa Matshoba Steven Preston Supervisors Dr James Gain Dr Patrick Marais.
Turning a Mobile Device into a Mouse in the Air
Wire Detection Version 2 Joshua Candamo Friday, February 29, 2008.
Byron Hood | version 0.3 Computer Systems Lab Project Sign Language Recognition.
Camera calibration from multiple view of a 2D object, using a global non linear minimization method Computer Engineering YOO GWI HYEON.
ICSE 2011 Implementation Update Aaron Valdez Liane Nakamura Deeksha Malhotra Phase Manager: Max Mueller.
Whole Slide Image Stitching for Osteosarcoma detection Ovidiu Daescu Colaborators: Bogdan Armaselu and Harish Babu Arunachalam University of Texas at Dallas.
Level Set Segmentation ~ 9.37 Ki-Chang Kwak.
Over the recent years, computer vision has started to play a significant role in the Human Computer Interaction (HCI). With efficient object tracking.
Using Hand Gestures for Alternative User Verification
CSC391/691 Intro to OpenCV Dr. Rongzhong Li Fall 2016
Lecture 26 Hand Pose Estimation Using a Database of Hand Images
CSCI1600: Embedded and Real Time Software
Efficient Deformable Template Matching for Face Tracking
Register Pressure Guided Unroll-and-Jam
Online Graph-Based Tracking
The Implementation of a Glove-Based User Interface
Midterm Exam Closed book, notes, computer Similar to test 1 in format:
Midterm Exam Closed book, notes, computer Similar to test 1 in format:
Applications of Image Processing to Sign Language Recognition
Presented by Xu Miao April 20, 2005
Working with the Compute Block
CSCI1600: Embedded and Real Time Software
Sign Language Recognition With Unsupervised Feature Learning
Presentation transcript:

Byron Hood | version 0.4 Computer Systems Lab Project Sign Language Recognition using Webcams

Byron Hood | version 0.4 Computer Systems Lab Project Overview Average person’s typing speed  Composing: ~19 words per minute  Transcribing: ~33 words per minute Sign speaker  Full sign language: ~200 words per minute  Spelling out: estimate: 50 words per minute  Up to 3x faster

Byron Hood | version 0.4 Computer Systems Lab Project Purpose and Scope Native signers can input faster Benefits:  Hearing & speaking disabled  Sign interpreters Just letters & numbers for now  Additional complexity too much to handle  Would require smaller distinctions

Byron Hood | version 0.4 Computer Systems Lab Project Research Related projects  Using mechanical gloves, colored gloves  Tracking body parts  Neural network-based application Still images: 92% accuracy Motion: less than 50% accuracy  Feature vector-based application Also about 90% accuracy on stills No motion tests

Byron Hood | version 0.4 Computer Systems Lab Project More Research Image techniques:  Edge detection (Robert’s Cross)‏  Line detection (Hough transform)‏  Line interpretation methods Chaining groups of lines Macro-scale templates  Residual math Memory management

Byron Hood | version 0.4 Computer Systems Lab Project Testing Model Human interaction necessary General testing model: ~/syslab-tech $ \ >./main images/hand.png [DEBUG] Edge detect time: 29 ms Errors: 0 Warnings: 0

Byron Hood | version 0.4 Computer Systems Lab Project Program Architecture Webcam capture Edge detection Line detection Interpretation Attribute matching IMAGE FEATURE OUTLINE LINE LIST FINGER POSITONS SERVER PROCESS

Byron Hood | version 0.4 Computer Systems Lab Project Edge Detection Results Results:  Outlines the important edges and not much besides  Robert’s Cross balances detection of major and minor lines Original image (800 x 703)‏ Final image (800 x 703)‏

Byron Hood | version 0.4 Computer Systems Lab Project Cropping Results Original (800 x 703)‏ Result (633 x 645)‏ Remove useless rows & columns with no features Better contrast Very large optimization  Memory savings  Area difference means order n 2

Byron Hood | version 0.4 Computer Systems Lab Project Line detection Finished!  Recently finished tweaking sensitivities  Still a few potential memory issues

Byron Hood | version 0.4 Computer Systems Lab Project Line Grouping Part of line detection Large optimization  Iterate over an order of magnitude fewer items  Easier to handle, more pronounced trends Examples of line groups, called “chains”

Byron Hood | version 0.4 Computer Systems Lab Project Line Interpretation Chaining groups of lines Templates  Generation  Template-based comparison Line residuals  Use point coordinate averages  Calculate average offset from average  Easy to find height of finger

Byron Hood | version 0.4 Computer Systems Lab Project Sample Output After a typical run: 10 days, 6:12:19 until graduation!! ~/syslab-tech/src $./main hand.png Edge detection took 0.04 sec Image cropping took 0.00 sec Line detection took 0.17 sec (detected 1424 lines)‏ Line chaining took 0.25 sec (detected 130 chains)‏ Getting orientation took src (1 => ORIENTATION_FORWARD)‏ Getting pinky pos. took 0.00 sec (2 => FINGER_BENT)‏ Getting ring pos. took 0.00 sec (2 => FINGER_BENT)‏ Getting middle pos. took 0.01 sec (2 => FINGER_BENT)‏ Getting index pos. took sec (4 => FINGER_TUCKED)‏ Overall process took 0.47 sec [TOTALCOUNT] allocated: , freed: ; leaked:

Byron Hood | version 0.4 Computer Systems Lab Project Timing Timing data from runs:  To nearest hundredth of a second (1)Edge detection: 0.04 sec (2)Image cropping: 0.00 sec (3)Line detection: 0.17 sec (4)Line chaining: 0.25 sec (5)orientation: 0.08 sec (6)Pinky finger: 0.00 sec (7)Ring finger: 0.00 sec (8)Middle finger: 0.01 sec (9)Index finger: 0.00 sec (10)Overall process: 0.47 sec  A little slow considering goal of real-time

Byron Hood | version 0.4 Computer Systems Lab Project The Mysterious Future Perfect line interpretation Work on memory management  Am leaking large quantities (~50K) of memory  Aggressive profiling needed Finish camera-computer interaction  Device control must be precise, picky

Byron Hood | version 0.4 Computer Systems Lab Project The End! Code will be available to future years Contact me for a copy: