Converting sign language gestures from digital images to text

Slides:



Advertisements
Similar presentations
QR Code Recognition Based On Image Processing
Advertisements

CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 4 – Digital Image Representation Klara Nahrstedt Spring 2009.
Computational Biology, Part 23 Biological Imaging II Robert F. Murphy Copyright  1996, 1999, All rights reserved.
E.G.M. PetrakisImage Segmentation1 Segmentation is the process of partitioning an image into regions –region: group of connected pixels with similar properties.
Learning Techniques for Video Shot Detection Under the guidance of Prof. Sharat Chandran by M. Nithya.
Image Processing in Matlab An Introductory Approach by Sabih D. Khan
July 27, 2002 Image Processing for K.R. Precision1 Image Processing Training Lecture 1 by Suthep Madarasmi, Ph.D. Assistant Professor Department of Computer.
1 Video Processing Lecture on the image part (8+9) Automatic Perception Volker Krüger Aalborg Media Lab Aalborg University Copenhagen
Database-Based Hand Pose Estimation CSE 6367 – Computer Vision Vassilis Athitsos University of Texas at Arlington.
Department of Electrical and Computer Engineering He Zhou Hui Zheng William Mai Xiang Guo Advisor: Professor Patrick Kelly ASLLENGE.
Virtual Dart: An Augmented Reality Game on Mobile Device Supervisor: Professor Michael R. Lyu Prepared by: Lai Chung Sum Siu Ho Tung.
1 Preprocessing for JPEG Compression Elad Davidson & Lilach Schwartz Project Supervisor: Ari Shenhar SPRING 2000 TECHNION - ISRAEL INSTITUTE of TECHNOLOGY.
Segmentation Divide the image into segments. Each segment:
Traffic Sign Recognition Jacob Carlson Sean St. Onge Advisor: Dr. Thomas L. Stewart.
CSSE463: Image Recognition Day 30 Due Friday – Project plan Due Friday – Project plan Evidence that you’ve tried something and what specifically you hope.
Cindy Song Sharena Paripatyadar. Use vision for HCI Determine steps necessary to incorporate vision in HCI applications Examine concerns & implications.
Highlights Lecture on the image part (10) Automatic Perception 16
Computer Vision Basics Image Terminology Binary Operations Filtering Edge Operators.
Hand Movement Recognition By: Tokman Niv Levenbroun Guy Instructor: Todtfeld Ari.
Smart Traveller with Visual Translator. What is Smart Traveller? Mobile Device which is convenience for a traveller to carry Mobile Device which is convenience.
California Car License Plate Recognition System ZhengHui Hu Advisor: Dr. Kang.
Real-Time Face Detection and Tracking Using Multiple Cameras RIT Computer Engineering Senior Design Project John RuppertJustin HnatowJared Holsopple This.
Face Detection and Neural Networks Todd Wittman Math 8600: Image Analysis Prof. Jackie Shen December 2001.
ICBV Course Final Project Arik Krol Aviad Pinkovezky.
Knowledge Systems Lab JN 8/24/2015 A Method for Temporal Hand Gesture Recognition Joshua R. New Knowledge Systems Laboratory Jacksonville State University.
FEATURE EXTRACTION FOR JAVA CHARACTER RECOGNITION Rudy Adipranata, Liliana, Meiliana Indrawijaya, Gregorius Satia Budhi Informatics Department, Petra Christian.
CS 101 – Sept. 2 Chapter 3: data representation Many kinds of data  all in binary Today: integers –Unsigned –Signed Later: text, images, code, etc.
Knowledge Base approach for spoken digit recognition Vijetha Periyavaram.
Knowledge Systems Lab JN 9/10/2002 Computer Vision: Gesture Recognition from Images Joshua R. New Knowledge Systems Laboratory Jacksonville State University.
Hand Gesture Recognition System for HCI and Sign Language Interfaces Cem Keskin Ayşe Naz Erkan Furkan Kıraç Özge Güler Lale Akarun.
Data Representation CS280 – 09/13/05. Binary (from a Hacker’s dictionary) A base-2 numbering system with only two digits, 0 and 1, which is perfectly.
A Method for Hand Gesture Recognition Jaya Shukla Department of Computer Science Shiv Nadar University Gautam Budh Nagar, India Ashutosh Dwivedi.
Hands segmentation Pat Jangyodsuk. Motivation Alternative approach of finding hands Instead of finding bounding box, classify each pixel whether they’re.
Application in Computer Vision Final Project Nir Slakman, Oren Zur and Noam Ben-Ari.
DEVELOPMENT OF ALGORITHM FOR PANORAMA GENERATION, AND IMAGE SEGMENTATION FROM STILLS OF UNDERVEHICLE INSPECTION Balaji Ramadoss December,06,2002.
Morphological Image Processing
出處: Signal Processing and Communications Applications, 2006 IEEE 作者: Asanterabi Malima, Erol Ozgur, and Miijdat Cetin 2015/10/251 指導教授:張財榮 學生:陳建宏 學號: M97G0209.
Wei Dang Kevin Ellsworth Cory Shirts.  Goal: have a user interface to allow user text input using sign language digits and letters ◦ User interface ◦
ECE 8443 – Pattern Recognition EE 3512 – Signals: Continuous and Discrete Objectives: Spectrograms Revisited Feature Extraction Filter Bank Analysis EEG.
CS 376b Introduction to Computer Vision 03 / 21 / 2008 Instructor: Michael Eckmann.
Presented By: ROLL No IMTIAZ HUSSAIN048 M.EHSAN ULLAH012 MUHAMMAD IDREES027 HAFIZ ABU BAKKAR096(06)
Week 2 REU Nolan Warner. Overview This weeks progress/projects Things learned Tentative research topics.
Eurecom, 6 Feb 2007http://biobimo.eurecom.fr Project BioBiMo 1.
JASON BANICH ADVISOR: DR. JOHN SENG Crosswalk Detection via Computer Vision.
1 Eye Detection in Images Introduction To Computational and biological Vision Lecturer : Ohad Ben Shahar Written by : Itai Bechor.
Image-Based Segmentation of Indoor Corridor Floors for a Mobile Robot Yinxiao Li and Stanley T. Birchfield The Holcombe Department of Electrical and Computer.
Image Segmentation by Histogram Thresholding Venugopal Rajagopal CIS 581 Instructor: Longin Jan Latecki.
Magic Camera Master’s Project Defense By Adam Meadows Project Committee: Dr. Eamonn Keogh Dr. Doug Tolbert.
CS 376b Introduction to Computer Vision 03 / 31 / 2008 Instructor: Michael Eckmann.
CS Spring 2010 CS 414 – Multimedia Systems Design Lecture 4 – Audio and Digital Image Representation Klara Nahrstedt Spring 2010.
Machine Vision Edge Detection Techniques ENT 273 Lecture 6 Hema C.R.
EE368: Digital Image Processing Bernd Girod Leahy, p.1/15 Face Detection on Similar Color Images Scott Leahy EE368, Stanford University May 30, 2003.
Vision & Image Processing for RoboCup KSL League Rami Isachar Lihen Sternfled.
License Plate Recognition of A Vehicle using MATLAB
Over the recent years, computer vision has started to play a significant role in the Human Computer Interaction (HCI). With efficient object tracking.
Coin Recognition Using MATLAB - Emad Zaben - Bakir Hasanein - Mohammed Omar.
IMAGE PROCESSING APPLIED TO TRAFFIC QUEUE DETECTION ALGORITHM.
Student Gesture Recognition System in Classroom 2.0 Chiung-Yao Fang, Min-Han Kuo, Greg-C Lee, and Sei-Wang Chen Department of Computer Science and Information.
IMAGE PROCESSING Tadas Rimavičius.
KINECT AMERICAN SIGN TRANSLATOR (KAST)
Hand Gestures Based Applications
Histogram—Representation of Color Feature in Image Processing Yang, Li
Machine Vision Acquisition of image data, followed by the processing and interpretation of these data by computer for some useful application like inspection,
Higher School of Economics , Moscow, 2016
Presented by :- Vishal Vijayshankar Mishra
Joshua Kahn, Scott Wiese ECE533 – Fall 2003 December 12, 2003
Filtering An image as a function Digital vs. continuous images
Wadner Joseph • James Haralambides, PhD Abstract
Higher School of Economics , Moscow, 2016
Higher School of Economics , Moscow, 2016
Presentation transcript:

Converting sign language gestures from digital images to text ASL2TXT Converting sign language gestures from digital images to text George Corser

Presentation Overview Concept Foundation: Barkoky & Charkari (2011) Segmentation Thinning My Contribution: Corser (2012) Segmentation (similar to Barkoky) CED: Canny Edge Dilation (Minus Errors) Assumption: User trains his own phone

Concept Deaf and hearing people talking on the phone, each using their natural language Sign-activated commands like voice-activated

Situation: Drive Thru Window Think: Stephen Hawking Deaf person signs order Phone speaks order Confirmation on screen

Process Flow Requires several conversion processes Many have been accomplished Remaining: ASL2TXT

Goal: Find an Algorithm Find an image processing algorithm that recognizes ASL alphabet = A Web site

Barkoky: Segmentation & Thinning Barkoky counts endpoints to determine sign (doesn’t work for ASL)

Barkoky Process Segmentation Thinning Capture RGB image Rescale Extract using colors Reduce noise Crop at wrist Result: hand segment Input: hand segment Apply thinning Find endpoints, joints Calculate lengths Clean short lengths Identify gesture by counting endpoints

1. Capture RGB Image 2. Rescale % ---------- 1. Capture RGB image a = imread('DSC04926.JPG'); figure('Name','RGB image'),imshow(a); % ---------- 2. Rescale image to 205x154 a10 = imresize(a, 0.1); figure('Name','Rescaled image'),imshow(a10);

3. Extract Hand Using Colors % ---------- 3. Extract hand using color abw10 = zeros(205,154,1); for i=1:205, for j=1:154, if a10(i,j,2)<140 && a10(i,j,3)<100, abw10(i,j,1)=255; end; figure('Name','Extracted'),imshow(abw10); Note: Color threshold code differs from Barkoky

Colors: Training Set Histograms

Colors: Training Set (2) Red Green Blue Excel

Colors: Test Set Histograms

4. Reduce Noise % ---------- 4. Reduce noise for i=2:204, for j=1:154, if abw10(i-1,j,1)==0 if abw10(i+1,j,1)==0, abw10(i,j,1)=0; end; end; if abw10(i-1,j,1)==255 if abw10(i+1,j,1)==255, end; end; abw10 = imfill(abw10,'holes');

5. Identify Wrist Position % ---------- 5. Identify wrist position for i=204:-1:1, for j=1:154, if abw10(i,j,1)==255, break; end; end; if j ~= 154 && abw10(i+1,j,1)~=255, wristi=i+1; wristj=j+1; break;

Wrist Detection Algorithm searches bottom-to-top of image Finds a leftmost white pixel above black pixel Sets wrist position SE of found white pixel

Corser: Segmentation & CED Segmentation (similar to Barkoky) Color threshold technique slightly different American Sign Language (ASL) alphabet, not Persian Sign Language (PSL) numbers Image Comparison: Tried Several Methods Full Threshold (Minus Errors) Diced Segments (Minus Errors) Endpoint Count Difference CED: Canny Edge Dilation

ASL Training Set Hit-or-miss: 23% Barkoky: 8%

ASL Test Set MATLAB

A

A

B

B

C

C

D

D

E

E

F

F

G

G

H

H

I

I

J

J

K

K

L

L

M

M

N

N

O

O

P

P

Q

Q

R

R

S

S

T

T

U

U

V

V

W

W

X

X

Y

Y

Z

Z

Z

Hybrid Algorithm Example % ---------- MATLAB Code ------------------- matchtotal = 0; if abs(x10range - x20range) < 20, matchtotal = matchtotal + 10; end; if abs(y10range - y20range) < 20, matchtotal = matchtotal + 11; matchtotal = matchtotal - abs(h10 - h20); % ----- h10, h20 are vector magnitudes -----

Erosion Subtraction

Canny Edge

Canny Edge Dilation Code % ---------- MATLAB Code ------------------- se = strel('disk',5); a10 = edge(a10,'canny'); a20 = edge(a20,'canny'); a10 = imdilate(a10,se); a20 = imdilate(a20,se); % ----- Then calculate matches minus errors

Experimental Results Technique Correct Full Threshold (Minus Errors) 19% (27%) Diced Segments (Minus Errors) 23% (27%) Barkoky Endpoint Count Diff. 8% Hybrid - Height/Width/Endpoints 19% Erosion Subtraction 15% Canny Edge Dilation (Minus Errors) 12% (35%)

Disadvantages Dependent on lighting conditions Fails with flesh-tone backgrounds Requires calibration to a specific user Limited applications: text messaging, activation (“sign” similar to voice activation) ASL numbers (A=10, D=1, O=0, V=2, W=6) Alphabet is tiny portion of full translation: complete translation maybe many years away

Future Work Barkoky claims flesh tones can be detected, but I have yet to replicate (even Barkoky changed his color detection scheme) Could write letter-by-letter algorithm Could use range camera to compute distance of finger instead of shape of hand Motion analysis or edge count Many possibilities… we’ve only just begun! Cue: music http://www.youtube.com/watch?v=__VQX2Xn7tI

The End