LYU0203 Smart Traveller with Visual Translator for OCR and Face Recognition Supervised by Prof. LYU, Rung Tsong Michael Prepared by: Wong Chi Hang Tsang.

Slides:



Advertisements
Similar presentations
Patient information extraction in digitized X-ray imagery Hsien-Huang P. Wu Department of Electrical Engineering, National Yunlin University of Science.
Advertisements

Finger Gesture Recognition through Sweep Sensor Pong C Yuen 1, W W Zou 1, S B Zhang 1, Kelvin K F Wong 2 and Hoson H S Lam 2 1 Department of Computer Science.
Advanced Image Processing Student Seminar: Lipreading Method using color extraction method and eigenspace technique ( Yasuyuki Nakata and Moritoshi Ando.
FACIAL EMOTION RECOGNITION BY ADAPTIVE PROCESSING OF TREE STRUCTURES Jia-Jun Wong and Siu-Yeung Cho Forensic and Security Lab School of Computer Engineering.
1 CPC group SeminarThursday, June 1, 2006 Classification techniques for Hand-Written Digit Recognition Venkat Raghavan N. S., Saneej B. C., and Karteek.
A New Scheme For Robust Blind Digital Video Watermarking Supervised by Prof. LYU, Rung Tsong Michael Presented by Chan Pik Wah, Pat Mar 5, 2002 Department.
An Infant Facial Expression Recognition System Based on Moment Feature Extraction C. Y. Fang, H. W. Lin, S. W. Chen Department of Computer Science and.
July 27, 2002 Image Processing for K.R. Precision1 Image Processing Training Lecture 1 by Suthep Madarasmi, Ph.D. Assistant Professor Department of Computer.
Page 1 SIXTH SENSE TECHNOLOGY Presented by: KIRTI AGGARWAL 2K7-MRCE-CS-035.
Identifying Image Spam Authorship with a Variable Bin-width Histogram-based Projective Clustering Song Gao, Chengcui Zhang, Wei Bang Chen Department of.
Facial feature localization Presented by: Harvest Jang Spring 2002.
As applied to face recognition.  Detection vs. Recognition.
Face Recognition & Biometric Systems, 2005/2006 Face recognition process.
Virtual Dart: An Augmented Reality Game on Mobile Device Supervisor: Professor Michael R. Lyu Prepared by: Lai Chung Sum Siu Ho Tung.
Combining Human and Machine Capabilities for Improved Accuracy and Speed in Visual Recognition Tasks Research Experiment Design Sprint: IVS Flower Recognition.
Real-time Embedded Face Recognition for Smart Home Fei Zuo, Student Member, IEEE, Peter H. N. de With, Senior Member, IEEE.
LYU0603 A Generic Real-Time Facial Expression Modelling System Supervisor: Prof. Michael R. Lyu Group Member: Cheung Ka Shun ( ) Wong Chi Kin ( )
LYU0603 A Generic Real-Time Facial Expression Modelling System Supervisor: Prof. Michael R. Lyu Group Member: Cheung Ka Shun ( ) Wong Chi Kin ( )
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Feature vs. Model Based Vocal Tract Length Normalization for a Speech Recognition-based Interactive Toy Jacky CHAU Department of Computer Science and Engineering.
LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
ART: Augmented Reality Table for Interactive Trading Card Game Albert H.T. Lam, Kevin C. H. Chow, Edward H. H. Yau and Michael R. Lyu Department of Computer.
Chinese Character Recognition for Video Presented by: Vincent Cheung Date: 25 October 1999.
Cindy Song Sharena Paripatyadar. Use vision for HCI Determine steps necessary to incorporate vision in HCI applications Examine concerns & implications.
FACE RECOGNITION BY: TEAM 1 BILL BAKER NADINE BROWN RICK HENNINGS SHOBHANA MISRA SAURABH PETHE.
LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.
PCA Channel Student: Fangming JI u Supervisor: Professor Tom Geoden.
Smart Traveller with Visual Translator. What is Smart Traveller? Mobile Device which is convenience for a traveller to carry Mobile Device which is convenience.
Multimedia Security Digital Video Watermarking Supervised by Prof. LYU, Rung Tsong Michael Presented by Chan Pik Wah, Pat Nov 20, 2002 Department of Computer.
Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.
Biometrics & Security Tutorial 6. 1 (a) Understand why use face (P7: 3-4) and face recognition system (P7: 5-10)
Visual Speech Recognition Using Hidden Markov Models Kofi A. Boakye CS280 Course Project.
Oral Defense by Sunny Tang 15 Aug 2003
LYU0203 Smart Traveller with Visual Translator for OCR and Face Recognition Supervised by Prof. LYU, Rung Tsong Michael Prepared by: Wong Chi Hang Tsang.
Vision-Based Biometric Authentication System by Padraic o hIarnain Final Year Project Presentation.
Evaluating the use of OCR on a Mobile Device Presented by : Hamed Alharbi Supervisor by :Dr Brett Wilkinson.
Eigenfaces for Recognition Student: Yikun Jiang Professor: Brendan Morris.
Presentation by: K.G.P.Srikanth. CONTENTS  Introduction  Components  Working  Applications.
Final Year Student Projects: Prelude Michael R. Lyu.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
(CMSC5720-1) MSC projects by Prof K.H. Wong (21 July2014) (shb907) MSC projects supervised by Prof.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Face Recognition System By Arthur. Introduction  A facial recognition system is a computer application for automatically identifying or verifying a person.
Submitted by:- Vinay kr. Gupta Computer Sci. & Engg. 4 th year.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal VideoConference Archives Indexing System.
Week 2 REU Nolan Warner. Overview This weeks progress/projects Things learned Tentative research topics.
Controlling Computer Using Speech Recognition (CCSR) Creative Masters Group Supervisor : Dr: Mounira Taileb.
Reporter: 資訊所 P Yung-Chih Cheng ( 鄭詠之 ).  Introduction  Data Collection  System Architecture  Feature Extraction  Recognition Methods  Results.
Design of PCA and SVM based face recognition system for intelligent robots Department of Electrical Engineering, Southern Taiwan University, Tainan County,
Mobile Image Processing
Team Members Ming-Chun Chang Lungisa Matshoba Steven Preston Supervisors Dr James Gain Dr Patrick Marais.
Final Year Project Vision based biometric authentication system By Padraic ó hIarnain.
By: Hossein and Hadi Shayesteh Supervisor: Mr. James Connan.
 Handwritten character recognition is a frontier area for research for the past few decades  OCR-process of translation of images of handwritten shorthand.
An intelligent strategy for checking the annual inspection status of motorcycles based on license plate recognition Yo-Ping Huang a, Chien-Hung Chen b,
License Plate Recognition of A Vehicle using MATLAB
ZAGAZIG UNIVERSITY-BENHA BRANCH SHOUBRA FACULTY OF ENGINEERING ELECTRICAL ENGINGEERING DEPT. COMPUTER SYSTEM DIVISION GRAUDATION PROJECT 2003.
Optical Character Recognition
Presented By Bhargav (08BQ1A0435).  Images play an important role in todays information because A single image represents a thousand words.  Google's.
Automatic License Plate Recognition for Electronic Payment system Chiu Wing Cheung d.
Digital Video Library - Jacky Ma.
Deeply learned face representations are sparse, selective, and robust
Improving Chinese handwriting Recognition by Fusing speech recognition
FACE RECOGNITION TECHNOLOGY
Extracting Old Persian Cuneiform Font Out of
A language assistant system for smart glasses
Face Recognition and Detection Using Eigenfaces
Optical Character Recognition
Aline Martin ECE738 Project – Spring 2005
Presentation transcript:

LYU0203 Smart Traveller with Visual Translator for OCR and Face Recognition Supervised by Prof. LYU, Rung Tsong Michael Prepared by: Wong Chi Hang Tsang Siu Fung Department of Computer Science & Engineering The Chinese University of Hong Kong

Outline Introduction System Architecture Korean OCR Friend Reminder Conclusion Acknowledgement

Introduction – What is VTT? Smart Traveller with Visual Translator (VTT)  Mobile Device which is convenient for a traveller to carry Mobile Phone, Pocket PC, Palm, etc.  Recognize and translate the foreign text into native language  Detect and recognize the face into name

Introduction – Objective Two main features:  Korean to English Visual Translation  Remind Somebody’s Information with Face Image

Introduction – Objective (Cont.) Real Life Examples  Sometimes we lose the way, we need to know where we are.  Sometimes we forget somebody we met before.

System Architecture GUI Camera API Camera Korean OCRFace Recognizer Face Database Stroke Database & Dictionary Request Data RequestOutput User QueryResultQueryUpdateResult Request Response

Korean OCR (KOCR) Usage  Visual Translator from Korean to English Procedure for using KOCR  Text Area Detection  Character Identification  Translation

KOCR – Program Flow Initialization Capture Image Text Segmentation Recognition Translation

KOCR – Text Area Detection Edge Detection using Sobel Filter Horizontal Projection and Vertical Projection Find Potential Text Area by threshold Hor izon tal Proj ecti on Threshold Vertical Projection

KOCR – Text Area Detection (Cont.)

KOCR – Character Identification Features on Stroke  Extracted by Labeling Connected Component algorithm Proposed Feature Extraction  Five rays each side  Difference of adjacent rays (-1 or 0 or 1)  Has holes (0 or 1)  Dimension ratio of Stroke (width/height) (-1 or 0 or 1)

KOCR – Character Identification (Cont.)

KOCR – Translation Dictionary  Korean to English  About 1000 Korean Words Matching  Longest Match from left to right

KOCR – Translation (Cont.)

KOCR – Evaluations OCR Correctness  Training Set (3327 – 30% of all Character)  Testing Set (7845 – Others)  Result (64%)  Suggestion Train all Korean characters

KOCR – Evaluations (Cont.) Text Segmentation Correctness  45 Captured Images  99 Characters  Result Segment 83% characters correctly Segment 71% image correctly  Acceptable Result

KOCR – Evaluations (Cont.) OCR Correctness  45 Captured Images  99 Characters  Result 79% Characters correctly Recognized 69% Images correctly Recognized

Friend Reminder – Program Flow Initialization Capture Image Face Segmentation Recognition Show Profile

Friend Reminder (FR) Usage  Show the Profile of Friend by capturing a photo Procedure for using FR  Face Segmentation  Face Identification  Friend’s Profile

FR – Face Segmentation Eye Detection  Algorithm Gabor Wavelet Feature Log-Polar Sampling  Manual Selected (Suggest) Selected Eyes and Mouth Positions

FR – Face Segmentation

FR – Face Identification EigenFace  By using Principal Component Analysis (PCA)  Project the input face into the eigenvectors that pre-learned  Find the difference between the projection and the faces in database  Face determined to be ‘NEW’ if the difference is larger than a threshold

FR – Friend’s Profile

FR – Evaluations Eye Detection Correctness  40 Images  Result 22.5% Image Successfully Detected  Non-acceptable  Suggestion Manually Select Eyes and Mouth Positions

FR – Evaluations Face Identification  Evaluation Information 26 Test Persons’ Faces  16 faces is in database  10 faces is not in database 3 faces Trained per person 8 persons in face database  Result 77% Successfully Identified  63% Successfully Identified as person in database  100% Successfully Identified as person not in database

Conclusion Combined Modern Equipments  Digital camera  Personal Data Assistant (PDA) Techniques Learned  Image Processing  Optical Character Recognition  Face Recognition Techniques VTT Integrated  VTT for Korean to English OCR  VTT for Friend Reminder

Acknowledgement Thanks Professor Michael Lyu, Project Supervisor  Give us valuable advice  Provide us necessary equipments Thanks Edward Yau, Technical Manager of VIEW project  Give us many ideas

~The End~