Improving Chinese handwriting Recognition by Fusing speech recognition Zhang Xi-Wen CSE, CUHK and HCI Lab., ISCAS 2005.4.12.

Slides:



Advertisements
Similar presentations
Word Spotting DTW.
Advertisements

Abstract This article investigates the importance of the vocal source information for speaker recogni- tion. We propose a novel feature extraction scheme.
A Low-cost Attack on a Microsoft CAPTCHA Yan Qiang,
Confidence Measures for Speech Recognition Reza Sadraei.
CSE111: Great Ideas in Computer Science Dr. Carl Alphonce 219 Bell Hall Office hours: M-F 11:00-11:
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
Chapter 1: Introduction to Pattern Recognition
March 15-17, 2002Work with student Jong Oh Davi Geiger, Courant Institute, NYU On-Line Handwriting Recognition Transducer device (digitizer) Input: sequence.
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
PLT 2007 CSIS Shorthand Handwriting Recognition for Pen-Centric Interfaces Charles C. Tappert 1 and Jean R. Ward 2 1 School of CSIS, Pace University, New.
Video summarization by graph optimization Lu Shi Oct. 7, 2003.
LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.
Spelling Checkers Daniel Jurafsky and James H. Martin, Prentice Hall, 2000.
Smart Traveller with Visual Translator. What is Smart Traveller? Mobile Device which is convenience for a traveller to carry Mobile Device which is convenience.
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
SKETCH-BASED USER INTERFACE STUDY Presented By Jin Xiangyu Department of Computer Science and Technology Nanjing University June 2002.
An Automatic Segmentation Method Combined with Length Descending and String Frequency Statistics for Chinese Shaohua Jiang, Yanzhong Dang Institute of.
(Off-Line) Cursive Word Recognition Tal Steinherz Tel-Aviv University.
On Timing- Independent False Path Identification Feng Yuan, Qiang Xu Cuhk Reliable Computing Lab, The Chinese University of Hong Kong ICCAD 2010.
1 7-Speech Recognition (Cont’d) HMM Calculating Approaches Neural Components Three Basic HMM Problems Viterbi Algorithm State Duration Modeling Training.
EE 492 ENGINEERING PROJECT LIP TRACKING Yusuf Ziya Işık & Ashat Turlibayev Yusuf Ziya Işık & Ashat Turlibayev Advisor: Prof. Dr. Bülent Sankur Advisor:
Chapter 14 Speaker Recognition 14.1 Introduction to speaker recognition 14.2 The basic problems for speaker recognition 14.3 Approaches and systems 14.4.
S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.
Learning Models for Object Recognition from Natural Language Descriptions Presenters: Sagardeep Mahapatra – Keerti Korrapati
© 2008 Chen Hao, Beijing Institute of Technology 1 Intelligent Parking System: Parking Guide Application in Beijing and Method for License Plate Localization.
Language Joviltė Beržanskytė PSbns Content: Elements of language Language development The Influence of language to thinking Do animals use language?
1 University of Palestine Topics In CIS ITBS 3202 Ms. Eman Alajrami 2 nd Semester
Recognition of spoken and spelled proper names Reporter : CHEN, TZAN HWEI Author :Michael Meyer, Hermann Hild.
1 Iterative Multimodel Subimage Binarization for Handwritten Character Segmentation Author: Amer Dawoud and Mohamed S. Kamel Source: IEEE TRANSACTIONS.
1 Sentence-extractive automatic speech summarization and evaluation techniques Makoto Hirohata, Yosuke Shinnaka, Koji Iwano, Sadaoki Furui Presented by.
出處: Signal Processing and Communications Applications, 2006 IEEE 作者: Asanterabi Malima, Erol Ozgur, and Miijdat Cetin 2015/10/251 指導教授:張財榮 學生:陳建宏 學號: M97G0209.
8.0 Search Algorithms for Speech Recognition References: of Huang, or of Becchetti, or , of Jelinek 4. “ Progress.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Learning Phonetic Similarity for Matching Named Entity.
Optimization of Line Segmentation Techniques for Thai Handwritten Document Olarik Surinta Mahasarakham University Thailand.
What are the advantages of using bar code scanner?  Fast  It is fast  It is fast for reading data  It is fast for data input  Accurate  The advantage.
Multilingual Information Retrieval using GHSOM Hsin-Chang Yang Associate Professor Department of Information Management National University of Kaohsiung.
A Multilingual Hierarchy Mapping Method Based on GHSOM Hsin-Chang Yang Associate Professor Department of Information Management National University of.
1 ICASSP Paper Survey Presenter: Chen Yi-Ting. 2 Improved Spoken Document Retrieval With Dynamic Key Term Lexicon and Probabilistic Latent Semantic Analysis.
Confidence Measures As a Search Guide In Speech Recognition Sherif Abdou, Michael Scordilis Department of Electrical and Computer Engineering, University.
Pen Based User Interface II CSE 481b January 25, 2005.
Handwriting Recognition
Text Summarization using Lexical Chains. Summarization using Lexical Chains Summarization? What is Summarization? Advantages… Challenges…
Author :K. Thambiratnam and S. Sridharan DYNAMIC MATCH PHONE-LATTICE SEARCHES FOR VERY FAST AND ACCURATE UNRESTRICTED VOCABULARY KEYWORD SPOTTING Reporter.
NTNU Speech Lab 1 Topic Themes for Multi-Document Summarization Sanda Harabagiu and Finley Lacatusu Language Computer Corporation Presented by Yi-Ting.
1 7-Speech Recognition Speech Recognition Concepts Speech Recognition Approaches Recognition Theories Bayse Rule Simple Language Model P(A|W) Network Types.
Character Design and Stamp Algorithms for Character Projection Electron-Beam Lithography P. Du, W. Zhao, S.H. Weng, C.K. Cheng, and R. Graham UC San Diego.
EXAMPLE : Principle of Optimality and Dynamic Programming.
Ink Analysis Richard Anderson CSE 481b Winter 2007.
1 A Statistical Matching Method in Wavelet Domain for Handwritten Character Recognition Presented by Te-Wei Chiang July, 2005.
Date of download: 6/2/2016 Copyright © 2016 SPIE. All rights reserved. The Ottoman alphabet without diacritics and dots. Letters in the rectangles are.
From: Standardized Assessment of Reading Performance: The New International Reading Speed Texts IReST Invest. Ophthalmol. Vis. Sci ;53(9):
Machine Learning for Computer Security
Improving Chinese handwriting Recognition by Fusing speech recognition
Pattern Recognition Sergios Theodoridis Konstantinos Koutroumbas
8.0 Search Algorithms for Speech Recognition
Voice To Text Conversion
An Introduction to Supervised Learning
Compilers B V Sai Aravind (11CS10008).
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
Dysgraphia.
Content Augmentation for Mixed-Mode News Broadcasts Mike Dowman
Recognizing Location Names from Chinese Texts
Self-supervised adaptation for on-line text recognition
汉语连续语音识别 年1月4日访北京工业大学 973 Project 2019/4/17 汉语连续语音识别 年1月4日访北京工业大学 郑 方 清华大学 计算机科学与技术系 语音实验室
EE 492 ENGINEERING PROJECT
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
Visual Recognition of American Sign Language Using Hidden Markov Models 문현구 문현구.
Artificial Intelligence 2004 Speech & Natural Language Processing
ประกาศกระทรวงอุตสาหกรรม ฉบับที่ 5292 (พ.ศ. 2562)
Presentation transcript:

Improving Chinese handwriting Recognition by Fusing speech recognition Zhang Xi-Wen CSE, CUHK and HCI Lab., ISCAS

Outline 1 Chinese handwriting recognition 2 Chinese speech recognition 3 Information fusion 4 Experimental results

1.Handwriting Recognition Handwriting segmentation Character recognition

1.1 Handwriting segmentation It is more difficult for Chinese handwriting segmentation

Character extraction using histogram A histogram of between-stroke gaps. The dimidiate threshold of the histogram is to extract lines of strokes. The dimidiate threshold of the histogram of a line of strokes is to extract characters.

Figure 1. Handwriting segmentation

Problems remained A Chinese character may be mis-segmented into many characters. Many Chinese characters may be mis- grouped as a character. The segmentation error will inevitably result in handwriting recognition errors.

1.2 Character recognition –Isolated character recognizer from HW –Many candidates

Text recognized from the handwriting. Handwriting. The ground-truth text. Figure 2. Handwriting recognition

2 Speech recognition Chinese speech. On-line, microphone. Continuous speech recognizer from MS.

Text recognized from the speech corresponding to the handwriting. The ground-truth text. Figure 3. Speech recognition

3 Text fusion An optimization problem Dynamic Programming

3.1 Principles The fused text should contain more semantic information. Construct a text with the least characters and the most semantic information.

3.2 Four ways Text recognized from the handwriting. Text recognized from the speech corresponding to the handwriting. Figure 4. Texts to be fused

3.3 Dynamic Programming A directed graph. Optimal paths.

Figure 5. A directed graph with N levels.

(c) The optimal fused text corresponding to the optimal path. (d) The ground-truth text. Figure 6. Text fusion using DP. (b) Text recognized from the speech corresponding to the handwriting. (a) Text recognized from the handwriting.

3.4 A language model Lexicon Syntax Semantic

Lexicon

4 Experimental results

Thank you very much for your criticism, comments and suggestions! Tel: