Chinese Character Recognition for Video Presented by: Vincent Cheung Date: 25 October 1999.

Slides:



Advertisements
Similar presentations
Environmental Remote Sensing GEOG 2021
Advertisements

Segmentation of Touching Characters in Devnagari & Bangla Scripts Using Fuzzy MultiFactorial Analysis Presented By: Sanjeev Maharjan St. Xavier’s College.
Prénom Nom Document Analysis: Document Image Processing Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Identifying Image Spam Authorship with a Variable Bin-width Histogram-based Projective Clustering Song Gao, Chengcui Zhang, Wei Bang Chen Department of.
A Low-cost Attack on a Microsoft CAPTCHA Yan Qiang,
Hue-Grayscale Collaborating Edge Detection & Edge Color Distribution Space Jiqiang Song March 6 th, 2002.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials 2.
LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.
1Ellen L. Walker Recognizing Objects in Computer Images Ellen L. Walker Mathematical Sciences Dept Hiram College Hiram, OH 44234
Text Detection in Video Min Cai Background  Video OCR: Text detection, extraction and recognition  Detection Target: Artificial text  Text.
3. Introduction to Digital Image Analysis
CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.
Robust Object Segmentation Using Adaptive Thresholding Xiaxi Huang and Nikolaos V. Boulgouris International Conference on Image Processing 2007.
LYU0203 Smart Traveller with Visual Translator for OCR and Face Recognition Supervised by Prof. LYU, Rung Tsong Michael Prepared by: Wong Chi Hang Tsang.
Traffic Sign Recognition Jacob Carlson Sean St. Onge Advisor: Dr. Thomas L. Stewart.
An Approach to Korean License Plate Recognition Based on Vertical Edge Matching Mei Yu and Yong Deak Kim Ajou University Suwon, , Korea 指導教授 張元翔.
LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.
Computer Vision Basics Image Terminology Binary Operations Filtering Edge Operators.
A Probabilistic Framework For Segmentation And Tracking Of Multiple Non Rigid Objects For Video Surveillance Aleksandar Ivanovic, Tomas S. Huang ICIP 2004.
LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.
Smart Traveller with Visual Translator. What is Smart Traveller? Mobile Device which is convenience for a traveller to carry Mobile Device which is convenience.
A Real-Time for Classification of Moving Objects
Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.
The Chinese University of Hong Kong Department of Computer Science and Engineering Lyu0202 Advanced Audio Information Retrieval System.
Gholamreza Anbarjafari, PhD Video Lecturers on Digital Image Processing Digital Image Processing Spatial Domain Filtering: Part II.
Feature extraction Feature extraction involves finding features of the segmented image. Usually performed on a binary image produced from.
Image Subtraction for Real Time Moving Object Extraction Shahbe Mat Desa, Qussay A. Salih, CGIV’04.
Brief overview of ideas In this introductory lecture I will show short explanations of basic image processing methods In next lectures we will go into.
Lesson 4: Formatting the Worksheet
FEATURE EXTRACTION FOR JAVA CHARACTER RECOGNITION Rudy Adipranata, Liliana, Meiliana Indrawijaya, Gregorius Satia Budhi Informatics Department, Petra Christian.
CGMB214: Introduction to Computer Graphics
An efficient method of license plate location Pattern Recognition Letters 26 (2005) Journal of Electronic Imaging 11(4), (October 2002)
S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.
A Graph-based Friend Recommendation System Using Genetic Algorithm
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials: Informedia.
Presented By: Abirami Poonkundran Authors: Jeff Yan, Ahmad El Ahmad.
NTIT IMD 1 Speaker: Ching-Hao Lai( 賴璟皓 ) Author: Hongliang Bai, Junmin Zhu and Changping Liu Source: Proceedings of IEEE on Intelligent Transportation.
CVPR2013 Poster Detecting and Naming Actors in Movies using Generative Appearance Models.
Autonomous Robots Vision © Manfred Huber 2014.
Lecture # 19 Image Processing II. 2 Classes of Digital Filters Global filters transform each pixel uniformly according to the function regardless of.
By Pushpita Biswas Under the guidance of Prof. S.Mukhopadhyay and Prof. P.K.Biswas.
Igor Rosenberg Summer internship Creating a building detector June 16 th to September 15 th in Dublin City University, Ireland Supervisor: Alan Smeaton.
CS 376b Introduction to Computer Vision 03 / 31 / 2008 Instructor: Michael Eckmann.
Face Detection Final Presentation Mark Lee Nic Phillips Paul Sowden Andy Tait 9 th May 2006.
Tracking Groups of People for Video Surveillance Xinzhen(Elaine) Wang Advisor: Dr.Longin Latecki.
Preliminary Transformations Presented By: -Mona Saudagar Under Guidance of: - Prof. S. V. Jain Multi Oriented Text Recognition In Digital Images.
Instructor: Mircea Nicolescu Lecture 5 CS 485 / 685 Computer Vision.
Digital Image Processing Week V Thurdsak LEAUHATONG.
Automatic Caption Localization in Compressed Video By Yu Zhong, Hongjiang Zhang, and Anil K. Jain, Fellow, IEEE IEEE Transactions on Pattern Analysis and.
Course 3 Binary Image Binary Images have only two gray levels: “1” and “0”, i.e., black / white. —— save memory —— fast processing —— many features of.
1 Review and Summary We have covered a LOT of material, spending more time and more detail on 2D image segmentation and analysis, but hopefully giving.
Optical Character Recognition
Goal: Predicting the Where and What of actors and actions through Online Action Localization Figure 1.
Environmental Remote Sensing GEOG 2021
OCR Reading.
Program Studi S-1 Teknik Informatika FMIPA Universitas Padjadjaran
Traffic Sign Recognition Using Discriminative Local Features Andrzej Ruta, Yongmin Li, Xiaohui Liu School of Information Systems, Computing and Mathematics.
Applications of AI Image Processing.
Introduction Computer vision is the analysis of digital images
Introduction to Computer and Human Vision
Histogram Histogram is a graph that shows frequency of anything. Histograms usually have bars that represent frequency of occuring of data. Histogram has.
Text Detection in Images and Video
Introduction Computer vision is the analysis of digital images
CS654: Digital Image Analysis
Handwritten Characters Recognition Based on an HMM Model
Midterm Exam Closed book, notes, computer Similar to test 1 in format:
Word Processing Software Photo credit: © 2007 JupiterImagesCorporation.
Introduction Computer vision is the analysis of digital images
INSTRUCTIONAL NOTES There are many similarities between Photoshop and Illustrator. We have attempted to place tools and commands in the context of where.
DIGITAL IMAGE PROCESSING Elective 3 (5th Sem.)
Presentation transcript:

Chinese Character Recognition for Video Presented by: Vincent Cheung Date: 25 October 1999

Introduction n Many dialects in Chinese, but Chinese Characters is common in anywhere. n Many video programs have Chinese subtitles nowadays n Extract text from digital video programs can help for indexing, searching and retrieval

Features of Subtitles n Characters are in foreground n They are monochrome n They are rigid, from frame to frame n They are upright n They have size restrictions n They contrast with the background n They appear in clusters at a limited distance aligned to a horizontal line

Steps to Recognise Text n Clearing the background, removing noise n Segmenting the characters n Recognising them by pattern matching

Demo Video n A piece of news from ATV about Airport Authority Hong Kong and is reported in Cantonese n In MPEG format n Action!

MPEG Video n Consisted of a video track and an audio track n Consisted of frames n For video part, a frame is representing a static image

Steps to Remove Background Agnihotri & Dimitrova Suggested 7 steps procedures: n Channel Separation n Image Enhancement n Edge Detection n Edge Filtering n Character Detection n Text Box Detection n Text Line Detection & Enhancement

Sample Frame n The 100th frame of the demo video

Channel Separation n Use Red Channel which gives higher contrast edges n More probably that natural environment are in blue or green Green Channel Red Channel Blue Channel

Image Enhancement n To filter salt and pepper noise n To sharpen the edges n Quality of our mpeg video is quite good that we no need to take this step

Edge Detection n Find out the edges from the image n Use a 3x3 matrice mask [ ] n Use Sobel Filter instead n edges around text may be broken and not connected

Sample Edge Image

Edge Filtering n To remove areas which possibly do not contain text n Characters would give high density of objects, hence high density of edges n Finding out areas with high density of edges which give hints of where the characters located

Density of edges in horizontal lines

Filtering the Irrelevant Edges

Density of Edges in Vertical

What if the length of subtitle is short?? n Cut the image into certain parts and calculate the density of edges in those areas n Prevent the case if the subtitle is short and cannot give an overall view

Sample Image Divided in Parts

Challenges in Chinese Characters Segmentation n Square? n Not Really, they are variable in size!! Having different height and width n e.g.: ( 日, 曰 ) n Lead to some problem in Fixed- Distance Approach Segmentation n More problems if mixed with English, Numbers, and Symbols n e.g. 18 部「 IBM 」電腦

n Usually written in horizontal way, like English. n Do segmentation like English? n English: each character is horizontally linked n Chinese: may not have such linkage n e.g.: 八, 川 Challenges in Chinese Characters Segmentation

Character Recognition Pattern Matching n most straight forward n two pattern are compared n by using pattern distance

Classification for Faster Matching n By blackness (e.g. 一, 鬱 ) n By projection profiles

Possible Enhancement n Picking out the moving objects by keeping track of a number of consecutive frames n Use of lexicon to choose the most possible character

Q & A