Automatic Caption Localization in Compressed Video By Yu Zhong, Hongjiang Zhang, and Anil K. Jain, Fellow, IEEE IEEE Transactions on Pattern Analysis and.

Slides:



Advertisements
Similar presentations
15 Data Compression Foundations of Computer Science ã Cengage Learning.
Advertisements

INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS, ICT '09. TAREK OUNI WALID AYEDI MOHAMED ABID NATIONAL ENGINEERING SCHOOL OF SFAX New Low Complexity.
Fourier Transform – Chapter 13. Image space Cameras (regardless of wave lengths) create images in the spatial domain Pixels represent features (intensity,
Texture Segmentation Based on Voting of Blocks, Bayesian Flooding and Region Merging C. Panagiotakis (1), I. Grinias (2) and G. Tziritas (3)
Error detection and concealment for Multimedia Communications Senior Design Fall 06 and Spring 07.
Chapter 7 End-to-End Data
A New Approach for Video Text Detection and Localization M. Cai, J. Song and M.R. Lyu VIEW Technologies The Chinese University of Hong Kong.
By Max Havir. Video Compression MPEG1 MPEG2 MPEG4 MPEG7 MPEG21 Cinepak Motion JPEG A Motion JPEG B H.261 H.263 Sorenson Video Divx.
Efficient Moving Object Segmentation Algorithm Using Background Registration Technique Shao-Yi Chien, Shyh-Yih Ma, and Liang-Gee Chen, Fellow, IEEE Hsin-Hua.
Text Detection in Video Min Cai Background  Video OCR: Text detection, extraction and recognition  Detection Target: Artificial text  Text.
Issues for Multimedia Privacy & Security ---- Video Content Privacy Protection, Copyright Protection & Database Access Control Jianping Fan Dept of Computer.
3. Introduction to Digital Image Analysis
1 Data Compression Engineering Math Physics (EMP) Steve Lyon Electrical Engineering.
LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.
JPEG Still Image Data Compression Standard
1 An Efficient Method for DCT- Domain Image Resizing with Mixed Field/Frame-Mode Macroblocks Changhoon Yim and Michael A. Isnardi IEEE TRANSACTION ON CIRCUITS.
©Brooks/Cole, 2003 Chapter 15 Data Compression. ©Brooks/Cole, 2003 Realize the need for data compression. Differentiate between lossless and lossy compression.
IT 342 : Fundamentals of Multimedia
Image and Video Compression
Trevor McCasland Arch Kelley.  Goal: reduce the size of stored files and data while retaining all necessary perceptual information  Used to create an.
JPEG C OMPRESSION A LGORITHM I N CUDA Group Members: Pranit Patel Manisha Tatikonda Jeff Wong Jarek Marczewski Date: April 14, 2009.
ECE472/572 - Lecture 12 Image Compression – Lossy Compression Techniques 11/10/11.
Video Coding. Introduction Video Coding The objective of video coding is to compress moving images. The MPEG (Moving Picture Experts Group) and H.26X.
The MPEG-7 Color Descriptors
MPEG-1 and MPEG-2 Digital Video Coding Standards Author: Thomas Sikora Presenter: Chaojun Liang.
MPEG: (Moving Pictures Expert Group) A Video Compression Standard for Multimedia Applications Seo Yeong Geon Dept. of Computer Science in GNU.
DATA COMPRESSION LOSSY COMPRESSION METHODS What it is… A compression of information that is acceptable in pictures or videos, but not texts or programs.
Object Based Video Coding - A Multimedia Communication Perspective Muhammad Hassan Khan
BACKGROUND LEARNING AND LETTER DETECTION USING TEXTURE WITH PRINCIPAL COMPONENT ANALYSIS (PCA) CIS 601 PROJECT SUMIT BASU FALL 2004.
Institute of Informatics and Telecommunications – NCSR “Demokritos” TEXT EXTRACTION FROM IMAGES AND VIDEOS Ινστιτούτο πληροφορικής και τηλεπικοινωνιών.
Methods of Video Object Segmentation in Compressed Domain Cheng Quan Jia.
Compression video overview 演講者:林崇元. Outline Introduction Fundamentals of video compression Picture type Signal quality measure Video encoder and decoder.
Eyes detection in compressed domain using classification Eng. Alexandru POPA Technical University of Cluj-Napoca Faculty.
Spring 2000CS 4611 Multimedia Outline Compression RTP Scheduling.
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
Implementation, Comparison and Literature Review of Spatio-temporal and Compressed domains Object detection. By Gokul Krishna Srinivasan Submitted to Dr.
Text From Corners: A Novel Approach to Detect Text and Caption in Videos Xu Zhao, Kai-Hsiang Lin, Yun Fu, Member, IEEE, Yuxiao Hu, Member, IEEE, Yuncai.
Vehicle Detection in Aerial Surveillance Using Dynamic Bayesian Networks Hsu-Yung Cheng, Member, IEEE, Chih-Chia Weng, and Yi-Ying Chen IEEE TRANSACTIONS.
Wonjun Kim and Changick Kim, Member, IEEE
Introduction to JPEG m Akram Ben Ahmed
CS 376b Introduction to Computer Vision 03 / 17 / 2008 Instructor: Michael Eckmann.
Attila Kiss, Tamás Németh, Szabolcs Sergyán, Zoltán Vámossy, László Csink Budapest Tech Recognition of a Moving Object in a Stereo Environment Using a.
(B1) What are the advantages and disadvantages of digital TV systems? Hint: Consider factors on noise, data security, VOD etc. 1.
Ec2029 digital image processing
Motion Estimation Multimedia Systems and Standards S2 IF Telkom University.
1 Review and Summary We have covered a LOT of material, spending more time and more detail on 2D image segmentation and analysis, but hopefully giving.
1. 2 What is Digital Image Processing? The term image refers to a two-dimensional light intensity function f(x,y), where x and y denote spatial(plane)
Content Based Coding of Face Images
Visual Information Processing. Human Perception V.S. Machine Perception  Human perception: pictorial information improvement for human interpretation.
An improved SVD-based watermarking scheme using human visual characteristics Chih-Chin Lai Department of Electrical Engineering, National University of.
Chapter 9 Image Compression Standards
Automatic Video Shot Detection from MPEG Bit Stream
Data Compression.
DCT IMAGE COMPRESSION.
Presenter: Ibrahim A. Zedan
A Simple Image Compression : JPEG
Image Segmentation Techniques
Text Detection in Images and Video
4. DIGITAL IMAGE TRANSFORMS 4.1. Introduction
Standards Presentation ECE 8873 – Data Compression and Modeling
JPEG Still Image Data Compression Standard
15 Data Compression Foundations of Computer Science ã Cengage Learning.
Research Institute for Future Media Computing
Research Institute for Future Media Computing
Research Institute for Future Media Computing
15 Data Compression Foundations of Computer Science ã Cengage Learning.
Review and Importance CS 111.
Presentation transcript:

Automatic Caption Localization in Compressed Video By Yu Zhong, Hongjiang Zhang, and Anil K. Jain, Fellow, IEEE IEEE Transactions on Pattern Analysis and Machine Intelligence Vol 22, No. 4, April 2000

Introduction Caption text on video General methods for caption extraction Proposed Method How it works Evaluation

Caption Text on Video Parse, index and abstract of Video Caption Text Information of Video Describe the content Catch “highlights”

General Extraction Methods Component-based Geometrical arrangement Homogeneous color Texture-based Contrast the background Horizontal intensity variation

Most published method Applied on uncompressed images Digital video and images Compressed (MPEG & JPEG) DCT (Discrete Cosine Transform) coding Reducing interframe redundancy (for MPEG)

Proposed Method Step 1 & 2 Detecting Blocks Step 3 Refinement Step 4 Segmentation of rows Step 1 Step 2 Step 3 Step 4

Proposed Method Source frame

Step 1 & 2 Detecting Blocks of High Horizontal Spatial Intensity Variation Operates in DCT domain Not necessary to decompress Unit: 8x8 blocks in I-frames (Intracoded) Quantized DCT coefficients Readily extracted Fast DCT blocks with high horizontal intensity variation

Step 3 Remove noise by applying Morphological Operations Step 1 & 2 Picked high contrast nontext blocks Disconnected text blocks Wide spacing, low contrast, large fonts Step 3 Remove most isolated blocks Merges nearby blocks Applying Morphological Operations

Step 4 Segmentation based on vertical intensity variation Detected text regions Large vertical intensity variation Local vertical harmonics Corresponding row of text High vertical spectrum energy After horizontal/vertical text energy test

Dilating the previous result by one block

Evaluation Not work properly when: Very big characters Too widely spaced text Image texture

Caption Text on Video Parse, index and abstract of Video Caption Text Information of Video Describe the content Catch “highlights”

Evaluation Commonly used caption NOT very big characters NOT too widely spaced text NOT image texture Therefore, important information retrieved!

Evaluation Future work Proposed to other transform-based compressions Use also color information to improve accuracy Combining DCT blocks to support larger fonts Solution to P- and B-frames

Summary Proposed caption localization method For compressed video Fast Further development is needed to improve: Accuracy Support other compression methods