D1 - 12/12/2000 Le présent document contient des informations qui sont la propriété de France Télécom. L'acceptation de ce document par son destinataire.

Slides:



Advertisements
Similar presentations
D1 - 09/04/2002 Le présent document contient des informations qui sont la propriété de France Télécom. L'acceptation de ce document par son destinataire.
Advertisements

(Nom du fichier) - D1 - 01/03/2000 Le présent document contient des informations qui sont la propriété de France Télécom. L'acceptation de ce document.
D1 - 16/05/2014 Le présent document contient des informations qui sont la propriété de France Télécom. L'acceptation de ce document par son destinataire.
(Nom du fichier) - D1 - 01/03/2000 France Télécom R&D Le présent document contient des informations qui sont la propriété de France Télécom. L'acceptation.
Image Retrieval With Relevant Feedback Hayati Cam & Ozge Cavus IMAGE RETRIEVAL WITH RELEVANCE FEEDBACK Hayati CAM Ozge CAVUS.
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Distributed search for complex heterogeneous media Werner Bailer, José-Manuel López-Cobo, Guillermo Álvaro, Georg Thallinger Search Computing Workshop.
DONG XU, MEMBER, IEEE, AND SHIH-FU CHANG, FELLOW, IEEE Video Event Recognition Using Kernel Methods with Multilevel Temporal Alignment.
Visual Event Detection & Recognition Filiz Bunyak Ersoy, Ph.D. student Smart Engineering Systems Lab.
1 Texmex – November 15 th, 2005 Strategy for the future Global goal “Understand” (= structure…) TV and other MM documents Prepare these documents for applications.
1 Content-Based Retrieval (CBR) -in multimedia systems Presented by: Chao Cai Date: March 28, 2006 C SC 561.
0 General information Rate of acceptance 37% Papers from 15 Countries and 5 Geographical Areas –North America 5 –South America 2 –Europe 20 –Asia 2 –Australia.
Web- and Multimedia-based Information Systems. Assessment Presentation Programming Assignment.
ICIP 2000, Vancouver, Canada IVML, ECE, NTUA Face Detection: Is it only for Face Recognition?  A few years earlier  Face Detection Face Recognition 
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials 2.
Recent Developments in Human Motion Analysis
ADVISE: Advanced Digital Video Information Segmentation Engine
A Study of Approaches for Object Recognition
Object-based Image Representation Dr. B.S. Manjunath Sitaram Bhagavathy Shawn Newsam Baris Sumengen Vision Research Lab University of California, Santa.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.
Multimedia Search and Retrieval Presented by: Reza Aghaee For Multimedia Course(CMPT820) Simon Fraser University March.2005 Shih-Fu Chang, Qian Huang,
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Shared Ontology for Knowledge Management Atanas Kiryakov, Borislav Popov, Ilian Kitchukov, and Krasimir Angelov Meher Shaikh.
SLIDE 1IS 202 – FALL 2002 Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30 am - 12:00 am Fall 2002
A. Frank Multimedia Multimedia/Video Search. 2 A. Frank Contents Multimedia (MM) and search/retrieval Text-based MM search in General SEs Text-based MM.
Architecture & Data Management of XML-Based Digital Video Library System Jacky C.K. Ma Michael R. Lyu.
Visual Information Retrieval Chapter 1 Introduction Alberto Del Bimbo Dipartimento di Sistemi e Informatica Universita di Firenze Firenze, Italy.
Visual Information Systems visual information retrieval.
A structured learning framework for content- based image indexing and visual Query (Joo-Hwee, Jesse S. Jin) Presentation By: Salman Ahmad (270279)
Information Retrieval in Practice
Naresuan University Multimedia Paisarn Muneesawang
1 Faculty of Information Technology Generic Fourier Descriptor for Shape-based Image Retrieval Dengsheng Zhang, Guojun Lu Gippsland School of Comp. & Info.
Multimedia Databases (MMDB)
Multimedia Information Retrieval and Multimedia Data Mining Chengcui Zhang Assistant Professor Dept. of Computer and Information Science University of.
Information Systems & Semantic Web University of Koblenz ▪ Landau, Germany Semantic Web - Multimedia Annotation – Steffen Staab
Subtask 1.8 WWW Networked Knowledge Bases August 19, 2003 AcademicsAir force Arvind BansalScott Pollock Cheng Chang Lu (away)Hyatt Rick ParentMark (SAIC)
80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.
Understanding The Semantics of Media Chapter 8 Camilo A. Celis.
Research Projects 6v81 Multimedia Database Yohan Jin, T.A.
IST Programme - Key Action III Semantic Web Technologies in IST Key Action III (Multimedia Content and Tools) Hans-Georg Stork CEC DG INFSO/D5
PSEUDO-RELEVANCE FEEDBACK FOR MULTIMEDIA RETRIEVAL Seo Seok Jun.
Efficient Visual Object Tracking with Online Nearest Neighbor Classifier Many slides adapt from Steve Gu.
Competence Centre on Information Extraction and Image Understanding for Earth Observation 29th March 2007 Category - based Semantic Search Engine 1 Mihai.
1 Applications of video-content analysis and retrieval IEEE Multimedia Magazine 2002 JUL-SEP Reporter: 林浩棟.
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Semi-Automatic Image Annotation Liu Wenyin, Susan Dumais, Yanfeng Sun, HongJiang Zhang, Mary Czerwinski and Brent Field Microsoft Research.
Chittampally Vasanth Raja 10IT05F vasanthexperiments.wordpress.com.
Chittampally Vasanth Raja vasanthexperiments.wordpress.com.
MMDB-9 J. Teuhola Standardization: MPEG-7 “Multimedia Content Description Interface” Standard for describing multimedia content (metadata).
M4 / September Integrating multimodal descriptions to index large video collections M4 meeting – Munich Nicolas Moënne-Loccoz, Bruno Janvier,
Skeleton Based Action Recognition with Convolutional Neural Network
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Yixin Chen and James Z. Wang The Pennsylvania State University
Multimedia Systems and Communication Research Multimedia Systems and Communication Research Department of Electrical and Computer Engineering Multimedia.
G. Marchionini, Univ. of Maryland Electronic Environments Cost Trends: Hardware cost < Software cost < Information cost < People time Virtuality (transcend.
Relevance Feedback in Image Retrieval System: A Survey Tao Huang Lin Luo Chengcui Zhang.
Image features and properties. Image content representation The simplest representation of an image pattern is to list image pixels, one after the other.
Introduction to MPEG  Moving Pictures Experts Group,  Geneva based working group under the ISO/IEC standards.  In charge of developing standards for.
MPEG 7 &MPEG 21.
REAL-TIME DETECTOR FOR UNUSUAL BEHAVIOR
Digital Video Library - Jacky Ma.
Visual Information Retrieval
Introduction Multimedia initial focus
E-Commerce Lecture 8.
Multimedia Content Description Interface
Multimedia Information Retrieval
Presentation of MPEG Standardisation Roadmap Status: Approved, Public
Presentation transcript:

D1 - 12/12/2000 Le présent document contient des informations qui sont la propriété de France Télécom. L'acceptation de ce document par son destinataire implique, de la part de ce dernier, la reconnaissance du caractère confidentiel de son contenu et l'engagement de n'en faire aucune reproduction, aucune transmission à des tiers, aucune divulgation et aucune utilisation commerciale sans l'accord préalable écrit de France Télécom R&D France Telecom's expectations and research in Object Recognition Henri Sanson, Christophe Laurent, Olivier Bernier

D2 - 12/12/2000 France Télécom R&D Outline è France Telecom Markets and context evolution è Visual content indexing: from low-level to semantic description ‡ Overview of current research in Image retrieval and video annotation è Object recognition for Human Computer Interface

D3 - 12/12/2000 France Télécom R&D France Telecom Markets and Applications è France Telecom is a global Telecommunication operator: è from telephony to multimedia and audiovisual services ‡ Fixed networks/services: ‡ Mobiles networks/services: ‡ Internet access and services ‡ IP Data and communication services for corporations è 2 structuring trends: ‡ An increasing importance and presence of visual contents in the services: leveraging the higher bandwidth high value added services (people pay for content and the means to reach it easily) ‡ A need to compensate an increasing technological and functional complexity by providing natural Human Machine/Service Interfaces Vocal Interfaces Visual interfaces

D4 - 12/12/2000 France Télécom R&D Visual content indexing: from low-level to semantic description (1) è Context: ‡ more visual content, huge data volumes, temporal constraint for videos ‡ Need for efficient indexing methods enabling fast and relevant access è Applications: ‡ Media asset management: in addition to traditional audiovisual companies (TV, production), more and more enterprises own image or video assets and face management issues. Relevance has prime importance But cost effectiveness is becoming more and more accute ‡ Web search and filtering engines Huge volume, very variable: robust automatic indexing is the only solution Although surrounding text may be used, the visual content itself is the only reliable source to use ‡ Video surveillance: Specific environment and content type Automatic processing

D5 - 12/12/2000 France Télécom R&D Visual content indexing: from low-level to semantic description (2) è Traditionnally 2 radically opposite approaches: ‡ Accurate but manual Semantic Annotation: Ontologies Time consuming The indexing choices limit possible queries ‡ Low-level –based feature descriptions: aka "Color, Texture and Shape" MPEG-7 Visual Framework Automatic processing but very limited in practice (save for some classification purposes): relies on query-by-example, little usable è Emerging trend: convergence of both previous paradigms ‡ Automated knowledge-based semantic indexing using visual recognition ‡ Many advantages: Semantic and automatic No linguistic fence Indexing complementation is always possible ‡ But still difficult !

D6 - 12/12/2000 France Télécom R&D Visual content indexing: from low-level to semantic description (3) è Constraints of the application impacting the recognition: ‡ High variability of shooting conditions: same objects appear very differently: color, pose, scale, shadows, ‡ High variability in the content type (indoor, outdoor, News, movies, sports) ‡ Potentially huge number of objects or object categories to recognize concurrently ‡ Video Real time working targeted, even much faster for still image ‡ Recognition approach as flexible as possible is expected for generic objects è Qualification of the methods must in fine be done by real experimentation "on the ground" by true end users, and is measured by their satisfaction rate.

D7 - 12/12/2000 France Télécom R&D Current work (1) è Research: ‡ Color space invariance w.r.t shooting conditions ‡ Salient feature-based image retrieval and object description ‡ Face detection and recognition in generic images: è Development: Video indexing platform: ‡ Video: shot change detection, specific image labelling (news speaker, weather report/ commercal gingle), face detection, text detection ‡ Audio/speech: speech/music/other segmentation, keywords recognition, free vocabulary phonetic search

D8 - 12/12/2000 France Télécom R&D Saliency-based Color Image Indexing è Image signature is extracted from a limited number of perceptually important pixels called salient points è Salient points are computed by combining a discrete wavelet transform with a Zerotree representation of wavelet coefficients  Salient points are located on most sharp boundaries è The image signature is composed of a color correlogram computed in the neighborhood of each salient point  This signature can be completed with a texture signature computed around the salient points è An invariant color space (c1c2c3) is used to be robust to imaging conditions

D9 - 12/12/2000 France Télécom R&D Salient Points Extraction

D /12/2000 France Télécom R&D Experimental Results è Database containing ~2000 TV images è Extraction of 18 difficult requests è Computation of ranking metric è Comparaison with the MPEG-7 SCD (Scalable Color Descriptor) Outperforms SCD in 90% of cases !

D /12/2000 France Télécom R&D Object recognition for Human Computer Interface (1) è Context: ‡ Services functionalities are everyday more and more sophisticated ‡ End users are expecting simpler user interfaces ‡ Visual interactions appear to be a good complement to more usual vocal interfaces: Universality: much less constrained by linguistical variability Web cams are widespread Maybe less sensitive to environmental noise/ capturing conditions Permits fast interaction: an image worths 1000 words

D /12/2000 France Télécom R&D Visual recognition for Human-Computer Interfaces è Face detection and tracking ‡ Neural Network based face detection for still images. ‡ Extension to real time face detection in video streams. ‡ Real time face tracking for HCI using statistical models (EM, particle filtering). è Gesture recognition ‡ Static hand posture recognition based on neural networks. ‡ Dynamic gesture recognition (HMMS, IOHMMs and GIOHMMs). ‡ Body tracking in 2D. ‡ Body tracking in 3D using disparity cameras (Triclops).