SoLSTiCe Similarity of locally structured data in computer vision Université-Jean Monnet (Saint-Etienne) LIRIS (Lyon) (1/02/2014 -2018) Elisa Fromont,

Slides:

Advertisements

Similar presentations

Data mining for Web Video Auto Tagging TRAN Hoang Tung.

Advertisements

Towards a Quadratic Time Approximation of Graph Edit Distance Fischer, A., Suen, C., Frinken, V., Riesen, K., Bunke, H. Contents Introduction Graph edit.

Spatio-Temporal Relationship Match: Video Structure Comparison for Recognition of Complex Human Activities M. S. Ryoo and J. K. Aggarwal ICCV2009.

Context-based object-class recognition and retrieval by generalized correlograms by J. Amores, N. Sebe and P. Radeva Discussion led by Qi An Duke University.

Cornell Accelerating Belief Propagation in Hardware Skand Hurkat and José Martínez Computer Systems Laboratory Cornell University

Local Discriminative Distance Metrics and Their Real World Applications Local Discriminative Distance Metrics and Their Real World Applications Yang Mu,

Evaluating Color Descriptors for Object and Scene Recognition Koen E.A. van de Sande, Student Member, IEEE, Theo Gevers, Member, IEEE, and Cees G.M. Snoek,

Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.

Víctor Ponce Miguel Reyes Xavier Baró Mario Gorga Sergio Escalera Two-level GMM Clustering of Human Poses for Automatic Human Behavior Analysis Departament.

Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.

Robust Object Tracking via Sparsity-based Collaborative Model

Data Visualization STAT 890, STAT 442, CM 462

Computer and Robot Vision I

Local Descriptors for Spatio-Temporal Recognition

ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.

Presented by Marlene Shehadeh Advanced Topics in Computer Vision ( ) Winter

Beyond Actions: Discriminative Models for Contextual Group Activities Tian Lan School of Computing Science Simon Fraser University August 12, 2010 M.Sc.

Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,

CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.

Computer Vision & Biomimetic Object Recognition Bruce A. Draper Department of Computer Science January 28, 2008.

A Self-Organizing Approach to Background Subtraction for Visual Surveillance Applications Lucia Maddalena and Alfredo Petrosino, Senior Member, IEEE.

INTRODUCTION TO Machine Learning ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.

INTRODUCTION TO Machine Learning 3rd Edition

Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Learning to classify the visual dynamics of a scene Nicoletta Noceti Università degli Studi di Genova Corso di Dottorato.

Real-time Action Recognition by Spatiotemporal Semantic and Structural Forest Tsz-Ho Yu, Tae-Kyun Kim and Roberto Cipolla Machine Intelligence Laboratory,

Benjamin Gutierrez Becker, Loic Peter

Image Based Positioning System Ankit Gupta Rahul Garg Ryan Kaminsky.

Unsupervised Learning of Categories from Sets of Partially Matching Image Features Kristen Grauman and Trevor Darrel CVPR 2006 Presented By Sovan Biswas.

CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.

Autonomous Learning of Object Models on Mobile Robots Xiang Li Ph.D. student supervised by Dr. Mohan Sridharan Stochastic Estimation and Autonomous Robotics.

Laboratoire d'InfoRmatique en Image et Systèmes d'information LIRIS UMR 5205 CNRS/INSA de Lyon/Université Claude Bernard Lyon 1/Université Lumière Lyon.

Last Words COSC Big Data (frameworks and environments to analyze big datasets) has become a hot topic; it is a mixture of data analysis, data mining,

Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Marcin Marszałek, Ivan Laptev, Cordelia Schmid Computer Vision and Pattern Recognition, CVPR Actions in Context.

Multi-task Low-rank Affinity Pursuit for Image Segmentation Bin Cheng, Guangcan Liu, Jingdong Wang, Zhongyang Huang, Shuicheng Yan (ICCV’ 2011) Presented.

Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.

Demonstration Study for Applying AVED to Still Images from Station M Kick-off Meeting Demonstration Study for Applying AVED to Still Images from Station.

Different Features. Glasses vs. No Glasses Beard vs. No Beard.

Pedestrian Detection and Localization

Networked Audio Visual Systems and Home Platforms ADMIRE-P at Med-e-Tel 2005 April 6-8, Application of Video Technologies and Pattern Recognition.

Evaluation of Research Theme CogB. Objectives LEAR: LEArning and Recognition in vision Visual recognition and scene understanding –Particular objects.

Introduction to the Semantic Web and Linked Data

March 31, 1998NSF IDM 98, Group F1 Group F Multi-modal Issues, Systems and Applications.

Lucent Technologies - Proprietary 1 Interactive Pattern Discovery with Mirage Mirage uses exploratory visualization, intuitive graphical operations to.

Using decision trees to build an a framework for multivariate time- series classification 1 Present By Xiayi Kuang.

IEEE 2015 Conference on Computer Vision and Pattern Recognition Active Learning for Structured Probabilistic Models with Histogram Approximation Qing SunAnkit.

Finding Clusters within a Class to Improve Classification Accuracy Literature Survey Yong Jae Lee 3/6/08.

Network Management Lecture 13. MACHINE LEARNING TECHNIQUES 2 Dr. Atiq Ahmed Université de Balouchistan.

Parsing Natural Scenes and Natural Language with Recursive Neural Networks INTERNATIONAL CONFERENCE ON MACHINE LEARNING (ICML 2011) RICHARD SOCHER CLIFF.

Machine learning & object recognition Cordelia Schmid Jakob Verbeek.

Age-invariant Face Recognition

Brief Intro to Machine Learning CS539

Term Project Proposal By J. H. Wang Apr. 7, 2017.

A. M. R. R. Bandara & L. Ranathunga

Knowledge Discovery, Machine Learning, and Social Mining

Eick: Introduction Machine Learning

Learning Mid-Level Features For Recognition

Performance of Computer Vision

Recognizing Deformable Shapes

Paper Presentation: Shape and Matching

By Suren Manvelyan, Crocodile (nile crocodile?) By Suren Manvelyan,

What is Pattern Recognition?

Rob Fergus Computer Vision

Data Warehousing and Data Mining

AHED Automatic Human Emotion Detection

Christoph F. Eick: A Gentle Introduction to Machine Learning

Week 3 Volodymyr Bobyr.

Week 6 Presentation Ngoc Ta Aidean Sharghi.

Anirban Laha and Vikas C. Raykar, IBM Research – India.

Presentation transcript:

SoLSTiCe Similarity of locally structured data in computer vision Université-Jean Monnet (Saint-Etienne) LIRIS (Lyon) (1/02/ ) Elisa Fromont, Kick-off meeting, 14/02/2014

Présentation du consortium

Aim: design new models and tools for representing and managing images and videos Targeted applications: classification, recognition or indexing (in a context of occlusions and non rigid objects in 2D (+ t), 3D and 3D+t media) Proposal: explore locally structured data (LSD) = visual features + discrete structures to model local (spatio-temporal) relationships 3 main tasks: 1.[Extracting LSD from images and videos:] extract relevant visual features and structure them w.r.t. spatial and temporal relationships. 2.[Measuring the similarity of LSD:] design relevant similarity measures for comparing LSD, and efficient algorithms for computing these measures. 3.[Mining LSD:] characterize LSD by means of frequently (or infrequently) occurring patterns (itemsets, sequences or graphs) and use them to create discriminative features for solving computer vision tasks. Main ideas

The project : 4 tasks interconnected 1.[Task 0] will be dedicated to the project management; 2.[Task 1] will design LSD for describing images and videos, and will design tools for extracting these LSD; 3.[Task 2] will design kernels, similarity measures and matching algorithms for comparing LSD; 4.[Task 3] will design mining algorithms for extracting relevant patterns in LSD; 5.[Task 4] will be dedicated to the design and use of demo platforms to test (and demonstrate) on computer vision benchmarks and new datasets the models and tools designed in Tasks 1 to 3.

Livrables (1/2) Tâche 1 From images and Vidéos to LSD (LaHC) D1.1Research report describing new descriptors for images D1.2.2Survey of state-of-the-art approaches for structuring visual words by means of strings, trees or graphs D1.2.2Research report describing new LSD for images and 3D objects D1.3Research report on extensions of LSD of subtask 2.2 for videos, and evaluation Tâche 2: Mesuring the similarity of LSD (LIRIS) D 2.1.1Research report describing new matching algorithms D2.1.2Design of an open-source library of graph matching algorithms D2.2Research report on new kernel for combinatorial maps D2.3Research report on metric learning or deep learning on locally structured data

Livrables (2/2) Tâche 3: Mining LSD D3.1.1Research report on mining LSD in images and videos D3.1.2Research report on new algorithms to mine LSD in images and/or videos D3.2Research report on using frequent substructures to find relevant features for image classification D3.3.1Research report on mining approximate patterns in plane graph D3.3.2Research report on finding relevant spatio temporal patterns in videos Tâche 4: Demonstrations in computer vision D4.1.1Creation of the Solstice platform D4.1.2Activity recognition module for software platform LIRIS-VISION Tracking D4.1.3Demo in the Solstice platform D4.1.4Activity recognition module for robotics platform LIRIS-VOIR D4.2.1Object recognition module for software platform LIRIS-VISION D4.2.2Object recognition demo for the Solstice platform D4.2.3Object recognition module for robotics platform LIRIS-VOIR

Planning

Valorisation/Impact scientific communications submitted to major conferences and journals (CVPR, ECCV, ICCV, ICPR, AVSS, KDD, ICML, ECML, PKDD, ICPR, etc.) and journals (IEEE-T-PAMI, PR, IJCV, CVIU, MLJ, JMLR, etc.) in image processing, pattern recognition, combinatorial optimization, machine learning, and data mining. open source platforms developed in task 4 (and task 2) workshops co-located with major conferences in order to share ongoing research. design educational and recreational demos targeting a non specialist public to be presented during popular events such as “la fête de la science”.

Use of resources LaHC ( euros): – Staff ( euros) Ph.D Student: 36 months on « New matching strategies for data mining applied to computer vision problems » (tasks 2 and and 4) co-supervised with liris – Travels – Other expenses: master thesis grants + hardware LIRIS ( euros) – Staff ( euros): Ph.D Student: 36 months on « Analysis of complex scenes with structured models » (tasks 1 and 2 + 3) co-supervised with LaHC – Travels – Other expenses: master thesis grants + hardware

Points to discuss Website (Jean Monnet) Include some more members (Taygun, Romain?) How to spend the money for the second thesis (Remi, Marc, Damien ?) Demos? Next meetings