TagSense: A Smartphone-based Approach to Automatic Image Tagging - Ujwal Manjunath.

Slides:



Advertisements
Similar presentations
Multi-hop wireless networks Fact or fiction? Injong Rhee Department of Computer Science North Carolina State University.
Advertisements

By Zheng Sun, Aveek Purohit, Shijia Pan, Frank Mokaya, Raja Bose, and Pei Zhang final38.pdf.
TouchDevelop Chapter 5-7 Chapter 5 Audio Chapter 6 Camera, Graphics and Video Chapter 7 Sensors Mengfei Ren.
Irek Defée Signal Processing for Multimodal Web Irek Defée Department of Signal Processing Tampere University of Technology W3C Web Technology Day.
Topic Space: a mobile augmented reality recommendation app Jim Hahn and Ben Ryckman.
1 “Did you see Bob?”: Human Localization using Mobile Phones Ionut Constandache Co-authors: Xuan Bao, Martin Azizyan, and Romit Roy Choudhury Modified.
Chapter 5 Input and Output. What Is Input? What is input? p. 166 Fig. 5-1 Next  Input device is any hardware component used to enter data or instructions.
Bryan Donyanavard Nik Sumikawa. Project Description Transfer data between two mobile phones via Bluetooth. A unique cell phone movement will establish.
Person Re-Identification Application for Android
SurroundSense Mobile Phone Localization via Ambience Fingerprinting Scott Seto CS 495/595 November 1, 2011
InSight: Recognizing Humans without Face Recognition He Wang, Xuan Bao, Romit Roy Choudhury, Srihari Nelakuditi.
Urban Sensing Jonathan Yang UCLA CS194 Fall 2007 Jonathan Yang UCLA CS194 Fall 2007.
1 Discussion Class 10 Informedia. 2 Discussion Classes Format: Question Ask a member of the class to answer. Provide opportunity for others to comment.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Slides modified and presented by Brandon Wilson.
1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System Supervisor: Prof Michael Lyu Presented by: Lewis Ng,
Haptic: Image: Audio: Text: Landmark: YesNo YesNo YesNo YesNo YesNo Haptic technology, or haptics, is a tactile feedback technology that takes advantage.
[Context to Make You More Aware] Presentation [Adrienne Andrew, Yaw Anokwa, Karl Koscher, Jonathan Lester, Gaetano Borriello Department of Computer Science.
MULTIMEDIA SYSTEM.
Firefighter Indoor Navigation using Distributed SLAM (FINDS) Major Qualifying Project Matthew Zubiel Nick Long Advisers: Prof. Duckworth, Prof. Cyganski.
Research Area B Leif Kobbelt. Communication System Interface Research Area B 2.
Digital Photography Vocabulary
SoundSense: Scalable Sound Sensing for People-Centric Application on Mobile Phones Hon Lu, Wei Pan, Nocholas D. lane, Tanzeem Choudhury and Andrew T. Campbell.
Presented by: Z.G. Huang May 04, 2011 Did You See Bob? Human Localization using Mobile Phones Romit Roy Choudhury Duke University Durham, NC, USA Ionut.
Satellites in Our Pockets: An Object Positioning System using Smartphones Justin Manweiler, Puneet Jain, Romit Roy Choudhury TsungYun
MULTIMEDIA M U A T H H U M A I D R a s h A t a l l a h.
Camera types. Megapixel  Equal to one million pixels (or 1 MP).  Higher the MP = higher resolution = nicer looking picture.
July 25, 2010 SensorKDD Activity Recognition Using Cell Phone Accelerometers Jennifer Kwapisz, Gary Weiss, Samuel Moore Department of Computer &
Behavior analysis based on coordinates of body tags Mitja Luštrek, Boštjan Kaluža, Erik Dovgan, Bogdan Pogorelc, Matjaž Gams Jožef Stefan Institute, Department.
SurroundSense: Mobile Phone Localization via Ambience Fingerprinting Martin Azizyan, Ionut Constandache, Romit Roy Choudhury Mobicom 2009.
Introduction to Photography: The Exposure Triangle Ms. Whiteside * Circle High School Photo by V Whiteside.
Chapter 7 Structuring System Process Requirements
1 SurroundSense: Mobile Phone Localization via Ambience Fingerprinting Ionut Constandache Co-authors: Martin Azizyan and Romit Roy Choudhury.
TEMPLATE DESIGN © Detecting User Activities Using the Accelerometer on Android Smartphones Sauvik Das, Supervisor: Adrian.
Presented by: Marcela D. Rodríguez CICESE/UABC, Ensenada, México 1st International Workshop on Ubiquitous Mobile Instrumentation.
LoCaF: Detecting Real-World States with Lousy Wireless Cameras Benjamin Meyer, Richard Mietz, Kay Römer 1.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal VideoConference Archives Indexing System.
Content Sharing over Smartphone-Based Delay- Tolerant Networks.
Research Activities at Florida State Vision Group Xiuwen Liu Florida State Vision Group Department of Computer Science Florida State University
Final Year Projects Prof. Joseph Ng. Mobile Application Development Sensor Programming Android Platform Preferred Applications on the following domains:
Submitted by: Giorgio Tabarani, Christian Galinski Supervised by: Amir Geva CIS and ISL Laboratory, Technion.
The Second Life of a Sensor: Integrating Real-World Experience in Virtual Worlds using Mobile Phones Mirco Musolesi, Emiliano Miluzzo, Nicholas D. Lane,
Beyond the PC Kiosks & Handhelds Albert Huang Larry Rudolph Oxygen Research Group MIT CSAIL.
James Pittman February 9, 2011 EEL 6788 MoVi: Mobile Phone based Video Highlights via Collaborative Sensing Xuan Bao Department of ECE Duke University.
TEMPLATE DESIGN © E-Eye : A Multi Media Based Unauthorized Object Identification and Tracking System Tolgahan Cakaloglu.
Microsoft Innovative Teacher Awards READ THIS CAREFULLY The following slides provide you with guidelines for the content of your Innovative Teacher Awards.
+ Dancing Lights Areej and Aljoharah. + Contents Introduction Objective Approach Progress Implementation Future Works.
1 SurroundSense: Mobile Phone Localization via Ambience Fingerprinting.
Contents Introduction What are Location-based services Working of Location-based services Location Tracking Technologies Power profiling a mobile phone.
MACHINE VISION GROUP MOBILE FEATURE-CLOUD PANORAMA CONSTRUCTION FOR IMAGE RECOGNITION APPLICATIONS Miguel Bordallo, Jari Hannuksela, Olli silvén Machine.
Sensors For Mobile Phones  Ambient Light Sensor  Proximity Sensor  GPS Receiver Sensor  Gyroscope Sensor  Barometer Sensor  Accelerometer Sensor.
Eric Minner & James Pittman. Outline Project Statement / Motivation Concept overview Quick computer vision overview Demo Lessons Learned Future Work.
Pocket, Bag, Hand, etc. - Automatically Detecting Phone Context through Discovery Emiliano Miluzzoy, Michela Papandreax, Nicholas D. Laney, Hong Luy, Andrew.
Microsoft Innovative Teacher Awards READ THIS CAREFULLY The following slides provide you with guidelines for the content of your Innovative Teacher Awards.
1 Indoor Semantic Localization (SurroundSense). Many emerging location based apps do not care about the physical location Instead, they need the user’s.
1 Part A Multimedia Production Chapter 2 Multimedia Basics Digitization, Coding-decoding and Compression Information and Communication Technology.
QSREALM.BLOGSPOT.COM Input Output Devices. QSREALM.BLOGSPOT.COM Input – Output Devices Also known as Peripheral Devices. These surround a computer’s CPU.
What Do Sensors Do in a Smartphone? For Seniors Zarrin Begum.
Accelerometer based motion gestures for mobile devices Presented by – Neel Parikh Advisor Committee members Dr. Chris Pollett Dr. Robert Chun Dr. Mark.
Lens Gestures: Integrating Compound Gesture Inputs for Shortcut Activation.
CHAPTER 8 Sensors and Camera. Chapter objectives: Understand Motion Sensors, Environmental Sensors and Positional Sensors Learn how to acquire measurement.
Chapter 8 Sensors and Camera. Figure 08.01: The Accelerometer can gauge the orientation of a stationary device.
Introduction to Mobile-Cloud Computing. What is Mobile Cloud Computing? an infrastructure where both the data storage and processing happen outside of.
Automatic License Plate Recognition for Electronic Payment system Chiu Wing Cheung d.
Map for Easy Paths GIANLUCA BARDARO
Video-based human motion recognition using 3D mocap data
TagSense: A Smartphone-based Approach to Automatic Image Tagging
Activity 1.6: Different Ways to See The World
Xin Qi, Matthew Keally, Gang Zhou, Yantao Li, Zhen Ren
Presentation transcript:

TagSense: A Smartphone-based Approach to Automatic Image Tagging - Ujwal Manjunath

Overview Introduction Scope System Overview Design and Implementation Performance Evaluation Limitations Future of TagSense

Introduction sensor-assisted tagging. tags are systematically organized into a “when-where-who- what” format. better than image processing/face recognition??? Challenges faced? Goals

Introduction Envisioning an alternative, out-of-band opportunity towards automatic image tagging. Designing TagSense, an architecture for coordinating the mobile phone sensors, and processing the sensed information to tag images. Implementing and evaluating TagSense on Android phones.

Picture 1: November 21st afternoon, Nasher Museum, in-door, Romit, Sushma, Naveen, Souvik, Justin, Vijay,Xuan, standing, talking. Picture 2: December 4th afternoon, Hudson Hall, out-door, Xuan, standing, snowing.

Picture 3: November 21st noon, Duke Wilson Gym, indoor,Chuan, Romit, playing, music. Tags extracted using Location services, light- sensor readings, accelerometers and sound. TagSense tags each picture with the time, location, individual-name, and basic activity.

Scope of TagSense TagSense requires the content in the pictures to have an electronic footprint that can be captured over at least one of the sensing dimensions. Images of objects (e.g., bicycles, furniture, paintings), of animals, or of people without phones, cannot be recognized. TagSense narrows down the focus to identifying the individuals in a picture, and their basic activities.

System Overview TagSense architecture – the camera phone triggers sensing in participating mobile phones and gathers the sensed information. It then determines who is in the picture and tags the picture with the people and the context.

SYSTEM OVERVIEW the application prompts the user for a session password. password acts as a shared session key. Phone to phone communication is performed using the WiFi ad hoc mode. phones perform basic activity recognition on the sensed information, and send them back.

Design And Implementation Who are in the picture? Accelerometer based motion signatures Figure 3: The variance of accelerometer readings from phones of (a) those in the picture and (b) those outside the picture. Posing signature is evident in (a) and absent in (b).

Design And Implementation Complementary Compass Directions Figure 4: (a) Personal Compass Offset (PCO) (b) PCO distribution from 50 pictures where subjects are facing the camera. PCO calibration is necessary to detect people in a picture using compass.

Design And Implementation Moving Subjects Figure 5: Extracting motion vectors of people from two successive snapshots in (a) and (b): (c) The optical flow field showing the velocity of each pixel; (d) The corresponding color graph; (e) The result of edge detection; (f) The motion vectors for the two detected moving objects.

Design And Implementation WHAT are they doing? Accelerometer: Standing, Sitting, Walking, Jumping, Biking, Playing. Acoustic: Talking, Music, Silence. WHERE is the picture taken? TagSense utilizes the light sensor on the camera phone. WHEN is the picture taken? Tagging the picture with current time. TagSense adds to this by contacting an Internet weather service and fetching the weather conditions.

PERFORMANCE EVALUATION

Overall Performance Figure 10: The overall precision of TagSense is not as high as iPhoto and Picasa, but its recall is much better, while their fall-out is comparable

LIMITATIONS OF TAGSENSE TagSense vocabulary of tags is quite limited. TagSense does not generate captions. TagSense cannot tag pictures taken in the past. TagSense requires users to input a group password at the beginning of a photo session.

FUTURE OF TAGSENSE Smartphones are becoming context-aware with personal sensing. The granularity of localization will approach a foot. Smartphones are replacing point and shoot cameras.

Conclusion Mobile phones are becoming inseparable from humans and are replacing traditional cameras. TagSense leverages this trend to automatically tag pictures with people and their activities. TagSense has somewhat lower precision and comparable fall- out but significantly higher recall than iPhoto/Picasa.