Mor Naaman, Yee Jiun Song, Andreas Paepcke, Hector Garcia-Molina Digital Library Project, Database Group Stanford University Automatic Organization for.

Slides:



Advertisements
Similar presentations
Visualizing maps on the web. What is a Map? A map is a drawing that is the representation, on a certain scale, of a terrain.
Advertisements

Spelling Correction for Search Engine Queries Bruno Martins, Mario J. Silva In Proceedings of EsTAL-04, España for Natural Language Processing Presenter:
Location Recognition Given: A query image A database of images with known locations Two types of approaches: Direct matching: directly match image features.
A Unified Framework for Context Assisted Face Clustering
Hybrid Context Inconsistency Resolution for Context-aware Services
Imbalanced data David Kauchak CS 451 – Fall 2013.
Sequence Clustering and Labeling for Unsupervised Query Intent Discovery Speaker: Po-Hsien Shih Advisor: Jia-Ling Koh Source: WSDM’12 Date: 1 November,
Internet Vision - Lecture 3 Tamara Berg Sept 10. New Lecture Time Mondays 10:00am-12:30pm in 2311 Monday (9/15) we will have a general Computer Vision.
Chapter 18 Sampling Distribution Models
Business Process Modeling in Microsoft Visio® Interfacing’s BPMN Modeler: Overview.
Good and Bad Locality Descriptions Elements and Examples.
Return to Outline Copyright © 2009 by Maribeth H. Price 6-1 Chapter 6 Spatial Joins.
1 The SF Muni Map Project Maggie Law & Kaichi Sung SIMS 2003 Masters Project.
NCKU CSIE Visualization & Layout for Image Libraries Baback Moghaddam, Qi Tian IEEE Int’l Conf. on CVPR 2001 Speaker: 蘇琬婷.
1 CS 430 / INFO 430 Information Retrieval Lecture 15 Usability 3.
Automatic Image Annotation and Retrieval using Cross-Media Relevance Models J. Jeon, V. Lavrenko and R. Manmathat Computer Science Department University.
1 CS 502: Computing Methods for Digital Libraries Lecture 20 Multimedia digital libraries.
ISP 433/633 Week 5 Multimedia IR. Goals –Increase access to media content –Decrease effort in media handling and reuse –Improve usefulness of media content.
Browsing Personal Images Using Episodic Memory Chufeng Chen School of Computing and Technology, University of Sunderland
Security-Driven Heuristics and A Fast Genetic Algorithm for Trusted Grid Job Scheduling Shanshan Song, Ricky Kwok, and Kai Hwang University of Southern.
Generating Summaries and Visualization for Large Collections of Geo-referenced Photographs Alexander Jaffe*, Mor Naaman*, Tamir Tassa †, Marc Davis $ *Yahoo!
Lost in Memories Interacting With Large Photo Collections on PDAs Susumu Harada, Mor Naaman*, Yee Jiun Song, QianYing Wang, Andreas Paepcke Digital Library.
PeopleTones: a system for the detection and notification of buddy proximity on mobile phones Kevin A. Li Timothy Sohn Steven Huang William G. Griswold.
ZoneTag: Putting Your Photos in Context Mor Naaman Yahoo! Research Berkeley.
Projects in the Intelligent User Interfaces Group Frank Shipman Associate Director, Center for the Study of Digital Libraries.
Google Account Basics: Getting Started with free Google applications.
Prototyping and Designing New Assistive Technologies for People with Disabilities Shaun Kane Human-Centered UMBC
Systems Analysis and Design in a Changing World, 6th Edition
Navigating and Browsing 3D Models in 3DLIB Hesham Anan, Kurt Maly, Mohammad Zubair Computer Science Dept. Old Dominion University, Norfolk, VA, (anan,
Universidade de São Paulo – School of Engineering Computing and Digital Systems Engineering Dept. Agricultural Automation Laboratory (LAA) BIODIVERSITY.
Georeferencing with Paper Maps  Map Basics  How to Georeference with Paper Maps  Latitude and Longitude Mathematically  Error Calculator.
Unsupervised Learning. CS583, Bing Liu, UIC 2 Supervised learning vs. unsupervised learning Supervised learning: discover patterns in the data that relate.
GPS usage and GeoReferencing A crash course to the eTrex Venture and Georeferencing your field data.
NR 422: Topology Jim Graham Fall 2010 See: odatabase-topology.pdf.
Electronic Visualization Laboratory, University of Illinois at Chicago PAVIS Pervasive Adaptive Visualization and Interaction Service Javid Alimohideen.
Exploring Map Layers in Google Earth Georeferencing Images.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
Moving from Web-based Collaboration to the Mobile Arena Nimrod Geva Product Group Manager, KWizCom
Beyond Co-occurrence: Discovering and Visualizing Tag Relationships from Geo-spatial and Temporal Similarities Date : 2012/8/6 Resource : WSDM’12 Advisor.
Interactive Discovery and Semantic Labeling of Patterns in Spatial Data Thomas Funkhouser, Adam Finkelstein, David Blei, and Christiane Fellbaum Princeton.
Review of the paper: The Ubiquitous Camera: An In-Depth Study of Camera Phone Use Authors: Tim Kindberg, Mirjana Spasojevic, Rowanne Fleck, Abigail Sellen.
© 2000 OCLC Online Computer Library Center, Incorporated Dawn Lawson Manager, Electronic Products OCLC Forest Press Introduction to WebDewey.
Automated Construction of Parameterized Motions Lucas Kovar Michael Gleicher University of Wisconsin-Madison.
Copyright © 2006 by Maribeth H. Price 2-1 Chapter 2 Working with ArcMap.
IEEE Int'l Symposium on Signal Processing and its Applications 1 An Unsupervised Learning Approach to Content-Based Image Retrieval Yixin Chen & James.
Tracker data quality monitoring based on event display M.S. Mennea – G. Zito University & INFN Bari - Italy.
PSEUDO-RELEVANCE FEEDBACK FOR MULTIMEDIA RETRIEVAL Seo Seok Jun.
1 Shape Segmentation and Applications in Sensor Networks Xianjin Xhu, Rik Sarkar, Jie Gao Department of CS, Stony Brook University INFOCOM 2007.
11.2 Apply Proportion to Scale Models SWBAT solve problems involving scale drawings.
Map Projections. What is a Map Projection? A mathematical expression used to represent the 3D surface of the earth on a 2D map Always results in distortions.
GeoNames is … Gazetteer aggregator of open geo data I am... Marc Wick GeoNames.
1 CMPT 275 High Level Design Phase Modularization.
Example with shaded urban area: Davis, California  Linear extent- Defined as the distance from the geographic center of the location to the furthest point.
Co-funded by the European Union WeKnowIt Emerging, Collective Intelligence for personal, organisational and social use Event Detection.
Topology Relationships between features: Supposed to prevent:
Clustering Hongfei Yan School of EECS, Peking University 7/8/2009 Refer to Aaron Kimball’s slides.
Fast SLAM Simultaneous Localization And Mapping using Particle Filter A geometric approach (as opposed to discretization approach)‏ Subhrajit Bhattacharya.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
ParkNet: Drive-by Sensing of Road-Side Parking Statistics Irfan Ullah Department of Information and Communication Engineering Myongji university, Yongin,
The Geographer’s Tools (Chapter 1, Lesson 1, pp.11-25) (Chapter 1, Lesson 1, pp.11-25)
Building a Transportation Database WMATA Rail Station Mapping Project 3/18/2014.
IIIT HYDERABAD Techniques for Organization and Visualization of Community Photo Collections Kumar Srijan Faculty Advisor : Dr. C.V. Jawahar.
CYBER-GIS FOR SCIENTIFIC DISCOVERIES. Global Forest Change Hansen, M. C. et al (2013). High-Resolution Global Maps of 21st-Century Forest Cover Change.
Modeling Perspective Effects in Photographic Composition Zihan Zhou, Siqiong He, Jia Li, and James Z. Wang The Pennsylvania State University.
Chapter 6 Spatial Joins.
Connecting Interface Metaphors to Support Creation of Path-based Collections Unmil P. Karadkar, Andruid Kerne, Richard Furuta, Luis Francisco-Revilla,
Physical Structure of GDB
Professor John Canny Spring 2003
Dingding Liu* Yingen Xiong† Linda Shapiro* Kari Pulli†
CSc4730/6730 Scientific Visualization
Presentation transcript:

Mor Naaman, Yee Jiun Song, Andreas Paepcke, Hector Garcia-Molina Digital Library Project, Database Group Stanford University Automatic Organization for Digital Photographs with Geographic Coordinates

JCDL Geo-Referenced Photos April 8 th, :20:02pm Latitude: N Longitude: W

JCDL Geo-Photography Technology + 1)2)

JCDL Personal Photo Libraries Searching/browsing very difficult Little discernible structure to photo collections

JCDL Content-based retrieval –Basic, primitive (far from semantic) Manual labeling –Improved, yet cumbersome Visual methods for fast scanning (Zoom) –Don’t scale well Managing Personal Photos

JCDL Our Approach Absolutely no human effort required Utilize time and location –Automatically captured –Easy to get

JCDL Automatic Organization

JCDL Automatic Organization

JCDL Automatic Organization

JCDL Automatic Organization

JCDL Outline Requirements and challenges The algorithms Sample output Experiment results

JCDL Browsing by Location/Time Use a map/calendar –wwmx.org from MSR: Map issues –Lots of screen space –Sparse –Limited interaction? –Not intuitive for some

Using Hierarchies Time United States Yosemite N.P, Yosemite Valley, CA Location: Around: San Francisco, Berkeley, Sonoma CA San Francisco, Golden Gate Park, CA Seattle, WA … …… … Berkeley, Oakland CA : Yosemite N.P. (2 Days) : San Francisco (1 hour) Time:

JCDL Challenges Locations should be intuitive Events are tricky –3-days trip to NYC –The kid’s soccer game, followed by a birthday party Good names are important.

JCDL Outline Requirements and challenges The algorithms Sample output Experiment results

JCDL Process Diagram

JCDL Discovering Structure Location Hierarchy Initial Event Segmentation Location Clustering Final Event Segmentation Event Hierarchy Initial Event Segmentation Automatic Organization

JCDL Initial Event Segmentation Photos occur in bursts Identify bursts: semantically “connected”

JCDL Initial Event Segmentation Stream of photos More details: Graham et al, JCDL 2002 Tomorrow Proceedings

JCDL Discovering Structure Location Hierarchy Initial Event Segmentation Final Event Segmentation Event Hierarchy Location Clustering Automatic Organization

JCDL Location Clusters Cluster the bursts into locations A. Gionis and H. Mannila. Finding recurrent sources in sequences. In Proceedings, Computational molecular biology –Minimize: number of clusters –Minimize: error (distance to cluster centers)

Photo location Location Clusters: 2-D View

2-D View: with Bursts

JCDL Location Clusters Location 4 - Location 3 - Location 2 - Location 1 -

Location 4 - Location 3 - Location 2 - Location 1 - Location Clusters (breakdown) Some clusters may be overloaded: –Many bursts / picture-taking days in one location San Francisco

JCDL Discovering Structure Location Hierarchy Initial Event Segmentation Location Clustering Event Hierarchy Final Event Segmentation Automatic Organization

JCDL Final Event Segmentation Again scan sequence, new events detected: –Whenever location context changes –In the same location, use adaptive time threshold

JCDL 2004 Final Event Segmentation Overnight trip to Yosemite Soccer game and dinner

JCDL Next - names Detected location and event structure Need to choose names for each node

30 Assigning Names Photo location Stanford Palo Alto City Park Palo Alto Butano State Park Stanford42 Palo Alto30 Butano10 P.A. park8

31 Assigning Names – Nearby? San Jose, 20 miles San Francisco, 30 miles What if photos occur sparsely within cities or parks?

JCDL 2004 Assigning Names - Nearby Which city has stronger “gravity”?

JCDL 2004 Assigning Names - Nearby San Jose is Closer

JCDL 2004 Assigning Names - Nearby San Jose is bigger* *larger population

JCDL 2004 Assigning Names - Nearby But San Fran is more important!* *greater Google count Final name for location cluster: “Stanford, 30 miles South of SF”

JCDL Assigning Names - Alexandria Using polygon-based dataset of administrative areas Alexandria gazetteer can be used for other prominent geographic features

JCDL Outline The requirement and challenges of automatic organization The algorithms Sample output Experiment results

JCDL Location Hierarchy Photoshop Album (at least 4 man-hours) Our system (about 0 man- seconds)

39 Location Hierarchy (US) +San Francisco, Berkeley, Sonoma, CA -Stanford, Mountain View, Monterey, CA Monterey (58 miles S of San Jose) Mountain View (4 miles NW of San Jose) Stanford -Colorado (219 miles W of Denver) -Long Beach (35 miles S of Los Angeles, CA) -Philadelphia, PA -Seattle, WA -Sequoia N.P. (153 miles E of Fresno, CA) -South lake Tahoe; Bear Valley, CA -Yosemite N.P.; Yosemite Valley, CA

Events about 0 man- seconds: : Long Beach,CA (3 days) : San Francisco,CA (3 hours) : Colorado (3 days) : San Francisco,CA(1 hours) : Mountain View,CA (5 hours) : San Francisco,CA (1 hours) : Philadelphia,PA (1 hours) : Sequoia NP (3 days)... Photoshop Album (at least 4 man-hours)

JCDL Event Names LOCALE: share automatically Check personal calendar Event Gazetteer Easy interface

JCDL Experiment Tested on 3 real-world geo-referenced photo collections Our system automatically generated the structure and names Tested with the owners

JCDL Experiment - Locations Accepted the automatic hierarchy Only minor edits requested –Merge/split few of the locations

JCDL Experiment - Events Compared to events as annotated by users 80-85% in both recall and precision Other metrics proposed (see paper)

JCDL Experiment - Naming Naming location clusters –For 76% of clusters, system and users pick at least one name in common –For the rest, “automatic” name was useful

Not yet published: Paid 13 participants to “geo-reference” their photos Loaded to WWMX and our browser –Most liked the map better, but… –Performed the same for search/browse tasks –Event notion helps overcome location handicap –Organization “made sense” P.S. Some didn’t touch the map, yet used our location hierarchy. P.S.2 This was on a BIG screen!

JCDL Thank You! More details: Proceedings Google: Mor Naaman

JCDL Future Work User interface PDA Integrate with map Global photo libraries

JCDL Enhancing Personal Collections Browse/search for photos by location Detect events

JCDL Our System: PhotoCompas Location hierarchy Event hierarchy Our algorithm: –Creates them simultaneously Such that they inform each other –Assigns geo-names using gazetteers and Google

48 != hours in Seattle: One trip. 48 hours in San Francisco: Countless events. Monday 7am Monday 4pm Tuesday 8am Tuesday 1pm Monday 11am Monday 1pm Monday 7pm …

JCDL Why not use simple location hierarchy? What’s wrong with country state city? States division is usually perceived by users (in US) but not always City list likely to be too long Many countries do not have “states”

JCDL Initial Event Segmentation

JCDL Remember The Bursts?

JCDL Location Clusters

JCDL Location Clusters