ELPUB 2010, Helsinki, Finland1 A Collaborative Faceted Categorization System – User Interactions Kurt Maly; Harris Wu; Mohammad Zubair ; Contact:

Slides:



Advertisements
Similar presentations
The Biosafety Clearing-House of the Cartagena Protocol on Biosafety Tutorial – BCH Resources.
Advertisements

© 2011 Delmar, Cengage Learning Chapter 1 Getting Started with Dreamweaver.
Visual Scripting of XML
Beyond “The System Shall...” A Journey from Good to Great Requirements.
Bringing Order to the Web: Automatically Categorizing Search Results Hao Chen SIMS, UC Berkeley Susan Dumais Adaptive Systems & Interactions Microsoft.
Using the Web-based Training Tool MyFloridaMarketPlace Revised Date: 12/14/06.
Comprehensive Continuous Improvement Plan(CCIP) Training Module 3 Planning Tool, Part 2.
Bring Order to Your Photos: Event-Driven Classification of Flickr Images Based on Social Knowledge Date: 2011/11/21 Source: Claudiu S. Firan (CIKM’10)
The Complex Dynamics of Collaborative Tagging Harry Halpin University of Edinburgh Valentin Robu CWI, Netherlands Hana Shepherd Princeton University WWW.
Explorations in Tag Suggestion and Query Expansion Jian Wang and Brian D. Davison Lehigh University, USA SSM 2008 (Workshop on Search in Social Media)
1 CS 430 / INFO 430 Information Retrieval Lecture 8 Query Refinement: Relevance Feedback Information Filtering.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
Mobile Web Search Personalization Kapil Goenka. Outline Introduction & Background Methodology Evaluation Future Work Conclusion.
The Act Tab (Project). Please select a button to learn more. Welcome to the Project Act Tab. Here you can Add Team Members, provide and receive Feedback,
1 of 6 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation.
Information Retrieval: Human-Computer Interfaces and Information Access Process.
Recall: Query Reformulation Approaches 1. Relevance feedback based vector model (Rocchio …) probabilistic model (Robertson & Sparck Jones, Croft…) 2. Cluster.
Presented by Zeehasham Rasheed
I n t e g r i t y - S e r v i c e - E x c e l l e n c e Business & Enterprise Systems Introduction to Hewlett Packard (HP) Application Lifecycle Management.
| Computer Science Department | Ubiquitous Knowledge Processing Lab | © Prof. Dr. Iryna Gurevych | 1 del.icio.us Knowledge Management in Web.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
Delivering Knowledge for Health Edit mode to enable administrators to : Add widgets Add pages Set page format Publish to the website.
A Product of Enterprise Content Management System (CMS) Web & Portal Content Management Systems for faster web publishing Copyright.
LGU Document Management Solution. What is it? A Web-based Centralized Document Management Solution to keep track of digital documents Instantly search.
NAHAM Guide to Up-front Collections Welcome to the NAHAM Guide to Up-front Collections Demo CD! Please remember that this is only a demo of the Guide.
Databases and LINQ Visual Basic 2010 How to Program 1.
Query Relevance Feedback and Ontologies How to Make Queries Better.
Query Expansion.
Network and Systems Security By, Vigya Sharma (2011MCS2564) FaisalAlam(2011MCS2608) DETECTING SPAMMERS ON SOCIAL NETWORKS.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
Social Curation of Large Multimedia Collections on a Microsoft Azure Cloud Dazhi Chong, Samuel Coppage, Xiangyi Gu, Harris Wu Kurt Maly, Mohammad Zubair.
Service Computation 2010November 21-26, Lisbon.
WORDPRESS TECHNOLOGY BY AMEER. WELCOME INTRODUCTION WordPress is an Open Source software system used by millions of people around the world to create.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Moodle (Course Management Systems). Blogs In this Lecture, we’ll cover how to use blogs, blog capablilities and efficive blog practices.
Features and Algorithms Paper by: XIAOGUANG QI and BRIAN D. DAVISON Presentation by: Jason Bender.
Using the Web-Based Training Tool MyFloridaMarketPlace.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
Math Information Retrieval Zhao Jin. Zhao Jin. Math Information Retrieval Examples: –Looking for formulas –Collect teaching resources –Keeping updated.
A STUDENT GUIDE TO USING NAVIGATE NAVIGATE STUDENT GUIDE.
With Windows 7 Introductory© 2011 Pearson Education, Inc. Publishing as Prentice Hall1 Windows 7 Introductory Chapter 3 Advanced File Management and Advanced.
6/2/20161 Database Systems Lecture # 3 By: Asma Ahmad Jan 21 st, 2011.
This eCPIC Quick Guide has been developed to assist System Administrators with creating Hierarchy Grids in eCPIC. The Hierarchy Grid functionality allows.
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
Recuperação de Informação B Cap. 10: User Interfaces and Visualization , , 10.9 November 29, 1999.
CMPS 435 F08 These slides are designed to accompany Web Engineering: A Practitioner’s Approach (McGraw-Hill 2008) by Roger Pressman and David Lowe, copyright.
AL-MAAREFA COLLEGE FOR SCIENCE AND TECHNOLOGY INFO 232: DATABASE SYSTEMS CHAPTER 1 DATABASE SYSTEMS Instructor Ms. Arwa Binsaleh.
Evaluating & Maintaining a Site Domain 6. Conduct Technical Tests Dreamweaver provides many tools to assist in finalizing and testing your website for.
Harvesting Social Knowledge from Folksonomies Harris Wu, Mohammad Zubair, Kurt Maly, Harvesting social knowledge from folksonomies, Proceedings of the.
Of 49 lecture 18: tagging and folksonomy. of 49 ece 627, winter ‘132 metadata is: “structured information that describes, explains, locates, or otherwise.
Image Classification over Visual Tree Jianping Fan Dept of Computer Science UNC-Charlotte, NC
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Hybrid Content and Tag-based Profiles for recommendation in Collaborative Tagging Systems Latin American Web Conference IEEE Computer Society, 2008 Presenter:
Bringing Order to the Web : Automatically Categorizing Search Results Advisor : Dr. Hsu Graduate : Keng-Wei Chang Author : Hao Chen Susan Dumais.
Automatic Categorization of Patent Applications Presentation to the 3rd IPC Workshop, WIPO, Feb , The need for automatic categorization of.
ONBASE: HOW TO UPLOAD DOCUMENTATION EFFICIENTLY April 2, 2015.
Ontology Engineering and Feature Construction for Predicting Friendship Links in the Live Journal Social Network Author:Vikas Bahirwani 、 Doina Caragea.
Conceptual Overview For Understanding the New Paradigm Provided by: Web Services Section.
Profiling: What is it? Notes and reflections on profiling and how it could be used in process mining.
© 2017 by McGraw-Hill Education. This proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any manner.
WP4 Models and Contents Quality Assessment
About SharePoint Server 2007 My Sites
The IPT user interface and data quality tools
Adding Post Type Archive in WordPress Navigation Menus Guided By: wpglobalsupportwpglobalsupport.
Detecting Online Commercial Intention (OCI)
Lecture 1 File Systems and Databases.
This presentation document has been prepared by Vault Intelligence Limited (“Vault") and is intended for off line demonstration, presentation and educational.
The Product Service Code Selection Tool: A Quick Guide
Presentation transcript:

ELPUB 2010, Helsinki, Finland1 A Collaborative Faceted Categorization System – User Interactions Kurt Maly; Harris Wu; Mohammad Zubair ; Contact:

ELPUB 2010, Helsinki, Finland2 Outline Introduction –What is the problem we are addressing? –What is the approach we are taking? Facet schema evolution – user interactions –Schema enrichment –Anomaly detection –Visual schema presentation rearrangement –User feedback on classifications Conclusions –Future improvements

ELPUB 2010, Helsinki, Finland3 Introduction - Problem Problem –Navigating a large growing collection of digital objects, particularly non textual collection such as images and photographs Possible Approaches –Categorize and classify the collection manually using human experts Centralized, expensive, single perspective Static, rigid structure; does not evolve –Use of social tagging systems such as flickr.com Low precision and recall, lack of structure in tags, ambiguity and noise in tags

ELPUB 2010, Helsinki, Finland4 Introduction - Approach We built a system that improves access to a large, growing collection by supporting users to build a faceted classification collaboratively –Challenge: continuously classify new objects, modify the facet schema, and reclassify existing objects into the modified facet schema –Collaborative Approach: Enable users to collaboratively build a schema with facets and categories, and to classify documents into this schema Needs automated system support to create critical mass and make it easier for users to collaborate

The system front page ELPUB 2010, Helsinki, Finland5

Facet and Category Enrichment Statistical co-occurrence model –Subsumption parent-child relationship between x and y if all documents tagged with y are also tagged with x for an existing tagword t –identifies all documents with tag t –If these documents have a common category c, the rule of subsumption implies that t is a possible subcategory of c Example CategorySuggested sub-category American Civil Warmilitary life Chinaboxer rebellion ELPUB 2010, Helsinki, Finland6

Schema Cleansing Problem: –categories are created under the wrong facet –child categories might represent a broader concept than the parent category Solution: –Use WordNet’s hierarchical relationships among words to detect anomalies ELPUB 2010, Helsinki, Finland7

Schema Cleansing Hierarchy in WordNet (hyponymy: known as “is a” relationship) dog, domestic dog, Canis familiaris => canine, canid => carnivore => placental, placental mammal, eutherian, eutherian mammal => mammal => vertebrate, craniate => chordate => animal, animate being, beast, brute, creature, fauna =>... anomaly detection algorithm Category Parent Cat Grandparent Category Problem President Holiday Politics more closely related to grandparent than to parent ELPUB 2010, Helsinki, Finland8

Ordering of Schema Display Problem: –significant number of categories are created under a given facet (or another category) –large number of facets are created Solution: –limit number of child categories/facets displayed –configure administratively –order the display by a popularity measure ELPUB 2010, Helsinki, Finland9

Ordering of Schema Display Popularity (P) measure –favours the biggest, most used, and fastest growing facets and categories P = 0.5*f(PN*PC)+ 0.5*PR f – normalizing factor PN - total number of items in a category PR - growth rate of a category: number of new (recent) items for a unit of time PC - number of clicks on the category link in the browsing menu over a period of time ELPUB 2010, Helsinki, Finland10

Expanding category display using the “more…” link ELPUB 2010, Helsinki, Finland11

Limiting category display using the “more…” link ELPUB 2010, Helsinki, Finland12

Quality Assessment through User Feedback “thumb-up” and “thumb-down” buttons available for every association –vote up or down for the association between an image and a category based on how relevant and accurate they think it is Value of this explicit feedback determines when a classification can be deleted or, conversely, when it becomes “hard”, i.e., it is confirmed –Action will update the confidence value of an association by increasing or decreasing it by 0.05 based on whether a user believes it is a correct classification or not –confidence value reaches > association is hardened –confidence value falls below a threshold -> association is deleted ELPUB 2010, Helsinki, Finland13

Feedback on category associations ELPUB 2010, Helsinki, Finland14

ELPUB 2010, Helsinki, Finland15 Conclusions Schema enrichment, cleansing and ordering are effective tools to remedy problems introduced by collaborative schema evolution Future improvements include recording actual administrator actions for training purposes