PROGRESS REPORT, NOVEMBER 9, 2009 TOM SCHIMOLER Applications of NLP in determining Tag Redundancy in Folksonomies.

Slides:



Advertisements
Similar presentations
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
Advertisements

Music Encoding Initiative (MEI) DTD and the OCVE
Semantic Web Thanks to folks at LAIT lab Sources include :
IPY and Semantics Siri Jodha S. Khalsa Paul Cooper Peter Pulsifer Paul Overduin Eugeny Vyazilov Heather lane.
Rock Music Work is made by: Olga Prudnikova Elena Anisimova Maria Solopova Teacher: E.A. Ermachenkova.
TOM SCHIMOLER 11/23/2009 CSC 549 A Semantic Approach to Tag Redundancy in Folksonomies Moving Forward.
A Music Search Engine Built upon Audio-based and Web-based Similarity Measures P. Knees, T., Pohle, M. Schedl, G. Widmer SIGIR 2007.
Blues Blues is a vocal-instrumental form of music which has origin in African American communities in southern U.S. Solo voice was later accompanied by.
Tagging Systems Austin Wester. Tags A keywords linked to a resource (image, video, web page, blog, etc) by users without using a controlled vocabulary.
Tagging Systems Mustafa Kilavuz. Tags A tag is a keyword added to an internet resource (web page, image, video) by users without relying on a controlled.
INTERNATIONAL INSTITUTE FOR GEO-INFORMATION SCIENCE AND EARTH OBSERVATION Conceptualization of Place via Spatial Clustering and Co- occurrence Analysis.
Subcultures English Presentation.
Overall Information Extraction vs. Annotating the Data Conference proceedings by O. Etzioni, Washington U, Seattle; S. Handschuh, Uni Krlsruhe.
Qualitative Data Analysis Systems
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
A New Web Semantic Annotator Enabling A Machine Understandable Web BYU Spring Research Conference 2005 Yihong Ding Sponsored by NSF.
IMT530- Organization of Information Resources1 Feedback Like exercises –But want more instructions and feedback on them –Wondering about grading on these.
PRAGMATICS. 3- Pragmatics is the study of how more gets communicated than is said. It explores how a great deal of what is unsaid is recognized. 4.
Teaching with Depth An Understanding of Webb’s Depth of Knowledge
Muse Презентацию подготовила Ученица 8а класса Гимназии №2 Соболева Надежда.
POTENTIAL RELATIONSHIP DISCOVERY IN TAG-AWARE MUSIC STYLE CLUSTERING AND ARTIST SOCIAL NETWORKS Music style analysis such as music classification and clustering.
Rock music is a genre of Pop music (popular music) which has its roots in 1940’s and 1950’s, being heavily influenced by Rhythm and Blues (R&B). Rock.
1 Folksonomy-Based Collabulary Learning Leandro Balby Marinho, Krisztian Buza, Lars Schmidt-Thieme
Key Stage 1 SATs Parent Information Meeting. The National Curriculum All maintained schools must follow the National Curriculum by law. It consists of.
Electronically Querying for the Provenance of Entities Simon Miles Provenance-Aware Service-Oriented Architectures.
School of something FACULTY OF OTHER School of Computing FACULTY OF ENGINEERING Sharing of Community Practice through Semantics: A Case Study in Academic.
Intégration Sémantique de l'Information par des Communautés d'Intelligence en Ligne ISICIL.
 Copyright 2006 Digital Enterprise Research Institute. All rights reserved. Collaborative Building of Controlled Vocabularies Crosswalks Mateusz.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
RuleML-2007, Orlando, Florida1 Towards Knowledge Extraction from Weblogs and Rule-based Semantic Querying Xi Bai, Jigui Sun, Haiyan Che, Jin.
Survey Of Music Information Needs, Uses, and Seeking Behaviors Jin Ha Lee J. Stephen Downie Graduate School of Library and Information Science University.
Knowledge based Learning Experience Management on the Semantic Web Feng (Barry) TAO, Hugh Davis Learning Society Lab University of Southampton.
Artificial intelligence project
Creating Collaborative Partnerships
Methods for the Automatic Construction of Topic Maps Eric Freese, Senior Consultant ISOGEN International.
JAZZ MUSICIAN. Miles Davis was into a lot of different genres.. Jazz, hard bop, bebop, cool Jazz, Modal, fusion…..
 Rock music is a genre of music that entered the mainstream in the 1960s however it also has its roots from the 1940s and 1950s rock and roll, rhythm.
Metadata in the Cloud Stephen White President, Gracenote.
BAA - Big Mechanism using SIRA Technology Chuck Rehberg CTO at Trigent Software and Chief Scientist at Semantic Insights™
CONCLUSION & FUTURE WORK Normally, users perform search tasks using multiple applications in concert: a search engine interface presents lists of potentially.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
1990s Indie, Electronica & New age, Manufactured pop & Dance.
SEMANTICS VS PRAGMATICS Semantics is the study of the relationships between linguistic forms and entities in the world; that is how words literally connect.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Our music.
Genre theory. Andrew Goodwin (1992)- generic conventions of the music video Andrew Goodwin (1992) writing in ‘Dancing in the Distraction Factory’ argued.
Advanced Semantics and Search Beyond Tag Clouds and Taxonomies Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services.
Creating Subjective and Objective Sentence Classifier from Unannotated Texts Janyce Wiebe and Ellen Riloff Department of Computer Science University of.
Digital Library Project Plan Greg Ferguson LIU LIS 654 October 25, 2011.
Presented By- Shahina Ferdous, Student ID – , Spring 2010.
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
Research Methodology Class.   Your report must contains,  Abstract  Chapter 1 - Introduction  Chapter 2 - Literature Review  Chapter 3 - System.
MUSIC MAKES OUR LIFE BRIGHTER AND HAPPIER From the history of music styles.
Inferring Declarative Requirements Specification from Operational Scenarios IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, VOL. 24, NO. 12, DECEMBER, 1998.
Radio Station Formats. Music Formats What a radio station's music format sounds like is governed by four parameters: music style music style music time.
CCT 333: Imagining the Audience in a Wired World Class 9: Scenarios and Requirements.
Understanding Naturally Conveyed Explanations of Device Behavior Michael Oltmans and Randall Davis MIT Artificial Intelligence Lab.
Hosted by: Jordyne Hutson, Heather Tackett And Rob Fernandez.
Theory and Research Chapter 2. Concepts, Variables and Hypotheses Concepts W ords or signs that refer to phenomena that share common characteristics.
ALTERNATIVE / INDIE ROCK MAGAZINE Indie rock is a sub-genre of alternative rock that originated in the united kingdom and the united states in the 1980’s.
Building Trustworthy Semantic Webs
Social Knowledge Mining
Julius Information Extractor
Introduction to Semantic Metadata & Semantic Web
Esteban Guerrero David Vásquez Ponce Fernando Torres Adrián Salazar
The ultimate in data organization
Presentation transcript:

PROGRESS REPORT, NOVEMBER 9, 2009 TOM SCHIMOLER Applications of NLP in determining Tag Redundancy in Folksonomies

Big Question: What is redundancy?  Although I have previously demonstrated examples of redundancy in tag clouds, there must be a formal, measurable way of expressing redundancy.

A Relational Model of Folksonomies Folksonomies are comprised of 3 entity-types in a ternary relationship:  Users: generate annotation content (Subject)  Resources: items of interest (Object)  Tags: semantic “glue” tying users to resources (Predicate) Aside from the basic annotation relation (u, r, t), we can define a number of relations which impart deeper information

General Tagging Relations tag-tag: we can define 3 notions of “co-occurrence”  Annotation-level: the tags have been used by the same person on the same resource  User-level: the tags have been used by the same person for difference resources  Resource-level: the tags have been used by different people for the same resource resource-resource: analogous to the above, we can also define 3 “co-occurrence” relations for resources These relations are directly observable and do not impart explicit semantic information

Non-domain specific Semantic Relations A basic assumption of folksonomy research is that the explicit tagging relations imply deeper semantic relations tag-tag:  alternate spelling: (“rock and roll”, “rock ‘n’ roll”)  alias: (“nlp”, “natural language processing”)  sympathetic: (“awesome”, “cool”)  antithetic: (“cool”, “sucks”)

Semantic relations in the Music Domain Within Last.fm are semantic relations which are specific to the music domain tag-tag:  sub-genre: (“heavy metal”, “death metal”) resource-tag:  genre: (The Pixies, “indie rock”)  location: (The Pixies, “boston”)  era: (The Pixies, “80s”) resource-resource:  membership: (Frank Black, The Pixies)  label-mates: (Throwing Muses, The Pixies)  influence: (The Pixies, Nirvana)

Context-sensitive semantic relations Some relations are useful only within a specific context (e.g., a user or community of users)  judgment: (The Pixies, “genius”)  misinformation: (The Pixies, “japanese”)

Redundancy as Relation Redundancy: a resource-specific semantic relation between tags suggesting that both tags impart the same amount and style of information about a resource Are “cool” and “awesome” in a redundancy relation?

Redundancy as Relation Redundancy: a resource-specific semantic relation between tags suggesting that both tags impart the same amount and style of information about a resource Are “cool” and “awesome” in a redundancy relation?  In the context of, for instance, Metallica, this seems like a reasonable assertion

Redundancy as Relation Redundancy: a latent resource-specific semantic relation between tags in which both tags impart the same amount and style of information about the resource Are “cool” and “awesome” in a redundancy relation?  In the context of, for instance, Metallica, this seems like a reasonable assertion  Given another resource, Miles Davis, the question is not clear cut; “cool” has a particular meaning (it’s a sub-genre of jazz) which is entirely different than the judgment tag “awesome”

Rule-based Determination of Redundancy One way to methodically determine the redundancy relation is through rules in which the antecedents are given as explicit relations Examples:  alt.spelling(t1, t2)  redundant(t1,t2) w.r.t. any resource r  location(r,t1) and location(r,t2)  redundant(t1,t2) w.r.t. r Rules are learned and applied through ML

Problem We require a great deal of a priori semantic information in order to derive rules This information is embedded in the natural language text of wiki’s associated with both tags and resources Therefore, NLP is used to extract this information An alternative (augmented) approach is to defer to a full ontology; this is well beyond the scope of the current project

Data Example (and subsequent offshoots) is a > band founded in by members of the soul-collective. The band is led by guitarist and early in their career featured many musicians but by the line-up had coalesced with four core members and frequent vocal guests. The band have a reputation for > shows and releasing frequent albums on a number of international record labels, including the family record label which was established in to document the activities of the whole collective. Offshoots and permutations include: *