Music Recommendation A Data Mining Approach Daniel McEnnis 2nd year PhD Daniel McEnnis 2nd year PhD.

Slides:



Advertisements
Similar presentations
1 ©2009 MeeMix MeeMix – A personalized Experience.
Advertisements

Recommender Systems & Collaborative Filtering
Google News Personalization: Scalable Online Collaborative Filtering
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Improvements and extras Paul Thomas CSIRO. Overview of the lectures 1.Introduction to information retrieval (IR) 2.Ranked retrieval 3.Probabilistic retrieval.
A PowerPoint Presentation
A Graph-based Recommender System Zan Huang, Wingyan Chung, Thian-Huat Ong, Hsinchun Chen Artificial Intelligence Lab The University of Arizona 07/15/2002.
A Brief Introduction. Acknowledgements  The material in this tutorial is based in part on: Concurrency: State Models & Java Programming, by Jeff Magee.
0 General information Rate of acceptance 37% Papers from 15 Countries and 5 Geographical Areas –North America 5 –South America 2 –Europe 20 –Asia 2 –Australia.
Visualization of Mobile Agents Project Update Ken Deeter CPSC 533C.
1 Oct 30, 2006 LogicSQL-based Enterprise Archive and Search System How to organize the information and make it accessible and useful ? Li-Yan Yuan.
Distributed Computing Group From Web to Map: Exploring the World of Music Olga Goussevskaia Michael Kuhn Michael Lorenzi Roger Wattenhofer Web Intelligence.
Web- and Multimedia-based Information Systems. Assessment Presentation Programming Assignment.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
- Confidential - MoodLogic Metadata MoodLogic © 2004 MoodLogic, Inc. – No reproduction or distribution without prior written permission. The promise and.
Modern Information Retrieval Chapter 2 Modeling. Can keywords be used to represent a document or a query? keywords as query and matching as query processing.
21 21 Web Content Management Architectures Vagan Terziyan MIT Department, University of Jyvaskyla, AI Department, Kharkov National University of Radioelectronics.
INFORMATION RETRIEVAL WEEK 1 AND 2
Agent Technology for e-Commerce
Web Mining Research: A Survey
Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.
1 CS 430: Information Discovery Lecture 20 The User in the Loop.
Recommender Systems; Social Information Filtering.
Recommender systems Ram Akella November 26 th 2008.
Music Recommendation By Daniel McEnnis. Outline Sociology of Music Recommendation Infrastructure –Relational Analysis Toolkit Description Evaluation –GATE.
Algorithms for Data Mining and Querying with Graphs Investigators: Padhraic Smyth, Sharad Mehrotra University of California, Irvine Students: Joshua O’
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
Department of Computer Science, University of California, Irvine Site Visit for UC Irvine KD-D Project, April 21 st 2004 The Java Universal Network/Graph.
Making Sense of the Information Age Presented by Kenan Doyle Branam Paradigm Communications, Inc. Paradigm Communications, Inc.
1 © Goharian & Grossman 2003 Introduction to Data Mining (CS 422) Fall 2010.
1 Folksonomy-Based Collabulary Learning Leandro Balby Marinho, Krisztian Buza, Lars Schmidt-Thieme
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Graph-RAT Overview By Daniel McEnnis. 2/32 What is Graph-RAT  Relational Analysis Toolkit  Database abstraction layer  Evaluation platform  Robustly.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Since 2007, the sales of Digital Music has increased from 20% to 36%. While this increase is good, the sale of Physical Music (CDs) has fallen from 80%
Distributed Networks & Systems Lab. Introduction Collaborative filtering Characteristics and challenges Memory-based CF Model-based CF Hybrid CF Recent.
Adaptive News Access Daniel Billsus Presented by Chirayu Wongchokprasitti.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
Overview of the Course Copyright 2003, Keith D. Cooper, Ken Kennedy & Linda Torczon, all rights reserved. Students enrolled in Comp 412 at Rice University.
1 Applying Collaborative Filtering Techniques to Movie Search for Better Ranking and Browsing Seung-Taek Park and David M. Pennock (ACM SIGKDD 2007)
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Presented By :Ayesha Khan. Content Introduction Everyday Examples of Collaborative Filtering Traditional Collaborative Filtering Socially Collaborative.
Math Information Retrieval Zhao Jin. Zhao Jin. Math Information Retrieval Examples: –Looking for formulas –Collect teaching resources –Keeping updated.
Page 1 Alliver™ Page 2 Scenario Users Contents Properties Contexts Tags Users Context Listener Set of contents Service Reasoner GPS Navigator.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Last.fm Hall of F/S Brian Morris Edward Nishihama.
RecBench: Benchmarks for Evaluating Performance of Recommender System Architectures Justin Levandoski Michael D. Ekstrand Michael J. Ludwig Ahmed Eldawy.
Introduction to LINQ Chapter 11. Introduction Large amounts of data are often stored in a database—an organized collection of data. A database management.
How Do We Find Information?. Key Questions  What are we looking for?  How do we find it?  Why is it difficult? “A prudent question is one-half of wisdom”
Introduction to the Semantic Web and Linked Data
Graph RAT A framework for integrating social and content data By Daniel McEnnis University of Waikato To what extent do artists cluster into genres Pattern.
Progress Report - Year 2 Extensions of the PhD Symposium Presentation Daniel McEnnis.
Collaborative Query Previews in Digital Libraries Lin Fu, Dion Goh, Schubert Foo Division of Information Studies School of Communication and Information.
EMEA Beat Schwegler Architect Microsoft EMEA HQ Ingo Rammer Principal Consultant thinktecture
A Unified Approach to Ranking in Probabilistic Databases Jian Li, Barna Saha, Amol Deshpande University of Maryland, College Park, USA VLDB
KMS & Collaborative Filtering Why CF in KMS? CF is the first type of application to leverage tacit knowledge People-centric view of data Preferences matter.
The Benefit of Using Tag-Based Profiles Claudiu Firan, Wolfgang Nejdl, Raluca Paiu 5 th Latin American Web Congress, 2007.
Visual Basic 2010 How to Program © by Pearson Education, Inc. All Rights Reserved.
Big Data Javad Azimi May First of All… Sorry about the language  Feel free to ask any question Please share similar experiences.
Chapter 11.  Large amounts of data are often stored in a database—an organized collection of data.  A database management system (DBMS) provides mechanisms.
Collaborative Filtering: Searching and Retrieving Web Information Together Huimin Lu December 2, 2004 INF 385D Fall 2004 Instructor: Don Turnbull.
Barthélémy von Haller CERN PH/AID For the ALICE Collaboration The ALICE data quality monitoring system.
1 Dongheng Sun 04/26/2011 Learning with Matrix Factorizations By Nathan Srebro.
Recommender Systems & Collaborative Filtering
So, what was this course about?
Introduction to Spark.
Introduction to LINQ Chapter 11 10/28/2015 Lect 4 CT1411.
Introduction to LINQ Chapter 11.
Movie Recommendation System
Presentation transcript:

Music Recommendation A Data Mining Approach Daniel McEnnis 2nd year PhD Daniel McEnnis 2nd year PhD

Overview  High level overview  Toolkit Improvements  Experiments  Evaluation  Algorithms research  Data  Future work  High level overview  Toolkit Improvements  Experiments  Evaluation  Algorithms research  Data  Future work

Project Goals  Integrate social information  Make algorithms ‘culturally aware’  Implement existing algorithms  Systematic evaluation framework  Integrate social information  Make algorithms ‘culturally aware’  Implement existing algorithms  Systematic evaluation framework

Similarity Algorithms  Create new relations based on some aspect of similarity  6 different varieties of similarity  Each algorithm can use one of 6 distance functions  Create new relations based on some aspect of similarity  6 different varieties of similarity  Each algorithm can use one of 6 distance functions

Aggregator Algorithms  Takes data from one set of actors and moves it to another  6 different varierties  Each variety uses one of 7 aggregator functions  Basic building block of Graph-RAT applications  Takes data from one set of actors and moves it to another  6 different varierties  Each variety uses one of 7 aggregator functions  Basic building block of Graph-RAT applications

Graph Triples Census  Probable novel algorithm  Proof of Correctness Completed  Proof of Time Complexity Completed  Literature review in progress  Probable novel algorithm  Proof of Correctness Completed  Proof of Time Complexity Completed  Literature review in progress

SUCCESS!  Graph-RAT programming language now functioning  Graph-RAT integrates social, cultural, personal, and audio data into algorithms  Includes most commercial algorithms  Contains primitives for existing academic systems  Evaluation is entirely automated  Graph-RAT programming language now functioning  Graph-RAT integrates social, cultural, personal, and audio data into algorithms  Includes most commercial algorithms  Contains primitives for existing academic systems  Evaluation is entirely automated

PROBLEMS

Evaluation Exploration  9 types of music recommendation  Personalized versus generic  Open query versus targeted query  Dynamic versus static data  New music versus all music  9 types of music recommendation  Personalized versus generic  Open query versus targeted query  Dynamic versus static data  New music versus all music

Personalized Radio  Open query with personalized presentation  Static data vs dynamic data  New items prediction vs predict anything  Open query with personalized presentation  Static data vs dynamic data  New items prediction vs predict anything

Targeted Search  Not personalized  Similarity queries  Automatically generating targeted lists for a browsing hierarchy  New music vs all music  Static vs dynamic data  Not personalized  Similarity queries  Automatically generating targeted lists for a browsing hierarchy  New music vs all music  Static vs dynamic data

Personalized Tag Radio  Create a personalized play list matching a given query  New music vs all music  Static vs dynamic data  Create a personalized play list matching a given query  New music vs all music  Static vs dynamic data

Excluded Types  ‘Top 40’ prediction  Rendered obsolete by other types  ‘Top 40’ prediction  Rendered obsolete by other types

Existing Algorithms  Item-to-Item collaborative filtering  7 variations  User-to-user collaborative filtering  7 variations  Associative mining collaborative filtering  Direct machine learning playlist data  Direct machine learning audio data  Item-to-Item collaborative filtering  7 variations  User-to-user collaborative filtering  7 variations  Associative mining collaborative filtering  Direct machine learning playlist data  Direct machine learning audio data

Novel Algorithms  Machine learning over profile data  Machine learning over cultural and profile data  Machine learning on different concatenations  Audio  Playlist  Profile  Cultural  Machine learning over profile data  Machine learning over cultural and profile data  Machine learning on different concatenations  Audio  Playlist  Profile  Cultural

Initial Data  LiveJournal  Separating music data is difficult  No tag info or audio content  No enough musical data  LastFM by User  No audio content  Data cleaning is an issue  LiveJournal  Separating music data is difficult  No tag info or audio content  No enough musical data  LastFM by User  No audio content  Data cleaning is an issue

Current Data  40’s Jazz Recordings  1800 annotated recordings from 70 CDs  Covers nearly all 40’s popular music  LastFM by Song  Retrieves tag and user info by song  Data cleaning on user playcounts needed  40’s Jazz Recordings  1800 annotated recordings from 70 CDs  Covers nearly all 40’s popular music  LastFM by Song  Retrieves tag and user info by song  Data cleaning on user playcounts needed

Data Cleaning Tags  Polysemy  Synonomy  Disjoint  Hypersomny  Hyposomny  Initial algorithms developed  Polysemy  Synonomy  Disjoint  Hypersomny  Hyposomny  Initial algorithms developed

Future Work: Programming  Radically different programming environment  SQL  LINQ library package in C#  Radically different programming environment  SQL  LINQ library package in C#

Future Work: Scalability  Distributed SQL database implementation  Just-in-time compilation  Event-based recalculation of algorithm results  Parallel execution of algorithms  Multi-threaded algorithms  Distributed SQL database implementation  Just-in-time compilation  Event-based recalculation of algorithm results  Parallel execution of algorithms  Multi-threaded algorithms