LYRIC-BASED ARTIST NETWORK Derek Gossi CS 765 Fall 2014.

Slides:



Advertisements
Similar presentations
1 ©2009 MeeMix MeeMix – A personalized Experience.
Advertisements

Complex Networks for Representation and Characterization of Images For CS790g Project Bingdong Li 9/23/2009.
Music Recommendation by Unified Hypergraph: Music Recommendation by Unified Hypergraph: Combining Social Media Information and Music Content Jiajun Bu,
Prediction Modeling for Personalization & Recommender Systems Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
A PowerPoint Presentation
Speaker Associate Professor Ning-Han Liu. What’s MIR  Music information retrieval (MIR) is the interdisciplinary science of retrieving information from.
Analysis and Modeling of Social Networks Foudalis Ilias.
VisualRank: Applying PageRank to Large-Scale Image Search Yushi Jing, Member, IEEE, and Shumeet Baluja, Member, IEEE.
COMP423 Intelligent Agents. Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set.
Oct 14, 2014 Lirong Xia Recommender systems acknowledgment: Li Zhang, UCSC.
COLLABORATIVE FILTERING Mustafa Cavdar Neslihan Bulut.
Sean Blong Presents: 1. What are they…?  “[…] specific type of information filtering (IF) technique that attempts to present information items (movies,
1 / 22 Issues in Text Similarity and Categorization Jordan Smith – MUMT 611 – 27 March 2008.
LYRIC-BASED ARTIST NETWORK METHODOLOGY Derek Gossi CS 765 Fall 2014.
Data Mining and Text Analytics in Music Audi Sugianto and Nicholas Tawonezvi.
Rubi’s Motivation for CF  Find a PhD problem  Find “real life” PhD problem  Find an interesting PhD problem  Make Money!
Friend Recommendations in Social Networks using Genetic Algorithms and Network Topology Jeff Naruchitparames, Mehmet Gunes, Sushil J. Louis University.
The Research Project - Preliminary Proposal Presentation Contextual Suggestion Track: Travel Plan Recommendation System Based on Open-web Information Presenter:
Berenzweig - Music Recommendation1 Music Recommendation Systems: A Progress Report Adam Berenzweig April 19, 2002.
- Confidential - MoodLogic Metadata MoodLogic © 2004 MoodLogic, Inc. – No reproduction or distribution without prior written permission. The promise and.
Recommendations via Collaborative Filtering. Recommendations Relevant for movies, restaurants, hotels…. Recommendation Systems is a very hot topic in.
MusicSense: Contextual Music Recommendation using Emotional Allocation Modeling Rui Cai, Chao Zhang, Chong Wang, Lei Zhang, and Wei-Ying Ma Proceedings.
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
Music Recommendation By Daniel McEnnis. Outline Sociology of Music Recommendation Infrastructure –Relational Analysis Toolkit Description Evaluation –GATE.
CONTENT-BASED BOOK RECOMMENDING USING LEARNING FOR TEXT CATEGORIZATION TRIVIKRAM BHAT UNIVERSITY OF TEXAS AT ARLINGTON DATA MINING CSE6362 BASED ON PAPER.
Overview of Web Data Mining and Applications Part I
POTENTIAL RELATIONSHIP DISCOVERY IN TAG-AWARE MUSIC STYLE CLUSTERING AND ARTIST SOCIAL NETWORKS Music style analysis such as music classification and clustering.
References: Linden, G.; Smith, B.; York, J.; , "Amazon.com recommendations: item-to-item collaborative filtering,". Internet Computing, IEEE , vol.7,
Tag-based Social Interest Discovery
Unsupervised Learning. CS583, Bing Liu, UIC 2 Supervised learning vs. unsupervised learning Supervised learning: discover patterns in the data that relate.
Performance of Recommender Algorithms on Top-N Recommendation Tasks RecSys 2010 Intelligent Database Systems Lab. School of Computer Science & Engineering.
MACHINE LEARNING TECHNIQUES FOR MUSIC PREDICTION S. Grant Lowe Advisor: Prof. Nick Webb.
1 Formal Models for Expert Finding on DBLP Bibliography Data Presented by: Hongbo Deng Co-worked with: Irwin King and Michael R. Lyu Department of Computer.
By Chris Zachor.  Introduction  Background  Changes  Methodology  Data Collection  Network Topologies  Measures  Tools  Conclusion  Questions.
Improving Web Search Ranking by Incorporating User Behavior Information Eugene Agichtein Eric Brill Susan Dumais Microsoft Research.
User Profiling based on Folksonomy Information in Web 2.0 for Personalized Recommender Systems Huizhi (Elly) Liang Supervisors: Yue Xu, Yuefeng Li, Richi.
Event Detection using Customer Care Calls 04/17/2013 IEEE INFOCOM 2013 Yi-Chao Chen 1, Gene Moo Lee 1, Nick Duffield 2, Lili Qiu 1, Jia Wang 2 The University.
Glasgow 02/02/04 NN k networks for content-based image retrieval Daniel Heesch.
Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.
Wang-Chien Lee i Pervasive Data Access ( i PDA) Group Pennsylvania State University Mining Social Network Big Data Intelligent.
1 Recommender Systems Collaborative Filtering & Content-Based Recommending.
TEMPLATE DESIGN © Zhiyao Duan 1,2, Lie Lu 1, and Changshui Zhang 2 1. Microsoft Research Asia (MSRA), Beijing, China.2.
Music Information Retrieval Information Universe Seongmin Lim Dept. of Industrial Engineering Seoul National University.
Unsupervised Learning of Visual Sense Models for Polysemous Words Kate Saenko Trevor Darrell Deepak.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
A Content-Based Approach to Collaborative Filtering Brandon Douthit-Wood CS 470 – Final Presentation.
1 Collaborative Filtering & Content-Based Recommending CS 290N. T. Yang Slides based on R. Mooney at UT Austin.
Recommender Systems Debapriyo Majumdar Information Retrieval – Spring 2015 Indian Statistical Institute Kolkata Credits to Bing Liu (UIC) and Angshul Majumdar.
Advanced Semantics and Search Beyond Tag Clouds and Taxonomies Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services.
Music Recommendation On-line Survey Presented by Daniel Wu & Gordon Chang
Pairwise Preference Regression for Cold-start Recommendation Speaker: Yuanshuai Sun
A Classification-based Approach to Question Answering in Discussion Boards Liangjie Hong, Brian D. Davison Lehigh University (SIGIR ’ 09) Speaker: Cho,
Tagging Systems and Their Effect on Resource Popularity Austin Wester.
KNN CF: A Temporal Social Network kNN CF: A Temporal Social Network Neal Lathia, Stephen Hailes, Licia Capra University College London RecSys ’ 08 Advisor:
Improved Video Categorization from Text Metadata and User Comments ACM SIGIR 2011:Research and development in Information Retrieval - Katja Filippova -
KAIST TS & IS Lab. CS710 Know your Neighbors: Web Spam Detection using the Web Topology SIGIR 2007, Carlos Castillo et al., Yahoo! 이 승 민.
Optimization Indiana University July Geoffrey Fox
Collaborative Filtering - Pooja Hegde. The Problem : OVERLOAD Too much stuff!!!! Too many books! Too many journals! Too many movies! Too much content!
Recommendation Systems ARGEDOR. Introduction Sample Data Tools Cases.
A content-based System for Music Recommendation and Visualization of User Preference Working on Semantic Notions Dmitry Bogdanov, Martin Haro, Ferdinand.
COMP423 Intelligent Agents. Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set.
ITunes Genius Presented By: Dibyendu Talukder (MT13063) Prerna Juneja (MT13099)
The Greek Audio Dataset
Data Mining: Concepts and Techniques
Introduction to Music Information Retrieval (MIR)
Brian Whitman Paris Smaragdis MIT Media Lab
Movie Recommendation System
CSE 491/891 Lecture 25 (Mahout).
Using Clustering to Make Prediction Intervals For Neural Networks
Presentation transcript:

LYRIC-BASED ARTIST NETWORK Derek Gossi CS 765 Fall 2014

The Big Problem How do we make better music recommendations?

The Big Problem How do we make better music recommendations? Personalized recommendations Anonymous recommendations based on similarity Playlist generation

The Big Problem How do we make better music recommendations? Ideally: Understand all the factors which link songs or artists together

Topics Background on Music Recommendation The Dataset Existing Research Proposed Research

BACKGROUND ON MUSIC RECOMMENDATION

Music Recommendation Systems

Approaches to Recommendation Collaborative Filtering Users that liked this artist/song also liked that artist/song Amazon, iTunes store, Spotify Tagging Categorization based on user-generated or pre-defined tags Calm, sad, romantic, cheerful, anxious, depressed Last.fm Content-based Look at the audio signal Not widely used in industry yet Pandora, Spotify (in progress) What can the lyrics tell us?

Approaches to Recommendation

The Problem with Tags

Care vs. Scale B Whitman, Co-Founder of The Echo Nest, “How music recommendation works—and doesn’t work”

Care vs. Scale B Whitman, Co-Founder of The Echo Nest, “How music recommendation works—and doesn’t work”

Comparison of Approaches Collaborative filtering is widely used in practice Precision vs. Profit Even though you might like x better, Amazon makes more money by recommending y Probably less of an issue for subscription services such as Spotify Existing recommendation systems largely do not take content of music into account Why? Possibility for large error Computational cost Still being researched

MIR (Music Information Retrieval) Emerging area of research Gathering information directly from audio signal Success in determining tempo, key, and loudness Research in time signature tracking, melody detection

MIR (Music Information Retrieval) What about trying to predict location on reduced-dimension latent space of users and songs using audio features? Deep learning methodologies

The Question Can lyrics be used to improve recommender systems? Benefits of lyrical analysis approach Known factors make for easy error checking Large-scale factors such as repetition or key words are easy to compute Nearly as scalable as pure audio analysis for most popular genres Disadvantages of lyrical analysis approach Not all songs have lyrics! Text analysis is a subtle and complex problem too Audio + lyrics make for new interpretations Reducing to artist level will “average out” some error A combined approach will likely be the best approach

Care vs. Scale B Whitman, Co-Founder of The Echo Nest, “How music recommendation works—and doesn’t work”

Care vs. Scale Lyrical analysis B Whitman, Co-Founder of The Echo Nest, “How music recommendation works—and doesn’t work”

Care vs. Scale Lyrical analysis Lyrical analysis + audio analysis + CF B Whitman, Co-Founder of The Echo Nest, “How music recommendation works—and doesn’t work”

THE DATASET The Million Song Dataset (MSD)

Million Song Dataset Open source dataset released in Feb 2011 Metadata and audio features for a million contemporary audio tracks

The Million Song Dataset Challenge Online competition Given full listening history for 1 million users Given half of the listening history for 110,000 users Goal: predict the other half of the listening history Metric: mean average precision Best ranked teams used some form collaborative filtering See F. Aiolli, “A Preliminary Study on a Recommender System for the Million Song Dataset Challenge”

The Million Song Dataset Challenge

EXISTING RESEARCH A Summary

Network Topology P. Cano, O. Celma, and M. Koppenberger. “The topology of music recommendation networks,” Feb Analyzes four music recommendation systems from a network perspective Directed edges n = 16,302 (Yahoo) to 51,616 (MSN) m = 158,866 (AMG) to 511,539 (Yahoo) Small-world properties in all networks Average shortest path < 8 Clustering coefficient from 0.14 (Amazon) to 0.54 (MSN)

Lyrical Analysis X. Hu, J. S. Downie, and A. F. Ehmann. “Lyric text mining in music mood classification,” ,829 unique audio tracks from last.fm with lyrics and tags Tags grouped into 18 distinct categories calm, comfort quiet, serene, mellow, chill out, … grief, heartbreak, mournful, sorrow, sorry, … Objective: predict tag category Lyrical model, audio feature model, and combined model Lyrical features were found to outperform audio in cases

Lyrical Analysis Y. Xia, K. Wong, L. Wang, and M. Xu. “Sentiment vector space model for lyric-based song sentiment classification,” June Custom sentiment vector space model (s-VSM) used to classify 2,653 Chinese pop songs Only two classes: light-hearted and heavy-hearted Lyrics found to outperform audio features in the classification problem

PROPOSED RESEARCH

Proposed Research Use the MSD to create a network of songs and artists linked by threshold lyrical similarity Metric of similarity will be based on: Use of key words or key word groups Word complexity and range of words used Sentiment Random sample will need to be used, as mapping full dataset would require ~750,000 2 iterations Cluster the network into n distinct “communities” Unsupervised approach

Research Questions Network properties? Scale, clustering, etc. What are the most natural communities? Genre, mood, complexity? How does it compare to existing models? How much error is introduced by using lyrics only? How does the network topology of artists linked by lyrical similarity possess compare to existing user-based collaborative filtering networks? Can it be used to improve music recommendation?

QUESTIONS?