ETH Zurich – Distributed Computing Group Michael Kuhn 1ETH Zurich – Distributed Computing Group Social Audio Features An Intuitive Guide to the Music Galaxy.

Slides:

Advertisements

Similar presentations

1 ©2009 MeeMix MeeMix – A personalized Experience.

Advertisements

Song Intersection by Approximate Nearest Neighbours Michael Casey, Goldsmiths Malcolm Slaney, Yahoo! Inc.

Mustafa Cayci INFS 795 An Evaluation on Feature Selection for Text Clustering.

Aggregating local image descriptors into compact codes

PARTITIONAL CLUSTERING

LYRIC-BASED ARTIST NETWORK Derek Gossi CS 765 Fall 2014.

Collaborative Filtering Sue Yeon Syn September 21, 2005.

A Music Search Engine Built upon Audio-based and Web-based Similarity Measures P. Knees, T., Pohle, M. Schedl, G. Widmer SIGIR 2007.

LYRIC-BASED ARTIST NETWORK METHODOLOGY Derek Gossi CS 765 Fall 2014.

Presented by: Mingyuan Zhou Duke University, ECE April 3, 2009

Visualization and Cluster

University of Joensuu Dept. of Computer Science P.O. Box 111 FIN Joensuu Tel fax Isomap Algorithm.

Machine Learning & Data Mining CS/CNS/EE 155 Lecture 14: Embeddings 1Lecture 14: Embeddings.

Distributed Computing Group From Web to Map: Exploring the World of Music Olga Goussevskaia Michael Kuhn Michael Lorenzi Roger Wattenhofer Web Intelligence.

Search and Retrieval: More on Term Weighting and Document Ranking Prof. Marti Hearst SIMS 202, Lecture 22.

Berenzweig - Music Recommendation1 Music Recommendation Systems: A Progress Report Adam Berenzweig April 19, 2002.

1 Efficient Clustering of High-Dimensional Data Sets Andrew McCallum WhizBang! Labs & CMU Kamal Nigam WhizBang! Labs Lyle Ungar UPenn.

ETH Zurich – Distributed Computing Group Samuel Welten 1ETH Zurich – Distributed Computing Group Michael Kuhn Roger Wattenhofer Samuel Welten TexPoint.

Distributed Computing Group Exploring Music Collections on Mobile Devices Michael Kuhn Olga Goussevskaia Roger Wattenhofer MobileHCI 2008 Amsterdam, NL.

Distributed Computing Group Visually and Acoustically Exploring the High-Dimensional Space of Music Lukas Bossard Michael Kuhn Roger Wattenhofer SocialCom.

IR Models: Latent Semantic Analysis. IR Model Taxonomy Non-Overlapping Lists Proximal Nodes Structured Models U s e r T a s k Set Theoretic Fuzzy Extended.

Three Algorithms for Nonlinear Dimensionality Reduction Haixuan Yang Group Meeting Jan. 011, 2005.

Understanding and Organizing User Generated Data Methods and Applications.

Michael Kuhn Distributed Computing Group (DISCO) ETH Zurich The MusicExplorer Project: Mapping the World of Music.

Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.

A Global Geometric Framework for Nonlinear Dimensionality Reduction Joshua B. Tenenbaum, Vin de Silva, John C. Langford Presented by Napat Triroj.

Probabilistic Latent Semantic Analysis

Atul Singh Junior Undergraduate CSE, IIT Kanpur.  Dimension reduction is a technique which is used to represent a high dimensional data in a more compact.

Dimensionality Reduction

NonLinear Dimensionality Reduction or Unfolding Manifolds Tennenbaum|Silva|Langford [Isomap] Roweis|Saul [Locally Linear Embedding] Presented by Vikas.

Lightseminar: Learned Representation in AI An Introduction to Locally Linear Embedding Lawrence K. Saul Sam T. Roweis presented by Chan-Su Lee.

Dimensionality Reduction. Multimedia DBs Many multimedia applications require efficient indexing in high-dimensions (time-series, images and videos, etc)

Multimedia Data Mining Arvind Balasubramanian Multimedia Lab (ECSS 4.416) The University of Texas at Dallas.

FREEGAL MUSIC Freegal Music offers access to nearly 3 million songs, including Sony Music’s catalog of legendary artists. In total, the collection is comprised.

Nonlinear Dimensionality Reduction Approaches. Dimensionality Reduction The goal: The meaningful low-dimensional structures hidden in their high-dimensional.

Manifold learning: Locally Linear Embedding Jieping Ye Department of Computer Science and Engineering Arizona State University

Tim Pohle, Peter Knees, Markus Schedl, Elias Pampalk, and Gerhard Widmer IEEE Transactions on Multimedia, Vol 9, No. 3, April 2007 Present by Yi-Tang Wang.

The Tutorial of Principal Component Analysis, Hierarchical Clustering, and Multidimensional Scaling Wenshan Wang.

Latent Semantic Analysis Hongning Wang VS model in practice Document and query are represented by term vectors – Terms are not necessarily orthogonal.

EMIS 8381 – Spring Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.

Latent Semantic Analysis Hongning Wang Recap: vector space model Represent both doc and query by concept vectors – Each concept defines one dimension.

1 Learning from Shadows Dimensionality Reduction and its Application in Artificial Intelligence, Signal Processing and Robotics Ali Ghodsi Department of.

Understanding The Semantics of Media Chapter 8 Camilo A. Celis.

Computer Vision Lab. SNU Young Ki Baik Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

Music Information Retrieval Information Universe Seongmin Lim Dept. of Industrial Engineering Seoul National University.

ISOMAP TRACKING WITH PARTICLE FILTER Presented by Nikhil Rane.

Manifold learning: MDS and Isomap

CSC2535: Computation in Neural Networks Lecture 12: Non-linear dimensionality reduction Geoffrey Hinton.

Nonlinear Dimensionality Reduction Approach (ISOMAP)

Tony Jebara, Columbia University Advanced Machine Learning & Perception Instructor: Tony Jebara.

Data Projections & Visualization Rajmonda Caceres MIT Lincoln Laboratory.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Mining massive document collections by the WEBSOM method Presenter : Yu-hui Huang Authors :Krista Lagus,

Language Modeling Putting a curve to the bag of words Courtesy of Chris Jordan.

Chapter 13 (Prototype Methods and Nearest-Neighbors )

Discovering the Arts  Elements of Music  Elements of Art.

Collaborative Filtering via Euclidean Embedding M. Khoshneshin and W. Street Proc. of ACM RecSys, pp , 2010.

Optimization Indiana University July Geoffrey Fox

CSC321: Lecture 25: Non-linear dimensionality reduction Geoffrey Hinton.

Out of sample extension of PCA, Kernel PCA, and MDS WILSON A. FLORERO-SALINAS DAN LI MATH 285, FALL

A Self-organizing Semantic Map for Information Retrieval Xia Lin, Dagobert Soergel, Gary Marchionini presented by Yi-Ting.

ITunes Genius Presented By: Dibyendu Talukder (MT13063) Prerna Juneja (MT13099)

Social Audio Features for Advanced Music Retrieval Interfaces

Multimedia Content-Based Retrieval

Understanding and Organizing User Generated Data

Outline Nonlinear Dimension Reduction Brief introduction Isomap LLE

Step-By-Step Instructions for Miniproject 2

Musical Style Classification

Ying Dai Faculty of software and information science,

Indiana University July Geoffrey Fox

NonLinear Dimensionality Reduction or Unfolding Manifolds

Presentation transcript:

ETH Zurich – Distributed Computing Group Michael Kuhn 1ETH Zurich – Distributed Computing Group Social Audio Features An Intuitive Guide to the Music Galaxy Michael Kuhn Distributed Computing Group (DISCO) ETH Zurich

„Today, I would like to listen to something cheerful.“ „Something like Lenny Kravitz would be great.“ „Who can help me to discover my collection?“

„half of the time I spend skipping songs...”

„In my shelf AC/DC is next to the ZZ Top...“

Similar or different???

cover flow looks better cover flow looks better

does not well represent perceived similarity miles davis beatles fatboy slim beatles fatboy slim avril lavigne miles davis

…well reflects perceived music similarity. …is as convenient to use as an audio feature space. We want to have something that… Social Audio Features

socially derived music similarity + mapping into Euclidean space = Social Audio Features

ETH Zurich – Distributed Computing Group Michael Kuhn 11 Advantages of a Feature Space Similar songs are close to each other Quickly find nearest neighbors Span (and play) volumes Create smooth playlists by interpolation Visualize a collection Low memory footprint –Well suited for mobile domain convenient basis to build music software

Creating Social Audio Features, Method 1: Collaborative Filtering and MDS

#common users (co-occurrences) (co-occurrences) Occurrences of song A Occurrences of song B „Users who listen to Muse also listen to Oasis...“ Problem: Only pairwise similarity, but no global view!

Getting a global view... d = ? pairwise similarities 1 1

Principal Component Analysis (PCA): – Project on hyperplane that maximizes variance. – Computed by solving an eigenvalue problem. Basic idea of MDS: – Assume that the exact positions y 1,...,y N in a high-dimensional space are given. – It can be shown that knowing only the distances d(y i, y j ) between points we can calculate the same result as applying PCA to y 1,...,y N. Problem: Complexity O(n 2 log n) – use approximation: LMDS [da Silva and Tenenbaum, 2002] Classical Multidimensional Scaling (MDS)

Problem: Some links erroneously shortcut certain paths Problem: Use embedding as estimator for distance: Remove edges that get stretched most and re-embed

After 30 rounds of iterative embedding Original embedding

Pink Floyd - Time Pink Floyd - On the Run Pink Floyd - Any Colour you Like Pink Floyd - The Great Gig in the Sky Pink Floyd - Eclipse Pink Floyd - Us and Them Pink Floyd - Brain Damage Pink Floyd - Speak to Me Pink Floyd - Money Pink Floyd - Breathe Pink Floyd - One of These Days Miles Davis - So What Horace Silver - Song For My Father Bill Evans - All of You Miles Davis - Freddie Freeloader Nat King Cole - The More I See You Miles Davis - So Near Miles Davis - Flamenco Sketches Charles Mingus - Eat That Chicken Jimmy Smith - On the Sunny Side Julie London - Daddy Bill Evans – My Man‘s Gone Now 10 Dimensions give a reasonable quality Example Neighborhoods in 10D Space (0.5M songs)

Creating Social Audio Features, Method 2: Social Tags and PLSA

Meaningful labels, but sparse data Meaningful labels, but sparse data Good similarity information, but no labels Good similarity information, but no labels Let’s combine this information

ETH Zurich – Distributed Computing Group Michael Kuhn 23 Combining Usage Data and Social Tags

ETH Zurich – Distributed Computing Group Michael Kuhn 24 art painting artist music collection approach psychology feeling female subjective audio signal music beat timbre 1)Select latent class z with probability P(z|d) 2)Select word w with probability P(w|z) PLSA: find probabilities that best approximate observed word distribution PLSA: Probabilistic Latent Semantic Analysis (PLSA)

ETH Zurich – Distributed Computing Group Michael Kuhn 25 Probabilistic Latent Semantic Analysis (PLSA) Everyonehasaphotographicmemory… some just don’t have film. 1)Select latent class z with probability P(z|d) 2)Select word w with probability P(w|z) PLSA: find probabilities that best approximate observed word distribution PLSA:

ETH Zurich – Distributed Computing Group Michael Kuhn 26 PLSA: Interpretation as Space can be seen as a vector that defines a point in space [Hofmann, 1999] K small: Dimensionality reduction songs latent music style classes tags

ETH Zurich – Distributed Computing Group Michael Kuhn 27 … Greenday – basket case rock punk pop-punk Madonna – like a prayer pop dance female vocalists Beatles – hey jude 60‘s Classic rock british Applying PLSA to Music and Tags Greenday Beatles Madonna 32 latent classes (=dimensions), 1.1M songs

ETH Zurich – Distributed Computing Group Michael Kuhn 28 Evaluation Artist clustering Comparison to coll. filtering Comparison to coll. filtering Tag consistency

ETH Zurich – Distributed Computing Group Michael Kuhn 29 LMDS vs. PLSA Space Advantages of LMDS: –Same accurracy at lower dimensionality (10 vs. 32) Advantages of PLSA: –Natural meaning of tags –Assignment of tags to songs (probabilistic) Current sizes (approx.): LMDS: 600K tracks PLSA: 1.1M tracks Current sizes (approx.): LMDS: 600K tracks PLSA: 1.1M tracks

Using the Social Audio Features

high-dimensional!high-dimensional!

ETH Zurich – Distributed Computing Group Michael Kuhn 32 Visualization in 2D Identify relevant tags Find centroids of these tags in high-dimensional space Apply Principal Component Analysis (PCA) to these centroids

ETH Zurich – Distributed Computing Group Michael Kuhn 33

What people have chosen during the researcher‘s night in Zurich

ETH Zurich – Distributed Computing Group Michael Kuhn 35 YouJuke – The YouTube Jukebox

YouTube as media source YouTube as media source Social Audio Features to create smart playlist

apps.facebook.com/youjukeapps.facebook.com/youjuke

„Half of the time I spend skipping songs“

I only want to listen to songs that match my mood...

After only few skips, we know pretty well which songs match the user‘s mood After only few skips, we know pretty well which songs match the user‘s mood

ETH Zurich – Distributed Computing Group Michael Kuhn 43 Work in Progress: Who is Dancing? AC/DCAC/DC BeatlesBeatles ProdigyProdigy

ETH Zurich – Distributed Computing Group Michael Kuhn 44 „In my shelf AC/DC is next to ZZ Top...“ Browsing Covers

Video

Selected Comments from museek Users Your software is a pathetic piece of crap! […] Does a good job learning my tastes[…] […] easy browse and make playlists. Auto play related music is very good. 넥원 잘돌아갑니다 버벅거리지안고 굿 ui 도 굿이고요 ! [...] Love the ability to automatically play similar music. [...] [...] Love the ability to automatically play similar music. [...] Good potential, but album art is tiny & blurry […] Just got it and want to put more music on my sd card now. Pretty cool once you get the hang of it. L'algorithme de sélection des playlists en fonction de l'évolution de votre humeur est un véritable bijou. Félicitations […] Awesome app beating the ipod genius feature and coverflow. […]

ETH Zurich – Distributed Computing Group Michael Kuhn 48 Questions? Thanks to: –Lukas Bossard –Mihai Calin –Matthias Flückiger –Olga Goussevskaia –Michael Lorenzi –Roger Wattenhofer –Samuel Welten –Martin Wirz URLs: – – –apps.facebook.com/youjuke (Michael Kuhn)

ETH Zurich – Distributed Computing Group Michael Kuhn 49 Publications Sensing Dance Engagement for Collaborative Music Control. Michael Kuhn, Martin Wirz, Matthias Flückiger, Roger Wattenhofer, Gerhard Tröster. (accepted at ISWC 2011) Social Audio Features for Advanced Music Retrieval Interfaces. Michael Kuhn, Roger Wattenhofer, and Samuel Welten. ACM Multimedia, Florence, October Visually and Acoustically Exploring the High-Dimensional Space of Music. Lukas Bossard, Michael Kuhn, and Roger Wattenhofer. IEEE International Conference on Social Computing (SocialCom), Vancouver, Canada, August From Web to Map: Exploring the World of Music. Olga Goussevskaia, Michael Kuhn, Michael Lorenzi, and Roger Wattenhofer. IEEE/WIC/ACM International Conference on Web Intelligence (WI), Sydney, Australia, December Exploring Music Collections on Mobile Devices. Olga Goussevskaia, Michael Kuhn, and Roger Wattenhofer. International Conference on Human-Computer Interaction with Mobile Devices and Services (MobileHCI), Amsterdam, Netherlands, September 2008.