Tim Pohle, Peter Knees, Markus Schedl, Elias Pampalk, and Gerhard Widmer IEEE Transactions on Multimedia, Vol 9, No. 3, April 2007 Present by Yi-Tang Wang.

Slides:

Advertisements

Similar presentations

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki

Advertisements

Text Categorization.

Chapter 5: Introduction to Information Retrieval

Speaker Associate Professor Ning-Han Liu. What’s MIR  Music information retrieval (MIR) is the interdisciplinary science of retrieving information from.

Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.

Linked data: P redicting missing properties Klemen Simonic, Jan Rupnik, Primoz Skraba {klemen.simonic, jan.rupnik,

Franz de Leon, Kirk Martinez Web and Internet Science Group  School of Electronics and Computer Science  University of Southampton {fadl1d09,

Learning to Cluster Web Search Results SIGIR 04. ABSTRACT Organizing Web search results into clusters facilitates users quick browsing through search.

Outline Introduction Music Information Retrieval Classification Process Steps Pitch Histograms Multiple Pitch Detection Algorithm Musical Genre Classification.

A Music Search Engine Built upon Audio-based and Web-based Similarity Measures P. Knees, T., Pohle, M. Schedl, G. Widmer SIGIR 2007.

Kohonen Self Organising Maps Michael J. Watts

Data Mining Techniques: Clustering

G. Valenzise *, L. Gerosa, M. Tagliasacchi *, F. Antonacci *, A. Sarti * IEEE Int. Conf. On Advanced Video and Signal-based Surveillance, 2007 * Dipartimento.

Self Organizing Maps. This presentation is based on: SOM’s are invented by Teuvo Kohonen. They represent multidimensional.

Distributed Computing Group From Web to Map: Exploring the World of Music Olga Goussevskaia Michael Kuhn Michael Lorenzi Roger Wattenhofer Web Intelligence.

Berenzweig - Music Recommendation1 Music Recommendation Systems: A Progress Report Adam Berenzweig April 19, 2002.

Clustering… in General In vector space, clusters are vectors found within  of a cluster vector, with different techniques for determining the cluster.

Creating Concept Hierarchies in a Customer Self-Help System Bob Wall CS /29/05.

Expectation Maximization Method Effective Image Retrieval Based on Hidden Concept Discovery in Image Database By Sanket Korgaonkar Masters Computer Science.

Speaker Adaptation for Vowel Classification

Visual Querying By Color Perceptive Regions Alberto del Bimbo, M. Mugnaini, P. Pala, and F. Turco University of Florence, Italy Pattern Recognition, 1998.

Distributed Computing Group Visually and Acoustically Exploring the High-Dimensional Space of Music Lukas Bossard Michael Kuhn Roger Wattenhofer SocialCom.

Switch to Top-down Top-down or move-to-nearest Partition documents into ‘k’ clusters Two variants “Hard” (0/1) assignment of documents to clusters “soft”

1 A DATA MINING APPROACH FOR LOCATION PREDICTION IN MOBILE ENVIRONMENTS* by Gökhan Yavaş Feb 22, 2005 *: To appear in Data and Knowledge Engineering, Elsevier.

Presented by Zeehasham Rasheed

Classification of Music According to Genres Using Neural Networks, Genetic Algorithms and Fuzzy Systems.

Clustering with Bregman Divergences Arindam Banerjee, Srujana Merugu, Inderjit S. Dhillon, Joydeep Ghosh Presented by Rohit Gupta CSci 8980: Machine Learning.

Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques

Neural Networks Lecture 17: Self-Organizing Maps

A Hybrid Self-Organizing Neural Gas Network James Graham and Janusz Starzyk School of EECS, Ohio University Stocker Center, Athens, OH USA IEEE World.

Lecture 09 Clustering-based Learning

Radial Basis Function Networks

Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.

 C. C. Hung, H. Ijaz, E. Jung, and B.-C. Kuo # School of Computing and Software Engineering Southern Polytechnic State University, Marietta, Georgia USA.

«Tag-based Social Interest Discovery» Proceedings of the 17th International World Wide Web Conference (WWW2008) Xin Li, Lei Guo, Yihong Zhao Yahoo! Inc.,

Unsupervised Learning Reading: Chapter 8 from Introduction to Data Mining by Tan, Steinbach, and Kumar, pp , , (

International Conference on Intelligent and Advanced Systems 2007 Chee-Ming Ting Sh-Hussain Salleh Tian-Swee Tan A. K. Ariff. Jain-De,Lee.

UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.

A Scalable Self-organizing Map Algorithm for Textual Classification: A Neural Network Approach to Thesaurus Generation Dmitri G. Roussinov Department of.

NEURAL NETWORKS FOR DATA MINING

Glasgow 02/02/04 NN k networks for content-based image retrieval Daniel Heesch.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A Comparison of SOM Based Document Categorization Systems.

MUMT611: Music Information Acquisition, Preservation, and Retrieval Presentation on Timbre Similarity Alexandre Savard March 2006.

TEMPLATE DESIGN © Zhiyao Duan 1,2, Lie Lu 1, and Changshui Zhang 2 1. Microsoft Research Asia (MSRA), Beijing, China.2.

Self Organization of a Massive Document Collection Advisor : Dr. Hsu Graduate : Sheng-Hsuan Wang Author : Teuvo Kohonen et al.

Introduction to Digital Libraries hussein suleman uct cs honours 2003.

AUDIO TONALITY MODE CLASSIFICATION WITHOUT TONIC ANNOTATIONS Zhiyao Duan 1,2, Lie Lu 1, and Changshui Zhang 2 1. Microsoft Research Asia (MSRA), China.

Music Information Retrieval Information Universe Seongmin Lim Dept. of Industrial Engineering Seoul National University.

Expert Systems with Applications 34 (2008) 459–468 Multi-level fuzzy mining with multiple minimum supports Yeong-Chyi Lee, Tzung-Pei Hong, Tien-Chin Wang.

Combining Audio Content and Social Context for Semantic Music Discovery José Carlos Delgado Ramos Universidad Católica San Pablo.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Externally growing self-organizing maps and its application to database visualization and exploration.

1 A Web Search Engine-Based Approach to Measure Semantic Similarity between Words Presenter: Guan-Yu Chen IEEE Trans. on Knowledge & Data Engineering,

Effective Automatic Image Annotation Via A Coherent Language Model and Active Learning Rong Jin, Joyce Y. Chai Michigan State University Luo Si Carnegie.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A self-organizing map for adaptive processing of structured.

Self-Organizing Maps (SOM) (§ 5.5)

Citation-Based Retrieval for Scholarly Publications 指導教授：郭建明學生：蘇文正 M

1 CS 430: Information Discovery Lecture 5 Ranking.

Example Apply hierarchical clustering with d min to below data where c=3. Nearest neighbor clustering d min d max will form elongated clusters!

Mining Tag Semantics for Social Tag Recommendation Hsin-Chang Yang Department of Information Management National University of Kaohsiung.

1 Text Categorization  Assigning documents to a fixed set of categories  Applications:  Web pages  Recommending pages  Yahoo-like classification hierarchies.

A Self-organizing Semantic Map for Information Retrieval Xia Lin, Dagobert Soergel, Gary Marchionini presented by Yi-Ting.

Big data classification using neural network

Compact Query Term Selection Using Topically Related Text

Study Guide for ES205 Yu-Chi Ho Jonathan T. Lee Nov. 7, 2000

Musical Style Classification

Introduction to Cluster Analysis

INF 141: Information Retrieval

EM Algorithm and its Applications

Measuring the Similarity of Rhythmic Patterns

Presentation transcript:

Tim Pohle, Peter Knees, Markus Schedl, Elias Pampalk, and Gerhard Widmer IEEE Transactions on Multimedia, Vol 9, No. 3, April 2007 Present by Yi-Tang Wang

Outline  Introduction  Audio-Based Similarity  Web-Based Similarity  Problem Modeling  Evaluation and Results  Conclusion & future work

Introduction  A novel music player interface using a wheel  Generating a circular playlist from personal repositories  Keeps on playing similar tracks  Not only audio-based similarity is used, but also text-based similarity

Audio-Based Similarity  MFCCs ( Mel frequency cepstral coefficients )  Discarding the higher-order MFCCs  beneficial for the ability to compare different frames, but possibly at the cost of discarding musically meaningful information.

Audio-Based Similarity  The wave file were downsampled to 22 kHz  19 MFCCs per frame  Ignoring the temporal order  Model the distribution of MFCC coefficients with Gaussian mixture model

Audio-Based Similarity  Similarity between music  Compute the distance between two GMM  Likelihood  computing the probability that the MFCCs of song A be generated by the model of B  Drawback: need to store all MFCC coefficients

Audio-Based Similarity  Sampling  Only store the GMM parameters, instead of storing MFCCs  Sample from one GMM  compute the likelihood given another GMM  Corresponds roughly to re-creating a song

Web-Based Similarity  Cultural, social, historical, and contextual aspects should be taken into account  WWW information  Query using artist’s name + ”music” with Google  50 top-ranked pages are retrieved  Remove all terms that - # of occur page < c  Such that about terms remain

Web-Based Similarity  Term frequency tf ta  a : artist, t : term  # of occurrences of t in documents related to a  Document Frequency df t  # of pages t occurred in  Term weight per artist  term frequency × inverse document frequency

Web-Based Similarity  Each artist is described by a vector of term weights  Apply cosine normalization on the vector  Euclidean distance is a simple similarity measure  In this paper, we use SOM as measure method

Web-Based Similarity - SOM  SOM － Self-organizing Maps  a subtype of artificial neural networks  It is trained using unsupervised learning  low dimensional representation of the training samples while preserving the topological properties of the input space  Using a rectangular 2-D grid in this paper for text-based similarity between songs

Web-Based Similarity - SOM  A SOM consists of units  A model vector in the high- dimensional input data space is assigned to each of the units.  model vectors which belong to units close to each other on the 2-D grid, are also close to each other in the data space.  Training to choose model vectors Unit

Web-Based Similarity - SOM  Batch-SOM algorithm  Initial  Randomly initialize the model vector  1 st step  for each data item x i, the Euclidean distance between x and each model vector is calculated  each data item x is assigned to the unit c i that represents it best.

Web-Based Similarity - SOM  2 nd step  neighborhood relationship between two units is usually defined by a Gaussian-like function  h jk = exp(-d jk 2 /r t 2 )  d jk = distance on the map, r t = neighborhood radius  r t decrease with each iteration (the adaptation strength decreases gradually)

Web-Based Similarity - SOM  Two artist is similar if they are mapped to same or adjacent units Newer experiments have actually shown that 6 × 6 grid might be better for this collection

Combining two approach  Adding a constant value to the audio-based distance matrix for all songs of dissimilar artists  Half of maximum audio-based distance  Adding Penalty to transitions between songs by dissimilar artist

Previous work  Audio-based similarity – Fluctuation Patterns  Using SOM only on audio-based data  Labeling SOM with information from www  A 3-D browsing system  P. Knees, M. Schedl, T. Pohle and G.Widmer, “An Innovative Three Dimensional User Interface for Exploring Music Collections Enriched with Meta- Information from the Web,” ACM MM’06

Problem Modeling  Map the playlist generation problem to Traveling Salesman Problem  The cities correspond to the tracks in collection  The distances are determined by the similarities between the tracks  Find a optimal route = producing a circular playlist

TSP Problem  Greedy Algorithm  All edges are examined in order of increasing length and add to the route properly  Minimum Spanning Tree  Found a minimum spanning tree and do DFS  Connecting the nodes in the order they are first visited  LKH  Lin-Kernighan algorithm proposed in 1971  Start with randomly generated tour  Deleting edges from the route and recombining the remaining tour fragments

TSP Problem  One-Dimensional SOM  Train a 1-D cyclic SOM  a circular playlist  As many units as tracks?  Recursive approach  Combining subtour in a greedy manner

Evaluation & Results  Collection 1  2545 tracks, 13 genres  A Cappella (4.4%), Acid Jazz (2.7%), Blues (2.5%), Bossa Nova (2.8%), Celtic (5.2%), Electronica (21.1%), Folk Rock (9.4%), Italian (5.6%), Jazz (5.3%), Metal (16.1%), Punk Rock (10.2%), Rap (12.9%), and Reggae (1.8%)  103 artists  for each artist, minimum - 8 tracks, maximum - 61 tracks

Evaluation & Results  Collection 2  3456 tracks, 7 genres  Classical (14.7%), Dance (15.0%), Hip-Hop (14.5%), Jazz (13.6%), Metal (14.9%), Pop (11.6%), and Punk (15.6%). The minimum number  339 artists  for each artist, minimum - 1 tracks, maximum tracks

Fluctuations Between Genres  A Cappella, Acid Jazz, Blues, Bossa Nova, Celtic, Electronica, Folk Rock, Italian, Jazz, Metal, Punk Rock, Rap, andReggae (collection 1)

Shannon Entropy  Estimate how locally coherent a playlist is  Count how many of n consecutive tracks belonged to each genre  n = 2…12  Typical album contains about 12 tracks  Average over the whole playlist  SOM yields better results on web-enhanced data than LKH on audio only data

Shannon Entropy

Long-Term Consistency  SOM algorithm on combined data

Long-Term Consistency  MinSpan algorithm on audio similarity data

Long-Term Consistency  Greedy algorithm on audio similarity data

Long-Term Consistency

User Study  10 test persons using the collection 2collection 2  Create a large playlist  Extract 10 seed tracks  Randomly choosing a start point  Selecting tracks at intervals of 3 degress  Generate two playlist  Adding the next nine tracks  Randomly choose from same genre

User Study  Users rate each playlist from 1 to 5  Summing up rating scores  Calculate the difference tsp i,j － gen i,j  i : playlist no., j : user

User Interface

 The user interface is very intuitive and its handling extremely easy  Apple’s iPod  Users’ opinion  A scanning function to skip 10 seconds when pressing  Genres containing only a few tracks are quite difficult to locate  Not usable when finding a specific track

Summary of Evaluation Result  all TSP algorithms provided better results with respect to our playlist evaluation criteria when using the web based extension  the combined similarity measure reduces the number of unexpected placements of tracks in the playlist

Summary of Evaluation Result  LKH and greedy algorithm  best small-scale genre entropy values  large-scale genre distributions are quite fragmented  SOM-based algorithm  highest entropy values  the least fragmented long-term genre distributions  MinSpan algorithm  in the middle field regarding the entropy values

Conclusion & future work  a new approach to conveniently access the music stored in mobile sound players  The whole collection is ordered in a circular playlist and thus accessible with only one input wheel  two different similarity measures — one relying on timbre information, the other on a combination of timbre and community metadata gathered from artist related web pages

Conclusion & future work  Problems to solve  Not possible to precisely select a desired piece  only tracks selectable that are representative for a region  zooming or hierarchical structuring techniques  The user does not know in advance which region on the wheel contains which style of music

Conclusion & future work  M. Schedl, T. Pohle, P. Knees, and G.Widmer, “Assigning and visualizing music genres by web-based co-occurrence analysis,” in Proc. 7 th Int. Conf. Music Information Retrieval (ISMIR’06), Victoria, Canada, Oct

Thank You