A content-based System for Music Recommendation and Visualization of User Preference Working on Semantic Notions Dmitry Bogdanov, Martin Haro, Ferdinand.

Slides:



Advertisements
Similar presentations
Matthias Gruhne, Page 1 Fraunhofer Institut Integrierte Schaltungen Robust Audio Identification for Commercial Applications Matthias.
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Mining customer ratings for product recommendation using the support vector machine and the latent class model William K. Cheung, James T. Kwok, Martin.
LYRIC-BASED ARTIST NETWORK Derek Gossi CS 765 Fall 2014.
Franz de Leon, Kirk Martinez Web and Internet Science Group  School of Electronics and Computer Science  University of Southampton {fadl1d09,
Content-based retrieval of audio Francois Thibault MUMT 614B McGill University.
Outline Introduction Music Information Retrieval Classification Process Steps Pitch Histograms Multiple Pitch Detection Algorithm Musical Genre Classification.
LYRIC-BASED ARTIST NETWORK METHODOLOGY Derek Gossi CS 765 Fall 2014.
ARNOLD SMEULDERS MARCEL WORRING SIMONE SANTINI AMARNATH GUPTA RAMESH JAIN PRESENTERS FATIH CAKIR MELIHCAN TURK Content-Based Image Retrieval at the End.
Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.
Watching Unlabeled Video Helps Learn New Human Actions from Very Few Labeled Snapshots Chao-Yeh Chen and Kristen Grauman University of Texas at Austin.
Berenzweig - Music Recommendation1 Music Recommendation Systems: A Progress Report Adam Berenzweig April 19, 2002.
Chorus cluster meeting, Vilamoura April SAPIR Search in Audio-visual content using P2p IR Yosi Mass, Raul Santos.
- Confidential - MoodLogic Metadata MoodLogic © 2004 MoodLogic, Inc. – No reproduction or distribution without prior written permission. The promise and.
Architecture & Data Management of XML-Based Digital Video Library System Jacky C.K. Ma Michael R. Lyu.
MusicSense: Contextual Music Recommendation using Emotional Allocation Modeling Rui Cai, Chao Zhang, Chong Wang, Lei Zhang, and Wei-Ying Ma Proceedings.
Classification of Music According to Genres Using Neural Networks, Genetic Algorithms and Fuzzy Systems.
Map-Reduce and Parallel Computing for Large-Scale Media Processing Youjie Zhou.
1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System Supervisor: Prof Michael Lyu Presented by: Lewis Ng,
김덕주 (Duck Ju Kim). Problems What is the objective of content-based video analysis? Why supervised identification has limitation? Why should use integrated.
POTENTIAL RELATIONSHIP DISCOVERY IN TAG-AWARE MUSIC STYLE CLUSTERING AND ARTIST SOCIAL NETWORKS Music style analysis such as music classification and clustering.
Sound Applications Advanced Multimedia Tamara Berg.
MediaEval Workshop 2011 Pisa, Italy 1-2 September 2011.
Theoretic and artistic research studying opportunities of symbiosis of Western and non-Western musical idiom Olmo Cornelis.
Tag Clouds Revisited Date : 2011/12/12 Source : CIKM’11 Speaker : I- Chih Chiu Advisor : Dr. Koh. Jia-ling 1.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
Object Bank Presenter : Liu Changyu Advisor : Prof. Alex Hauptmann Interest : Multimedia Analysis April 4 th, 2013.
Recommendation system MOPSI project KAROL WAGA
PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.
Detecting Semantic Cloaking on the Web Baoning Wu and Brian D. Davison Lehigh University, USA WWW 2006.
A Regression Approach to Music Emotion Recognition Yi-Hsuan Yang, Yu-Ching Lin, Ya-Fan Su, and Homer H. Chen, Fellow, IEEE IEEE TRANSACTIONS ON AUDIO,
Universit at Dortmund, LS VIII
MUMT611: Music Information Acquisition, Preservation, and Retrieval Presentation on Timbre Similarity Alexandre Savard March 2006.
TEMPLATE DESIGN © Zhiyao Duan 1,2, Lie Lu 1, and Changshui Zhang 2 1. Microsoft Research Asia (MSRA), Beijing, China.2.
Understanding The Semantics of Media Chapter 8 Camilo A. Celis.
You Are What You Tag Yi-Ching Huang and Chia-Chuan Hung and Jane Yung-jen Hsu Department of Computer Science and Information Engineering Graduate Institute.
Online Kinect Handwritten Digit Recognition Based on Dynamic Time Warping and Support Vector Machine Journal of Information & Computational Science, 2015.
AUDIO TONALITY MODE CLASSIFICATION WITHOUT TONIC ANNOTATIONS Zhiyao Duan 1,2, Lie Lu 1, and Changshui Zhang 2 1. Microsoft Research Asia (MSRA), China.
JACQUELINE JACKSON COURSE: DIGITAL MUSIC MUSIC APP PROJECT.
Combining Audio Content and Social Context for Semantic Music Discovery José Carlos Delgado Ramos Universidad Católica San Pablo.
[The Band SIG] MPEG7 - Audio 손우람 2007 년 12 월 1 일.
WEB MINING. In recent years the growth of the World Wide Web exceeded all expectations. Today there are several billions of HTML documents, pictures and.
Visual Categorization With Bags of Keypoints Original Authors: G. Csurka, C.R. Dance, L. Fan, J. Willamowski, C. Bray ECCV Workshop on Statistical Learning.
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
Flickr Tag Recommendation based on Collective Knowledge BÖrkur SigurbjÖnsson, Roelof van Zwol Yahoo! Research WWW Summarized and presented.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
Duraid Y. Mohammed Philip J. Duncan Francis F. Li. School of Computing Science and Engineering, University of Salford UK Audio Content Analysis in The.
Music Recommendation On-line Survey Presented by Daniel Wu & Gordon Chang
MMDB-9 J. Teuhola Standardization: MPEG-7 “Multimedia Content Description Interface” Standard for describing multimedia content (metadata).
MPEG-7 Audio Overview Ichiro Fujinaga MUMT 611 McGill University.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Content-Based MP3 Information Retrieval Chueh-Chih Liu Department of Accounting Information Systems Chihlee Institute of Technology 2005/06/16.
Duc-Tien Dang-Nguyen, Giulia Boato, Alessandro Moschitti, Francesco G.B. De Natale Department to Information and Computer Science –University of Trento.
Improving Music Genre Classification Using Collaborative Tagging Data Ling Chen, Phillip Wright *, Wolfgang Nejdl Leibniz University Hannover * Georgia.
1 Hidden Markov Model: Overview and Applications in MIR MUMT 611, March 2005 Paul Kolesnik MUMT 611, March 2005 Paul Kolesnik.
Musical Genre Categorization Using Support Vector Machines Shu Wang.
R ESEARCH P ROGRESS R EPORT – C OVER S ONGS I DENTIFICATION Ken.
BASS TRACK SELECTION IN MIDI FILES AND MULTIMODAL IMPLICATIONS TO MELODY gPRAI Pattern Recognition and Artificial Intelligence Group Computer Music Laboratory.
Metamidi: a tool for automatic metadata extraction from MIDI files Tomás Pérez-García, Jose M. Iñesta, and David Rizo Computer Music Laboratory University.
Introduction to MPEG  Moving Pictures Experts Group,  Geneva based working group under the ISO/IEC standards.  In charge of developing standards for.
Multi-Class Sentiment Analysis with Clustering and Score Representation Yan Zhu.
Deep content-based music recommendation A¨aron van den Oord, Sander Dieleman, Benjamin Schrauwen (NIPS 2013)
MULTIMEDIA SYSTEMS CBIR & CBVR. Schedule Image Annotation (CBIR) Image Annotation (CBIR) Video Annotation (CBVR) Video Annotation (CBVR) Few Project Ideas.
TRECVID IES Lab. Intelligent E-commerce Systems Lab. 1 Presented by: Thay Setha 05-Jul-2012.
Genre Classification of Music by Tonal Harmony Carlos Pérez-Sancho, David Rizo Departamento de Lenguajes y Sistemas Informáticos, Universidad de Alicante,
How to Get Reposts on SoundCloud Reposts are the fastest way to post songs to your Profile page and to share your favorite new discoveries on SoundCloud.
Artist Identification Based on Song Analysis
Introduction to Music Information Retrieval (MIR)
Multimodal Caricatural Mirror
Recommender Systems: Collaborative & Content-based Filtering Features
Presentation transcript:

A content-based System for Music Recommendation and Visualization of User Preference Working on Semantic Notions Dmitry Bogdanov, Martin Haro, Ferdinand Fuhrmann Anna Xambo, Emilia Gomez, Perfecto Herrera Music Technology Group Universitat Pompeu Fabra Roc Boronat, 138, Barcelona, Spain Presented By: Thay Setha Content-Based Multimedia Indexing (CBMI), th International Workshop 1

Outline 1.Introduction 2.Last.fm & SoundCloud 3.System Architecture 4.Implementation 5.Conclusion 2

Introduction (1/2) Rapid growth of music available in the Internet stores already exceeds 10 millions. This requires effective means for browsing and search in music collections. This paper present a content- based system for music recommendation and visualization of user preferences 3

Introduction (2/2) System exploits content-based information extract from audio signal. – High level semantic descriptions of music inferred from low-level timbral, temporal, and tonal feature. System obtains preference set of music by giving user profile on popular online music services, Last.fm and SoundCloud. Semantic descriptors including genres, musical culture, moods, instrumentation types, rhythm, and tempo information are extracted from preference set of a user and then exploited to recommend similar music and visualize musical preferences. 4

Last.fm & SoundCloud (1/2)  Last.fm  Established music recommender with an extensive number of users and large playable music collection.  Provides means for both monitoring listening statistics and social tagging.  Last.fm API  5

Last.fm & SoundCloud (2/2)  SoundCloud  Platform which allows users (mostly musicians) to collaborate, promote, and distribute their music.  Allows users to upload their own tracks or mark tracks as their favorite.  SoundCloud API  6

System Architecture (1/11) There are four parts of system for music recommendation and visualization  Data gathering  Audio analysis  Music recommendation  Preference visualization 7

System Architecture (2/11) 8

System Architecture (3/11)  Data gathering i.Specify his/her account name in both Last.fm & SoundCloud ii.Using Last.fm API & Sound Cloud API to retrieve the tracks. 9

System Architecture (4/11) Different types of tracks can be used to infer the user’s preference set: – Tracks marked as favorite by the user on Last.fm – Tracks listened most by the user according to their Last.fm’s statistics. – Tracks marked as favorites by the user on SoundCloud – Tracks uploaded by the user on SoundCloud Result:URLs of the tracks to be included in the preference set using Last.fm and SoundCloud APIs. 10

System Architecture (5/11)  Audio Analysis i.Extraction of low level audio features such as timbral, temporal, tonal,... Then running a number of classification tasks using support vector machines trained on ground truth information about genres, musical culture, moods, instrumentation, rhythm, and tempo. ii.Technically, we use Canoris API we can obtain semantic description for each track from the user’s preference set such as genres, musical culture, moods, instrumentation, rhythm, and tempo. 11

System Architecture (6/11) Canoris offers functionality in both the analysis and synthesis of sound and music. – Analysis  High and low level analysis: upload any sound file and retrieve high level descriptions like genre, mood,...  Similarity Search: Put all your files in a collection and see which sound and songs are similar to each other. – Synthesis  Voice synthesis  Voice Transform 12

System Architecture (7/11)  Music Recommendation i.We employ an in-house music collection of music excerpts. ii.Using Canoris API to retrieve the same semantic descriptions as used for the preference set. iii.The system searches for tracks inside the in-house collection with the smallest semantic distance to any of the tracks in the preference set. iv.Recommendation outcomes are presented to user including metadata of tracks, audio previews, and the reason why this track was recommended 13

System Architecture (8/11) Semantic Distance (Classifier-Based Distance) – First Step:  Apply multi-class support vector machines (SVMs) to infer different groups of musical dimensions such as genre and musical culture, moods and instruments, and rhythm and tempo.  As an output, the classifier provides probability values of classes (musical dimension) on which it was trained. – Second Step:  Define distance operating on a formed high-level semantic space.  We select cosine distance (CLAS-Cos), Pearson correlation distance (CLAS-Pears) and Spearman’s rho correlation distance (CLAS- Spear). 14

System Architecture (9/11) Fig.1. General Schema of CLAS distance. Given two songs X and Y, low-level audio descriptors are extracted, a number of SVM classifications are run based on ground truth music collections, and high-level representations, containing probabilities of classes for each classifier, are obtained. A distance between X and Y is calculated with correlation distances such as Pearson correlation distance. 15

System Architecture (10/11)  Preference Visualization 16  To visualize a user’s musical preferences in the form of a humanoid cartoon character, the Musical Avatar.  Selected descriptors of each track are summarized across all tracks in the preference set.  Mapping descriptors to the visual elements of the avatar, which are implemented using Processing.

System Architecture (11/11) 17 Figure 1: Block-diagram of the proposed methodology. The user’s music tracks are analyzed and represented with low-level audio features which are Later transformed, by means of classifiers, into semantic descriptors. The individual track descriptors are furthermore summarized into the user profile Which is finally mapped to a series of graphical features of the Musical Avatar. Calculate low level feature by extract frame by frame Summarized means & variance across all frames Using SVM perform regression by suitable classifier produce semantic dimension. High semantic descriptor space contain probability estimate for each class

Implementation Demo System: 18

Conclusion  They presented a system for music recommendation and user preference visualization. Operating on content based information extracted from audio with high-level semantic description  System employs the Last.fm and SoundCloud APIs to generate semantic user models.  System generates musical recommendations, relying on a semantic similarity measure between music tracks.  Musical preference of the users are visualized in the form of Musical Avatars. 19

Thank You! 20