A Regression Approach to Music Emotion Recognition Yi-Hsuan Yang, Yu-Ching Lin, Ya-Fan Su, and Homer H. Chen, Fellow, IEEE IEEE TRANSACTIONS ON AUDIO,

Slides:



Advertisements
Similar presentations
V-Detector: A Negative Selection Algorithm Zhou Ji, advised by Prof. Dasgupta Computer Science Research Day The University of Memphis March 25, 2005.
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Feature Selection as Relevant Information Encoding Naftali Tishby School of Computer Science and Engineering The Hebrew University, Jerusalem, Israel NIPS.
Geometric Representation of Regression. ‘Multipurpose’ Dataset from class website Attitude towards job –Higher scores indicate more unfavorable attitude.
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Biointelligence Laboratory, Seoul National University
CS Statistical Machine learning Lecture 13 Yuan (Alan) Qi Purdue CS Oct
Particle swarm optimization for parameter determination and feature selection of support vector machines Shih-Wei Lin, Kuo-Ching Ying, Shih-Chieh Chen,
« هو اللطیف » By : Atefe Malek. khatabi Spring 90.
Software Quality Ranking: Bringing Order to Software Modules in Testing Fei Xing Michael R. Lyu Ping Guo.
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
Principal Component Analysis
Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Ensemble Learning: An Introduction
1 Ensembles of Nearest Neighbor Forecasts Dragomir Yankov, Eamonn Keogh Dept. of Computer Science & Eng. University of California Riverside Dennis DeCoste.
A Study of the Relationship between SVM and Gabriel Graph ZHANG Wan and Irwin King, Multimedia Information Processing Laboratory, Department of Computer.
What is Learning All about ?  Get knowledge of by study, experience, or being taught  Become aware by information or from observation  Commit to memory.
CONTENT-BASED BOOK RECOMMENDING USING LEARNING FOR TEXT CATEGORIZATION TRIVIKRAM BHAT UNIVERSITY OF TEXAS AT ARLINGTON DATA MINING CSE6362 BASED ON PAPER.
Review Rong Jin. Comparison of Different Classification Models  The goal of all classifiers Predicating class label y for an input x Estimate p(y|x)
Classification and Prediction: Regression Analysis
Introduction to machine learning
Introduction to Data Mining Engineering Group in ACL.
Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.
Summarized by Soo-Jin Kim
Presented By Wanchen Lu 2/25/2013
Performance of Recommender Algorithms on Top-N Recommendation Tasks RecSys 2010 Intelligent Database Systems Lab. School of Computer Science & Engineering.
Multimodal Interaction Dr. Mike Spann
General Tensor Discriminant Analysis and Gabor Features for Gait Recognition by D. Tao, X. Li, and J. Maybank, TPAMI 2007 Presented by Iulian Pruteanu.
COMMON EVALUATION FINAL PROJECT Vira Oleksyuk ECE 8110: Introduction to machine Learning and Pattern Recognition.
Machine Learning Using Support Vector Machines (Paper Review) Presented to: Prof. Dr. Mohamed Batouche Prepared By: Asma B. Al-Saleh Amani A. Al-Ajlan.
A hybrid SOFM-SVR with a filter-based feature selection for stock market forecasting Huang, C. L. & Tsai, C. Y. Expert Systems with Applications 2008.
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
An Introduction to Support Vector Machines (M. Law)
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Contextual Ranking of Keywords Using Click Data Utku Irmak, Vadim von Brzeski, Reiner Kraft Yahoo! Inc ICDE 09’ Datamining session Summarized.
Diversifying Search Result WSDM 2009 Intelligent Database Systems Lab. School of Computer Science & Engineering Seoul National University Center for E-Business.
1 Chapter 6. Classification and Prediction Overview Classification algorithms and methods Decision tree induction Bayesian classification Lazy learning.
DATA CLUSTERING WITH KERNAL K-MEANS++ PROJECT OBJECTIVES o PROJECT GOAL  Experimentally demonstrate the application of Kernel K-Means to non-linearly.
Project 11: Determining the Intrinsic Dimensionality of a Distribution Okke Formsma, Nicolas Roussis and Per Løwenborg.
Ensemble Methods in Machine Learning
Classification Ensemble Methods 1
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 12: Advanced Discriminant Analysis Objectives:
MACHINE LEARNING 7. Dimensionality Reduction. Dimensionality of input Based on E Alpaydın 2004 Introduction to Machine Learning © The MIT Press (V1.1)
Data Mining and Decision Support
CS Statistical Machine learning Lecture 12 Yuan (Alan) Qi Purdue CS Oct
Competition II: Springleaf Sha Li (Team leader) Xiaoyan Chong, Minglu Ma, Yue Wang CAMCOS Fall 2015 San Jose State University.
Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs.
Using decision trees to build an a framework for multivariate time- series classification 1 Present By Xiayi Kuang.
Online Evolutionary Collaborative Filtering RECSYS 2010 Intelligent Database Systems Lab. School of Computer Science & Engineering Seoul National University.
Combining multiple learners Usman Roshan. Decision tree From Alpaydin, 2010.
Incremental Reduced Support Vector Machines Yuh-Jye Lee, Hung-Yi Lo and Su-Yun Huang National Taiwan University of Science and Technology and Institute.
Music Emotion Classification: A Fuzzy Approach
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
Shadow Detection in Remotely Sensed Images Based on Self-Adaptive Feature Selection Jiahang Liu, Tao Fang, and Deren Li IEEE TRANSACTIONS ON GEOSCIENCE.
Data Summit 2016 H104: Building Hadoop Applications Abhik Roy Database Technologies - Experian LinkedIn Profile:
Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
Combining Models Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya.
Principal Component Analysis (PCA)
Chapter 7. Classification and Prediction
CSE 4705 Artificial Intelligence
Basic machine learning background with Python scikit-learn
Final Year Project Presentation --- Magic Paint Face
Biointelligence Laboratory, Seoul National University
Shih-Wei Lin, Kuo-Ching Ying, Shih-Chieh Chen, Zne-Jung Lee
Model generalization Brief summary of methods
Analysis for Predicting the Selling Price of Apartments Pratik Nikte
Multivariate Methods Berlin Chen
Multivariate Methods Berlin Chen, 2005 References:
NON-NEGATIVE COMPONENT PARTS OF SOUND FOR CLASSIFICATION Yong-Choon Cho, Seungjin Choi, Sung-Yang Bang Wen-Yi Chu Department of Computer Science &
Presentation transcript:

A Regression Approach to Music Emotion Recognition Yi-Hsuan Yang, Yu-Ching Lin, Ya-Fan Su, and Homer H. Chen, Fellow, IEEE IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 16, NO. 2, FEBRUARY 2008 Sung Eun Park Intelligent Database Systems Lab School of Computer Science & Engineering Seoul National University, Seoul, Korea

Copyright  2008 by CEBT Contents  Introduction Simple concept of the model  Body Regression approach Model Explanation Evaluation  Conclusion Discussion Contribution Q&A 2

Copyright  2008 by CEBT Brief Concept of the Model Thayer’s arousal-valence emotion plane. ♬♬♬♬ ♬ 3

Copyright  2008 by CEBT An application using this concept  Musicovery based on the same concept of this model. click Find relevant music of the point 4

Copyright  2008 by CEBT  Many good regressor(regression algorithms ) are readily available.  Given N inputs (x i, y i ), 1≤ i ≤ N, where x i is a feature vector for the ith input sample, and y i ∈ R is the real value to be predicted for the ith sample, the regression system trains a regression algorithm(regressor) R(∙) such that the mean squared error ε is minimized. Regression Approach 5 minimizea feature vectorReal Value Find this!! Predicted Value

Copyright  2008 by CEBT The model ♬ ♬ ♬ ♬ ♬ Ground Truth Musical Features Regressor Reg.A and Reg.V Subjective test Feature Extraction Regression Emotion Visualization 6

Copyright  2008 by CEBT The model in detail Preprocessing Regressor Training Training Data Subjective Test Feature extraction Reg.AReg.V Preprocessing Emotion Visualization Test Data Reg.AReg.V Feature extraction 7

Copyright  2008 by CEBT  The dependency between the two dimensions, arousal and valence What is the positive music? Then what is the energetic music? Principle Component Analysis(CPA) is common way of reducing the correlation between variables. An Issue of the Continuous Perspective 8 energetic calm Computed by PCA Original data Principle component

Copyright  2008 by CEBT Reducing Correlation Between Variables 9 AV plane: some dependency exists PC plane: no dependency exists Train regressor R p,R q Test in PQ plane and compare with AV plane Details follow in the later presentation

Copyright  2008 by CEBT Dataset Preprocessing Regressor Training Training Data Subjective Test Feature extraction Reg.AReg.V 195 popular songs selected from a number of Western, Chinese, and Japanese albums. 1) These songs should be distributed uniformly in each quadrant of the emotion plane. 2) Each music sample should express a certain dominant emotion. 10

Copyright  2008 by CEBT  253 volunteers from the campus Subjective Test Preprocessing Regressor Training Training Data Subjective Test Feature extraction Reg.AReg.V  Is asked to listen to ten music samples randomly drawn from the music database and to label the AV values from –1.0 to 1.0 in 11 ordinal levels.  Label the evoking emotion rather than the perceived one 11  Standard deviation of evaluation to the same song is 0.3( which is okay)  Same person tend to label same with same music.

Copyright  2008 by CEBT Feature Extraction Preprocessing Regressor Training Training Data Subjective Test Feature extraction Reg.AReg.V 12

Copyright  2008 by CEBT Feature Extraction Preprocessing Regressor Training Training Data Subjective Test Feature extraction Reg.AReg.V Psysound aims to model parameters of Auditory sensation based on some psychoacoustic models. Earlier research found that 15 of the features are more closely related to emotion perception. 13

Copyright  2008 by CEBT Feature Extraction Preprocessing Regressor Training Training Data Subjective Test Feature extraction Reg.AReg.V  Select features from all extracted features which is related to Emotion.  RReliefF is used as a feature extraction algorithm(FSA).  RRF m,n is a space with top-m and top-n selected features. 14

Copyright  2008 by CEBT Regression Algorithms Preprocessing Regressor Training Training Data Subjective Test Feature extraction Reg.AReg.V Three regression algorithms: 1.Multiple linear regression (MLR) Assumes lineal relationship Simple method 2.Support vector regression (SVR) Nonlinearly maps input features into higher dimensional feature space In many cases superior to existing machine learning methods 3. AdaBoost.RT (BoostR) Nonlinear regression algorithm A number of regression trees are trained iteratively and weighted according to the prediction accuracy 15

Copyright  2008 by CEBT  Method R 2 Statistics : showing how much prediction and real value are close.  AV and PC Plane comparison : The effect of variance dependency Evaluation 16 The best combination No significant difference <<

Copyright  2008 by CEBT Evaluation 17 Regressor Comparison A plane with no correlation Selected feature space

Copyright  2008 by CEBT Evaluation – The Prediction Accuracy + Ground Truth Prediction Result The best performance of the regression approach reaches 58.3% for arousal and 28.1% for valence by using PC RRF SVR 18

Copyright  2008 by CEBT Performance Evaluation  Using same ground truth data and feature data 19 =100.3 =117.7

Copyright  2008 by CEBT  Subjectivity issue Individual difference : influence of many factors. Cultural background, generation, sex, and personality.  GWMER(Group-wise MER scheme)  Personalization can be an alternative way. Discussion 20 R1 R… R2 R3 R4 Regressor G1 G… G2 G3 G4 Users Regressor Choosing

Copyright  2008 by CEBT Contribution  One of the first attempts that develop an MER system from a continuous perspective.(Each song maps to a point in the emotion plane)  A sound theoretical foundation is proposed. Regression theory.  Extensive performance study. Several algorithms are tested  Dealing with subjectivity issues of Music Emotion Retrieval(MER). Emotion is different from person to person Two demensions in emotion plane are not dependent. 21

Thank you… Q&A