Artist Identification Based on Song Analysis

Slides:

Advertisements

Similar presentations

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki

Advertisements

CVPR2013 Poster Modeling Actions through State Changes.

Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.

Franz de Leon, Kirk Martinez Web and Internet Science Group  School of Electronics and Computer Science  University of Southampton {fadl1d09,

Content-based retrieval of audio Francois Thibault MUMT 614B McGill University.

Dual-domain Hierarchical Classification of Phonetic Time Series Hossein Hamooni, Abdullah Mueen University of New Mexico Department of Computer Science.

Bring Order to Your Photos: Event-Driven Classification of Flickr Images Based on Social Knowledge Date: 2011/11/21 Source: Claudiu S. Firan (CIKM’10)

A Comprehensive Study on Third Order Statistical Features for Image Splicing Detection Xudong Zhao, Shilin Wang, Shenghong Li and Jianhua Li Shanghai Jiao.

Berenzweig - Music Recommendation1 Music Recommendation Systems: A Progress Report Adam Berenzweig April 19, 2002.

Language and Speaker Identification using Gaussian Mixture Model Prepare by Jacky Chau The Chinese University of Hong Kong 18th September, 2002.

FACE RECOGNITION, EXPERIMENTS WITH RANDOM PROJECTION

A Supervised Approach for Detecting Boundaries in Music using Difference Features and Boosting Douglas Turnbull Computer Audition Lab UC San Diego, USA.

Semi-supervised protein classification using cluster kernels Jason Weston, Christina Leslie, Eugene Ie, Dengyong Zhou, Andre Elisseeff and William Stafford.

FYP0202 Advanced Audio Information Retrieval System By Alex Fok, Shirley Ng.

Automatic Gender Identification using Cell Phone Calling Behavior Presented by David.

SoundSense: Scalable Sound Sensing for People-Centric Application on Mobile Phones Hon Lu, Wei Pan, Nocholas D. lane, Tanzeem Choudhury and Andrew T. Campbell.

1 Template-Based Classification Method for Chinese Character Recognition Presenter: Tienwei Tsai Department of Informaiton Management, Chihlee Institute.

Study of Word-Level Accent Classification and Gender Factors

Active Learning for Class Imbalance Problem

International Conference on Intelligent and Advanced Systems 2007 Chee-Ming Ting Sh-Hussain Salleh Tian-Swee Tan A. K. Ariff. Jain-De,Lee.

Evaluation of Speaker Recognition Algorithms. Speaker Recognition Speech Recognition and Speaker Recognition speaker recognition performance is dependent.

Polyphonic Music Transcription Using A Dynamic Graphical Model Barry Rafkind E6820 Speech and Audio Signal Processing Wednesday, March 9th, 2005.

Dan Rosenbaum Nir Muchtar Yoav Yosipovich Faculty member : Prof. Daniel LehmannIndustry Representative : Music Genome.

MUMT611: Music Information Acquisition, Preservation, and Retrieval Presentation on Timbre Similarity Alexandre Savard March 2006.

Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.

Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.

Music Information Retrieval Information Universe Seongmin Lim Dept. of Industrial Engineering Seoul National University.

I can be You: Questioning the use of Keystroke Dynamics as Biometrics —Paper by Tey Chee Meng, Payas Gupta, Debin Gao Presented by: Kai Li Department of.

Singer similarity / identification Francois Thibault MUMT 614B McGill University.

Singer Similarity Doug Van Nort MUMT 611. Goal Determine Singer / Vocalist based on extracted features of audio signal Classify audio files based on singer.

Performance Comparison of Speaker and Emotion Recognition

Introduction to Pattern Recognition (การรู้จํารูปแบบเบื้องต้น)

CS378 Final Project The Netflix Data Set Class Project Ideas and Guidelines.

Arlindo Veiga Dirce Celorico Jorge Proença Sara Candeias Fernando Perdigão Prosodic and Phonetic Features for Speaking Styles Classification and Detection.

Finding document topics for improving topic segmentation Source: ACL2007 Authors: Olivier Ferret (18 route du Panorama, BP6) Reporter:Yong-Xiang Chen.

Improving Support Vector Machine through Parameter Optimized Rujiang Bai, Junhua Liao Shandong University of Technology Library Zibo , China { brj,

Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine 朱林娇 14S

Musical Genre Categorization Using Support Vector Machines Shu Wang.

Statistical techniques for video analysis and searching chapter Anton Korotygin.

ADAPTIVE BABY MONITORING SYSTEM Team 56 Michael Qiu, Luis Ramirez, Yueyang Lin ECE 445 Senior Design May 3, 2016.

Research Methodology Proposal Prepared by: Norhasmizawati Ibrahim (813750)

Multi-Class Sentiment Analysis with Clustering and Score Representation Yan Zhu.

A content-based System for Music Recommendation and Visualization of User Preference Working on Semantic Notions Dmitry Bogdanov, Martin Haro, Ferdinand.

Predictive Automatic Relevance Determination by Expectation Propagation Y. Qi T.P. Minka R.W. Picard Z. Ghahramani.

Automatic Classification of Audio Data by Carlos H. L. Costa, Jaime D. Valle, Ro L. Koerich IEEE International Conference on Systems, Man, and Cybernetics.

Detection Of Anger In Telephone Speech Using Support Vector Machine and Gaussian Mixture Model Prepared By : Siti Marahaini Binti Mahamood.

Topic Modeling for Short Texts with Auxiliary Word Embeddings

My Smartphone knows what you print exploring smartphone-based side-channel attacks against 3d Printers Chen Song, feng lin, zongjie ba, kui ren, chi zhou,

University of Rochester

Bag-of-Visual-Words Based Feature Extraction

Presentation on Artificial Neural Network Based Pathological Voice Classification Using MFCC Features Presenter: Subash Chandra Pakhrin 072MSI616 MSC in.

Computational NeuroEngineering Lab

Brian Whitman Paris Smaragdis MIT Media Lab

Using Transductive SVMs for Object Classification in Images

VAD (Voice Activity Detector)

Presented by Steven Lewis

CRANDEM: Conditional Random Fields for ASR

Christophe Dubach, Timothy M. Jones and Michael F.P. O’Boyle

Presenter: Simon de Leon Date: March 2, 2006 Course: MUMT611

Somi Jacob and Christian Bach

Presentation on Timbre Similarity

A maximum likelihood estimation and training on the fly approach

Anthor: Andreas Tsiartas, Prasanta Kumar Ghosh,

Speaker Identification:

Jia-Bin Huang Virginia Tech

Kostas Kolomvatsos, Christos Anagnostopoulos

Automatic Handwriting Generation

Measuring the Similarity of Rhythmic Patterns

Report 4 Brandon Silva.

Auditory Morphing Weyni Clacken

Presentation transcript:

Artist Identification Based on Song Analysis

Motivation People have a large collection of digital music Although metadata about songs are available from other sources it would be nice to be able to recognize an artist from the song itself.

Previous Work Michael Mandel and Dan Ellis have come up with a scheme to create Gaussian Model using the mean and variances of 20 features of MFCC vectors calculated for all the frames of a song . Once this Gaussian model is formed they compare the similar Gaussian Model of the test song using a SVM classifier.

Previous Work… With this method they achieved 69% to 84% accuracy in detecting artists http://www.ee.columbia.edu/~dpwe/pubs/ismir05-svm.pdf

Proposed Extension The MFCC frames of a song that were chosen to form a Gaussian model were random. I would like to use the Music similarity measures to select the MFCC frames of a song. The paper by Matthew Cooper and Jonathan Foote on Automatic Music Summarization by Similarity analysis provides a way for getting an audio thumbnail for a song http://www.fxpal.com/publications/FXPAL-PR-02-171.pdf

Proposed Extension… For artist identification purpose the frames which has the artists voice has the most information MFCC features are good at identifying the spectral characteristics of speech The assumption is that the frames of songs having high similarity scores will probably have artists voice in it.

Proposed Extension… I plan to build a similarity matrix comparing the MFCC vectors of each frame of the song with every other frame The distance between the frames will be calculated based on the dot product of the feature vectors The Frames having the highest similarity scores across frames of a song will be chosen to build a Gaussian model

Proposed Extension… I plan to experiment with how many frames I select using this similarity metric for song level features and then using a SVM classifier . Mandel & Ellis in their paper had tried to build an artist model based on all the songs from a training set and did not get good results

Proposed Extension… I believe that a successful artist model can be built if we select the right frames from the songs in the training set. I plan to build a Gaussian model for an artist by selecting the frames from the song set with highest similarity scores.

Evaluation I plan to evaluate my techniques on USPOP2002 dataset http://www.ee.columbia.edu/~dpwe/research/musicsim/uspop2002.html Evaluation criteria based on training times and the success of identifying artists.

Challenges Computing similarity matrix may be computationally inefficient

References http://www.geocities.com/sivaram@snet.net