Brian Whitman Paris Smaragdis MIT Media Lab

Slides:

Advertisements

Similar presentations

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki

Advertisements

Advanced Image Processing Student Seminar: Lipreading Method using color extraction method and eigenspace technique ( Yasuyuki Nakata and Moritoshi Ando.

Prediction Modeling for Personalization & Recommender Systems Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.

Speaker Associate Professor Ning-Han Liu. What’s MIR  Music information retrieval (MIR) is the interdisciplinary science of retrieving information from.

Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.

LYRIC-BASED ARTIST NETWORK Derek Gossi CS 765 Fall 2014.

Franz de Leon, Kirk Martinez Web and Internet Science Group  School of Electronics and Computer Science  University of Southampton {fadl1d09,

LYRIC-BASED ARTIST NETWORK METHODOLOGY Derek Gossi CS 765 Fall 2014.

Face Recognition and Biometric Systems

São Paulo Advanced School of Computing (SP-ASC’10). São Paulo, Brazil, July 12-17, 2010 Looking at People Using Partial Least Squares William Robson Schwartz.

Multi-class SVM with Negative Data Selection for Web Page Classification Chih-Ming Chen, Hahn-Ming Lee and Ming-Tyan Kao International Joint Conference.

Content-Based Classification, Search & Retrieval of Audio Erling Wold, Thom Blum, Douglas Keislar, James Wheaton Presented By: Adelle C. Knight.

Classification of Music According to Genres Using Neural Networks, Genetic Algorithms and Fuzzy Systems.

Berenzweig - Music Recommendation1 Music Recommendation Systems: A Progress Report Adam Berenzweig April 19, 2002.

Single Category Classification Stage One Additive Weighted Prototype Model.

Pattern Recognition Topic 1: Principle Component Analysis Shapiro chap

Multidimensional Analysis If you are comparing more than two conditions (for example 10 types of cancer) or if you are looking at a time series (cell cycle.

Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.

Classification of Music According to Genres Using Neural Networks, Genetic Algorithms and Fuzzy Systems.

Identifying Words that are Musically Meaningful David Torres, Douglas Turnbull, Luke Barrington, Gert Lanckriet Computer Audition Lab UC San Diego ISMIR.

Recommender systems Ram Akella November 26 th 2008.

Postgraduate Department of Electrical Engineering PPGEE UFPR - Federal University of Paraná Luis Gustavo Weigert Machado

POTENTIAL RELATIONSHIP DISCOVERY IN TAG-AWARE MUSIC STYLE CLUSTERING AND ARTIST SOCIAL NETWORKS Music style analysis such as music classification and clustering.

Face Recognition Using EigenFaces Presentation by: Zia Ahmed Shaikh (P/IT/2K15/07) Authors: Matthew A. Turk and Alex P. Pentland Vision and Modeling Group,

EE513 Audio Signals and Systems Statistical Pattern Classification Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.

Sound Applications Advanced Multimedia Tamara Berg.

Audio classification Discriminating speech, music and environmental audio Rajas A. Sambhare ECE 539.

Project 1 : Eigen-Faces Applied to Speech Style Classification Brad Keserich, Senior, Computer Engineering College of Engineering and Applied Science;

MediaEval Workshop 2011 Pisa, Italy 1-2 September 2011.

Centre for Computational Creativity Semantic Audio Studio Tools and Techniques using MPEG-7 Dr. Michael Casey Centre for Computational Creativity Department.

Automated Patent Classification By Yu Hu. Class 706 Subclass 12.

Contactforum: Digitale bibliotheken voor muziek. 3/6/2005 Real music libraries in the virtual future: for an integrated view of music and music information.

EMIS 8381 – Spring Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.

A Regression Approach to Music Emotion Recognition Yi-Hsuan Yang, Yu-Ching Lin, Ya-Fan Su, and Homer H. Chen, Fellow, IEEE IEEE TRANSACTIONS ON AUDIO,

1 Detection and Discrimination of Sniffing and Panting Sounds of Dogs Ophir Azulai(1), Gil Bloch(1), Yizhar Lavner (1,2), Irit Gazit (3) and Joseph Terkel.

MUMT611: Music Information Acquisition, Preservation, and Retrieval Presentation on Timbre Similarity Alexandre Savard March 2006.

Overview of Part I, CMSC5707 Advanced Topics in Artificial Intelligence KH Wong (6 weeks) Audio signal processing – Signals in time & frequency domains.

Classification Course web page: vision.cis.udel.edu/~cv May 12, 2003  Lecture 33.

Study of Protein Prediction Related Problems Ph.D. candidate Le-Yi WEI 1.

SVMs for (x) Recognition (From Moghaddam / Yang’s “Gender Classification with SVMs”) Brian Whitman.

Singer similarity / identification Francois Thibault MUMT 614B McGill University.

Duraid Y. Mohammed Philip J. Duncan Francis F. Li. School of Computing Science and Engineering, University of Salford UK Audio Content Analysis in The.

Singer Similarity Doug Van Nort MUMT 611. Goal Determine Singer / Vocalist based on extracted features of audio signal Classify audio files based on singer.

Exploring in the Weblog Space by Detecting Informative and Affective Articles Xiaochuan Ni, Gui-Rong Xue, Xiao Ling, Yong Yu Shanghai Jiao-Tong University.

Face Image-Based Gender Recognition Using Complex-Valued Neural Network Instructor :Dr. Dong-Chul Kim Indrani Gorripati.

Improving Music Genre Classification Using Collaborative Tagging Data Ling Chen, Phillip Wright *, Wolfgang Nejdl Leibniz University Hannover * Georgia.

Data Mining Durga Kumar. Internet Advertisements Data Set sements

Musical Genre Categorization Using Support Vector Machines Shu Wang.

Cell Segmentation in Microscopy Imagery Using a Bag of Local Bayesian Classifiers Zhaozheng Yin RI/CMU, Fall 2009.

BASS TRACK SELECTION IN MIDI FILES AND MULTIMODAL IMPLICATIONS TO MELODY gPRAI Pattern Recognition and Artificial Intelligence Group Computer Music Laboratory.

Automatic Classification of Audio Data by Carlos H. L. Costa, Jaime D. Valle, Ro L. Koerich IEEE International Conference on Systems, Man, and Cybernetics.

Data Mining, Machine Learning, Data Analysis, etc. scikit-learn

Tenacious Deep Learning

Detecting Semantic Concepts In Consumer Videos Using Audio Junwei Liang, Qin Jin, Xixi He, Gang Yang, Jieping Xu, Xirong Li Multimedia Computing Lab,

Artist Identification Based on Song Analysis

Semantic Video Classification

Automatic Sleep Stage Classification using a Neural Network Algorithm

Context-based vision system for place and object recognition

Dynamic Routing Using Inter Capsule Routing Protocol Between Capsules

A Convolutional Neural Network Cascade For Face Detection

Blind Signal Separation using Principal Components Analysis

Musical Style Classification

Discovering Functional Communities in Social Media

Principal Component Analysis

Word Embedding Word2Vec.

EE513 Audio Signals and Systems

Data Mining, Machine Learning, Data Analysis, etc. scikit-learn

Somi Jacob and Christian Bach

Data Mining, Machine Learning, Data Analysis, etc. scikit-learn

NON-NEGATIVE COMPONENT PARTS OF SOUND FOR CLASSIFICATION Yong-Choon Cho, Seungjin Choi, Sung-Yang Bang Wen-Yi Chu Department of Computer Science &

Presentation transcript:

Brian Whitman Paris Smaragdis MIT Media Lab Combining Musical and Cultural Features for Intelligent Style Detection Brian Whitman Paris Smaragdis MIT Media Lab

Background Music classification by style A “human” concept; hard to model. Defines subclasses of genres. Can be utilized by recommendation engine for high-confidence results. 11/6/2018 ISE599 - by Frances Kao

Approach An automatic style detection system that operate on both of acoustic content of the audio community metadata: a vector space of descriptive textual terms crawled from the web Dataset: 5 styles, each with 5 different artists 11/6/2018 ISE599 - by Frances Kao

Audio-based Classification Form each song into some presentation Train a neural network to classify a song Representation: randomly choose 12 songs of each artist -> downsampling -> extract Power Spectral Density (PSD) -> use Principal Components Analysis (PCA) to reduce dimension -> representation of each artist. Feedforward time-delay neural network 11/6/2018 ISE599 - by Frances Kao

Audio-based Classification – Result Heavy Metal Contemporary Country Hardcore Rap Intelligent Dance Music R&B Fail to overcome intra-style auditory inconsistency. Particularly not good for IDM. Since this style is with huge auditory variance. 11/6/2018 ISE599 - by Frances Kao

Community Metadata-based Classification (1) Cultural feature Each artist is associated with terms which appear on the same web document as the artists’ name. Each term has a score calculated in terms of position and frequency of occurrence. 11/6/2018 ISE599 - by Frances Kao

Community Metadata-based Classification (2) Similarity For every 2 artists, calculate an overlap weight, which is the summation of every shared term. Form a similarity matrix to predict the style of each artist 11/6/2018 ISE599 - by Frances Kao

Community Metadata-based Classification - Result Heavy Metal Contemporary Country Hardcore Rap Intelligent Dance Music R&B Performed somewhat not perfectly for 2 styles, Rap and R&B. 11/6/2018 ISE599 - by Frances Kao

Combined Classification Heavy Metal Contemporary Country Hardcore Rap Intelligent Dance Music R&B Posterior probability, and average value 11/6/2018 ISE599 - by Frances Kao

Conclusion & Future Work Combined classification can overcome all the problems Future development can use a “culture ratio” to alert recommendation engines to use which classification method. 11/6/2018 ISE599 - by Frances Kao