Recognition of bumblebee species by their buzzing sound

Slides:



Advertisements
Similar presentations
ECG Signal processing (2)
Advertisements

Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Designing Facial Animation For Speaking Persian Language Hadi Rahimzadeh June 2005.
Dual-domain Hierarchical Classification of Phonetic Time Series Hossein Hamooni, Abdullah Mueen University of New Mexico Department of Computer Science.
Watching Unlabeled Video Helps Learn New Human Actions from Very Few Labeled Snapshots Chao-Yeh Chen and Kristen Grauman University of Texas at Austin.
AN IMPROVED AUDIO Jenn Tam Computer Science Dept. Carnegie Mellon University SOAPS 2008, Pittsburgh, PA.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
1 Music Classification Using SVM Ming-jen Wang Chia-Jiu Wang.
Eng. Shady Yehia El-Mashad
Slide Image Retrieval: A Preliminary Study Guo Min Liew and Min-Yen Kan National University of Singapore Web IR / NLP Group (WING)
Age and Gender Classification using Modulation Cepstrum Jitendra Ajmera (presented by Christian Müller) Speaker Odyssey 2008.
SoundSense by Andrius Andrijauskas. Introduction  Today’s mobile phones come with various embedded sensors such as GPS, WiFi, compass, etc.  Arguably,
Classification of place of articulation in unvoiced stops with spectro-temporal surface modeling V. Karjigi , P. Rao Dept. of Electrical Engineering,
Behavior analysis based on coordinates of body tags Mitja Luštrek, Boštjan Kaluža, Erik Dovgan, Bogdan Pogorelc, Matjaž Gams Jožef Stefan Institute, Department.
Artificial Intelligence Lecture No. 29 Dr. Asad Ali Safi ​ Assistant Professor, Department of Computer Science, COMSATS Institute of Information Technology.
Multimodal Information Analysis for Emotion Recognition
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
Hidden Markov Classifiers for Music Genres. Igor Karpov Rice University Comp 540 Term Project Fall 2002.
Look who’s talking? Project 3.1 Yannick Thimister Han van Venrooij Bob Verlinden Project DKE Maastricht University.
Classification (slides adapted from Rob Schapire) Eran Segal Weizmann Institute.
Ivica Dimitrovski 1, Dragi Kocev 2, Suzana Loskovska 1, Sašo Džeroski 2 1 Faculty of Electrical Engineering and Information Technologies, Department of.
Automatic Speech Recognition A summary of contributions from multiple disciplines Mark D. Skowronski Computational Neuro-Engineering Lab Electrical and.
Predicting Voice Elicited Emotions
Chapter 7 Speech Recognition Framework  7.1 The main form and application of speech recognition  7.2 The main factors of speech recognition  7.3 The.
Chenn-Jung Huang a*, Yi-Ju Yang b, Dian-Xiu Yang a, You-Jia Chen a a Department of Computer and Information Science b Institute of Ecology and Environmental.
Neural Network Recognition of Frequency Disturbance Recorder Signals Stephen Tang REU Final Presentation July 22, 2014.
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
Speech Recognition through Neural Networks By Mohammad Usman Afzal Mohammad Waseem.
Combining Models Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya.
CS 445/656 Computer & New Media
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Machine Learning with Spark MLlib
Applying Deep Neural Network to Enhance EMPI Searching
Debesh Jha and Kwon Goo-Rak
Action-Grounded Push Affordance Bootstrapping of Unknown Objects
Recognition of biological cells – development
MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.
An Artificial Intelligence Approach to Precision Oncology
Spoken Digit Recognition
Efficient Image Classification on Vertically Decomposed Data
Presentation on Artificial Neural Network Based Pathological Voice Classification Using MFCC Features Presenter: Subash Chandra Pakhrin 072MSI616 MSC in.
Hybrid Features based Gender Classification
Schizophrenia Classification Using
Machine Learning Feature Creation and Selection
Massachusetts Institute of Technology
Machine Learning Week 1.
Efficient Image Classification on Vertically Decomposed Data
Musical Style Classification
National Conference on Recent Advances in Wireless Communication & Artificial Intelligence (RAWCAI-2014) Organized by Department of Electronics & Communication.
Behrouz Minaei, William Punch
CSCI N317 Computation for Scientific Applications Unit Weka
Audio and Speech Computers & New Media.
Digital Systems: Hardware Organization and Design
Yi Zhao1, Yanyan Shen*1, Yanmin Zhu1, Junjie Yao2
AUDIO SURVEILLANCE SYSTEMS: SUSPICIOUS SOUND RECOGNITION
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
John H.L. Hansen & Taufiq Al Babba Hasan
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Lecture 10 – Introduction to Weka
MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.
Presenter: Shih-Hsiang(士翔)
Measuring the Similarity of Rhythmic Patterns
Keyword Spotting Dynamic Time Warping
Combination of Feature and Channel Compensation (1/2)
Minimax Pathology and Real-Number Minimax Model
THE ASSISTIVE SYSTEM SHIFALI KUMAR BISHWO GURUNG JAMES CHOU
Earthquake Prediction
Practical Hidden Voice Attacks against Speech and Speaker Recognition Systems NDSS 2019 Hadi Abdullah, Washington Garcia, Christian Peeters, Patrick.
Advisor: Dr.vahidipour Zahra salimian Shaghayegh jalali Dec 2017
Presenter: Donovan Orn
Presentation transcript:

Recognition of bumblebee species by their buzzing sound Mukhiddin Yusupov, Mitja Luštrek, Janez Grad and Matjaž Gams Jožef Stefan Institute Department of Intelligent Systems

Bumblebees 35 species in Slovenia Important pollinators, faster than honeybees, fly even in bad weather Sometimes bred in captivity for pollination in greenhouses One queen, a few hundred workers in a nest Endangered by intensive farming resulting in lack of flowers and safe nesting places

Our task Build a classifier: Input: recording of bumblebee sound Output: species and queen/worker type

Presentation structure Related work Dataset Preprocessing Feature extraction Classification Experimental results

Related work No work on classification of bumblebees

Related work No work on classification of bumblebees Some work on: General insect recognition for pest control A small competition for the best distance metric for insect sounds Birds Frogs ...

Dataset 71 recordings of bumblebees in the wild Queens of 8 species Workes of 7 species 15 classes in total

Preprocessing Manual segmentation Easy to see 3 relevant segments Not so easy to see that only 20% buzzing

Preprocessing Manual segmentation Volume normalization Pre-emphasis: boost high frequencies to emphasize buzzing over environmental noise Easy to see 3 relevant segments Not so easy to see that only 20% buzzing

Feature extraction (1) Segment the signal into short windows

Feature extraction (1) Segment the signal into short windows Mel-frequency cepstral coefficients (MFCC): Fourier transform: time → frequency domain Psycho-acoustic transformation of the frequency-domain data (appropriate binning etc.) Use 13 coefficients (bins)

Feature extraction (2) Linear predictive coefficients (LPC): Signal at a given time expressed as a linear combination of signals at previous times Select coefficients so that the difference between the signal expressed this way and the real signal within the window is minimized Use 10 coefficients

Feature extraction (2) Linear predictive coefficients (LPC): Signal at a given time expressed as a linear combination of signals at previous times Select coefficients so that the difference between the signal expressed this way and the real signal within the window is minimized Use 10 coefficients Compute mean coefficient values over all the windows in a recording

Classification Three-fold cross validation (too few recordings per class for more folds)

Classification Three-fold cross validation (too few recordings per class for more folds) Three machine-learning algorithms: J48 decision trees Multi-layer perceptron (MLP) Support vector machine (SVM)

Experimental results (1) 15 classes (species and queen/worker type) Two classes > 95 %, five > 80 % Algorithm \ Features MFCC LPC J48 56 % MLP 60 % SVM 64 % 57 %

Experimental results (2) 8 classes (species) Algorithm \ Features MFCC LPC J48 51 % 48 % MLP 50 % 49 % SVM 52 %

Experimental results (2) 8 classes (species) 2 classes (queen/worker type) Algorithm \ Features MFCC LPC J48 51 % 48 % MLP 50 % 49 % SVM 52 % Algorithm \ Features MFCC LPC J48 90 % 88 % MLP 93 % 87 % SVM 96 % 91 %

Conclusion Recognition of bumblebees by their buzzing sound with accuracy 64 % More data needed

Conclusion In the future: Recognition of bumblebees by their buzzing sound with accuracy 64 % More data needed In the future: Web application to classify unlabeled data Accept submissions of new labeled data