Kaggle: Whale Challenge

Slides:



Advertisements
Similar presentations
Feature Selection for Pattern Recognition J.-S. Roger Jang ( 張智星 ) CSIE Dept., National Taiwan University ( 台灣大學 資訊工程系 )
Advertisements

Sensor-Based Abnormal Human-Activity Detection Authors: Jie Yin, Qiang Yang, and Jeffrey Junfeng Pan Presenter: Raghu Rangan.
Pitch Tracking (音高追蹤) Jyh-Shing Roger Jang (張智星) MIR Lab (多媒體資訊檢索實驗室)
Introduction The aim the project is to analyse non real time EEG (Electroencephalogram) signal using different mathematical models in Matlab to predict.
Coin Counter Andres Uribe. what Find out the amount of money in a coin picture.
Speaker Recognition Sharat.S.Chikkerur Center for Unified Biometrics and Sensors
Vision Based Control Motion Matt Baker Kevin VanDyke.
Automatic in vivo Microscopy Video Mining for Leukocytes * Chengcui Zhang, Wei-Bang Chen, Lin Yang, Xin Chen, John K. Johnstone.
Uncertainty Representation. Gaussian Distribution variance Standard deviation.
A Comprehensive Study on Third Order Statistical Features for Image Splicing Detection Xudong Zhao, Shilin Wang, Shenghong Li and Jianhua Li Shanghai Jiao.
Face Recognition & Biometric Systems, 2005/2006 Face recognition process.
Medical Imaging Mohammad Dawood Department of Computer Science University of Münster Germany.
Medical Imaging Mohammad Dawood Department of Computer Science University of Münster Germany.
Assuming normally distributed data! Naïve Bayes Classifier.
A Data-Driven Approach to Quantifying Natural Human Motion SIGGRAPH ’ 05 Liu Ren, Alton Patrick, Alexei A. Efros, Jassica K. Hodgins, and James M. Rehg.
4/25/2001ECE566 Philip Felber1 Speech Recognition A report of an Isolated Word experiment. By Philip Felber Illinois Institute of Technology April 25,
Speaker Adaptation for Vowel Classification
Color a* b* Brightness L* Texture Original Image Features Feature combination E D 22 Boundary Processing Textons A B C A B C 22 Region Processing.
A new predictive search area approach for fast block motion estimation Kuo-Liang Chung ( 鍾國亮 ) Lung-Chun Chang ( 張隆君 ) 國立台灣科技大學資訊工程系暨研究所 IEEE TRANSACTIONS.
Warped Linear Prediction Concept: Warp the spectrum to emulate human perception; then perform linear prediction on the result Approaches to warp the spectrum:
MediaEval Workshop 2011 Pisa, Italy 1-2 September 2011.
國立屏東商業技術學院 資訊工程系 ( 所 ) 多媒體技術發展實驗室 Laboratory of Multimedia Technology Development Department of Computer Science and Information Engineering Nation Pingtung.
Introduction For some compiler, the intermediate code is a pseudo code of a virtual machine. Interpreter of the virtual machine is invoked to execute the.
International Conference on Intelligent and Advanced Systems 2007 Chee-Ming Ting Sh-Hussain Salleh Tian-Swee Tan A. K. Ariff. Jain-De,Lee.
南台科技大學 資訊工程系 Automatic Website Summarization by Image Content: A Case Study with Logo and Trademark Images Evdoxios Baratis, Euripides G.M. Petrakis, Member,
Under Supervision of Dr. Kamel A. Arram Eng. Lamiaa Said Wed
南台科技大學 資訊工程系 A web page usage prediction scheme using sequence indexing and clustering techniques Adviser: Yu-Chiang Li Speaker: Gung-Shian Lin Date:2010/10/15.
Digital Linear Filters 張智星 (Roger Jang) 多媒體資訊檢索實驗室 清華大學 資訊工程系.
Video Based Palmprint Recognition Chhaya Methani and Anoop M. Namboodiri Center for Visual Information Technology International Institute of Information.
DEVELOPMENT OF ALGORITHM FOR PANORAMA GENERATION, AND IMAGE SEGMENTATION FROM STILLS OF UNDERVEHICLE INSPECTION Balaji Ramadoss December,06,2002.
Experimental Results ■ Observations:  Overall detection accuracy increases as the length of observation window increases.  An observation window of 100.
資訊工程系智慧型系統實驗室 iLab 南台科技大學 1 A Static Hand Gesture Recognition Algorithm Using K- Mean Based Radial Basis Function Neural Network 作者 :Dipak Kumar Ghosh,
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
Forward-Scan Sonar Tomographic Reconstruction PHD Filter Multiple Target Tracking Bayesian Multiple Target Tracking in Forward Scan Sonar.
A Comparative Study of Kernel Methods for Classification Applications Yan Liu Oct 21, 2003.
Spam Detection Ethan Grefe December 13, 2013.
CS654: Digital Image Analysis Lecture 25: Hough Transform Slide credits: Guillermo Sapiro, Mubarak Shah, Derek Hoiem.
TRAFFIC SIGN SEGMENTATION AND RECOGNITION IN SCENE IMAGES Fei Qin1, Bin Fang1, Hengjun Zhao1 1. Department of Computer Science, Chongqing University, Chongqing.
Robust Entropy-based Endpoint Detection for Speech Recognition in Noisy Environments 張智星
數位影像處理概論 課程名稱數位影像處理概論 課程編碼 30N06701 系所代碼 / 名稱 03 / 電子系 開課班級夜四技電子四甲 夜四技電子四乙 開課教師賴培淋 學分 3.0 時數 3 必選修選修 南台科技大學 課程資訊.
Intelligent Space 國立台灣大學資訊工程研究所 智慧型空間實驗室 Service Behavior Consistency in the OSGi Platform Authors Y.Qin, H.Hao,L.Jun, G.Jidong and L.Jian Proceedings.
國立交通大學 電信工程研究所 National Chiao Tung University Institute of Communication Engineering 1 Phone Boundary Detection using Sample-based Acoustic Parameters.
Counting How Many Words You Read
Applying the Resonance Frequencies of Mechanical System in the Analysis of Bearing Vibration 沈毓泰 南台科技大學 機械系 副教授.
Medical Image Analysis
Predicting Voice Elicited Emotions
October 16, 2014Computer Vision Lecture 12: Image Segmentation II 1 Hough Transform The Hough transform is a very general technique for feature detection.
哼唱檢索用於嵌入式系統 張智星 多媒體資訊檢索實驗室 台灣大學 資訊工程系.
Intelligent Space 國立台灣大學資訊工程研究所 智慧型空間實驗室 Brainstorming Principles Reporter Chun-Feng Liao Sep 12,2005 Source D.Bellin and S.S.Simone, ”Brainstorming: A.
DTW for Speech Recognition J.-S. Roger Jang ( 張智星 ) MIR Lab ( 多媒體資訊檢索實驗室 ) CS, Tsing Hua Univ. ( 清華大學.
Notes on HW 1 grading I gave full credit as long as you gave a description, confusion matrix, and working code Many people’s descriptions were quite short.
Wire Detection Version 2 Joshua Candamo Friday, February 29, 2008.
Digital Image Processing Lecture 17: Segmentation: Canny Edge Detector & Hough Transform Prof. Charlene Tsai.
資訊工程系智慧型系統實驗室 iLab 南台科技大學 1 A new social and momentum component adaptive PSO algorithm for image segmentation Expert Systems with Applications 38 (2011)
Performance Indices for Binary Classification 張智星 (Roger Jang) 多媒體資訊檢索實驗室 台灣大學 資訊工程系.
Beat Tracking (節拍追蹤) 張智星 (Roger Jang)
Discrete Fourier Transform (DFT)
莊 永 裕 國立台灣大學 資訊工程學系 通訊與多媒體實驗室
Spoken Digit Recognition
Detection of discontinuity using
Intro. to Audio Signals Jyh-Shing Roger Jang (張智星)
Sharat.S.Chikkerur S.Anand Mantravadi Rajeev.K.Srinivasan
Fitting Curve Models to Edges
A New Approach to Track Multiple Vehicles With the Combination of Robust Detection and Two Classifiers Weidong Min , Mengdan Fan, Xiaoguang Guo, and Qing.
Visualizing Audio for Anomaly Detection
Endpoint Detection ( 端點偵測)
Wavelet transform application – edge detection
Longest Common Subsequence (LCS)
Introduction to Artificial Intelligence Lecture 22: Computer Vision II
Edit Distance 張智星 (Roger Jang)
Presentation transcript:

Kaggle: Whale Challenge 張智星 jang@cs.nthu.edu.tw http://www.cs.nthu.edu.tw/~jang 多媒體資訊檢索實驗室 台灣大學 資訊工程系

Whale Challenge Problem definition Characteristics: Imbalance data Identify the existence of whales from sensor recordings Characteristics: Imbalance data Some recordings are hardly recognizable by non-experts

Dataset Training set Test set Recording format 47,844 recordings of 2 seconds 88.97% (42,565 recordings): w/o whales 11.03% (5,276 recordings): with whales Test set 25,468 recordings of 2 seconds Recording format 2000-Hz sample rate, 16-bit resolution

Preprocessing Potential preprocessing Trend removal Noise removal Trend estimation via polynomial fitting Noise removal Band-pass filter Removal of “non-whale” part Linear prediction?

Spectrogram kwcPreprocess.m W/o band-pass filter W/ band-pass filter

Potential Features Acoustic features Volume Pitch Spectrum MFCC … Visual features (obtained from spectrogram) Radon transform Hough transform Gabor filters …

Pitch Tracking kwcPitchTracking.m

Volume kwcVolume.m

Spectrogram kwcSpectrogram.m

Visual Features via Radon Transform Projection onto lines at various angles For grayscale images only Detection objects at a specific angle

Example of Radon Transform Source http://www.mathworks.com/help/images/ref/radon.html Output Code: goRadon.m

Example of Radon Transform (2) Source image Output Code: goRadon2.m

Visual Features via Hough Transform Commonly used for detection lines and circles For BW images only (after edge detection)

Visual Features via Hough Transform (2) Point to curve mapping Two points  Two sine curves The intersection is the right θ and ρ for the line connecting these two points

Example of Hough Transform Source http://www.ebsd-image.org/documentation/reference/ops/hough/op/houghtransform.html Image Hough space and its maxima Detected lines

Example of Hough Transform (2) Source http://www.mathworks.com/help/images/analyzing-images.html (MATLAB code available) Image Edge image Hough space and its maxima Detected lines

Methods Thresholding Static classifiers Sequence classifiers Volume variance Pitch variance Static classifiers Naïve Bayes classifiers GMM SVM … Sequence classifiers HMM CRF …

HMM Training kwcHmmTrain.m

HMM Evaluation kwcHmmEval.m

HMM Basic models Advanced models Class 1: sil Class 2: sil-whale-sil sil-whale-sil-whale-sil … 1.0 sil 0.9 0.4 1.0 sil w sil 0.1 0.6

HMM (2) Other approach Train HMM models Align each recording with the HMM Extract features from the whale part for other static classifiers Duration (no. of frames) Average log likelihood per frame 0.9 0.4 1.0 sil w sil 0.1 0.6

Performance Evaluation Performance evaluation of methods based on thresholding (http://en.wikipedia.org/wiki/Receiver_operating_characteristic): ROC, DET AUC