Mining Mouse Vocalizations Jesin Zakaria Department of Computer Science and Engineering University of California Riverside.

Slides:



Advertisements
Similar presentations
Original Figures for "Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring"
Advertisements

Content-Based Image Retrieval
Image Indexing and Retrieval using Moment Invariants Imran Ahmad School of Computer Science University of Windsor – Canada.
Color Image Processing
GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.
The Statistics of Fingerprints A Matching Algorithm to be used in an Investigation into the Reliability of the Use of Fingerprints for Identification Bob.
Quadtrees, Octrees and their Applications in Digital Image Processing
Clustered alignments of gene- expression time series data Adam A. Smith, Aaron Vollrath, Cristopher A. Bradfield and Mark Craven Department of Biosatatistics.
Classification Dr Eamonn Keogh Computer Science & Engineering Department University of California - Riverside Riverside,CA Who.
User Manual of Mining Mouse Vocalizations Prepared by Jesin Zakaria and Eamonn Keogh.
Jessica Lin, Eamonn Keogh, Stefano Loardi
Quadtrees, Octrees and their Applications in Digital Image Processing
Geometric Crossovers for Supervised Motif Discovery Rolv Seehuus NTNU.
Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.
A String Matching Approach for Visual Retrieval and Classification Mei-Chen Yeh* and Kwang-Ting Cheng Learning-Based Multimedia Lab Department of Electrical.
INTRODUCTION Problem: Damage condition of residential areas are more concerned than that of natural areas in post-hurricane damage assessment. Recognition.
Motif finding: Lecture 1 CS 498 CXZ. From DNA to Protein: In words 1.DNA = nucleotide sequence Alphabet size = 4 (A,C,G,T) 2.DNA  mRNA (single stranded)
AdvisorStudent Dr. Jia Li Shaojun Liu Dept. of Computer Science and Engineering, Oakland University 3D Shape Classification Using Conformal Mapping In.
Slide Image Retrieval: A Preliminary Study Guo Min Liew and Min-Yen Kan National University of Singapore Web IR / NLP Group (WING)
Space-Efficient Sequence Alignment Space-Efficient Sequence Alignment Bioinformatics 202 University of California, San Diego Lecture Notes No. 7 Dr. Pavel.
Scenario 6 Distinguishing different types of leukemia to target treatment.
Quadtrees, Octrees and their Applications in Digital Image Processing.
Automatic Minirhizotron Root Image Analysis Using Two-Dimensional Matched Filtering and Local Entropy Thresholding Presented by Guang Zeng.
Discovering the Correlation Between Evolutionary Genomics and Protein-Protein Interaction Rezaul Kabir and Brett Thompson
In Defense of Nearest-Neighbor Based Image Classification Oren Boiman The Weizmann Institute of Science Rehovot, ISRAEL Eli Shechtman Adobe Systems Inc.
Handwritten Hindi Numerals Recognition Kritika Singh Akarshan Sarkar Mentor- Prof. Amitabha Mukerjee.
Limitations of Cotemporary Classification Algorithms Major limitations of classification algorithms like Adaboost, SVMs, or Naïve Bayes include, Requirement.
Lei Zhang and Guoning Chen, Department of Computer Science, University of Houston Robert S. Laramee, Swansea University David Thompson and Adrian Sescu,
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
Fast Shapelets: All Figures in Higher Resolution.
1 Regions and Binary Images Hao Jiang Computer Science Department Sept. 24, 2009.
An Approximate Nearest Neighbor Retrieval Scheme for Computationally Intensive Distance Measures Pratyush Bhatt MS by Research(CVIT)
Learning Analogies and Semantic Relations Nov William Cohen.
VizTree Huyen Dao and Chris Ackermann. Introducing example
Intro. ANN & Fuzzy Systems Lecture 16. Classification (II): Practical Considerations.
Cancer Metastases Classification in Histological Whole Slide Images
3.1 Clustering Finding a good clustering of the points is a fundamental issue in computing a representative simplicial complex. Mapper does not place any.
Color Image Processing
Color Image Processing
Automatic Video Shot Detection from MPEG Bit Stream
IMAGE PROCESSING RECOGNITION AND CLASSIFICATION
Color Image Processing
Here is a plot which we call a raster plot
Human-level control through deep reinforcement learning
Historic Document Image De-Noising using Principal Component Analysis (PCA) and Local Pixel Grouping (LPG) Han-Yang Tang1, Azah Kamilah Muda1, Yun-Huoy.
REMOTE SENSING Multispectral Image Classification
Scale-Space Representation of 3D Models and Topological Matching
Mark Sayles, Ian M. Winter  Neuron 
Color Image Processing
(A) Graph depicting interaction between the five regulatory motifs measured by synergy and cooccurrence. (A) Graph depicting interaction between the five.
Three-Dimensional Structure of the Human DNA-PKcs/Ku70/Ku80 Complex Assembled on DNA and Its Implications for DNA DSB Repair  Laura Spagnolo, Angel Rivera-Calzada,
A Similarity Retrieval System for Multimodal Functional Brain Images
Structure of the Yeast RNA Polymerase II Holoenzyme
network of simple neuron-like computing elements
Mismatch Receptive Fields in Mouse Visual Cortex
Object Detection Creation from Scratch Samsung R&D Institute Ukraine
Pejman Mohammadi, Niko Beerenwinkel, Yaakov Benenson  Cell Systems 
Color Image Processing
Visual Search and Attention
Volume 36, Issue 4, Pages (November 2002)
Volume 54, Issue 6, Pages (June 2007)
The Spatiotemporal Organization of the Striatum Encodes Action Space
Volume 19, Issue 6, Pages (March 2009)
Structural Flexibility of CaV1. 2 and CaV2
Raghav Rajan, Allison J. Doupe  Current Biology 
by Khaled Nasr, Pooja Viswanathan, and Andreas Nieder
Lecture 16. Classification (II): Practical Considerations
Feature computation and classification of grating pitch.
Temporal profiles of eye velocity and neural activity for normal saccades. Temporal profiles of eye velocity and neural activity for normal saccades. Green.
Volume 11, Issue 7, Pages (May 2015)
Presentation transcript:

Mining Mouse Vocalizations Jesin Zakaria Department of Computer Science and Engineering University of California Riverside

124 Time (second) kHz 100 laboratory mice Mouse Vocalizations Figure 1: top) A waveform of a sound sequence produced by a lab mouse, middle) A spectrogram of the sound, bottom) An idealized version of the spectrogram

A X Q X P A X C X P The intution behind symbolizing the spectrogram Figure 3: The two fragments of data shown in Figure 2.bottom aligned to produce the maximum overlap. (Best viewed in color) Figure 4: The data shown in Figure 2 augmented by labeled syllables Figure1: top ) Two 0.5 second spectrogram representations of fragments of the vocal output of a male mouse. bottom ) Idealized (by human intervention) versions of the above 2

Time (second) kHz original idealized Background Figure 6: top) Original spectrogram, bottom) Idealized spectrogram (after thresholding and binarization) Figure 7: left) A real spectrogram of a mouse vocalization can be approximated by samples of handwritten Farsi digits (right). Some Farsi digits were rotated or transposed to enhance the similarity Time (sec) Figure 5: A snippet spectrogram that has seven syllables

I L SP connected components Figure 8: from left to right)snippet spectrogram, matrix corresponding to an idealized spectrogram I, matrix corresponding to the set of connected components L, mbrs of the candidate syllables Extracting syllables from spectrogram

I J K L M N O P A B C D E F G H a b c d e f g h i j k New Class Editing Ground truth I J K L M N O P A B C D E F G H Figure 9: Sixteen syllables provided by domain experts Figure 11: Ambiguity reduction of the original set of syllable classes. Representative examples from the reduced set of eleven classes are labeled as small letters

Editing Ground truth Figure 10: Thick/red curve represents the accuracy of classifying syllables of edited ground truth. Thin/blue curve represents the accuracy of classifying 692 labeled syllables using edited ground truth Adding more instances Classification Accuracy for edited ground truth for all the labeled syllables

Data mining Mouse Vocalizations ccccccgc eccccccc ecccccc ciaciaci dcibfcd ddcibfcd ccccccgc Figure 12: A clustering of eight snippets of mouse vocalization spectrograms using the string edit distance on the extracted syllables (spectrograms are rotated 90 degrees for visual clarity) Figure 13: A clustering of the same eight snippets of mouse vocalization shown in Figure 12 using the correlation method. The result appears near random Clustering mouse vocalizations

c c c c query image Data mining Mouse Vocalizations Similarity search / Query by content Figure 15: top) The query image from [2] was transcribed to cccc. Similar patterns are found in CT (first row) and KO (second row) mouse vocalizations in our collection Figure 14: top) A query image from [1], The syllable labels have been added by our algorithm to produce the query ciabqciacia, bottom) the two best matches found in our dataset; corresponding symbolic strings are ciafqcicia and ciqbqcaacja, with edit distance 2 and 3, respectively [1] J. M. S. Grimsley, et al., Development of Social Vocalizations in Mice. PLoS ONE 6(3): e17460 (2011). [2] T. E. Holy, Z. Guo, Ultrasonic songs of male mice, PLoS Biol 3(12): e386, (2005). query image c c c a a i b q a ii ciafqcicia Edit dist 2 ciqbqcaacja Edit dist 3

944.7 – sec – sec motif Data mining Mouse Vocalizations Assessing Motif Significance using z-score 16 17

Overrepresented in Control Overrepresented in Knock-out Figure 18: Examples of contrast set phrases. top) Three examples of a phrase ciacia that is overrepresented in KO, appearing 24 times in KO but never in CT. bottom) Two examples of a phrase dccccc that appears 39 times in CT and just twice in KO Contrast set mining using information gain