Michael H. Coen MIT Computer Science and Artificial Intelligence Laboratory AAAI’07 Talk July 25, 2007 Learning to Sing Like a Bird: The Self-Supervised.

Slides:



Advertisements
Similar presentations
Supplementary information for XII International Conference Speech and Computer (SPECOM'2007) October 15-18, 2007 for the submitted paper A Computational.
Advertisements

The Helmholtz Machine P Dayan, GE Hinton, RM Neal, RS Zemel
 Employed to develop french versions of questions on english intelligence tests  Became intrigued with the reasons children gave for wrong answers.
Mustafa Cayci INFS 795 An Evaluation on Feature Selection for Text Clustering.
Feature Selection as Relevant Information Encoding Naftali Tishby School of Computer Science and Engineering The Hebrew University, Jerusalem, Israel NIPS.
Unsupervised Learning Clustering K-Means. Recall: Key Components of Intelligent Agents Representation Language: Graph, Bayes Nets, Linear functions Inference.
Copyright © Allyn & Bacon 2004 Development Through the Lifespan Chapter 5 Cognitive Development in Infancy and Toddlerhood This multimedia product and.
 Course  Name  Student ID  Discipline  Introduction to computing  xxxxxx  xxxxx.
A Brief History of Artificial Intelligence
Module 14 Thought & Language. INTRODUCTION Definitions –Cognitive approach method of studying how we process, store, and use information and how this.
Overview S-Learning Rotary Robot Reaching Grasping Context-Based Similarity Natural Language Processing Perception Bio-Inspiration Applications BECCA BECCA—A.
Unsupervised Feature Selection for Multi-Cluster Data Deng Cai et al, KDD 2010 Presenter: Yunchao Gong Dept. Computer Science, UNC Chapel Hill.
F behavioral plasticity F CH8: development of learning in songbirds F behavioral analysis of birdsong F singing in the brain F summary PART 4: BEHAVIORAL.
Hybrid Pipeline Structure for Self-Organizing Learning Array Yinyin Liu 1, Ding Mingwei 2, Janusz A. Starzyk 1, 1 School of Electrical Engineering & Computer.
Michale Fee McGovern Institute for Brain Research Department of Brain and Cognitive Sciences MIT Jerusalem in Motion Workshop Jerusalem, Israel December.
EE141 1 Broca’s area Pars opercularis Motor cortexSomatosensory cortex Sensory associative cortex Primary Auditory cortex Wernicke’s area Visual associative.
Expectation Maximization Algorithm
Lecture #1COMP 527 Pattern Recognition1 Pattern Recognition Why? To provide machines with perception & cognition capabilities so that they could interact.
Baysian Approaches Kun Guo, PhD Reader in Cognitive Neuroscience School of Psychology University of Lincoln Quantitative Methods 2011.
Biointelligence Laboratory School of Computer Science and Engineering Seoul National University Cognitive Robots © 2014, SNU CSE Biointelligence Lab.,
CS Machine Learning. What is Machine Learning? Adapt to / learn from data  To optimize a performance function Can be used to:  Extract knowledge.
INTELLIGENT SYSTEMS Artificial Intelligence Applications in Business.
Vedrana Vidulin Jožef Stefan Institute, Ljubljana, Slovenia
Resilient Machines Through Continuous Self-Modeling Pattern Recognition Seung-Hyun Lee Soft Computing Lab. Josh Bongard,Victor Zykov, and Hod.
The biological basis of bird song production. Bird song facts: 1.There are about 4000 species of song birds each of which usually produce 1 to many bird.
Transfer Learning From Multiple Source Domains via Consensus Regularization Ping Luo, Fuzhen Zhuang, Hui Xiong, Yuhong Xiong, Qing He.
Jean Piaget ( ) was a biologist who originally studied molluscs but moved into the study of the development of children's understanding, through.
Perceptual and Sensory Augmented Computing Visual Object Recognition Tutorial Visual Object Recognition Bastian Leibe & Computer Vision Laboratory ETH.
CPSC 502, Lecture 15Slide 1 Introduction to Artificial Intelligence (AI) Computer Science cpsc502, Lecture 16 Nov, 3, 2011 Slide credit: C. Conati, S.
Babies and Computers Are They Related? – Abel Nyamapfene.
Study Guide Questions Describe neurological maturation and the plasticity viewpoints. Does thought precede language (Piaget) or do they interact (Vygotsky)?
A Regression Approach to Music Emotion Recognition Yi-Hsuan Yang, Yu-Ching Lin, Ya-Fan Su, and Homer H. Chen, Fellow, IEEE IEEE TRANSACTIONS ON AUDIO,
IRCS/CCN Summer Workshop June 2003 Speech Recognition.
ON REAL-TIME MEAN-AND-VARIANCE NORMALIZATION OF SPEECH RECOGNITION FEATURES Pere Pujol, Dušan Macho, and Climent NadeuNational ICT TALP Research Center.
Indirect Supervision Protocols for Learning in Natural Language Processing II. Learning by Inventing Binary Labels This work is supported by DARPA funding.
Fig.1. Flowchart Functional network identification via task-based fMRI To identify the working memory network, each participant performed a modified version.
How to evaluate (and compare/contrast) an approach
Chapter 6: Theories of Cognitive Development. Chapter 6: Theories of Cognitive Development Chapter 6 has three modules: Module 6.1 Setting the Stage:
Exploiting cross-modal rhythm for robot perception of objects Artur M. Arsenio Paul Fitzpatrick MIT Computer Science and Artificial Intelligence Laboratory.
Course Overview  What is AI?  What are the Major Challenges?  What are the Main Techniques?  Where are we failing, and why?  Step back and look at.
© 2011 The McGraw-Hill Companies, Inc. Instructor name Class Title, Term/Semester, Year Institution Intelligence Introductory Psychology Concepts.
The Neurology of Speech and Language: Avians to Humans David B. Rosenfield, M.D. Director, Speech/Language Center Director, EMG/Motor Control Lab. Professor.
Introduction to Deep Learning
Language Development Theories: Practical Implications
Vedrana Vidulin Jožef Stefan Institute, Ljubljana, Slovenia
Miss. Mona AL-Kahtani.  Basic assumption:  Language acquisition is one example of the human child’s remarkable ability to learn from experience and.
Artificial Intelligence: Research and Collaborative Possibilities a presentation by: Dr. Ernest L. McDuffie, Assistant Professor Department of Computer.
Thanh Le, Katheleen J. Gardiner University of Colorado Denver
Cognitive Science Overview Introduction, Syllabus
SPARSITY & SPEECH SCIENCE? TOWARDS A DATA-DRIVEN CHARACTERIZATION OF SPEECH MOTOR CONTROL V IKRAM R AMANARAYANAN University of Southern California, Los.
Robot Intelligence Technology Lab. 10. Complex Hardware Morphologies: Walking Machines Presented by In-Won Park
UNIT-IV. Introduction Speech signal is generated from a system. Generation is via excitation of system. Speech travels through various media. Nature of.
Parsing Natural Scenes and Natural Language with Recursive Neural Networks INTERNATIONAL CONFERENCE ON MACHINE LEARNING (ICML 2011) RICHARD SOCHER CLIFF.
V k equals the vector difference between the object and the block across the first and last frames in the image sequence or more formally: Toward Learning.
CPH Dr. Charnigo Chap. 14 Notes In supervised learning, we have a vector of features X and a scalar response Y. (A vector response is also permitted.
Big data classification using neural network
Nicolas Alzetta CoNGA: Cognition and Neuroscience Group of Antwerp
Goodfellow: Chap 1 Introduction
A BRIEF OVERVIEW FOR CRITICAL PERIODS OF VOCAL LEARNING
Done Done Course Overview What is AI? What are the Major Challenges?
Machine Learning Basics
Information-Theoretic Listening
Goodfellow: Chap 1 Introduction
Michael Meredith Biological Science
Multiple Change Point Detection for Symmetric Positive Definite Matrices Dehan Kong University of Toronto JSM 2018 July 30, 2018.
Deep learning Introduction Classes of Deep Learning Networks
Quaid –e- azam university
Toward a Great Class Project: Discussion of Stoianov & Zorzi’s Numerosity Model Psych 209 – 2019 Feb 14, 2019.
Presentation transcript:

Michael H. Coen MIT Computer Science and Artificial Intelligence Laboratory AAAI’07 Talk July 25, 2007 Learning to Sing Like a Bird: The Self-Supervised Acquisition of Birdsong &

AAAI’07 TalkM.H. Coen Outline Why do this research? Background: Cross-Modal Clustering (+ demo)  A biologically-inspired algorithm for machine learning (Coen 2005, 2006a, 2006b, Coen et al. 2007) A brief introduction to the zebra finch An architecture for sensorimotor learning (+ demo)  A simple, recursive application of cross-modal clustering  Views motor control as perception backwards Discussion IntroductionBackgroundZebra FinchesSensorimotor LearningDiscussion (Taeniopygia guttata)

AAAI’07 TalkM.H. Coen In the grand scheme of things… AIEngineering Applied Math Science Statistical NLP Deep Blue/Chinook DARPA Grand Challenge Optimization Operations Research Statistical Machine Learning Physiology Neuroscience Cognitive Science IntroductionBackgroundZebra FinchesSensorimotor LearningDiscussion

AAAI’07 TalkM.H. Coen A fundamental question Animals solve extremely difficult non-parametric and distribution free learning problems during development. How? IntroductionBackgroundZebra FinchesSensorimotor LearningDiscussion Belief: Answering this lets us: 1)Better understand learning in animals 2)Build new types of machine learning systems

AAAI’07 TalkM.H. Coen Cross-modal clustering briefly… Use multiple viewpoints (or datasets) describing the same events makes learning easier Biological motivation:  Perceptual systems share information constantly during “ordinary” perception (Stein and Meredith 1993, Shimojo and Shams 2001, Calvert et al. 2004, Spence and Driver 2004) IntroductionBackgroundZebra FinchesSensorimotor LearningDiscussion In a nutshell, CMC exploits redundancy within correlated datasets to discover unknown categories

AAAI’07 TalkM.H. Coen How does it work? A simple example Assume two events in the world: red and blue Events in the world: IntroductionBackgroundZebra FinchesSensorimotor LearningDiscussion

AAAI’07 TalkM.H. Coen How does it work? A simple example Assume two events in the world: red and blue Assume two datasets: Mode A and Mode B Events in the world: Mode A Mode B Thought experiment creature: IntroductionBackgroundZebra FinchesSensorimotor LearningDiscussion

AAAI’07 TalkM.H. Coen The view from the inside the creature… Mode AMode B Can we learn the red and blue events by sharing internal perspectives? IntroductionBackgroundZebra FinchesSensorimotor LearningDiscussion Note: We will call these datasets slices

AAAI’07 TalkM.H. Coen Recovering the categories 1)Iteratively project regions in each dataset onto the other dataset. 2) Merge regions in each dataset whose projections are the closest. 3)Continue… To play with online, Google: MIT Artificial Intelligence Demonstrations IntroductionBackgroundZebra FinchesSensorimotor LearningDiscussion Mode AMode B

Acquire language Understand fMRI data Learn to sing Sensorimotor learning What can you learn when you know nothing?

Acquire language Understand fMRI data What can you learn when you know nothing? Learn to sing Sensorimotor learning

AAAI’07 TalkM.H. Coen The zebra finch Small, unusually social oscine songbird Perhaps the most studied bird in neuroscience Complex vocal harmonics  People often mistake spectrograms for human speech Almost identical FoxP2 gene with humans  Governs vocal generation (Taeniopygia guttata) IntroductionBackgroundZebra FinchesSensorimotor LearningDiscussion

IntroductionBackgroundZebra FinchesSensorimotor LearningDiscussion

AAAI’07 TalkM.H. Coen Dynamics of song acquisition Day 1: Fledgling is born! First month: Father sings to his children ~Day 20: Males begin singing to themselves Day 90: Song crystallizes at sexual maturity IntroductionBackgroundZebra FinchesSensorimotor LearningDiscussion

An Architecture for Sensorimotor Learning Sensory CortexMotor Cortex Perceptual Processing Perceptual Slices Motor Control Innate Exploratory Motor Behaviors Sensory Organs Muscles/Effectors Afferent Processing Efferent Processing External World Perceptual Slices Events in the world IntroductionBackgroundZebra FinchesSensorimotor LearningDiscussion Cross-Modal Clustering happens here!

Sensory CortexMotor Cortex Perceptual Processing Perceptual Slices Motor Control Innate Exploratory Motor Behaviors Sensory Organs Muscles/Effectors Motor Slices Internal Perception (Cartesian Theater) Afferent Processing Efferent Processing An Architecture for Sensorimotor Learning External World Motor Slices Innate Exploratory Motor Behaviors Cross-Modal Clustering now happens here!

Sensory CortexMotor Cortex Perceptual Processing Perceptual Slices Motor Slices Motor Control Sensory Organs Muscles/Effectors Internal Perception (Cartesian Theater) Afferent Processing Efferent Processing An Architecture for Sensorimotor Learning External World Innate Exploratory Motor Behaviors

AAAI’07 TalkM.H. Coen Parental training: a simple example IntroductionBackgroundZebra FinchesSensorimotor LearningDiscussion

AAAI’07 TalkM.H. Coen Self-observation of innate activity Internal self-observation (Cartesian Theater) IntroductionBackgroundZebra FinchesSensorimotor LearningDiscussion External self-observation (Perceptual channels)

Recursive cross-modal clustering

AAAI’07 TalkM.H. Coen Acquired intentional motor control IntroductionBackgroundZebra FinchesSensorimotor LearningDiscussion

Sensory CortexMotor Cortex Perceptual Processing Perceptual Slices Motor Control Sensory Organs Muscles/Effectors Afferent Processing Efferent Processing An Architecture for Sensorimotor Learning External World Innate Exploratory Motor Behaviors Motor Slices Internal Perception (Cartesian Theater) Articulatory Synthesizer Perceptual Slices

Sensory CortexMotor Cortex Perceptual Processing Perceptual Slices Motor Control Sensory Organs Muscles/Effectors Afferent Processing Efferent Processing An Architecture for Sensorimotor Learning External World Innate Exploratory Motor Behaviors Articulatory Synthesizer Internal Perception (Cartesian Theater) Motor Slices

Sensory CortexMotor Cortex Perceptual Processing Perceptual Slices Motor Control Sensory Organs Muscles/Effectors Afferent Processing Efferent Processing An Architecture for Sensorimotor Learning External World Innate Exploratory Motor Behaviors Articulatory Synthesizer Internal Perception (Cartesian Theater) Motor Slices

AM FM Entropy Amplitude Mean Frequency Pitch Goodness Pitch Pitch Weight

AM FM Entropy Amplitude Mean Frequency Pitch Goodness Pitch Pitch Weight Defining Songemes

AAAI’07 TalkM.H. Coen A learner for birdsong Lower level features Higher level features IntroductionBackgroundZebra FinchesSensorimotor LearningDiscussion A 15 dimension, highly compact manifold

AAAI’07 TalkM.H. Coen Some zebra finch slices Goodness of pitch Pitch Mean frequency Wiener Entropy IntroductionBackgroundZebra FinchesSensorimotor LearningDiscussion

AAAI’07 TalkM.H. Coen Early “bird” babbling IntroductionBackgroundZebra FinchesSensorimotor LearningDiscussion

AAAI’07 TalkM.H. Coen Samba “Samba’s son” Birdsong mimicry IntroductionBackgroundZebra FinchesSensorimotor LearningDiscussion A word about evaluating empirical experiments…

AAAI’07 TalkM.H. Coen Contributions A new architecture for sensorimotor learning  Entirely self-supervised  Biologically inspired  Extremely simple, dimensionally compact Wide range of applications  Robotics  Sensor arrays  Computational learning  Dynamic control systems  Skill acquisition based on observation IntroductionBackgroundZebra FinchesSensorimotor LearningDiscussion

AAAI’07 TalkM.H. Coen Acknowledgments Ofer Tchernichovski Whitman Richards Rodney Brooks Howard Shrobe Patrick Winston Robert Berwick Gerald Sussman Adam Kraft Kobi Gal Krzysztof Gajos To play with online, Google: MIT Artificial Intelligence Demonstrations IntroductionBackgroundZebra FinchesSensorimotor LearningDiscussion

AAAI’07 TalkM.H. Coen Extra slides follow

AAAI’07 TalkM.H. Coen Acquisition of harmonic complexity IntroductionBackgroundZebra FinchesSensorimotor LearningDiscussion

AAAI’07 TalkM.H. Coen Related work Unsupervised clustering: Language  de Marcken (1996)  de Sa and Ballard (1997)  Lin (2004) Vision  Bartlett (2001)  Stauffer (2002) Statistical clustering  Dempster et al. (1977)  Smyth (1999) Blind signal separation  Hyvärinen (2001) Neuroscience  Becker and Hinton (1995), Becker (2005)  Granger (2003) Auditory scene analysis  Slaney et al. (2001) Minimal supervision  Blum and Mitchell (1998) Co-Clustering (Bi-Clustering, Block Clustering)  Friedman, Mosenzon, Slonim, and Tishby (2001)  Taskar, Segal, and Koller (2001)  Madeira and Oliveira (2004) Analysis of animal vocalizations: Birds (finches and buntings)  Kogan and Margoliash (1997) Bowhead Whales  Mellinger and Clark (1993) African elephants  Clemins and Johnson (2003) Humans  Guenther and Perkell (2004) Primary distinctions of our approach: 1.Fully unsupervised 2.Non-parametric:  Distribution free  Unknown number of clusters 3.Presumes no domain knowledge 4.Neurologically motivated IntroductionBackgroundZebra FinchesSensorimotor LearningDiscussion

AAAI’07 TalkM.H. Coen Current and future work Human protolinguistic babbling  Proficiency of an eight month old child  Entire phonetic structure of English Building an atlas of modular brain function  From human and rat fMRI data  New approaches to clinical treatments for autism Theoretical investigations  Convergence properties IntroductionBackgroundZebra FinchesSensorimotor LearningDiscussion