SYNTHESIS BY EXAMPLE Creation of sound synthesis algorithms using evolutionary methods By: Ricardo A. García MIT - Media Lab Machine.

Slides:

Advertisements

Similar presentations

Applications of one-class classification

Advertisements

Topic 5 Instructional audio OWT 410. Instructional audio Digital audio Definition of podcast Type of podcast Steps for creating audio podcasts Tools for.

Learning in Neural and Belief Networks - Feed Forward Neural Network 2001 년 3 월 28 일 안순길.

ECG Signal processing (2)

Salvatore giorgi Ece 8110 machine learning 5/12/2014

EE2F2 - Music Technology 9. Additive Synthesis & Digital Techniques.

Content-based retrieval of audio Francois Thibault MUMT 614B McGill University.

Generating Novel Reflectance Functions Adam Brady.

Chapter 7 Principles of Analog Synthesis and Voltage Control Contents Understanding Musical Sound Electronic Sound Generation Voltage Control Fundamentals.

Classical Analog Synthesis. Analog Synthesis Overview Sound is created by controlling electrical current within synthesizer, and amplifying result. Basic.

Presented by: Thabet Kacem Spring Outline Contributions Introduction Proposed Approach Related Work Reconception of ADLs XTEAM Tool Chain Discussion.

SBSE Course 3. EA applications to SE Analysis Design Implementation Testing Reference: Evolutionary Computing in Search-Based Software Engineering Leo.

Mining for High Complexity Regions Using Entropy and Box Counting Dimension Quad-Trees Rosanne Vetro, Wei Ding, Dan A. Simovici Computer Science Department.

Content-Based Classification, Search & Retrieval of Audio Erling Wold, Thom Blum, Douglas Keislar, James Wheaton Presented By: Adelle C. Knight.

So far.. We have covered a) Requirements gathering: observation & interview. b) Requirements specification. c) Requirements validation. d) Design/paper.

1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.

1/7 INFO60021 Natural Language Processing Harold Somers Professor of Language Engineering.

1. Elements of the Genetic Algorithm  Genome: A finite dynamical system model as a set of d polynomials over  2 (finite field of 2 elements)  Fitness.

Software Metrics II Speaker: Jerry Gao Ph.D. San Jose State University URL: Sept., 2001.

Retrieval Evaluation: Precision and Recall. Introduction Evaluation of implementations in computer science often is in terms of time and space complexity.

Highlights Lecture on the image part (10) Automatic Perception 16

Music Processing Roger B. Dannenberg. Overview  Music Representation  MIDI and Synthesizers  Synthesis Techniques  Music Understanding.

PH 105 Dr. Cecilia Vogel Lecture 12. OUTLINE  Timbre review  Spectrum  Fourier Synthesis  harmonics and periodicity  Fourier Analysis  Timbre and.

Multimedia Enabling Software. The Human Perceptual System Since the multimedia systems are intended to be used by human, it is a pragmatic approach to.

Information theory, fitness and sampling semantics colin johnson / university of kent john woodward / university of stirling.

1 Presenter: Ming-Shiun Yang Sah, A., Balakrishnan, M., Panda, P.R. Design, Automation & Test in Europe Conference & Exhibition, DATE ‘09. A Generic.

Synthesis Basics (1) Analog Synthesis Intro to Digital Oscillators.

SINGLE CHANNEL SPEECH MUSIC SEPARATION USING NONNEGATIVE MATRIXFACTORIZATION AND SPECTRAL MASKS Jain-De,Lee Emad M. GraisHakan Erdogan 17 th International.

CHAPTER 12 ADVANCED INTELLIGENT SYSTEMS © 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang.

DIGITAL WATERMARKING OF AUDIO SIGNALS USING A PSYCHOACOUSTIC AUDITORY MODEL AND SPREAD SPECTRUM THEORY * By: Ricardo A. Garcia *Research done at: University.

DIGITAL WATERMARKING OF AUDIO SIGNALS USING A PSYCHOACOUSTIC AUDITORY MODEL AND SPREAD SPECTRUM THEORY By: Ricardo A. Garcia University of Miami School.

ASP.NET Web Application and Development Digital Media Department Unit Credit Value : 4 Essential Learning time : 120 hours Digital.

{ Graphite Grigory Arashkovich, Anuj Khanna, Anirban Gangopadhyay, Michael D’Egidio, Laura Willson.

Neural and Evolutionary Computing - Lecture 9 1 Evolutionary Neural Networks Design  Motivation  Evolutionary training  Evolutionary design of the architecture.

Chapter 5: Electronic Music and Synthesizers Who uses electronic musical synthesizers? Each advance in electronic technology is followed by a concomitant.

1 5. Application Examples 5.1. Programmable compensation for analog circuits (Optimal tuning) 5.2. Programmable delays in high-speed digital circuits (Clock.

1 Reconstructing head models from photograph for individualized 3D-audio processing Matteo Dellepiane, Nico Pietroni, Nicolas Tsingos, Manuel Asselot,

Chapter 12 The Principles of Computer Music Contents Digital Audio Processing Noise Reduction Audio Compression Digital Rights Management (DRM)

© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 12-1 Chapter 12 Advanced Intelligent Systems.

EE Audio Signals and Systems Linear Prediction Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.

Introduction to Compilers. Related Area Programming languages Machine architecture Language theory Algorithms Data structures Operating systems Software.

Topic #1: Introduction EE 456 – Compiling Techniques Prof. Carl Sable Fall 2003.

ELeaRNT: Evolutionary Learning of Rich Neural Network Topologies Authors: Slobodan Miletic 3078/2010 Nikola Jovanovic 3077/2010

1 Compiler Design (40-414)  Main Text Book: Compilers: Principles, Techniques & Tools, 2 nd ed., Aho, Lam, Sethi, and Ullman, 2007  Evaluation:  Midterm.

Quiz 1 Review. Analog Synthesis Overview Sound is created by controlling electrical current within synthesizer, and amplifying result. Basic components:

Introduction to the Semantic Web and Linked Data

Nuria Lopez-Bigas Methods and tools in functional genomics (microarrays) BCO17.

Duraid Y. Mohammed Philip J. Duncan Francis F. Li. School of Computing Science and Engineering, University of Salford UK Audio Content Analysis in The.

Marwan Al-Namari 1 Digital Representations. Bits and Bytes Devices can only be in one of two states 0 or 1, yes or no, on or off, … Bit: a unit of data.

COMP135/COMP535 Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 2 Lecture 2 – Digital Representations.

Fast Query-Optimized Kernel Machine Classification Via Incremental Approximate Nearest Support Vectors by Dennis DeCoste and Dominic Mazzoni International.

CISC 849 : Applications in Fintech Namami Shukla Dept of Computer & Information Sciences University of Delaware iCARE : A Framework for Big Data Based.

Automatic Transcription System of Kashino et al. MUMT 611 Doug Van Nort.

Learning Kernel Classifiers 1. Introduction Summarized by In-Hee Lee.

SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.

Clustering [Idea only, Chapter 10.1, 10.2, 10.4].

1 A genetic algorithm with embedded constraints – An example on the design of robust D-stable IIR filters 潘欣泰國立高雄大學資工系.

1 Tempo Induction and Beat Tracking for Audio Signals MUMT 611, February 2005 Assignment 3 Paul Kolesnik.

Computers & Programming (CSE 102)

Automatic Synthesizer Preset Generation with PresetGen

EE2F2: Music Technology - Revision

Machine Learning for dotNET Developer Bahrudin Hrnjica, MVP

Information-Theoretic Listening

Chapter 12 Advanced Intelligent Systems

Developing a Versatile Audio Synthesizer TJHSST Computer Systems Lab

Content Based Image Retrieval

Emad M. Grais Hakan Erdogan

DIGITAL WATERMARKING OF AUDIO SIGNALS USING A PSYCHOACOUSTIC AUDITORY MODEL AND SPREAD SPECTRUM THEORY By: Ricardo A. Garcia University of Miami School.

Auditory Morphing Weyni Clacken

Srinivas Neginhal Anantharaman Kalyanaraman CprE 585: Survey Project

Presentation transcript:

SYNTHESIS BY EXAMPLE Creation of sound synthesis algorithms using evolutionary methods By: Ricardo A. García MIT - Media Lab Machine Listening Group © 2001

MIT - Media Lab Machine Listening Group Synthesis by Example Ricardo A. Garcia © 2001 INTRODUCTION zHow is that sound coming out of your soundcard being generated? zIs that a real violin? zWhat kind of controls do we have over it? pitch, length, other performance parameters sound attributes: brightness, harmonic content, etc The questions: zHow are those sound synthesis algorithms conceived? zCan this design process be automated?

MIT - Media Lab Machine Listening Group Synthesis by Example Ricardo A. Garcia © 2001 In a nutshell... zUser supplies a set of examples (inputs/target output) zThe system ”reverse engineers" the sound and returns a sound synthesis technique that is capable of rendering a sound "similar" to the target sound! ? Inputs Example Output TARGET

MIT - Media Lab Machine Listening Group Synthesis by Example Ricardo A. Garcia © 2001 Talk outline zIntroduction zSound Synthesizers zDesign of Sound Synthesis Techniques (SST) zProposed approach for design zGenetic Programming zAutomatic Generation of Sound Synthesizers (AGeSS) system zExamples

MIT - Media Lab Machine Listening Group Synthesis by Example Ricardo A. Garcia © 2001 What is a sound synthesizer? zAny method or device used to produce sound zSound Synthesis Technique (SST): Algorithm that produces sound samples, usually implemented in a digital computer xInputs: –internal parameters (init time) –control signals (time varying) xOutput: –sound samples (Waveform) zStructured audio, symbolic representation Model analysis

MIT - Media Lab Machine Listening Group Synthesis by Example Ricardo A. Garcia © 2001 Inside a SST yFunctional form of the algorithm Hard to do! Prefabricated formulas and algorithms xY=A*sin(x+B)+C yInternal parameters “Easy to do” Parameter estimation Analysis tools xA=0.4, B=0.1, C=-0.3

MIT - Media Lab Machine Listening Group Synthesis by Example Ricardo A. Garcia © 2001 Traditional design zPick a Functional Form zDo parameter estimation to fit your target … LPC FM Additive Parameter Estimation TARGETInputs USE IT Human selects Functional Form

MIT - Media Lab Machine Listening Group Synthesis by Example Ricardo A. Garcia © 2001 Proposed approach z“Suggest” functional form (suggest “new” SST) zDo parameter estimation zCompare with target zSuggest new SST Parameter Estimation TARGETInputs Suggest SST functional form + Initial Parameters Functional Elements Compare with Target

MIT - Media Lab Machine Listening Group Synthesis by Example Ricardo A. Garcia © 2001 Representation of SSTs zPseudocode zFormula zTopology graph assign 3 to x sum x and 0.4 return x

MIT - Media Lab Machine Listening Group Synthesis by Example Ricardo A. Garcia © 2001 SST space zSST space: all the possible valid combinations of a set of functional elements and their connections! HUGE! zHypothesis: ”given a set of inputs, target and fitness metrics, it is possible to find the functional-form and internal parameters of a SST capable of synthesizing a sound “close” to the target”.

MIT - Media Lab Machine Listening Group Synthesis by Example Ricardo A. Garcia © 2001 Design of SST as a search zSpecs: xInputs xoutput (TARGET) xFitness metric zSearch: find a point (SST, Functional form + internal parameters) that fulfills the specs EXAMPLE

MIT - Media Lab Machine Listening Group Synthesis by Example Ricardo A. Garcia © 2001 The Genetic Programming Loop HUMAN Vs MACHINE

MIT - Media Lab Machine Listening Group Synthesis by Example Ricardo A. Garcia © 2001 Comparing sounds --fitness function-- Analytical distance metrics: MSE of complex spectrogram (magnitude and phase)

MIT - Media Lab Machine Listening Group Synthesis by Example Ricardo A. Garcia © 2001 Perceptual Fitness Function zA perceptual distance metrics xPerceptually relevant components: full error xAbove masking threshold: error up to M.T. xBelow masking threshold: no error

MIT - Media Lab Machine Listening Group Synthesis by Example Ricardo A. Garcia © 2001 Development mapping from expression tree to topology graph

MIT - Media Lab Machine Listening Group Synthesis by Example Ricardo A. Garcia © 2001 AGeSS system zAutomatic Generation of Sound Synthesizers zMatlab scripts and C++ programs

MIT - Media Lab Machine Listening Group Synthesis by Example Ricardo A. Garcia © 2001 Examples zA synthetic tone (des35comp2.avi) (des35comp2.avi) zA piano note (des44comp2.avi) (des44comp2.avi) zFM synthesizer (chow727.avi) (chow727.avi)

MIT - Media Lab Machine Listening Group Synthesis by Example Ricardo A. Garcia © 2001 FM synthesis example zTarget formula zOutput (generation 220+)

MIT - Media Lab Machine Listening Group Synthesis by Example Ricardo A. Garcia © 2001 FM synthesis example (scale) Training example (A880, B988) Output (best of generation 850) Scale Scale with best of generation 850

MIT - Media Lab Machine Listening Group Synthesis by Example Ricardo A. Garcia © 2001 For more information… Ricardo A. Garcia Web: