Carnegie Mellon Music Understanding and the Future of Music Performance Roger B. Dannenberg Professor of Computer Science, Art, and Music Carnegie Mellon.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

A Musical Data Mining Primer CS235 – Spring ’03 Dan Berger
Speaker Associate Professor Ning-Han Liu. What’s MIR  Music information retrieval (MIR) is the interdisciplinary science of retrieving information from.
VET Music: Work Opportunity Gus Campbell. Trends and emerging technologies in the music industry introduction As technology gets better and better in.
KARAOKE FORMATION Pratik Bhanawat (10bec113) Gunjan Gupta Gunjan Gupta (10bec112)
Toward Automatic Music Audio Summary Generation from Signal Analysis Seminar „Communications Engineering“ 11. December 2007 Patricia Signé.
Musique concrète at secondary school: Asking the ‘right’ questions Anna-Marie Higgins PhD Student (Education) Cambridge University.
Outline Introduction Music Information Retrieval Classification Process Steps Pitch Histograms Multiple Pitch Detection Algorithm Musical Genre Classification.
Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.
AUTOMATIC ORGANIZING AND FORMATTING FOR LECTURE NOTES SHIQING (LICIA) HE ADIVISOR: PROF.KRISTINA STRIEGNITZ SPRING 2014 STRUCTURING THE UNSTRUCTURED NOTE:
Content-Based Classification, Search & Retrieval of Audio Erling Wold, Thom Blum, Douglas Keislar, James Wheaton Presented By: Adelle C. Knight.
Guitar Trainer Adam Janke CS 470 Final Presentation.
Everyone likes music Most just listen, but many play: Music Merchants: $8B, 5M instruments (US, 2006) Sound reinforcement: $1.5B (US, 2006) Audacity Audio.
Digital audio and computer music COS 116: 2/26/2008.
Visual Programming Via The Squeak Car Demo Dan Grossman University of Washington CS4HS July 11-13, 2008.
Music Processing Roger B. Dannenberg. Overview  Music Representation  MIDI and Synthesizers  Synthesis Techniques  Music Understanding.
Outline of Presentation Introduction of digital video libraries Introduction of the CMU Informedia Project Informedia: user perspective Informedia:
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
Chapter 14 Recording and Editing Sound. Getting Started FAQs: − How does audio capability enhance my PC? − How does your PC record, store, and play digital.
Digital audio and computer music COS 116, Spring 2012 Guest lecture: Rebecca Fiebrink.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Introduction to MIR Course Overview 1.
CS Machine Learning. What is Machine Learning? Adapt to / learn from data  To optimize a performance function Can be used to:  Extract knowledge.
EE513 Audio Signals and Systems Statistical Pattern Classification Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Digital Sound and Video Chapter 10, Exploring the Digital Domain.
Skills: none Concepts: introduction to and history of speech (with and without text) and music processing, audio file formats, the audio processing workflow,
An Introduction to Podcasting Texas Rural Education Association June 15 th, 2007 Kyle Kuehler.
Introduction CSE 1310 – Introduction to Computers and Programming
Educational Software using Audio to Score Alignment Antoine Gomas supervised by Dr. Tim Collins & Pr. Corinne Mailhes 7 th of September, 2007.
Multimedia Databases (MMDB)
Introduction CSE 1310 – Introduction to Computers and Programming Vassilis Athitsos University of Texas at Arlington 1.
Technology in Art & Music Education Created by: Erica Cupp, Desiree Dizon, Heather Smith & Tasha Smith.
(CMSC5720-1) MSC projects by Prof K.H. Wong (21 July2014) (shb907) MSC projects supervised by Prof.
Introduction CSE 1310 – Introduction to Computers and Programming Vassilis Athitsos University of Texas at Arlington 1.
Audio Editing and Podcasting October 9, Objectives for Today’s Workshop Define Podcasting Classroom Applications Recording and Editing Audio See.
Polyphonic Music Transcription Using A Dynamic Graphical Model Barry Rafkind E6820 Speech and Audio Signal Processing Wednesday, March 9th, 2005.
Chapter 15 Recording and Editing Sound. 2Practical PC 5 th Edition Chapter 15 Getting Started In this Chapter, you will learn: − How sound capability.
ENOMA - European Network of Online Musical Archives ENOMA Workshop – The Grieg Academy, UiB 26 May 2006 Leif Arne Rønningen and Lars Erik Løvhaug NTNU.
Rhythmic Transcription of MIDI Signals Carmine Casciato MUMT 611 Thursday, February 10, 2005.
Overview of Part I, CMSC5707 Advanced Topics in Artificial Intelligence KH Wong (6 weeks) Audio signal processing – Signals in time & frequency domains.
ITEC 830 Podcasting Workshop Presenters: Erica Chan Brian Murakami.
Computer Science 1 Week 11. This Week... QBasic While Loops QBasic While Loops Audio Basics Audio Basics.
MULTIMEDIA INPUT / OUTPUT TECHNOLOGIES INTRODUCTION 6/1/ A.Aruna, Assistant Professor, Faculty of Information Technology.
Audio Communications: Sound Mr. Butler Communication Systems John Jay High School Wappingers Central School District UPDATED 11/2011.
Digital audio and computer music COS 116, Spring 2010 Adam Finkelstein Slides and demo thanks to Rebecca Fiebrink.
What is Audacity? Audacity is a free audio editor and recording program which is classified as open source software. It is easily downloaded to one’s.
Unit 01- setting up and operate a digital audio workstation By Alfie fennemore.
Virtual Instruments and Sequencing. Proses Pembuatan Lagu/Musik TrackingMixingMastering.
ICMC 2004 – Nov. 5 1 Andante: Composition and Performance with Mobile Musical Agents Leo Kazuhiro Ueda Fabio Kon
CSCI-100 Introduction to Computing Hardware Part II.
MIS2502: Data Analytics Advanced Analytics - Introduction.
Issues in Automatic Musical Genre Classification Cory McKay.
In the General Music Classroom February 13, 2009.
Introduction CSE 1310 – Introduction to Computers and Programming Vassilis Athitsos University of Texas at Arlington 1.
Transcription Software Amazing Slow Downer & Transcribe! Rick Lollar Amazing Slow Downer & Transcribe! Rick Lollar.
Artificial Intelligence
XP Practical PC, 3e Chapter 14 1 Recording and Editing Sound.
1 Tempo Induction and Beat Tracking for Audio Signals MUMT 611, February 2005 Assignment 3 Paul Kolesnik.
CS 591 S1 – Computational Audio -- Spring, 2017
Lesson 2 – Video & Audio Editing
Chapter 15 Recording and Editing Sound
CS 445/656 Computer & New Media
Rhythmic Transcription of MIDI Signals
Audio to Score Alignment for Educational Software
A Personal Tour of Machine Learning and Its Applications
Image Recognition. Contents: Motivation Objective Definition Introduction Preprocessing / Edge Detection Neural Networks in Image Recognition Practical.
SPECIALIZED APPLICATION SOFTWARE
Music Computer & New Media.
Audio and Speech Computers & New Media.
Institute of New Media Development and Research
Presentation transcript:

Carnegie Mellon Music Understanding and the Future of Music Performance Roger B. Dannenberg Professor of Computer Science, Art, and Music Carnegie Mellon University

Carnegie Mellon 2 © 2013 Roger B. Dannenberg Why Computers and Music? Music in every human society! Computing can make music:  More Fun  More Available  Higher Quality  More Personal

Carnegie Mellon My Background Always interested in math and music and making things Discovered synthesizers in high school Discovered computers about the same time Discovered computer music in college Research motivated by musical experience:  Computer accompaniment  Expressive programming languages for music  Audacity  … current work 3 © 2013 Roger B. Dannenberg

Carnegie Mellon 4 Overview Introduction How Is Computation Used in Music Today? New Capabilities: What Can Computers Do Tomorrow? What Will Music Be Like in the Future? © 2013 Roger B. Dannenberg

Carnegie Mellon 5 How Is Computation Used in Music Today? Indabamusic.com © 2013 Roger B. Dannenberg

Carnegie Mellon 6 Music Computation Today Production: digital recording, editing, mixing Nearly all music production today...  Records audio to (digital) disk  Edit/manipulate audio digitally  Equalization  Reverberation  Convert to media:  CD  MP3  Etc. protools.com © 2013 Roger B. Dannenberg

Carnegie Mellon 7 Music Computation Today Musical Instruments: synthesizers and controllers Sonic Spring (Tomas Henriques) Linnstrument (Roger Linn) Drum Machine (Yamaha) Synthesizer (Solaris) © 2013 Roger B. Dannenberg

Carnegie Mellon 8 Music Computation Today Distribution: compression, storage, networks Napster Apple iPod Apple iTunes Amazon Cloud Player © 2013 Roger B. Dannenberg

Carnegie Mellon 9 Music Computation Today Search, recommendation, music fingerprinting Google Music China Music Fingerprinting Pandora Music Recommendation © 2013 Roger B. Dannenberg

Carnegie Mellon 10 Overview Computer Music Introduction How Is Computation Used in Music Today? New Capabilities: What Can Computers Do Tomorrow? What Will Music Be Like in the Future? © 2013 Roger B. Dannenberg

Carnegie Mellon 11 New Capabilities: What Can Computers Do Tomorrow? Computer accompaniment Style classification Score alignment Onset detection Sound synthesis © 2013 Roger B. Dannenberg

Carnegie Mellon 12 Accompaniment Video © 2013 Roger B. Dannenberg

Carnegie Mellon 13 Computer Accompaniment Performance Input Processing Matching Score for Performer Score for Accompaniment Performance Music Synthesis Accompaniment © 2013 Roger B. Dannenberg

Carnegie Mellon 14 Computer Accompaniment Performance Input Processing Matching Score for Performer Score for Accompaniment Performance Music Synthesis Accompaniment Performance  ABA A11 B122 B122 A123 C23 B3 G Score  Dynamic Programming, plus... On-line, column-by-column evaluation Windowing for real-time evaluation Heuristics for best-yet matching Penalty for skipping notes © 2013 Roger B. Dannenberg

Carnegie Mellon 15 Computer Accompaniment Performance Input Processing Matching Score for Performer Score for Accompaniment Performance Music Synthesis Accompaniment Rule-based system: E.g. If matcher is confident and accompaniment is ahead < 0.1s, stop until synchronized. If matcher is confident and accompaniment is behind <0.5s, speed up until synchronized. © 2013 Roger B. Dannenberg

Carnegie Mellon 16 Vocal Accompaniment Lorin Grubb’s Ph.D. (CMU CSD) Machine learning used to:  Learns what kinds of tempo variation are likely  Characterize sensors  When is a notated G sensed as a G#? Machine learning necessary for good performance © 2013 Roger B. Dannenberg

Carnegie Mellon 17 Vocal Accompaniment © 2013 Roger B. Dannenberg

Carnegie Mellon 18 How It Works Probability Score Position Score position modeled as a probability density function Bayesian update rule: P(s|o)  P(o|s)P(s) P(o|s) is e.g. "probability of observing pitch G if the score says play an A." Simple statistics on labeled training data. Prior P(s) by fast convolution with a log normal (describes tempo and tempo variation) © 2013 Roger B. Dannenberg

Carnegie Mellon 19 Commercial Implementation rtsp://qt.partner-streaming.com/makemusic/wm_03_l.mov rtsp://qt.partner-streaming.com/makemusic/wm_04_l.mov © 2013 Roger B. Dannenberg

Carnegie Mellon 20 Style Classification: Listening to Jazz Styles ? Lyrical Pointilistic Syncopated Frantic © 2013 Roger B. Dannenberg

Carnegie Mellon 21 Jazz Style Recognition © 2013 Roger B. Dannenberg

Carnegie Mellon 22 Techniques Extract features from audio:  Note density  Mean & Std. Dev. of pitch range  Mean & Std. Dev. of pitch intervals  Silence vs. Sounding ("duty factor") ... and many more Features over 5-second windows Standard Classifiers (Naive Bayes, Linear, Neural Net) © 2013 Roger B. Dannenberg

Carnegie Mellon 23 Polyphonic Audio-to-Score Alignment vs © 2013 Roger B. Dannenberg

Carnegie Mellon 24 © 2013 Roger B. Dannenberg Audacity Editor with Automatic Audio-to-MIDI Alignment

Carnegie Mellon 25 Finding Note Onsets (How to segment music audio into notes.) Not all attacks are clean Slurs do not have obvious (or fast) transitions We can use score alignment to get a rough idea of where the notes are (~1/10 second) Then, machine learning can create programs that do an even better job (bootstrap learning). © 2013 Roger B. Dannenberg

Carnegie Mellon 26 Expressive Performance © 2013 Roger B. Dannenberg

Carnegie Mellon 27 Phrase-based Synthesis Note-by-Note Synthesis Phrase-based Synthesis © 2013 Roger B. Dannenberg

Carnegie Mellon 28 Example Envelopes Tongued Note Slurred Note © 2013 Roger B. Dannenberg

Carnegie Mellon 29 Synthesis Examples Good trumpet sounds, mechanically performed: Same sounds, but performed with AI-based model of trumpet performance: Another example: Trumpet example from Ning Hu’s thesis: Bassoon example from Ning Hu’s thesis: © 2013 Roger B. Dannenberg

Carnegie Mellon 30 Overview Computer Music Introduction How Is Computation Used in Music Today? New Capabilities: What Can Computers Do Tomorrow? What Will Music Be Like in the Future? © 2013 Roger B. Dannenberg

Carnegie Mellon 31 Human Computer Music Performance The most advanced computer music research is applied to esoteric art music.  There is a widespread practice of interactive computer (art) music  … but relatively little sophistication in popular music OPPORTUNITY  State-of-the-art computer music systems for popular music performance  Autonomous Intelligent Machine Musicians © 2013 Roger B. Dannenberg

Carnegie Mellon 32 bass ? Example Suppose you want to get together and play music... BUT, you're missing a _______ player. credit: Green Day © 2013 Roger B. Dannenberg

Carnegie Mellon 33 What Research Is Needed? Synchronization  Signal processing  Machine learning  Human interface Sketchy notation  Representation issues Improvisation  Models of style Sound Production  Phrase-based synthesis? Modularity/Systems issues  Real-time systems  Software architecture Interaction  HCI © 2013 Roger B. Dannenberg

Carnegie Mellon 34 Is There a Market? What's the Impact? $8B annual US music sales  Excluding recordings, education, performances 5 million musical instruments per year Performance revenue is on the order of $10B Recording revenue is similar; order of $10B Approximately 1/2 of all US households have a practicing musician... so very roughly $10+B and 100M people! © 2013 Roger B. Dannenberg

Carnegie Mellon 35 Rock Prodigy Guitar Hero for Real Guitars Game design, content, animation, etc. by others (Play Video)Video Unsolicited comment: "The best part about it is polyphonic pitch detection" © 2013 Roger B. Dannenberg

Carnegie Mellon 36 An Example © 2013 Roger B. Dannenberg

Carnegie Mellon Online, collaborative development of creative content is already here… 37 © 2013 Roger B. Dannenberg

Carnegie Mellon 38 What Will People Do With HCMP? Practice with virtual bands. Create their own arrangements. Post machine-readable music online, share. Blend conventional performance with algorithmic composition, new sounds, new music. Robot performers. Eventually... new art forms Think of the electric guitar, drum machine in music, camera in visual art,... © 2013 Roger B. Dannenberg

Carnegie Mellon 39 Another Example © 2013 Roger B. Dannenberg

Carnegie Mellon 40 Conclusion Music Understanding and Human Computer Music Performance will enrich musical experiences for millions of people, including both amateurs and professionals. If we build computers that can perform popular music interactively with intelligence, great music will be made. That is the future of music performance. © 2013 Roger B. Dannenberg