Segmenting Popular Music Sentence by Sentence Wan-chi Lee.

Slides:



Advertisements
Similar presentations
An Active contour Model without Edges
Advertisements

Change-Point Detection Techniques for Piecewise Locally Stationary Time Series Michael Last National Institute of Statistical Sciences Talk for Midyear.
Scale & Affine Invariant Interest Point Detectors Mikolajczyk & Schmid presented by Dustin Lennon.
Shapelets Correlated with Surface Normals Produce Surfaces Peter Kovesi School of Computer Science & Software Engineering The University of Western Australia.
KARAOKE FORMATION Pratik Bhanawat (10bec113) Gunjan Gupta Gunjan Gupta (10bec112)
Approximations of points and polygonal chains
Standard Data Analysis - Integration
Content-based retrieval of audio Francois Thibault MUMT 614B McGill University.
AES 120 th Convention Paris, France, 2006 Adaptive Time-Frequency Resolution for Analysis and Processing of Audio Alexey Lukin AES Student Member Moscow.
Cos 429: Face Detection (Part 2) Viola-Jones and AdaBoost Guest Instructor: Andras Ferencz (Your Regular Instructor: Fei-Fei Li) Thanks to Fei-Fei Li,
Ideal Filters One of the reasons why we design a filter is to remove disturbances Filter SIGNAL NOISE We discriminate between signal and noise in terms.
Classification of Music According to Genres Using Neural Networks, Genetic Algorithms and Fuzzy Systems.
So far all our functions were linear. In many situations, it might not be the case. Example: Production cost. – c 1 = $11/unit for first 5 items – c 2.
Biomedical signal processing: Wavelets Yevhen Hlushchuk, 11 November 2004.
Pole Zero Speech Models Speech is nonstationary. It can approximately be considered stationary over short intervals (20-40 ms). Over thisinterval the source.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
Lyric alignment in popular songs Luong Minh Thang WING group meeting 12 Oct, 2007.
Feature extraction Feature extraction involves finding features of the segmented image. Usually performed on a binary image produced from.
Brief overview of ideas In this introductory lecture I will show short explanations of basic image processing methods In next lectures we will go into.
Learning and Recognizing Activities in Streams of Video Dinesh Govindaraju.
IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING MARCH 2010 Lan-Ying Yeh
Sound Targeting Platform Andrew Lenharth Michael Schaffer Quang Luu CSE 477 May 22, 2001.
1 Live Sound Reinforcement Equalizers and other signal processing equipment.
LE 460 L Acoustics and Experimental Phonetics L-13
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.
Infrasound detector for Apatity group Asming V.E., Kola Regional Seismological Center, Apatity, Russia.
Multiresolution STFT for Analysis and Processing of Audio
Adapting matched filtering searches for compact binary inspirals in LSC detector data. Chad Hanna – For the LIGO Scientific Collaboration.
Sound Detection Derek Hoiem Rahul Sukthankar (mentor) August 24, 2004.
Basics of Neural Networks Neural Network Topologies.
Similarity Matrix Processing for Music Structure Analysis Yu Shiu, Hong Jeng C.-C. Jay Kuo ACM Multimedia 2006.
Experimental research in noise influence on estimation precision for polyharmonic model frequencies Natalia Visotska.
BLAST: Basic Local Alignment Search Tool Altschul et al. J. Mol Bio CS 466 Saurabh Sinha.
Adaptive Mining Techniques for Data Streams using Algorithm Output Granularity Mohamed Medhat Gaber, Shonali Krishnaswamy, Arkady Zaslavsky In Proceedings.
ECE 5525 Osama Saraireh Fall 2005 Dr. Veton Kepuska
23 November Md. Tanvir Al Amin (Presenter) Anupam Bhattacharjee Department of Computer Science and Engineering,
Audio Tempo Extraction Presenter: Simon de Leon Date: February 9, 2006 Course: MUMT611.
Singer similarity / identification Francois Thibault MUMT 614B McGill University.
Sound Targeting Platform Andrew Lenharth Michael Schaffer Quang Luu CSE 477 April 24, 2001.
Section 4.3 Riemann Sums and Definite Integrals. To this point, anytime that we have used the integral symbol we have used it without any upper or lower.
Singer Similarity Doug Van Nort MUMT 611. Goal Determine Singer / Vocalist based on extracted features of audio signal Classify audio files based on singer.
Data QC and filtering Bryce HutchinsonSumit Verma Objective: Consider the frequency range of different seismic features Look for low frequency and high.
6.S093 Visual Recognition through Machine Learning Competition Image by kirkh.deviantart.com Joseph Lim and Aditya Khosla Acknowledgment: Many slides from.
The Story of Wavelets Theory and Engineering Applications
Topic: Pitch Extraction
© 2001 Dr. Laura Snodgrass, Ph.D.1 Psychophysics Mathematical formula for relationship between changes in the physical stimulus and changes in conscious.
 What is Filter ? A Filter is an electrical network that can transmit signal within a specified frequency range. This Frequency range is called PASS BAND.
CS 445/656 Computer & New Media
CS 591 S1 – Computational Audio -- Spring, 2017
Maths Mastery Order of Operations.
CS 591 S1 – Computational Audio
Session 7: Face Detection (cont.)
The Choral Experience Cover Slide
CS 591 S1 – Computational Audio -- Spring, 2017
A Smartphone App-Based
Fitting Curve Models to Edges
Filter Design by Windowing
IEEE Regulatory AHC Power Measurement Method
Cos 429: Face Detection (Part 2) Viola-Jones and AdaBoost Guest Instructor: Andras Ferencz (Your Regular Instructor: Fei-Fei Li) Thanks to Fei-Fei.
Ideal Filters One of the reasons why we design a filter is to remove disturbances Filter SIGNAL NOISE We discriminate between signal and noise in terms.
Chapter 4: Representing sound
Pass The Ball.
Audio and Speech Computers & New Media.
8.5 Modulation of Signals basic idea and goals
Slope Fields and Euler’s Method
Marian Stewart Bartlett, Gwen C. Littlewort, Mark G. Frank, Kang Lee 
Shared Features in Log-Linear Models
Time Series Filtering Time Series
Signal Detection & Processing
CS5760: Computer Vision Lecture 9: RANSAC Noah Snavely
Presentation transcript:

Segmenting Popular Music Sentence by Sentence Wan-chi Lee

Basic Idea In a song, The energy of audio signal will be low in the gap between sentences. Trying to detect the energy gap. Problem: There will be accompaniment sound. The dynamic range of audio signal varies a lot: hard to choose threshold.

Examples of audio signal

Methods Band-pass Filtering the signal: Here I use 6 order elliptic filter with pass band 800Hz~1.6KHz. For a short sliding window, calculating the average energy of the signal I use a 0.1 second window. Detecting the valley of average energy by piecewise linear approximation.

Piece-wise Linear Approximation I used a top-down method in determining the approximation. Specify an error bound. Find a segmentation point that best improve the approximation. Calculate linear regression for each segment as the approximation. If the error bound is not achieved, repeat above steps.

Segmentation Point After finding the linear approximation, choose points representing the gap in energy. Place some restrictions to make the segments be in reasonable length.

Demo and Discussion I only used one feature. Other features can be incorporated. Heuristic method: no training needed, but lots of parameters to tune. It should be integrated with onset detection to let the segmenting points coincide with the onset.