Sung-Won Yoon, David ChoiEE368C Project Proposal Bandwidth Extrapolation of Audio Signals Sung-Won Yoon David Choi February 8 th, 2001.

Slides:



Advertisements
Similar presentations
Low-Complexity Transform and Quantization in H.264/AVC
Advertisements

A brief review of non-neural-network approaches to deep learning
Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 1 IEEE Speech Coding Workshop Sept 17–20, 2000 Lake Lawn Resort Delavan, WI Jean-Marc Valin,
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
11/11/02 IDR Workshop Dealing With Location Uncertainty in Images Hasan F. Ates Princeton University 11/11/02.
Content-based retrieval of audio Francois Thibault MUMT 614B McGill University.
Page 0 of 34 MBE Vocoder. Page 1 of 34 Outline Introduction to vocoders MBE vocoder –MBE Parameters –Parameter estimation –Analysis and synthesis algorithm.
« هو اللطیف » By : Atefe Malek. khatabi Spring 90.
Speech Coding Nicola Orio Dipartimento di Ingegneria dell’Informazione IV Scuola estiva AISV, 8-12 settembre 2008.
“Random Projections on Smooth Manifolds” -A short summary
Psychology 202b Advanced Psychological Statistics, II February 10, 2011.
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
1 Speech Parametrisation Compact encoding of information in speech Accentuates important info –Attempts to eliminate irrelevant information Accentuates.
Overview of Adaptive Multi-Rate Narrow Band (AMR-NB) Speech Codec
2005/11/101 KOZ Scalable Audio Speaker: 陳繼大 An Introduction.
DEVON BRYANT CS 525 SEMESTER PROJECT Audio Signal MIDI Transcription.
Analysis & Synthesis The Vocoder and its related technology.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
Design of Expanded Constellations for PAPR Reduction in OFDM Systems Speaker: Dr. Ali Al-Shaikhi Assistant Professor, EE department.
Fundamentals of Perceptual Audio Encoding Craig Lewiston HST.723 Lab II 3/23/06.
Classification and Prediction: Regression Analysis
A Full Frequency Masking Vocoder for Legal Eavesdropping Conversation Recording R. F. B. Sotero Filho, H. M. de Oliveira (qPGOM), R. Campello de Souza.
Dr Mark Cresswell Statistical Forecasting [Part 1] 69EG6517 – Impacts & Models of Climate Change.
„Bandwidth Extension of Speech Signals“ 2nd Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction 22nd and 23rd June.
1 CS 551/651: Structure of Spoken Language Lecture 8: Mathematical Descriptions of the Speech Signal John-Paul Hosom Fall 2008.
Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission.
Using Neural Networks to Predict Claim Duration in the Presence of Right Censoring and Covariates David Speights Senior Research Statistician HNC Insurance.
SPEECH CODING Maryam Zebarjad Alessandro Chiumento.
EEG Classification Using Maximum Noise Fractions and spectral classification Steve Grikschart and Hugo Shi EECS 559 Fall 2005.
Jeff Howbert Introduction to Machine Learning Winter Regression Linear Regression.
1 PATTERN COMPARISON TECHNIQUES Test Pattern:Reference Pattern:
MUMT611: Music Information Acquisition, Preservation, and Retrieval Presentation on Timbre Similarity Alexandre Savard March 2006.
Basics of Neural Networks Neural Network Topologies.
Correlational Research Designs. 2 Correlational Research Refers to studies in which the purpose is to discover relationships between variables through.
VOCODERS. Vocoders Speech Coding Systems Implemented in the transmitter for analysis of the voice signal Complex than waveform coders High economy in.
Using Feed Forward NN for EEG Signal Classification Amin Fazel April 2006 Department of Computer Science and Electrical Engineering University of Missouri.
Audio Tempo Extraction Presenter: Simon de Leon Date: February 9, 2006 Course: MUMT611.
CpSc 881: Machine Learning
EEL 6586: AUTOMATIC SPEECH PROCESSING Speech Features Lecture Mark D. Skowronski Computational Neuro-Engineering Lab University of Florida February 27,
Presented by Rajatha Raghavendra
Linear Prediction Correlation can be used to make predictions – Values on X can be used to predict values on Y – Stronger relationships between X and Y.
MSc Project Musical Instrument Identification System MIIS Xiang LI ee05m216 Supervisor: Mark Plumbley.
Autoregressive (AR) Spectral Estimation
Feature Selection and Extraction Michael J. Watts
By Sarita Jondhale 1 Signal preprocessor: “conditions” the speech signal s(n) to new form which is more suitable for the analysis Postprocessor: operate.
Institut für Nachrichtengeräte und Datenverarbeitung Prof. Dr.-Ing. P. Vary On the Use of Artificial Bandwidth Extension Techniques in Wideband Speech.
Chapter 8 Relationships Among Variables. Outline What correlational research investigates Understanding the nature of correlation What the coefficient.
AN EXPECTATION MAXIMIZATION APPROACH FOR FORMANT TRACKING USING A PARAMETER-FREE NON-LINEAR PREDICTOR Issam Bazzi, Alex Acero, and Li Deng Microsoft Research.
Chapter Seventeen Copyright © 2004 John Wiley & Sons, Inc. Multivariate Data Analysis.
Information Processing by Neuronal Populations Chapter 6: Single-neuron and ensemble contributions to decoding simultaneously recoded spike trains Information.
1 Speech Compression (after first coding) By Allam Mousa Department of Telecommunication Engineering An Najah University SP_3_Compression.
Presentation III Irvanda Kurniadi V. ( )
HIGH-RESOLUTION SINUSOIDAL MODELING OF UNVOICED SPEECH GEORGE P. KAFENTZIS, YANNIS STYLIANOU MULTIMEDIA INFORMATICS LABORATORY DEPARTMENT OF COMPUTER SCIENCE.
Data statistics and transformation revision Michael J. Watts
High Quality Voice Morphing
David Sears MUMT November 2009
Objective and Subjective Audio Assessment of MP3 Players’ Quality
Scalable Speech Coding for IP Networks
Speech Signal Processing
Vocoders.
1 Vocoders. 2 The Channel Vocoder (analyzer) : The channel vocoder employs a bank of bandpass filters,  Each having a bandwidth between 100 HZ and 300.
Correlation and Regression-II
Outline Multilinear Analysis
Frequency Domain Perceptual Linear Predicton (FDPLP)
Linear Regression.
Linear regression Fitting a straight line to observations.
Today (2/23/16) Learning objectives:
Bandwidth Extrapolation of Audio Signals
Govt. Polytechnic Dhangar(Fatehabad)
Regression Part II.
Presentation transcript:

Sung-Won Yoon, David ChoiEE368C Project Proposal Bandwidth Extrapolation of Audio Signals Sung-Won Yoon David Choi February 8 th, 2001

Sung-Won Yoon, David ChoiEE368C Project Proposal Bandwidth Extrapolation of Audio Signals Outline Motivation Proposed system Lapped orthogonal transform (LOT) High frequency regeneration Experiment Expected results Workplan

Sung-Won Yoon, David ChoiEE368C Project Proposal Bandwidth Extrapolation of Audio Signals Bandwidth Extrapolation Results should be –Similar to original wideband signal –Perceptually better quality Narrowband LOT coefficients Wideband LOT coefficients nonlinear system XXY

Sung-Won Yoon, David ChoiEE368C Project Proposal Bandwidth Extrapolation of Audio Signals Proposed System LOT LOT -1 High Frequency Regeneration 2 Narrowband signal Wideband signal LPF LOT

Sung-Won Yoon, David ChoiEE368C Project Proposal Bandwidth Extrapolation of Audio Signals Lapped Orthogonal Transform H. Malvar & D. Staelin, N 50% Overlap 2NN LOT coefficient Avoids blocking artifact No increase in bit rate

Sung-Won Yoon, David ChoiEE368C Project Proposal Bandwidth Extrapolation of Audio Signals High Frequency Regeneration Trained system Parameters p are chosen to fit the training data Mapping from narrowband signal to wideband signal –Estimate LOT coefficients –Magnitude only XY From narrowband signal Regenerated high frequency

Sung-Won Yoon, David ChoiEE368C Project Proposal Bandwidth Extrapolation of Audio Signals Linear Estimation Each estimated high frequency coefficient is a weighted combination of the low freq. coefficients Possible sparse representation of weights Weights possibly chosen to exploit psychoacoustic phenomena (masking)

Sung-Won Yoon, David ChoiEE368C Project Proposal Bandwidth Extrapolation of Audio Signals Principal Components Analysis, Quasi-stationarity of windowed audio signals PCA applied on the LOT coefficients Classification of LOT blocks may be necessary

Sung-Won Yoon, David ChoiEE368C Project Proposal Bandwidth Extrapolation of Audio Signals Experiment 1.For simplicity of analysis, initiate study with single instrument audio signals 2.Investigate the correlation among frequency components 3.Implement linear estimator and PCA 4.Compare results -Perceptual quality -Mean square error

Sung-Won Yoon, David ChoiEE368C Project Proposal Bandwidth Extrapolation of Audio Signals Expected Results Extrapolation should improve the quality However: Extensions may be necessary “Several approaches were considered to extrapolate the high frequency spectral envelope. In all cases, the subjective quality was not satisfactory. This suggests that the high frequency formant structure of speech cannot be accurately predicted from the narrowband formants.” - J. Valin & R. Lefebvre, Bandwidth Extension of Narrowband Speech for Low Bit-rate Wideband Coding, IEEE, 2000

Sung-Won Yoon, David ChoiEE368C Project Proposal Bandwidth Extrapolation of Audio Signals Workplan Week 1: –Investigate the relationship between the low and high LOT coefficients –Quantify the relations that can be exploited Week 2: –Carry out the linear estimation based on the knowledge of the LOT coefficients Week 3: –Extend to PCA Week 4: –Compare the results –Prepare writeup and presentation