Bandwidth Extrapolation of Audio Signals

Slides:



Advertisements
Similar presentations
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Advertisements

11/11/02 IDR Workshop Dealing With Location Uncertainty in Images Hasan F. Ates Princeton University 11/11/02.
Speech Enhancement through Noise Reduction By Yating & Kundan.
EE513 Audio Signals and Systems Digital Signal Processing (Synthesis) Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Improvement of Audio Capture in Handheld Devices through Digital Filtering Problem Microphones in handheld devices are of low quality to reduce cost. This.
Page 0 of 34 MBE Vocoder. Page 1 of 34 Outline Introduction to vocoders MBE vocoder –MBE Parameters –Parameter estimation –Analysis and synthesis algorithm.
Sampling and quantization Seminary 2. Problem 2.1 Typical errors in reconstruction: Leaking and aliasing We have a transmission system with f s =8 kHz.
A 12-WEEK PROJECT IN Speech Coding and Recognition by Fu-Tien Hsiao and Vedrana Andersen.
Extracting Noise-Robust Features from Audio Data Chris Burges, John Platt, Erin Renshaw, Soumya Jana* Microsoft Research *U. Illinois, Urbana/Champaign.
Active Calibration of Cameras: Theory and Implementation Anup Basu Sung Huh CPSC 643 Individual Presentation II March 4 th,
1 Speech Parametrisation Compact encoding of information in speech Accentuates important info –Attempts to eliminate irrelevant information Accentuates.
Independent Component Analysis (ICA) and Factor Analysis (FA)
Lecture 4 Measurement Accuracy and Statistical Variation.
Traffic modeling and Prediction ----Linear Models
EE513 Audio Signals and Systems Digital Signal Processing (Systems) Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.
„Bandwidth Extension of Speech Signals“ 2nd Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction 22nd and 23rd June.
Multiresolution STFT for Analysis and Processing of Audio
1 CS 551/651: Structure of Spoken Language Lecture 8: Mathematical Descriptions of the Speech Signal John-Paul Hosom Fall 2008.
Minimum Mean Squared Error Time Series Classification Using an Echo State Network Prediction Model Mark Skowronski and John Harris Computational Neuro-Engineering.
EEG Classification Using Maximum Noise Fractions and spectral classification Steve Grikschart and Hugo Shi EECS 559 Fall 2005.
Basics of Neural Networks Neural Network Topologies.
Digital Media Lab 1 Data Mining Applied To Fault Detection Shinho Jeong Jaewon Shim Hyunsoo Lee {cinooco, poohut,
Chapter 6 Spectrum Estimation § 6.1 Time and Frequency Domain Analysis § 6.2 Fourier Transform in Discrete Form § 6.3 Spectrum Estimator § 6.4 Practical.
Speech Recognition Feature Extraction. Speech recognition simplified block diagram Speech Capture Speech Capture Feature Extraction Feature Extraction.
1 Blind Channel Identification and Equalization in Dense Wireless Sensor Networks with Distributed Transmissions Xiaohua (Edward) Li Department of Electrical.
Project 11: Determining the Intrinsic Dimensionality of a Distribution Okke Formsma, Nicolas Roussis and Per Løwenborg.
Project 11: Determining the Intrinsic Dimensionality of a Distribution Okke Formsma, Nicolas Roussis and Per Løwenborg.
Experiments on Noise CharacterizationRoma, March 10,1999Andrea Viceré Experiments on Noise Analysis l Need of noise characterization for  Monitoring the.
Singer Similarity Doug Van Nort MUMT 611. Goal Determine Singer / Vocalist based on extracted features of audio signal Classify audio files based on singer.
CSC321: Lecture 7:Ways to prevent overfitting
Over-fitting and Regularization Chapter 4 textbook Lectures 11 and 12 on amlbook.com.
Present document contains informations proprietary to France Telecom. Accepting this document means for its recipient he or she recognizes the confidential.
Autoregressive (AR) Spectral Estimation
Sung-Won Yoon, David ChoiEE368C Project Proposal Bandwidth Extrapolation of Audio Signals Sung-Won Yoon David Choi February 8 th, 2001.
Feature Selection and Extraction Michael J. Watts
Institut für Nachrichtengeräte und Datenverarbeitung Prof. Dr.-Ing. P. Vary On the Use of Artificial Bandwidth Extension Techniques in Wideband Speech.
Neural Network Recognition of Frequency Disturbance Recorder Signals Stephen Tang REU Final Presentation July 22, 2014.
Chapter 15: Classification of Time- Embedded EEG Using Short-Time Principal Component Analysis by Nguyen Duc Thang 5/2009.
Multivariate statistical methods. Multivariate methods multivariate dataset – group of n objects, m variables (as a rule n>m, if possible). confirmation.
Descriptive Statistics The means for all but the C 3 features exhibit a significant difference between both classes. On the other hand, the variances for.
1 C.A.L. Bailer-Jones. Machine Learning. Data exploration and dimensionality reduction Machine learning, pattern recognition and statistical data modelling.
Speech Enhancement Summer 2009
Objective and Subjective Audio Assessment of MP3 Players’ Quality
Scalable Speech Coding for IP Networks
Spectral Analysis Spectral analysis is concerned with the determination of the energy or power spectrum of a continuous-time signal It is assumed that.
Digital Communications Chapter 13. Source Coding
Vocoders.
Spread Spectrum Audio Steganography using Sub-band Phase Shifting
Research in Computational Molecular Biology , Vol (2008)
Sampling rate conversion by a rational factor
Final Year Project Presentation --- Magic Paint Face
1 Vocoders. 2 The Channel Vocoder (analyzer) : The channel vocoder employs a bank of bandpass filters,  Each having a bandwidth between 100 HZ and 300.
Probability and Statistics for Computer Scientists Second Edition, By: Michael Baron Section 11.1: Least squares estimation CIS Computational.
Object Modeling with Layers
Historic Document Image De-Noising using Principal Component Analysis (PCA) and Local Pixel Grouping (LPG) Han-Yang Tang1, Azah Kamilah Muda1, Yun-Huoy.
Data Analysis Learning from Data
PCA based Noise Filter for High Spectral Resolution IR Observations
PCM & DPCM & DM.
8-Speech Recognition Speech Recognition Concepts
Chapter 6 Discrete-Time System
Texture Image Extrapolation for Compression
Analysis of Audio Using PCA
ERROR ENTROPY, CORRENTROPY AND M-ESTIMATION
Linear Prediction.
6. Time and Frequency Characterization of Signals and Systems
Somi Jacob and Christian Bach
Recursively Adapted Radial Basis Function Networks and its Relationship to Resource Allocating Networks and Online Kernel Learning Weifeng Liu, Puskal.
EE359 – Lecture 6 Outline Review of Last Lecture
A Data Partitioning Scheme for Spatial Regression
Presentation transcript:

Bandwidth Extrapolation of Audio Signals March 15th, 2001 David Choi Sung-Won Yoon Bandwidth Extrapolation of Audio Signals

Bandwidth Extrapolation of Audio Signals Outline Motivation Characteristics of audio data Proposed system Linear estimation Principal component analysis Results Conclusions Bandwidth Extrapolation of Audio Signals

Bandwidth Extrapolation Y Narrowband MDCT coefficients Wideband MDCT coefficients nonlinear system Results should be Similar to original wideband signal Perceptually better quality than narrowband Bandwidth Extrapolation of Audio Signals

High Frequency Components At 5.5 kHz and above, the components: Constitute small fraction of total energy Effects of phase distortion almost negligible Envelope is still important Can be hidden using error concealment Often uncorrelated with low frequency components Bandwidth Extrapolation of Audio Signals

Bandwidth Extrapolation of Audio Signals Correlation Cello (single instrument) Voice (one person) Cello exhibits patterned correlation Voice largely uncorrelated Bandwidth Extrapolation of Audio Signals

Bandwidth Extrapolation of Audio Signals System Diagram Wideband Training Data NarrowbandTest Data MDCT MDCT-1 Estimation LOW HIGH Training Reconstructed Wideband Estimation Parameters Bandwidth Extrapolation of Audio Signals

Bandwidth Extrapolation of Audio Signals Linear Estimation Y : low frequency coefficients (zero mean) X : high frequency coefficients (zero mean) Want to estimate X given Y (stationary) Bandwidth Extrapolation of Audio Signals

Principal Component Analysis , Taking m eigenvectors, Bandwidth Extrapolation of Audio Signals

Results (Linear Estimation) Cello Cutoff frequency: from 2.75kHz to 10kHz Test/training data subsets of single sample Signal energy Noise energy Bandwidth Extrapolation of Audio Signals

Bandwidth Extrapolation of Audio Signals Overfitting Same weights tested on new song Same instrument, same performer Setting the weights to zero Gave much better results Bandwidth Extrapolation of Audio Signals

Bandwidth Extrapolation of Audio Signals Reducing Overfit Low-order estimator was trained Limited number of non-zero weights Overfitting is reduced but poor S/N ratio results Cutoff freq: 4.125 kHz Bandwidth Extrapolation of Audio Signals

Results (PCA & Linear Estimation) Energy concentration well captured by PCA Magnitude sufficient Bandwidth Extrapolation of Audio Signals

Bandwidth Extrapolation of Audio Signals S/N Ratio using PCA (1) Cello Trained on one sample Test data from new sample Overfit begins around 60 eigenvectors Bandwidth Extrapolation of Audio Signals

Bandwidth Extrapolation of Audio Signals S/N Ratio using PCA (2) Vega Trained & tested on disjoint subsets of sample Y : 0 – 5.5 kHz Y : 3.48 – 5.5 kHz Bandwidth Extrapolation of Audio Signals

Bandwidth Extrapolation of Audio Signals Conclusions MSE criteria and perceptual criteria were not equivalent MDCT produced poorly correlated features which were difficult to predict Estimation degrades further when applied to data with inaccurate knowledge of statistics PCA provided poor description of low frequency for estimation Bandwidth Extrapolation of Audio Signals

Bandwidth Extrapolation of Audio Signals Future Directions Better transform to capture relevant characteristics of audio signals Employ models based on the audible physics of audio signals Divide signal windows into different classes Bandwidth Extrapolation of Audio Signals