2005/11/101 KOZ Scalable Audio Speaker: 陳繼大 An Introduction.

Slides:



Advertisements
Similar presentations
DCSP-4: Modem Jianfeng Feng Department of Computer Science Warwick Univ., UK
Advertisements

DCSP-3: Fourier Transform Jianfeng Feng Department of Computer Science Warwick Univ., UK
Copyright 2001, Agrawal & BushnellVLSI Test: Lecture 181 Lecture 18 DSP-Based Analog Circuit Testing  Definitions  Unit Test Period (UTP)  Correlation.
From
Quantification of Nonlinearity and Nonstionarity Norden E. Huang With collaboration of Zhaohua Wu; Men-Tzung Lo; Wan-Hsin Hsieh; Chung-Kang Peng; Xianyao.
Speech Compression. Introduction Use of multimedia in personal computers Requirement of more disk space Also telephone system requires compression Topics.
0 - 1 © 2010 Texas Instruments Inc Practical Audio Experiments using the TMS320C5505 USB Stick “Sine Waves” Texas Instruments University Programme Teaching.
Fourier Transform – Chapter 13. Image space Cameras (regardless of wave lengths) create images in the spatial domain Pixels represent features (intensity,
AES 120 th Convention Paris, France, 2006 Adaptive Time-Frequency Resolution for Analysis and Processing of Audio Alexey Lukin AES Student Member Moscow.
AUDIO COMPRESSION TOOLS & TECHNIQUES Gautam Bhattacharya.
Dr. Jie ZouPHY Chapter 8 (Hall) Sound Spectra.
1 Speech Parametrisation Compact encoding of information in speech Accentuates important info –Attempts to eliminate irrelevant information Accentuates.
Intro to Fourier Analysis Definition Analysis of periodic waves Analysis of aperiodic waves Digitization Time-frequency uncertainty.
MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.
Effective Bits. An ideal model of a digital waveform recorder OffsetGain Sampling Timebase oscillator Fs ADC Waveform Memory Address counter Compute Engine.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
Dr. Leon O Chua – Director 151M Cory Hall University of California, Berkeley August 30 th 2007 Chua’s Circuit: The Paradigm.
Human Psychoacoustics shows ‘tuning’ for frequencies of speech If a tree falls in the forest and no one is there to hear it, will it make a sound?
Basics of Signal Processing. frequency = 1/T  speed of sound × T, where T is a period sine wave period (frequency) amplitude phase.
1 Audio Compression Multimedia Systems (Module 4 Lesson 4) Summary: r Simple Audio Compression: m Lossy: Prediction based r Psychoacoustic Model r MPEG.
Q factor of an underdamped oscillator large if  is small compared to  0 Damping time or "1/e" time is  = 1/   (>> 1/   if  is very small)
Goals For This Class Quickly review of the main results from last class Convolution and Cross-correlation Discrete Fourier Analysis: Important Considerations.
Digital Audio What do we mean by “digital”? How do we produce, process, and playback? Why is physics important? What are the limitations and possibilities?
Vibrationdata 1 Unit 5 The Fourier Transform. Vibrationdata 2 Courtesy of Professor Alan M. Nathan, University of Illinois at Urbana-Champaign.
Digital Sound and Video Chapter 10, Exploring the Digital Domain.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.
Lecture 1 Signals in the Time and Frequency Domains
Basics of Signal Processing. SIGNALSOURCE RECEIVER describe waves in terms of their significant features understand the way the waves originate effect.
Lecture 18 DSP-Based Analog Circuit Testing
Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.
CSC361/661 Digital Media Spring 2002
Multiresolution STFT for Analysis and Processing of Audio
Fourier series. The frequency domain It is sometimes preferable to work in the frequency domain rather than time –Some mathematical operations are easier.
Module 2 SPECTRAL ANALYSIS OF COMMUNICATION SIGNAL.
Wireless and Mobile Computing Transmission Fundamentals Lecture 2.
Under Supervision of Dr. Kamel A. Arram Eng. Lamiaa Said Wed
SPEECH CODING Maryam Zebarjad Alessandro Chiumento.
EE210 Digital Electronics Class Lecture 2 March 20, 2008.
Introduction to Quantum Chaos
Lecture 1B (01/07) Signal Modulation
Signals CY2G2/SE2A2 Information Theory and Signals Aims: To discuss further concepts in information theory and to introduce signal theory. Outcomes:
UNIVERSITY OF MONTENEGRO ELECTRICAL ENGINEERING DEPARTMENT Igor DJUROVIĆ Vesna RUBEŽIĆ Time-Frequency Representations in Detection of Chaotic State in.
Vibrationdata 1 Unit 5 The Fourier Transform. Vibrationdata 2 Courtesy of Professor Alan M. Nathan, University of Illinois at Urbana-Champaign.
Authors: Sriram Ganapathy, Samuel Thomas, and Hynek Hermansky Temporal envelope compensation for robust phoneme recognition using modulation spectrum.
Week 7 Lecture 1+2 Digital Communications System Architecture + Signals basics.
Submitted By: Santosh Kumar Yadav (111432) M.E. Modular(2011) Under the Supervision of: Mrs. Shano Solanki Assistant Professor, C.S.E NITTTR, Chandigarh.
VOCODERS. Vocoders Speech Coding Systems Implemented in the transmitter for analysis of the voice signal Complex than waveform coders High economy in.
421 Pendulum Lab (5pt) Equation 1 Conclusions: We concluded that we have an numerically accurate model to describe the period of a pendulum at all angles.
Marwan Al-Namari 1 Digital Representations. Bits and Bytes Devices can only be in one of two states 0 or 1, yes or no, on or off, … Bit: a unit of data.
Vibrationdata 1 Unit 6a The Fourier Transform. Vibrationdata 2 Courtesy of Professor Alan M. Nathan, University of Illinois at Urbana-Champaign.
Novel Cascaded Chaotic Masking for Secure Communications
Frequency Modulation ECE 4710: Lecture #21 Overview:
Control and Synchronization of Chaos Li-Qun Chen Department of Mechanics, Shanghai University Shanghai Institute of Applied Mathematics and Mechanics Shanghai.
COMP135/COMP535 Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 2 Lecture 2 – Digital Representations.
1 Signals. 2 Signals Introduction Introduction Analog and Digital Analog and Digital.
Analysis of Traction System Time-Varying Signals using ESPRIT Subspace Spectrum Estimation Method Z. Leonowicz, T. Lobos
Speech Processing Using HTK Trevor Bowden 12/08/2008.
The Spectrum n Jean Baptiste Fourier ( ) discovered a fundamental tenet of wave theory.
ECE3340 Review of Numerical Methods for Fourier and Laplace Transform Applications – Part 1 Fourier Spring 2016 Prof. Han Q. Le Note: PPT file is the.
Introduction to Audio Watermarking Schemes N. Lazic and P
CS 591 S1 – Computational Audio
COMPUTER NETWORKS and INTERNETS
Unit 5 The Fourier Transform.
Continuous-Time Signal Analysis
For a periodic complex sound
Speech Pathologist #10.
EE210 Digital Electronics Class Lecture 2 September 03, 2008
Introduction to Csound 2.
Localizing the Chaotic Strange Attractors of Multiparameter Nonlinear Dynamical Systems using Competitive Modes A Literary Analysis.
Finding the Popular Frequency!
Presentation transcript:

2005/11/101 KOZ Scalable Audio Speaker: 陳繼大 An Introduction

2005/11/10P 22 References  K. M. Short et al, "An Introduction to the KOZ Scalable Audio Compression Technology", AES 118th Convention Paper, Barcelona, May 2005, Preprint 6446  M. K. Johnson, "Controlled Chaos and Other Sound Synthesis Techniques," Thesis for the Degree of Bachelor of Science, University of New Hampshire, May 2000  Douglas J. Nelson, and Kevin M. Short, “A channelized cross spectral method for improved frequency resolution.”, Proceedings of the IEEE-SP International Symposium on Time-Frequency and Time- Scale Analysis. IEEE Press, October 1998.

2005/11/10P 33 References (cont.)  “KOZ scalable audio compression” SO/IEC JTC 1/SC 29/WG11 M12253

2005/11/10P 44 Outline  Introduction  Double-scroll oscillator  High Freq. Resolution Analysis  Unified Domain  KOZ Scalable Audio  Results  Conclusions

2005/11/10P 55 Introduction  Traditional transform/subband based codecs encode data by quantizing the coefficients according to psychoacoustic model  Parametric coding is another way for coding – it records the parameters of models, rather than coefficients.  KOZ scalable audio belongs to parametric coding methods  KOZ scalable audio takes Chaos system as the model  The Chaotic system is a nonlinear system

2005/11/10P 66 Introduction (cont.)  Features of the KOZ codec Flexibility over a wide range of bitrates Both small-step and large-step scalability High resolution objects allows easy decoder-side post-processing Integrated Digital Rights Management

2005/11/10P 77 Double-scroll oscillator  Chaotic system: nonlinear dynamical systems deterministic mathematical object sensitive dependence on initial conditions predictable over a short period of time unpredictable in terms of long-term behavior

2005/11/10P 88 Double-scroll oscillator (cont.)  Cupolets: Output periodic waveforms of Chaotic system control process requires only on the order of 16 bits of information but the cupolets can be as simple as a sine wave or so complex that they have more than 200 harmonics in their spectrum

2005/11/10P 99 Double-scroll oscillator (cont.)  A chaotic system will settle down onto a complicated structure called an attractor – settle down onto the same attractor no matter what initial conditions are used  A chaotic system in its natural state is aperiodic.  To stabilize these orbits – simply perturbing the state of the system in certain fixed locations by a tiny amount.

2005/11/10P 1010 Double-scroll oscillator (cont.)  Double-scroll oscillator: One kind of chaotic system Nonlinear differential equations

2005/11/10P 1111 Double-scroll oscillator (cont.) where Parameters:  C, L, G, m, B

2005/11/10P 1212 Double-scroll oscillator (cont.)

2005/11/10P 1313 Double-scroll oscillator (cont.)  Double-scroll attractor can be controlled in such a way that the trajectories around it become periodic.  Control perturbing: a bit string, generally of 16 bits applied at an intersection with the control line  periodic orbits are in one-to-one correspondence with the control string used, independent of the initial state of the system

2005/11/10P 1414 Double-scroll oscillator (cont.)

2005/11/10P 1515 High Freq. Resolution Analysis  Detect (the accurate freq.) of tones  IF-based methods Differentiation of the signal phase fail completely if the signal environment consists of more than one sinusoid  CPS (Cross Power Spectral) Time-averaged IF method Phase differentiation is applied to a time- varying Fourier transform Fourier transform is used to “channelize” the signal isolating the tones

2005/11/10P 1616 High Freq. Resolution Analysis (cont.)  Improved (channelized) CPS estimator CPS can not detect and estimate tones which are not well separated Employ a second Fourier transform  TVFT: Time varying Fourier transform

2005/11/10P 1717 High Freq. Resolution Analysis (cont.)  Channelized CPS if f(t) is tone 

2005/11/10P 1818 High Freq. Resolution Analysis (cont.)

2005/11/10P 1919 Unified Domain  Convert the multiple channels into Special unitary group  Special unitary group group of n×n unitary matrices subgroup of the unitary group SU(2)

2005/11/10P 2020 KOZ Scalable Audio

2005/11/10P 2121 KOZ Scalable Audio (cont.)  Prioritize the components Psychoacoustics are used in order of perceptual importance  Classes of objects are then sorted in order of their “perceptual relevance”  Objects are segregated and written to the floating-point.CCA file format.  Scalability of KOZ is fulfilled by sorting.

2005/11/10P 2222 KOZ Scalable Audio (cont.)

2005/11/10P 2323 KOZ Scalable Audio (cont.)

2005/11/10P 2424 Results

2005/11/10P 2525 Conclusions  KOZ scalable audio takes chaotic system to model audio signal  CPS is applied to find tones and their accurate freq.  Scalability is fulfilled by sorting classes of objects with order of their “perceptual relevance”