2nd Workshop on Wideband Speech Quality - June 2005 1 Perceptual Wideband Audio Quality Assessments Using PEAQ Christian Schmidmer Opticom GmbH, Erlangen.

Slides:



Advertisements
Similar presentations
IP Cablecom and MEDIACOM 2004 Prediction and Monitoring of Quality for VoIP services Quality for VoIP services Vincent Barriac – France Télécom R&D SG12.
Advertisements

Revised estimates of human cochlear tuning from otoacoustic and behavioral measurements Christopher A. Shera, John J. Guinan, Jr., and Andrew J. Oxenham.
VMR-WB – Operation of the 3GPP2 Wideband Speech Coding Standard M. Jelinek†, R. Salami‡ and S. Ahmadi * †University of Sherbrooke, Canada ‡VoiceAge Corporation,
SOUND PRESSURE, POWER AND LOUDNESS MUSICAL ACOUSTICS Science of Sound Chapter 6.
Part II (MPEG-4) Audio TSBK01 Image Coding and Data Compression Lecture 11, 2003 Jörgen Ahlberg.
Department of Computer Engineering University of California at Santa Cruz MPEG Audio Compression Layer 3 (MP3) Hai Tao.
Introduction to MP3 and psychoacoustics Material from website by Mark S. Drew
Psycho-acoustics and MP3 audio encoding
MPEG/Audio Compression Tutorial Mike Blackstock CPSC 538a January 11, 2004.
CS335 Principles of Multimedia Systems Audio Hao Jiang Computer Science Department Boston College Oct. 11, 2007.
MPEG-1 MUMT-614 Jan.23, 2002 Wes Hatch. Purpose of MPEG encoding To decrease data rate How? –two choices: could decrease sample rate, but this would cause.
Time-Frequency Analysis Analyzing sounds as a sequence of frames
Digital Audio Coding – Dr. T. Collins Standard MIDI Files Perceptual Audio Coding MPEG-1 layers 1, 2 & 3 MPEG-4.
AUDIO COMPRESSION TOOLS & TECHNIQUES Gautam Bhattacharya.
Digital Representation of Audio Information Kevin D. Donohue Electrical Engineering University of Kentucky.
1 Digital Audio Compression. 2 Formats  There are many different formats for storing and communicating digital audio:  CD audio  Wav  Aiff  Au 
2nd Workshop on Wideband Speech Quality - June nd Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction 22nd.
1 TAC2000/ IP Telephony Lab Perceptual Evaluation of Speech Quality (PESQ) Speaker: Wen-Jen Lin Date: Dec
Christian Schmidmer, OPTICOM1 Subjective Quality Testing - Voice & Audio.
Speech & Audio Processing
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
Overview of Adaptive Multi-Rate Narrow Band (AMR-NB) Speech Codec
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
Fundamentals of Perceptual Audio Encoding Craig Lewiston HST.723 Lab II 3/23/06.
Audio CompressiontMyn1 Audio Compression Audio compression has become well entrenched in consumer and professional digital audio products such as the compact.
8th and 9th June 2004 Mainz, Germany Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction 1 Vincent Barriac, Jean-Yves.
1 Audio Compression Multimedia Systems (Module 4 Lesson 4) Summary: r Simple Audio Compression: m Lossy: Prediction based r Psychoacoustic Model r MPEG.
Ni.com Data Analysis: Time and Frequency Domain. ni.com Typical Data Acquisition System.
Digital Audio Watermarking: Properties, characteristics of audio signals, and measuring the performance of a watermarking system نيما خادمي کلانتري
DIGITAL WATERMARKING OF AUDIO SIGNALS USING A PSYCHOACOUSTIC AUDITORY MODEL AND SPREAD SPECTRUM THEORY By: Ricardo A. Garcia University of Miami School.
„Bandwidth Extension of Speech Signals“ 2nd Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction 22nd and 23rd June.
Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.
Media Representations - Audio
Subjective Sound Quality Assessment of Mobile Phones for Production Support Thorsten Drascher, Martin Schultes Workshop on Wideband Speech Quality in Terminals.
Tratamiento Digital de Voz Prof. Luis A. Hernández Gómez ftp.gaps.ssr.upm.es/pub/TDV/DOC/ Tema2c.ppt Dpto. Señales, Sistemas y Radiocomunicaciones.
Colombia, September 2013 The importance of models and procedures for planning, monitoring and control in the provision of communications services.
Dhatchaini Rajendran Student ID: Date :
Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 9 This presentation © 2004, MacAvon Media Productions Sound.
1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.
Chapter 7: Loudness and Pitch. Loudness (1) Auditory Sensitivity: Minimum audible pressure (MAP) and Minimum audible field (MAF) Equal loudness contours.
1 Presented by Jari Korhonen Centre for Quantifiable Quality of Service in Communication Systems (Q2S) Norwegian University of Science and Technology (NTNU)
Department of Communication and Electronic Engineering University of Plymouth, U.K. Lingfen Sun Emmanuel Ifeachor New Methods for Voice Quality Evaluation.
Submitted By: Santosh Kumar Yadav (111432) M.E. Modular(2011) Under the Supervision of: Mrs. Shano Solanki Assistant Professor, C.S.E NITTTR, Chandigarh.
SwissQual AG – Your QoS Partner Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction 1 8th and 9th June Mainz,
SOUND PRESSURE, POWER AND LOUDNESS MUSICAL ACOUSTICS Science of Sound Chapter 6.
Perspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony Nobuhiko Kitawaki University of Tsukuba, Japan.
1 Audio Coding. 2 Digitization Processing Signal encoder Signal decoder samplingquantization storage Analog signal Digital data.
AIMS’99 Workshop Heidelberg, May 1999 Assessing Audio Visual Quality P905 - AQUAVIT Assessment of Quality for audio-visual signals over Internet.
Introduction to psycho-acoustics: Some basic auditory attributes For audio demonstrations, click on any loudspeaker icons you see....
Audio Streaming © Nanda Ganesan, Ph.D.. Audio File Features Audio file is a record of captured sound that can be played back –The WAV File is an example.
CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2014.
Alan Clark Telchemy Modeling the effects of Burst Packet Loss and Recency on Subjective Voice Quality Alan Clark Telchemy
1 Video and Voice over IP performance over a Satellite link Bob Dixon, Ohio State University/OARnet Prasad Calyam, OARnet Joint Techs Workshops, Columbus,
Session 18 The physics of sound and the manipulation of digital sounds.
EE5359 Multimedia Processing Project Study and Comparison of AC3, AAC and HE-AAC Audio Codecs Dhatchaini Rajendran Student ID: Date :
Fletcher’s band-widening experiment (1940)
SOUND PRESSURE, POWER AND LOUDNESS
Fletcher’s band-widening experiment (1940) Present a pure tone in the presence of a broadband noise. Present a pure tone in the presence of a broadband.
Using Speech Recognition to Predict VoIP Quality
VoIP over Wireless Networks
Fletcher’s band-widening experiment (1940)
PSYCHOACOUSTICS A branch of psychophysics
Objective and Subjective Audio Assessment of MP3 Players’ Quality
Spread Spectrum Audio Steganography using Sub-band Phase Shifting
– Workshop on Wideband Speech Quality in Terminals and Networks
Speech Perception (acoustic cues)
Govt. Polytechnic Dhangar(Fatehabad)
Presentation transcript:

2nd Workshop on Wideband Speech Quality - June Perceptual Wideband Audio Quality Assessments Using PEAQ Christian Schmidmer Opticom GmbH, Erlangen

2nd Workshop on Wideband Speech Quality - June Contents  Quality, definitions  User expectation  Subjective tests  Psychoacoustics  PEAQ  PESQ vs. PEAQ

2nd Workshop on Wideband Speech Quality - June Aspects of Perceived Quality  Conversational Quality = ...

2nd Workshop on Wideband Speech Quality - June What is “Quality”? “Quality is the difference between what we perceive and what we expect.” From habilitation thesis of Prof. Ute Jekosch “…they are used to phones that sound like a phone.” Frank Meier, Infineon Maybe more important: …is for free.

2nd Workshop on Wideband Speech Quality - June Differences in Perception of Voice and Audio  Experience, a priori knowledge  Expectation  Cognitive effects  “Error correction”  Different subjective tests require different models

2nd Workshop on Wideband Speech Quality - June The Problem of Subjective Scales BitrateMOS 256kBit/s5 128kBit/s4 … 64kBit/s1 BitrateMOS 128Bit/s5 64kBit/s4 … 16kBit/s1 High Quality: Intermediate Quality: The range of qualities in the subjective test defines the subjective scale!

2nd Workshop on Wideband Speech Quality - June MOS acc. To P.800  Standardized Listening Test Procedure acc. to ITU-T P.800ff  Absolute Category Rating Test (ACR), no comparison to reference signal (original)  „How good does it sound?“  5-point grading scale ‚opinion scale‘  Averaging over test Subjects: MOS ‚Mean Opinion Score‘  Language dependent! Excellent Good Fair Poor Bad ImpairmentGrade

2nd Workshop on Wideband Speech Quality - June Standardised assessment procedure for 'small impairments' in audio systems (ITU-R 1994) Comparison between reference and test signal Very sensitive to subtle distortions double-blind triple-stimulus with hidden reference Subjective Assessment in ITU-R BS.1116 OriginalAB original / coded coded / original

2nd Workshop on Wideband Speech Quality - June Continuous grading scale with “anchors” “Subjective Difference Grade“ (SDG) Question: „How different do the files sound“ Subjective Assessment in ITU-R BS.1116

2nd Workshop on Wideband Speech Quality - June Subjective Testing of Intermediate Audio Quality (IAQ) “MUSHRA” Multi Stimulus Test with Hidden Reference and Anchors  developed by EBU working group B/AIM  targets at IAQ  ITU-R BS.1534

2nd Workshop on Wideband Speech Quality - June MUSHRA Test Training of Subjects subjects can randomly access all types of codecs at similar bitrate comparison with CD quality reference two low-pass 'anchors' (7kHz, 3.5kHz) incl.

2nd Workshop on Wideband Speech Quality - June MUSHRA Test Scoring Phase comparison with CD reference, hidden reference inc.. two low-pass 'anchors' (7kHz, 3.5kHz) inc.. subjects can randomly assess all codecs under test of similar bitrate at the same time subjects adjust slider, no score involved slider mapped to

2nd Workshop on Wideband Speech Quality - June Comparison of Subjective Test Methods

2nd Workshop on Wideband Speech Quality - June

2nd Workshop on Wideband Speech Quality - June Temporal Masking t [ms] SL [dB] Pre-Simultaneous-Postmasking Premasking: 2-5ms Postmasking: 120ms Depending on the signal characteristics of the masker Masker

2nd Workshop on Wideband Speech Quality - June Pitch Scale / Critical Bands A sine tone and a noise of critical bandwidth with the same center frequency and energy density are perceived equally loud.

2nd Workshop on Wideband Speech Quality - June Threshold in Quiet - Masked Threshold Threshold in Quiet

2nd Workshop on Wideband Speech Quality - June PEAQ is based on: –PAQM KPN Research, Netherlands / OPTICOM –NMR Fraunhofer, Germany / OPTICOM –DIX TU Berlin / Deutsche Telekom Berkom –POM CCETT, France –PERCEVAL CRC, Canada –"Tool box" IRT, Germany ITU-R TG 10/4: Call for proposals (1995) Jan released as ITU-R Rec. BS.1387 PEAQ

2nd Workshop on Wideband Speech Quality - June Intrusive Testing Network X A Network Y B Comparison with known stimulus: + Very high accuracy +Black box approach – no knowledge of DUT - Requires a reference signal -Generates traffic Alternatively both signals may be captured by the test system!

2nd Workshop on Wideband Speech Quality - June Two Versions of PEAQ:  PEAQ „Basic“  computational efficiency  realtime performance  PEAQ „Advanced“  highest possible accuracy

2nd Workshop on Wideband Speech Quality - June Structure of a perceptual measurement tool Reference (=sent file) Feature- Extractor Perceptual Model Test (=received file) Cognitive Model MOS (Quality Measure) Perceptual Model a b a b

2nd Workshop on Wideband Speech Quality - June Excitation Listening Level (dB SPL) Input Signal 1 FFT & Scaling 2048 Punkte 42.6ms/23.4Hz Outer and Middle Ear Weighting Grouping into Critical Bands ¼ Bark “Pitch” Internal Noise Spreading Temporal Masking Forward masking 2 + fs=48kHz (fs=44.1kHz) a b Perceptual Model, PEAQ “Basic”

2nd Workshop on Wideband Speech Quality - June MOVs used in PEAQ “Basic” Version

2nd Workshop on Wideband Speech Quality - June Perceptual Model, PEAQ “Advanced”

2nd Workshop on Wideband Speech Quality - June

2nd Workshop on Wideband Speech Quality - June PEAQ vs. MUSHRA Microsoft Windows Media 4 MPEG-4 AAC (Fraunhofer) MP3 (Fraunhofer) Quicktime 4, Music-Codec 2 (Qdesign) Real Audio 5.0 RealAudio G2 MPEG-4 TwinVQ (Yahama) EBU Tests of Internet Audio Codecs

2nd Workshop on Wideband Speech Quality - June Constraints of MUSHRA Testing no absolute scores: -> scores depend on the test condition low-pass anchors are only one quality dimension -> disturbance of artefacts is another one spreading of the scale from best to worst -> what about adding new items to an existing test? In order to verify PEAQ performance we must adjust the best and worst item (not the anchors!)

2nd Workshop on Wideband Speech Quality - June PEAQ vs. MUSHRA (EBU Test)

2nd Workshop on Wideband Speech Quality - June Results

2nd Workshop on Wideband Speech Quality - June Results

2nd Workshop on Wideband Speech Quality - June Results

2nd Workshop on Wideband Speech Quality - June Results

2nd Workshop on Wideband Speech Quality - June Results

2nd Workshop on Wideband Speech Quality - June When to use PEAQ or PESQ  Is it a BS.1116 or MUSHRA Experiment?  Use PEAQ!  Is the subjective test P.800?  Is it speech? Yes: –Is the bandwidth <= 8kHz? »Yes: Use PESQ! »No: Use PEAQ with care! No: »Use PEAQ with care!

2nd Workshop on Wideband Speech Quality - June Final Question:  Can I use PESQ instead of PEAQ?  Perception of voice differs from perception of music  PESQ time alignment fails on music  PEAQ and PESQ are modelling different subjective tests No!

2nd Workshop on Wideband Speech Quality - June OPTICOM Germany More Information: Thank you!