The Good, the Bad and the Muffled: the Impact of Different Degradations on Internet Speech Anna Watson and M. Angela Sasse Department of CS University.

Slides:



Advertisements
Similar presentations
While you are waiting for this session to begin please make sure your audio is working. Go to the Tools menu, select Audio and then Audio setup wizard.
Advertisements

Introduction to MP3 and psychoacoustics Material from website by Mark S. Drew
Measuring Perceived Quality of Speech and Video in Multimedia Conferencing Applications Anna Watson and M. Angela Sasse Dept. of CS University College.
QoS Impact on User Perception and Understanding of Multimedia Video Clips G. Ghinea and J.P. Thomas Department of Computer Science University of Reading,
Advanced Audio Setup Troubleshooting your audio Users’ Reference.
RTP: A Transport Protocol for Real-Time Applications Provides end-to-end delivery services for data with real-time characteristics, such as interactive.
The Essentials of 2-Level Design of Experiments Part I: The Essentials of Full Factorial Designs The Essentials of 2-Level Design of Experiments Part I:
Receiver-driven Layered Multicast S. McCanne, V. Jacobsen and M. Vetterli University of Calif, Berkeley and Lawrence Berkeley National Laboratory SIGCOMM.
Perception of syllable prominence by listeners with and without competence in the tested language Anders Eriksson 1, Esther Grabe 2 & Hartmut Traunmüller.
IT-101 Section 001 Lecture #8 Introduction to Information Technology.
Speech codecs and DCCP with TFRC VoIP mode Magnus Westerlund
A Basic Multimedia Quality Model Majid Bagheri
Bolo – A Simple Audioconference CS525u Multimedia Computing Due date: Project 2.
CS525u Multimedia Computing Introduction. Introduction Purpose Brief introduction to: –Digital Audio –Digital Video –Perceptual Quality –Network Issues.
The Good, the Bad and the Muffled: the Impact of Different Degradations on Internet Speech Anna Watson and M. Angela Sasse Dept. of CS University College.
Evaluation of Speak Project 2b Due March 24th. Overview Experiments to evaluate performance of your audioconference (proj2) Focus not only on how your.
Evaluation of Speak Project 2b Due March 30 th. Overview Experiments to evaluate performance of your audioconference (proj2) Focus not only on how your.
CS525z Multimedia Networking Review. Analog to Digital What is the relationship between –Fidelity and Sample Size –Fidelity and Sample Rate.
CS 360 – Spring 2007 Pacific University Multimedia Content (Streaming Media) Session “Layer” section Feb 2007.
Successful Multiparty Audio Communication over the Internet Vicky Hardman, M. Angela Sasse and Isidor Kouvelas Department of Computer Science University.
User Interface Testing. Hall of Fame or Hall of Shame?  java.sun.com.
An Empirical Study of Delay Jitter Management Policies D. Stone and K. Jeffay Computer Science Department University of North Carolina, Chapel Hill ACM.
CSc 461/561 CSc 461/561 Multimedia Systems Part C: 1. RTP/RTCP.
IE 429 DESIGNS OF EXPERIMENTS
Nov. 3, 2000 Adaptive Playout Scheduling in Packet Voice Communications.
TCP/IP Protocol Suite 1 Chapter 25 Upon completion you will be able to: Multimedia Know the characteristics of the 3 types of services Understand the methods.
The Effects of Jitter on the Perceptual Quality of Video Mark Claypool and Jonathan Tanner Computer Science Department Worcester Polytechnic Institute.
Using Redundancy and Interleaving to Ameliorate the Effects of Packet Loss in a Video Stream Yali Zhu, Mark Claypool and Yanlin Liu Department of Computer.
Successful Multiparty Audio Communication over the Internet Vicky Hardman, M. Angela Sasse and Isidor Kouvelas Department of Computer Science University.
Using Interleaving to Ameliorate the Effects of Packet Loss in a Video Stream Mark Claypool and Yali Zhu Computer Science Department Worcester Polytechnic.
1 Audio Compression Multimedia Systems (Module 4 Lesson 4) Summary: r Simple Audio Compression: m Lossy: Prediction based r Psychoacoustic Model r MPEG.
V Telecommunications Industry AssociationTR XXX.
ETSI STQ, Taiwan Workshop, February 13th, Recent improvements in transmission quality assessment : Background noise transmission Results of STF.
Raymond S. Pastore, Ph.D..  What is multimedia?  Verbal and Non Verbal representations better for learning than just one (Mayer, 2005)  Modality.
Subjective Sound Quality Assessment of Mobile Phones for Production Support Thorsten Drascher, Martin Schultes Workshop on Wideband Speech Quality in Terminals.
Computer Networks: Multimedia Applications Ivan Marsic Rutgers University Chapter 3 – Multimedia & Real-time Applications.
GSM voice quality Final report Tunis, 15 th of July 2004 Customer: Tunisiana.
Enhancing Teaching and Learning with Podcasts Mico e-Learning Workshop.
An Empirical Evaluation of VoIP Playout Buffer Dimensioning in Skype, Google Talk, and MSN Messenger Chen-Chi Wu, Kuan-Ta Chen, Yu-Chun Chang, and Chin-Laung.
Week Five: Assure Case Study Ruth Vega CMP/555 September 26, 2005.
RESEARCH METHODS 2. Psychological research methods The type of data collected in psychological research is used as the basis of classifying research methods.
BPS - 3rd Ed. Chapter 81 Producing Data: Experiments.
Definition and Coordination of Signal Processing Functions for telephone connections involving automotive speakerphones Scott Pennock Senior Hands-Free.
Compression No. 1  Seattle Pacific University Data Compression Kevin Bolding Electrical Engineering Seattle Pacific University.
1 Presented by Jari Korhonen Centre for Quantifiable Quality of Service in Communication Systems (Q2S) Norwegian University of Science and Technology (NTNU)
University of Plymouth United Kingdom {L.Sun; ICC 2002, New York, USA1 Lingfen Sun Emmanuel Ifeachor Perceived Speech Quality.
Internet Measurment Multimedia 1. Properties Challenges Tools State of the Art 2.
Analogue & Digital. Analogue Sound Storage Devices.
Page 1 The department of Information & Communications Engineering Dong-uk, kim A Survey of Packet Loss Recovery Techniques for Streaming.
Research Design Week 6 Part February 2011 PPAL 6200.
Chapter 13: Experiments and Observational Studies AP Statistics.
SOUND Sound Design is extremely important to the success of a show and for the audience to have a “total” experience Sound can tell its own story.
CHAPTER 9 Producing Data: Experiments BPS - 5TH ED.CHAPTER 9 1.
>> HIGHERVIEW Team: A. Sasse J. D. McCarthy D. Miras J. Riegelsberger Presentation to UCL Network Group: 3rd March 2004.
Ch 6. Multimedia Networking Myungchul Kim
 Face to face  Oral  Written  Visual  Electronic Communication in Administration 2.
Evaluation of Speak Project 2b Due November 4 th.
RTP and playout delay compensation Henning Schulzrinne Dept. of Computer Science Columbia University Fall 2003.
Producing Data: Experiments BPS - 5th Ed. Chapter 9 1.
1 Video and Voice over IP performance over a Satellite link Bob Dixon, Ohio State University/OARnet Prasad Calyam, OARnet Joint Techs Workshops, Columbus,
Olfactory Cues Modulate Facial Attractiveness Dematte, Osterbauer, & Spence (2007)
1 Topic 14 – Experimental Design Crossover Nested Factors Repeated Measures.
Using Speech Recognition to Predict VoIP Quality
VoIP over Wireless Networks
Between-Subjects, within-subjects, and factorial Experimental Designs
Analogue & Digital.
Kouhei Fujimoto Graduate School of Engineering Science
BUS 308 MENTOR Lessons in Excellence--bus308mentor.com.
Overview What is Multimedia? Characteristics of multimedia
Digital Audio Application of Digital Audio - Selected Examples
Presentation transcript:

The Good, the Bad and the Muffled: the Impact of Different Degradations on Internet Speech Anna Watson and M. Angela Sasse Department of CS University College London, London, UK Proceedings of ACM Multimedia November 2000

Introduction Multimedia conferencing is a growing area Well-known that need good quality audio for conferencing to be successful Much research focused on improving delay, jitter, loss Many think bandwidth will cure all problems –But bandwidth has been increasing exponentially while quality has not!

Motivation Large field trial from –13 UK institutions –150 participants Recorded user Perceptual Quality (PQ) –Beginning, Middle, End –(Why not only at end?) –(Why not continuously?) Matched with objective network performance metrics Analysis suggested that network was not primary influence on PQ!

Example: Missing Words Throughout But loss usually far less than 5%! - 1 hour Meeting - UCL to Glasgow -Super Janet -RTP reports 2-5 secs

Problems Cited Missing words –Likely causes: packet loss, poor speech detection, machine glitches Variation in volume –Likely causes: insufficient volume settings (mixer), poor headset quality Variation in quality among participants –Likely causes: high background noise, poor headset quality Experiments to measure which affect quality

Outline Introduction Experiments Results Conclusions

Audioconference Fixed Parameters Robust Audio Tool (RAT) –Home brewed in UCL –Does some repair of packets lost Coded in DVI 40 ms sample size Use “repetition” to repair lost packets –Good for small packets (20ms) –Not as good for large packets (80ms) +(Why not?)

Audioconference Variables Packet loss rates –5% (typical of multicast) and 20% (upper limit of tolerance) “Bad” microphone –Hard to measure, but Altai A087F Volume differences –Quiet, normal, loud through “pilot studies” –(Why can’t users just adjust volume?) Echo –From open microphone –(What is this?)

Measurement Method: Perceptual Quality Not ITU standard (paper at ACM MM ‘99) –Text labels bad –Built for Television quality Subjective through “slightly” labeled scale “Fully subscribe that … speech quality should not be treated as a unidimensional phenomenon…” But …

Measurement Method: Physiological User “cost” –Fatigue, discomfort, physical strain Measure user stress –Using a sensor on the finger Blood Volume Pulse (BVP) –Decreases under stress Heart Rate (HR) –Increases under stress (“Fight” or “Flight”)

Experimental Material Take script from “real” audioconference Act-out by two males without regional accents Actors on Sun Ultra workstations on a LAN –Only audio recorded –16 bit samples (DVI compresses to 4 bits) –Used RAT –Used silence deletion (hey, project 1!) Vary volume and feedback (speakers to mic) Split into 2-minute files, 8Khz, 40 ms packets Repetition when loss

Experimental Conditions Reference – non-degraded 5% loss – both voices, with repetition 20% loss – both voices, with repetition Echo – one had open mic, no headset Quiet – one recorded low volume, other norm Loud – one recorded high volume, other norm Bad mic – one had low quality mic, other norm  Determined “Intelligibility” not affected by above

Subjects 24 subjects –12 men –12 women All had good hearing Age 18 – 28 –(Probably students) None had previous experience in Internet audio or videoconferencing

Procedure Each listened to seven 2-minute test files twice –Played out by their audio tool –Used slider First file had no degradations (“Perfect”) –Users adjusted volume –Were told it was “best” Randomized order of files –Except “perfect” was 1 st and 8 th –So, 7 conditions heard once than another order Baseline physiological readings for 15 min When done, explain rating (tape-recorded)

Outline Introduction Experiments Results Conclusions

Quality Under Degradation Statistically significant?

Statistical Significance Tests Anova Test –For comparing means of two groups: first hearing and second hearing –No statistical difference between the two groups Analysis of variance –Degradation effect significant +Reference and mean of all others are different –Reference and 5% loss the same –Reference and Quiet the same –5% Loss and Quiet the same –20% Loss and Echo and Loud the same

Physiological Results: HR

Physiological Results: BVP Statistically significant?

Physiological Statistical Significance Tests Bad mic, loud and 20% loss all significantly more stressful than quiet and 5% loss Echo significantly more stressful than quiet in the HR data only Contrast to quality! –Bad Mic worse than 20% loss –Least stressful were quiet and 5% loss

Qualitative Results Asked subjects to describe why each rating Could clearly identify –quiet, loud and echo Bad mic –‘distant’, ‘far away’ or ‘muffled’ –‘on the telephone’, ‘walkie-talkie’ or ‘in a box’

Qualitative Results of Loss 5% loss –‘fuzzy’ and ‘buzzy’ (13 of 24 times) +From waveform changing in the missing packet and not being in the repeated packet –‘robotic’, ‘metallic’, ‘electronic’ (7 times) 20% loss –‘robotic’, ‘metallic’, ‘digital’, ‘electronic’ (15 times) –‘broken up’ and ‘cutting out’ (10 times) –‘fuzzy’ and ‘buzzy’ infrequently (2 times) 5 said ‘echo’, 10 described major volume changes –Not able to reliably see the cause of the degradation

Discussion 5% loss is different than reference condition (despite stats) because of descriptions –But subjects cannot identify it well –Need a tool to identify impairments 20% loss is worse than bad mic based on quality, but is the same based on physiological results –need to combine physiological and subjective Methodology of field trials to design controlled experiments can help understand media quality issues

Conclusion Audio quality degradation not primarily from loss –Volume, mic and echo are worse –And these are easy to fix! Educating users harder. By getting descriptions, should be easier to allow users to diagnose problems –Ex: ‘fuzzy’ or ‘buzzy’ to repetition for repair Volume changes harder –Could be reflected back to the user –Could do expert system to make sure certain quality before being allowed in

Future Work Delay and jitter compared with other degradations Interactive environments rather than just listening –Ex: echo probably worse Combination effects –Ex: bad mic plus too loud