Digital Audio Watermarking: Properties, characteristics of audio signals, and measuring the performance of a watermarking system نيما خادمي کلانتري Email:

Slides:

Advertisements

Similar presentations

Speech Compression. Introduction Use of multimedia in personal computers Requirement of more disk space Also telephone system requires compression Topics.

Advertisements

Motivation Application driven -- VoD, Information on Demand (WWW), education, telemedicine, videoconference, videophone Storage capacity Large capacity.

4.2 Digital Transmission Pulse Modulation (Part 2.1)

Digital Representation of Audio Information Kevin D. Donohue Electrical Engineering University of Kentucky.

SIMS-201 Characteristics of Audio Signals Sampling of Audio Signals Introduction to Audio Information.

IT-101 Section 001 Lecture #8 Introduction to Information Technology.

1 Digital Audio Compression. 2 Formats  There are many different formats for storing and communicating digital audio:  CD audio  Wav  Aiff  Au 

Quantization Prof. Siripong Potisuk.

8/16/20021 Digital Transmission Key Learning Points Fundamentals of Voice Digitization Pulse Code Modulation Quantification Noise Multiplexed Digital Lines.

1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.

MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.

Digital Watermarking. Introduction Relation to Cryptography –Cryptography is Reversibility (no evidence) Established –Watermarking (1990s) Non-reversible.

© 2006 Cisco Systems, Inc. All rights reserved. 2.2: Digitizing and Packetizing Voice.

Fundamentals of Digital Audio. The Central Problem n Waves in nature, including sound waves, are continuous: Between any two points on the curve, no matter.

Chapter 4 Digital Transmission

SIMS-201 Audio Digitization. 2  Overview Chapter 12 Digital Audio Digitization of Audio Samples Quantization Reconstruction Quantization error.

Digital Audio Multimedia Systems (Module 1 Lesson 1)

 Principles of Digital Audio. Analog Audio  3 Characteristics of analog audio signals: 1. Continuous signal – single repetitive waveform 2. Infinite.

A Full Frequency Masking Vocoder for Legal Eavesdropping Conversation Recording R. F. B. Sotero Filho, H. M. de Oliveira (qPGOM), R. Campello de Souza.

Image Compression - JPEG. Video Compression MPEG –Audio compression Lossy / perceptually lossless / lossless 3 layers Models based on speech generation.

Fundamentals of Multimedia, Chapter 6 Sound Intro Tamara Berg Advanced Multimedia 1.

Formatting and Baseband Modulation

DSP for Dummies aka How to turn this (actual raw sonar trace) Into this.. (filtered sonar data)

Digital audio. In digital audio, the purpose of binary numbers is to express the values of samples that represent analog sound. (contrasted to MIDI binary.

LE 460 L Acoustics and Experimental Phonetics L-13

Digital Audio What do we mean by “digital”? How do we produce, process, and playback? Why is physics important? What are the limitations and possibilities?

Ni.com Data Analysis: Time and Frequency Domain. ni.com Typical Data Acquisition System.

Sampling Terminology f 0 is the fundamental frequency (Hz) of the signal –Speech: f 0 = vocal cord vibration frequency (>=80Hz) –Speech signals contain.

GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.

DTC 354 Digital Storytelling Rebecca Goodrich. Wave made up of changes in air pressure by an object vibrating in a medium—water or air.

DIGITAL WATERMARKING OF AUDIO SIGNALS USING A PSYCHOACOUSTIC AUDITORY MODEL AND SPREAD SPECTRUM THEORY By: Ricardo A. Garcia University of Miami School.

Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.

CSC361/661 Digital Media Spring 2002

Media Representations - Audio

Digital Recording Theory Using Peak. Listening James Tenney, Collage #1 (“Blue Suede”),  Available in Bracken Library, on James Tenney Selected.

MPEG Audio coders. Motion Pictures Expert Group(MPEG) The coders associated with audio compression part of MPEG standard are called MPEG audio compressor.

Hearing Chapter 5. Range of Hearing Sound intensity (pressure) range runs from watts to 50 watts. Frequency range is 20 Hz to 20,000 Hz, or a ratio.

© 2006 Cisco Systems, Inc. All rights reserved. Optimizing Converged Cisco Networks (ONT) Module 2: Cisco VoIP Implementations.

ECE 4710: Lecture #6 1 Bandlimited Signals  Bandlimited waveforms have non-zero spectral components only within a finite frequency range  Waveform is.

1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.

Compression No. 1  Seattle Pacific University Data Compression Kevin Bolding Electrical Engineering Seattle Pacific University.

Georgia Institute of Technology Introduction to Processing Digital Sounds part 1 Barb Ericson Georgia Institute of Technology Sept 2005.

1 Introduction to Information Technology LECTURE 6 AUDIO AS INFORMATION IT 101 – Section 3 Spring, 2005.

Basic Concepts of Audio Watermarking. Selection of Different Approaches Embedding Domain  time domain  frequency domain DFT, DCT, etc. Modulation Method.

CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2009.

Encoding and Simple Manipulation

By Sarita Jondhale 1 The process of removing the formants is called inverse filtering The remaining signal after the subtraction of the filtered modeled.

Chapter 2 Basic Science: Analog and Digital Audio.

Intro-Sound-part1 Introduction to Processing Digital Sounds part 1 Barb Ericson Georgia Institute of Technology Oct 2009.

CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2014.

Voice Sampling. Sampling Rate Nyquist’s theorem states that a signal can be reconstructed if it is sampled at twice the maximum frequency of the signal.

Multimedia Sound. What is Sound? Sound, sound wave, acoustics Sound is a continuous wave that travels through a medium Sound wave: energy causes disturbance.

Session 18 The physics of sound and the manipulation of digital sounds.

Digital Audio I. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul.

Fundamentals of Multimedia Chapter 6 Basics of Digital Audio Ze-Nian Li and Mark S. Drew 건국대학교 인터넷미디어공학부 임 창 훈.

Lifecycle from Sound to Digital to Sound. Characteristics of Sound Amplitude Wavelength (w) Frequency ( ) Timbre Hearing: [20Hz – 20KHz] Speech: [200Hz.

[1] National Institute of Science & Technology Technical Seminar Presentation 2004 Suresh Chandra Martha National Institute of Science & Technology Audio.

COMPUTER NETWORKS and INTERNETS

Spread Spectrum Audio Steganography using Sub-band Phase Shifting

4.1 Chapter 4 Digital Transmission Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.

CS 4594 Data Communications

Basic Concepts of Audio Watermarking

Chapter 2 Signal Sampling and Quantization

4.2 Digital Transmission Pulse Modulation (Part 2.1)

Increasing Watermarking Robustness using Turbo Codes

Bits and Pieces November 6, 2007.

Digital Control Systems Waseem Gulsher

COMS 161 Introduction to Computing

Analog to Digital Encoding

Govt. Polytechnic Dhangar(Fatehabad)

Presentation transcript:

Digital Audio Watermarking: Properties, characteristics of audio signals, and measuring the performance of a watermarking system نيما خادمي کلانتري

Properties (1) Inaudibility ◦ Similarity between the original and watermarked signal Robustness ◦ Ability to detect the watermark after common signal processing and malicious attacks Data Payload ◦ The number of embedded bits per second 2

Properties (2) Statistical invisibility ◦ Performing statistical tests on a set of watermarked files should not reveal any information about the nature of the embedded information, nor about the technique used for watermarking Redundancy ◦ To ensure robustness the watermark information is embedded in multiple places on audio file 3

Different types of watermarks Robust ◦ Watermarks that are robust against attacks Fragile ◦ Have only very limited robustness Semi-Fragile ◦ Robust to some limited attacks Perceptible ◦ Watermark that can be easily perceived by the user 4

Different types of watermarks Bitstream watermark ◦ Marks that embedded directly into compressed audio Fingerprinting ◦ A special application of watermarking in which information such as recipient of the data is used to form the watermark 5

6 How Sound Perceived The cochlea, an organ in our inner ears, detects sound. The cochlea is joined to the eardrum by three tiny bones. It consists of a spiral of tissue filled with liquid and thousands of tiny hairs. The hairs get smaller as you move down into the cochlea. Each hair is connected to a nerve which feeds into the auditory nerve bundle going to the brain. The longer hairs resonate with lower frequency sounds, and the shorter hairs with higher frequencies. Thus the cochlea serves to transform the air pressure signal experienced by the ear drum into frequency information which can be interpreted by the brain as sound.

7 Digitization of Sound Sampling ◦ Most humans can’t hear anything over 20 kHz. ◦ The sampling rate must be more than twice the highest frequency component of the sound (Nyquist Theorem). ◦ CD quality is sampled at 44.1 kHz. ◦ Frequencies over kHz are filtered out before sampling is done. Quantization ◦ Telephone quality sound uses 8 bit samples. ◦ CD quality sound uses 16 bit samples (65,536 quantization levels) on two channels for stereo.

8 Encoder Design A. Apply bandlimiting filter to remove high frequency components. B. Sample at regular time intervals. C. Quantize each sample.

9 Sampling Error (Undersampling) If you undersample, one frequency will alias as another. For CD quality, frequencies above kHz are filtered out, and then the sound is sampled at 44.1 kHz.

10 Quantization Interval If V max is the maximum positive and negative signal amplitude and n is the number of binary bits used, then the magnitude of the quantization interval, q, is defined as follows: For example, what if we have 8 bits and the values range from –1000 to +1000?

11 Quantization Error (Noise) Any values within a quantization interval will be represented by the same binary value. Each code word corresponds to a nominal amplitude value that is at the center of the corresponding quantization interval. The actual signal may differ from the code word by up to plus or minus q/2, where q is the size of the quantization interval.

12 Quantization Intervals and Resulting Error

13 Insufficient Quantization Levels Insufficient quantization levels result from not using enough bits to represent each sample. Insufficient quantization levels force you to represent more than one sound with the same value. This introduces quantization noise. Dithering can improve the quality of a digital file with a small sample size (relatively few quantization levels).

14 Linear Vs. Non-Linear Quantization In linear quantization, each code word represents a quantization interval of equal length. In non-linear quantization, you use more digits to represent samples at some levels, and less for samples at other levels. For sound, it is more important to have a finer-grained representation (i.e., more bits) for low amplitude signals than for high because low amplitude signals are more sensitive to noise. Thus, non-linear quantization is used.

15 u-Law Used in North America and Japan

16 A-Law Used in Europe and the rest of the world and international routes

17 Discrete Fourier Transform

18 Fourier Transform of rect(t/τ)

19 Window function (1) Rectangular window

20 Window function (2) hamming window

21 Window function (3) hanning window

Critical bands 22

Bark to frequency conversion 23

Critical bands by Zwicker 24

Absolute Threshold of Hearing (ATH) 25

Frequency masking (1) 26

Frequency masking (2) 27

Cepstrum domain 28

Discrete Cosine Transform 29

Wavelet Transform 30

Measuring transparency (1) 31 Subjective tests ◦ Discriminative test ◦ Mean Opinion Score (MOS)

Measuring transparency (2) 32 Objective measures

Measuring transparency 33 Feature Extraction Feature Comparison Quality Estimation ODG Original Signal Watermarke d Signal O bjective D ifference G rade

Measuring transparency Objective test ◦ Perceptual Audio Quality Measurement (PAQM) ◦ Noise to Mask Ratio (NMR) ◦ Perceptual Evaluation of Audio Quality (PEAQ)  Report a value between 0 and -4. higher values show more transparency and vice versa ◦ Perceptual Evaluation of Speech Quality (PESQ)  Report a value between 4.5 and 0.5. higher values show more transparency and vice versa 34

Measuring Robustness 1.Embed a random watermark W on the audio signal A. does not diminish the fidelity of the cover below a specified minimum 2.Apply a set of relevant signal processing operations to the watermarked audio signal A’. 3.Extract the watermark W using the corresponding detector and measure the success of the recovery process ※ Bit-error rate(BER): ratio of incorrect extracted bits to the total number of embedded bits 35

Measuring Robustness 36 Normalized Correlation False Negative Alarm ◦ Detecting no watermark in a work that actually contain one False Positive Alarm ◦ Detection of a watermark in a work that does not actually contain one