Codec 2 open source speech codec

Slides:



Advertisements
Similar presentations
Speech Coding Techniques
Advertisements

Analog Communications
Bandwidth part 1 What does it mean? Different things to different people.
Introduction to Digital Voice over HF
Analogue to Digital Conversion (PCM and DM)
Signal Encoding Techniques (modulation and encoding)
Storey: Electrical & Electronic Systems © Pearson Education Limited 2004 OHT 5.1 Signals and Data Transmission  Introduction  Analogue Signals  Digital.
Speech Coding Nicola Orio Dipartimento di Ingegneria dell’Informazione IV Scuola estiva AISV, 8-12 settembre 2008.
Digital Data Transmission ECE 457 Spring Information Representation Communication systems convert information into a form suitable for transmission.
Digital Voice Communication Link EE 413 – TEAM 2 April 21 st, 2005.
Digital Voice Communication on HF
A Full Frequency Masking Vocoder for Legal Eavesdropping Conversation Recording R. F. B. Sotero Filho, H. M. de Oliveira (qPGOM), R. Campello de Souza.
1/21 Chapter 5 – Signal Encoding and Modulation Techniques.
Modulation, Demodulation and Coding Course Period Sorour Falahati Lecture 2.
LE 460 L Acoustics and Experimental Phonetics L-13
Lecture 1 Signals in the Time and Frequency Domains
AUDIO COMPRESSION msccomputerscience.com. The process of digitizing audio signals is called PCM PCM involves sampling audio signal at minimum rate which.
Wireless PHY: Modulation and Demodulation Y. Richard Yang 09/6/2012.
Basic Encoding Techniques
Technician License Course Chapter 2 Radio and Electronics Fundamentals
CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2009.
Outline Transmitters (Chapters 3 and 4, Source Coding and Modulation) (week 1 and 2) Receivers (Chapter 5) (week 3 and 4) Received Signal Synchronization.
1 Composite Signals and Fourier Series To approximate a square wave with frequency f and amplitude A, the terms of the series are as follows: Frequencies:
Technician License Course Chapter 2 Lesson Plan Module 3 – Modulation and Bandwidth.
Write-Log Click her to open this box. Set the frequency and mode
Modulation What is modulation?
When a signal is transmitted over a channel, the frequency band and bandwidth of the channel must match the signal frequency characteristics. Usually,
Modulation.
Present document contains informations proprietary to France Telecom. Accepting this document means for its recipient he or she recognizes the confidential.
Encoding How is information represented?. Way of looking at techniques Data Medium Digital Analog Digital Analog NRZ Manchester Differential Manchester.
CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2014.
Voice Sampling. Sampling Rate Nyquist’s theorem states that a signal can be reconstructed if it is sampled at twice the maximum frequency of the signal.
Institut für Nachrichtengeräte und Datenverarbeitung Prof. Dr.-Ing. P. Vary On the Use of Artificial Bandwidth Extension Techniques in Wideband Speech.
Signals Definition of Terms Modes – ways of conveying information. Modulation – the process of impressing or overlaying information to be conveyed onto.
Modulation and Multiplexing ICS 620. Overview Frequency Spectrum Modulation techniques Multiplexing--TDM vs FDM Multiple Access Signal formats.
Lifecycle from Sound to Digital to Sound. Characteristics of Sound Amplitude Wavelength (w) Frequency ( ) Timbre Hearing: [20Hz – 20KHz] Speech: [200Hz.
FOR MORE CLASSES VISIT  ECET 310 Week 1 Homework 1_1  ECET 310 Week 1 Homework 1_2  ECET 310 Week 2 Assignment Homework 2_1.
Fundamentals of Communications. Communication System Transmitter: originates the signal Receiver: receives transmitted signal after it travels over the.
Codec 2 ● open source speech codec ● low bit rate (2400 bit/s down to 1400 bit/s) ● applications include digital speech for HF and VHF radio ● fills gap.
Codec 2 ● open source speech codec ● low bit rate (2400 bit/s and below) ● applications include digital speech for HF and VHF radio ● fills gap in open.
ECET 310 Entire Course For more classes visit ECET 310 Week 1 Homework 1_1 ECET 310 Week 1 Homework 1_2 ECET 310 Week 2 Assignment.
Motivation ● The (Ham) world needs an open source, patent free speech codec at bit rates of less than 5000 bit/s ● I know how to build one!
Open Source Digital Radio David Rowe. Topics ● Digital Radio and VOIP ● Applications ● Open Source Digital Radio – who cares? ● Codec 2 explained ● Digital.
IT-101 Section 001 Lecture #15 Introduction to Information Technology.
Codec 2 at 450 bit/s Codec 2 Digital Voice over High Frequency (HF) Radio The Mission Why HF is Hard How Analog Does it Demo.
Basics of Small Scale Fading: Towards choice of PHY
Analog and Digital Modulation Techniques
Signal Encoding Techniques
Scalable Speech Coding for IP Networks
KOMUNIKASI DATA Materi Pertemuan 10.
Digital modulation techniques
Digital Communications Chapter 13. Source Coding
Vocoders.
Introduction to electronic communication systems
Overview Communication is the transfer of information from one place to another. This should be done - as efficiently as possible - with as much fidelity/reliability.
Analog Operating Modes
Introduction King Saud University
Signal Encoding Techniques
Speech and Audio Processing
ANALOG COMMUNICATION SYSTEMS
Mobile Systems Workshop 1 Narrow band speech coding for mobile phones
EEC4113 Data Communication & Multimedia System Chapter 3: Broadband Encoding by Muhazam Mustapha, October 2011.
Chapter 10. Digital Signals
Chapter 3: PCM Noise and Companding
Direct Sequence Spread Spectrum Modulation and Demodulation using Compressive Sensing Under the guidance of M.Venugopala Rao Submitted by K.Y.K.Kumari.
Transmitters Advanced Course requires a detailed knowledge of Transmitters and Receivers This session covers Transmitter Block Diagrams, Oscillators and.
Which of the following is a form of amplitude modulation?
REVIEW Physical Layer.
Recap In previous lessons we have looked at how numbers can be stored as binary. We have also seen how images are stored as binary. This lesson we are.
Introduction 1st semester King Saud University
Presentation transcript:

Codec 2 open source speech codec low bit rate (2500 bit/s down to 1200 bit/s) main application is digital speech for HF and VHF radio fills gap in open source speech codecs beneath 5000 bit/s work in progress samples & source: rowetel.com/codec2.html

Ham Radio and Open Source Ham radio is an experimental service Key to the hobby is the need to be able to experiment, understand, and modify Evolving from analog to digital communications But current digital speech modes use proprietary codecs Understanding them is discouraged, experimenting with them is illegal

Patents and Codecs The authors of proprietary/patented codecs borrowed heavily from the public domain Perhaps 5% of the algorithms they use are original and patented 95% of the algorithms in these codecs are public domain algorithms For that 5% we are being locked into their codec and being told to pay them royalties forever. To build an equivalent codec, we simply need alternatives for the 5% that is patented

Speech Coding Take speech samples (e.g. 16 bit samples at 8 kHz sampling rate) Compress to 1400 to 2400 bit/s What can we throw away? Retain intelligible speech Retain natural speech Use a model of speech, send model parameters, more efficient than coding waveform

Model Parameter example of a model parameter is pitch for humans in the range 50 to 500 Hz can be accurately represented with 7 bits updated every 20 ms so 7/0.02 = 350 bit/s to represent pitch

Sinusoidal Speech Model Amplitude 1 Phase 1 Frequency 1 Amplitude 2 Phase 2 Frequency 2 In practice speech consists of voiced speech (like vowels) and unvoiced speech (like consonants) For voiced speech the frequencies are harmonics of the pitch frequency However if the number of harmonics L is high, noise can be accurately generated using harmonics with random phases So instead of sending the speech waveform, we send the pitch frequency, and the harmonic amplitudes and phases These parameters are time varying, so we update them every 10-20ms Amplitude L Phase L Frequency L

Octave Demo

Bit Allocation Alpha codec, subject to rapid change 56 bits per 40ms frame, or 1400 bit/s Parameter Bits/frame Amplitudes 36 Frame Energy + Pitch 16 Voicing 4 Total 56

Sinusoidal Model Samples Original Sinusoidal Model Sinusoidal Model - Low Freq Amplitudes Sinusoidal Model - High Freq Amplitudes Only Sinusoidal Model – All Voiced Sinusoidal Model – All Un-Voiced Sinusoidal Model – Double Pitch

Voice over Digital Radio RF spectrum is extremely limited, noise, bit errors traditional analog speech systems (FM, SSB) are really efficient in power and bandwidth use but there is interest in using digital techniques .. if the right codec is available compressed speech requires less bit/s over channel this means less bandwidth and maybe improved speech quality ....

Power Efficiency: Tighter Compression = Better Speech 1 Watt Noise 1400 bit/s 2 x 1400 = 2800 bit/s You also use less Bandwidth...

Demo: Digital Voice Radio System codec2 enc FDMDV mod mic HF/VHF radio codec2 dec FDMDV demod spkr

Demo Input Speech (300 to 2600 Hz) Modem signal at Tx Modem signal at Rx (SNR = 5dB, 50Hz offset) Decoded Speech using Codec 2 SSB over same channel Decoded Speech (high SNR)

Voice is not like data A single error in a data packet make the entire packet is useless With a few errors in a voice packet it probably sounds OK (previous demo was 1% BER, NO FEC) If you discard or lose a voice packet every now and again it's probably OK In speech packets some bits are more important than others You may not need FEC, or only require it on some bits

Roadmap 2500 b/s 1400 b/s Quality improvements FDMDV modem GUI VOIP VHF Digital Radio FDMDV modem GUI