Digital Voice Communication on HF

Slides:



Advertisements
Similar presentations
Q-MAC Electronics Pty Ltd
Advertisements

Analog Communications
Introduction to Digital Voice over HF
Copyright 2001, Agrawal & BushnellVLSI Test: Lecture 181 Lecture 18 DSP-Based Analog Circuit Testing  Definitions  Unit Test Period (UTP)  Correlation.
Analogue to Digital Conversion (PCM and DM)
Chapter-3-1CS331- Fakhry Khellah Term 081 Chapter 3 Data and Signals.
CMP206 – Introduction to Data Communication & Networks Lecture 2 – Signals.
GBRA 5 WEEK TECHNICIAN LICENSE CLASS - WEEK 2 OPERATING MODES Ham’sAreRadioactive!
Page 0 of 34 MBE Vocoder. Page 1 of 34 Outline Introduction to vocoders MBE vocoder –MBE Parameters –Parameter estimation –Analysis and synthesis algorithm.
EE302 Lesson 21: Transmission of Binary Data in Communication Systems
DIGITAL COMMUNICATIONS.  The modern world is dependent on digital communications.  Radio, television and telephone systems were essentially analog in.
Speech Coding Nicola Orio Dipartimento di Ingegneria dell’Informazione IV Scuola estiva AISV, 8-12 settembre 2008.
Outline Transmitters (Chapters 3 and 4, Source Coding and Modulation) (week 1 and 2) Receivers (Chapter 5) (week 3 and 4) Received Signal Synchronization.
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
Digital Data Transmission ECE 457 Spring Information Representation Communication systems convert information into a form suitable for transmission.
Computer Networks Chapter 3: Digital transmissions fundamentals Part 1.
Digital Voice Communication Link EE 413 – TEAM 2 April 21 st, 2005.
A Software Defined Radio Implementation for Voice Transmission over Wireless Ad-hoc Networks Jason Tran SURF-IT 2009 Fellow Mentors: Dr. Homayoun Yousefi’zadeh.
Chapter 2 Fundamentals of Data and Signals
Chapter 2: Fundamentals of Data and Signals. 2 Objectives After reading this chapter, you should be able to: Distinguish between data and signals, and.
3.1 Chapter 3 Data and Signals Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Sep 08, 2005CS477: Analog and Digital Communications1 Example Systems, Signals Analog and Digital Communications Autumn
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
1 Chapter 2 Fundamentals of Data and Signals Data Communications and Computer Networks: A Business User’s Approach.
IT-101 Section 001 Lecture #15 Introduction to Information Technology.
Data Communication and Networking 332 Hardware Components of Data Communication.
1 Summary of SDR Analog radio systems are being replaced by digital radio systems for various radio applications. SDR technology aims to take advantage.
DIGITAL VOICE NETWORKS ECE 421E Tuesday, October 02, 2012.
Data Transmission The basics of media, signals, bits, carries, and modems (Part II)
Review: The application layer. –Network Applications see the network as the abstract provided by the transport layer: Logical full mesh among network end-points.
Data Communications & Computer Networks, Second Edition1 Chapter 2 Fundamentals of Data and Signals.
PSK31 Digital Mode.
Wireless PHY: Modulation and Demodulation Y. Richard Yang 09/6/2012.
Digital Communications
Amateur Radio Digital Modes
Radio Signals Modulation Defined The purpose of radio communications is to transfer information from one point to another. The information to be sent.
When to Code WHEN NOT TO CODE James D. (jj) Johnston Chief Scientist DTS, Inc.
1 Chapter 2 Fundamentals of Data and Signals Data Communications and Computer Networks: A Business User’s Approach.
Technician License Course Chapter 2 Radio and Electronics Fundamentals
CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2009.
ECE 5525 Osama Saraireh Fall 2005 Dr. Veton Kepuska
VOCODERS. Vocoders Speech Coding Systems Implemented in the transmitter for analysis of the voice signal Complex than waveform coders High economy in.
TI Cellular Mobile Communication Systems Lecture 4 Engr. Shahryar Saleem Assistant Professor Department of Telecom Engineering University of Engineering.
Technician License Course Chapter 2 Lesson Plan Module 3 – Modulation and Bandwidth.
Write-Log Click her to open this box. Set the frequency and mode
Encoding How is information represented?. Way of looking at techniques Data Medium Digital Analog Digital Analog NRZ Manchester Differential Manchester.
Chapter Two Fundamentals of Data and Signals Data Communications and Computer Networks: A Business User's Approach Eighth Edition.
Signals Definition of Terms Modes – ways of conveying information. Modulation – the process of impressing or overlaying information to be conveyed onto.
DIGITAL COMMUNICATION. Introduction In a data communication system, the output of the data source is transmitted from one point to another. The rate of.
Radio Equipment. Review: On the Transmitter Side The purpose of radio communications is to transfer information from one point to another. The information.
DATA AND COMPUTER COMMUNICATIONS Eighth Edition by William Stallings Lecture slides by Lawrie Brown Chapter 9 – Spread Spectrum.
Lifecycle from Sound to Digital to Sound. Characteristics of Sound Amplitude Wavelength (w) Frequency ( ) Timbre Hearing: [20Hz – 20KHz] Speech: [200Hz.
Codec 2 ● open source speech codec ● low bit rate (2400 bit/s down to 1400 bit/s) ● applications include digital speech for HF and VHF radio ● fills gap.
Codec 2 ● open source speech codec ● low bit rate (2400 bit/s and below) ● applications include digital speech for HF and VHF radio ● fills gap in open.
Motivation ● The (Ham) world needs an open source, patent free speech codec at bit rates of less than 5000 bit/s ● I know how to build one!
Open Source Digital Radio David Rowe. Topics ● Digital Radio and VOIP ● Applications ● Open Source Digital Radio – who cares? ● Codec 2 explained ● Digital.
IT-101 Section 001 Lecture #15 Introduction to Information Technology.
Codec 2 open source speech codec
Codec 2 at 450 bit/s Codec 2 Digital Voice over High Frequency (HF) Radio The Mission Why HF is Hard How Analog Does it Demo.
Virginia DX Century Club
Chapter Two Fundamentals of Data and Signals
디지털통신 Bandpass Modulation 1 임 민 중 동국대학교 정보통신공학과.
Vocoders.
Digital Communications
Pulse Code Modulation (PCM)
Overview Communication is the transfer of information from one place to another. This should be done - as efficiently as possible - with as much fidelity/reliability.
1 Vocoders. 2 The Channel Vocoder (analyzer) : The channel vocoder employs a bank of bandpass filters,  Each having a bandwidth between 100 HZ and 300.
Introduction King Saud University
Govt. Polytechnic Dhangar(Fatehabad)
Introduction 1st semester King Saud University
Presentation transcript:

Digital Voice Communication on HF Ford Amateur Radio League March 12, 2015 David Treharne, N8HKU

Overview J2E emissions (part of telephony). Need to have protocols that work, 2 or more, and not considered encryption or a cypher. Amateurs have working on it since 2000, or even before that A lot of proprietary algorithms that do not work together and are not open for experimentation Can Codec 2 work? Where will it be useful? (Rag chewing, emergency communications, contests?)

Why digital? Analog has done quite well for SSB and FM. The commercial world uses a lot of digital signals for both voice and data. Digital can improve signal to noise ratio by eliminating the channel noise. The Challenge: How to get a digital compression code down below our 2,700 Hz SSB bandwidth

Conventional compression Normal Digital sampling Need 2x the data minimum when a signal is digitized 3000 Hz of voice needs 6,000 samples per second, more reasonably 8,000 samples per second. Compression algorithms: Use patterns to reduce the size Use lossy compression techniques. If you lose some of the compressed data, then you lose the complete signal.

From: David Rowe Presentations on Codec2 The following slides were from two Codec2 presentations by David Rowe, VK5DGR. 2010 codec2_tapr_2010_v0.2 2012 Ica_2012_codec2

Codec 2 Make a digital encoding that models speech, not just compress it. Similar to commercial versions, but made simpler, and designed for voice work only. open source speech codec low bit rate (2400 bit/s down to 1400 bit/s) applications include digital speech for HF and VHF radio fills gap in open source speech codecs beneath 5000 bit/s work in progress samples & source: rowetel.com/codec2.html

This is Not a DSP talk Pitch Estimation Linear Prediction Line Spectrum Pairs Voicing Estimation Vector Quantisation Source-Filter Model of Speech production Inverse Discrete Fourier Transform Overlap-add Synthesis

Codec2 @ 1400 bit/s The benefits of a compressed voice signal: Send 45 phone calls in standard 64 kbit/s phone channel 175 bytes/s 30 second voice mail in 5250 bytes 30 minute pod cast in 308 Kbytes (Normal podcast: 20,000 Kbytes)

Main Application – Voice over Digital Radio RF spectrum is extremely limited, noise, bit errors Traditional analog speech systems (FM, SSB) are really efficient in power and bandwidth use But there is interest in using digital techniques .. if the right codec is available Compressed speech requires less bit/s over channel This means less bandwidth Less transmitter power, saving battery and improving speech quality

Power Efficiency: Tighter Compression = Better Speech 1 Watt 2 at 0.5 Watt Noise 1400 bit/s 2800 bit/s

Digital Voice Radio System codec2 enc FEC enc mod mic HF/VHF radio D/A codec2 dec FEC dec demod spkr

Voice is not like data If you get a single error in a data packet the entire packet is useless If you get a few errors in a voice packet it probably sounds OK (10% errors are OK) If you discard or lose a voice packet every now and again it's probably OK (unlike data, where all data must be perfect) In speech packets some bits are more important than others (protect the most important data only) these factors can be used to build a better voice system (less power, less spectrum, more robust)

Codec 2 Author - David Rowe Ham Radio operator, VK5DGR, first licensed in 1981 at age 13 (first computer in 1982) 20 years experience in speech coding Built some of the first real time speech codecs in the late 1980's on early DSP chips Now work full time on open software/open hardware for developing world communications (Wants efficient communications in parts of Africa not covered by phones or even by cell phone towers. Use a type of Mesh network to transmit efficient voice over long distances with little power or equipment)

Proprietary Codecs come in hardware or licensed software form difficult to distribute they cannot be modified understanding how they work is discouraged modification is actually illegal under the license D-Star uses the AMBE coding system. We cannot modify it, we can just purchase and use the hardware chip that performs this function.

Speech Coding Take speech samples (e.g. 16 bit samples at 8 kHz sampling rate) Compress to 1400 to 2400 bit/s What can we throw away? Retain intelligible speech Retain natural speech Use a model of speech, send model parameters, more efficient than coding waveform (Not just using compression, but looking at how human voice works. (These are not good for music, noises, etc.)

Model Parameter example of a model parameter is pitch for humans in the range 50 to 500 Hz (100Hz for males, 500 Hz for children) can be accurately represented with 7 bits updated every 20 ms so 7/0.02 = 350 bit/s to represent pitch

Sinusoidal Speech Coding Amplitude (16 bit samples) (female speaker) Notice a lot of repetition of the signals, this gives us a pitch signal for this 40mS sample) Pitch Period 35 samples or 4.4ms at 8kHz sample rate (230 Hz) 40ms of female speech Time (samples)

Sinusoidal Speech Coding Pitch 230Hz or 4.3ms Amplitude (dB) See how the speech spectrum is made up of peaks spaced by about 230Hz? Well 240Hz happens to be the pitch of the speech at this instant in time. Each peak can be thought of as a sine wave. A sinusoidal codec models the speech as set of sine waves, each with it’s own frequency, and phase, and amplitude. So instead of sending the speech waveform like a regular telephone, a sinusoidal encoder sends the sinusoid parameters over the channel to the decoder which then reconstructs the speech. The parameters change over time so we update them at regular intervals, like every 20ms. It turns out that if you do all of this right the speech at the decoder sounds pretty close to the original. Blue – spectrum of previous time domain segment Red – amplitude estimates of each sine wave Harmonics of 230Hz Frequency (Hz)

Sinusoidal Speech Model Amplitude 1 Phase 1 Frequency 1 Amplitude 2 Phase 2 Frequency 2 In practice speech consists of voiced speech (like vowels) and unvoiced speech (like consonants) For voiced speech the frequencies are harmonics of the pitch frequency However if the number of harmonics L is high, noise can be accurately generated using harmonics with random phases So instead of sending the speech waveform, we send the pitch frequency, and the harmonic amplitudes and phases These parameters are time varying, so we update them every 10-20ms Amplitude L Phase L Frequency L

1 Sine Wave

Male speaker, 80 Hz nominal pitch) 3 sine waves

10

25

50 sine wave harmonics, just changing phase and amplitude of each, produces a very close signal.

2550 bit/s quantised model parameters Encoder Block Diagram 7 bits of data for the pitch or fundamental frequency of the speaker at that point in time. Pitch est Pitch Quant 2550 bit/s quantised model parameters 16 bit, 8kHz samples FFT MBE Voicing est 2 bits (sampled 10mS intervals) of voicing of a vowel or a consonant) LPC Analysis LPC to LSP LSP Quant 36 bits (The model parameters of the speech) MBE voicing algorithm looks at the first 1kHz of the voice spectrum. Compares it to an ideal voiced spectrum and makes a voiced/unvoiced decision. LPC tends to make errors near 0Hz. A single bit is used to correct low frequency LPC errors. Makes a big different to speech quality, especially for males. Although necessary for high quality speech, phase information is discarded. We regenerate the phase information at the decoder using a rule based approach. Line Spectrum Pairs (LSPs) are a convenient way to encode LPCs for transmission over a channel. LPC Correction 1 bit to help with the low pitch frequencies Energy Quant 5 bits of energy signal (how loud)

Bit Allocation From one of the first versions of the Codec 51 bits per 20ms frame, or 2550 bit/s Parameter Bits/frame Spectral magnitudes (LSPs) 36 Low frequency LPC correction 1 Energy 5 Voicing (updated each 10ms) 2 Pitch 7 Total 51

Decoder Block Diagram LSP to LPC FFT Recover Harm Amps LSPs Energy LPC Correction Phase Synthesis Post Filter Voicing Phases are synthesised using a rule based approach. More work needed here to improve speech quality. Post filter helps with background noise. This codec can distort background noise and make it sound unpleasant. The post filter tracks average background noise level. The phase of any harmonics at less than this level is randomised so they sound more like noise. The post filter still needs some work. It is an alternative to mixed voicing models used in MBE and MELP codecs that uses zero bits to handle background noise. An inverse FFT is used for synthesis. We create a synthetic speech spectrum then IFFT to create the time domain signal Overlap add is a way to overlap adjacent IFFTs to create a continuous time domain signal. Inverse FFT Overlap Add 16 bit, 8kHz samples

FreeDV: The Digital voice for HF Speech is compressed down to 1600 bit/s then modulated onto a 1.25 kHz wide 16QPSK signal which is sent to the Mic input of a SSB radio. On receive, the signal is received by the SSB radio, then demodulated and decoded by FreeDV. Communications should be readable down to 2 dB S/N, and long-distance contacts are reported using 1-2 watts power. FreeDV was built by an international team of Radio Amateurs working together on coding, design, user interface and testing. FreeDV is open source software, released under the GNU Public License version 2.1. The FDMDV modem and Codec 2 Speech codec used in FreeDV are also open source.

Running Codec2 May 21, 2014: SmartMic announced! An embedded hardware product that allows you to run FreeDV without a PC. Plug SmartMic into your SSB or FM radio, and you now have Digital Voice (DV). (allows function with only 1 soundcard) $195.00

Issues over HF Fading: The signal is lost, and data is lost. Can handle some fading by use of the Forward Error Correction (FEC) bits. Ham Radio HF packet radio communication uses FEC. Codec2 also tries to decode signals even after fading. Most modems are designed to throw out data when signals are missing, since they are designed for data, where everything must be perfect. Codec2 is designed to try to decode even if there are errors. This allows us to still hear the rest of the speech even after a dropout. Since speech has a lot of redundancy, this is not an issue. (We do this all the time on HF) Group delay: When signals bounce off of the ionesphere at different heights, causing some early signals to arrive later, mixing with the later signals. 5mS is common delay in HF. Need a Codec that sends one packet of bits for longer than 5mS to make sure that it does not get affected by the delay. (Ham Radio RTTY at 45 baud sends the same signal for over 5mS before switching to a new signal to handle this delay. That is why that speed works so well for HF communications!!) Lining up the signal right in the passband. A problem for us already in HF SSB. Codec 2 starts a transmission by sending a sequence of known tones. The receiver knows the frequencies of these tones and lines up the decoder to match them.

What does this sound like? http://www.rowetel.com/blog/?page_id=452 Male vs Female (pitch differences, Codec2 has trouble with low frequencies, so it adds in an extra bit) background noise and speech codecs (if the noise does not sound like a voice with the harmonics, then it does not bother the coding very much)

Conclusions Digital HF will continue in experimentation phase into the future Use more bits to make it sound better, or to add in more error correction? Find a way to do this with one sound card? (I am not sure why it requires two cards.) Best used when a communication link has been established, then switch to digital. When it does work, it eliminates all of the QRM and QRN on SSB. It could be the ragchew method of the future! Maybe, just maybe, even for contests!

Bibliography freeDV: http://freedv.org/tiki-index.php Codec 2: http://www.rowetel.com/blog/?page_id=452