Download presentation
Presentation is loading. Please wait.
Published byMoris Bradford Modified over 8 years ago
2
Evolutionary Fuzzy Volume Tuner for Cellular Phones
3
2 Contents l Introduction l Evolutionary Fuzzy Volume Tuner(EFVT) –Obtaining Noise Levels –Fuzzy Noise Classifier –Personalization of FVT –Evolutionary Algorithm l Simulation using FuzzyControl++ l Quality of Speech l Summary and Conclusions
4
3 Introduction: Quality of Speech Speech Quality (QoS) Sound quality and Naturalness Intelligibility Conversational Effort Background Noise Environmental Conditions Listening and Talking Effort Speech Characteristics Network Conditions
5
4 Introduction: PsychoAcoustics l Ear has very complex hearing mechanism l Loudness Perception äFrequency dependent( Ear is most sensitive to sounds of 4Khz) äDuration Dependent ( If the duration of a sound is smaller than approx 200ms, it will be less loud than a sound of same intensity with a duration greater than 200ms) l Masking äWhen two tones are presented simultaneously, the weaker tone may not be heard äNoise signals mask the Audio signals l Hearing Impairments
6
5 Introduction: Volume Tuner l Current day Mobile Phones have manual volume Tuner settings l Background noise Affects the Quality of Speech( QoS) äUsers have to manually change the acoustic Volume levels of the mobile handset äUsers tend to bring the handset very close to their ears l Background noise classes { cars, busses, trains, factory …} l Evolutionary Fuzzy Volume Tuner adjusts the volume setting of the Mobile handset based on äNoise Level derived from the Voice Activity Detector äNoise Class derived from the Fuzzy Background classification system äPersonalization based on the individual’s hearing requirements äEvolutionary algorithm tunes IO Scaling factors, membership functions and optimizes the fuzzy rule-base äWe call the Mobile Phone which uses the Evolutionary Fuzzy Volume Tuner as a Smart Cellular Phone (SCP)
7
6 Example Speech File
8
7 Speech File Analysis
9
8 Fuzzy Sets l Fuzzy sets were introduced by Lotfi Zadeh in 1965 l A Fuzzy set is any point in the cube l The elements of the fuzzy set have degree of membership m(x) from 0 to 1. l Fit vector represent a fuzzy set l eg A= ( 0.25, 0.75)= ( x1, x2) In this example, element x1 belongs to, or fits in, subset A to degree 0.25= l We define fuzzy-set intersection fitwise by pairwise minimum, union by pairwise maximum and complementation by order reversal:
10
9 Fuzzy Systems Block Diagram
11
10 Fuzzy System( Mamdani)
12
11 Fuzzy Volume Tuner
13
12 Volume Level Very Low High Very High Degree of Membership Volume 0 1 Zero
14
13 Background Noise Level l GSM VAD computes the noise levels during the noise- only periods. An adaptive noise-suppressor filter is used to filter the input signal frame. The co-efficients of this filter is computed during noise-only periods determined by special measures taken to identify noise- only frames. These include signal stationarity and periodicity measures. l Fuzzy set values for Noise level : { Very Low, Medium Low, Low, Zero, High, Medium High, Very High}.
15
14 Background Noise Class l Fuzzy Noise Classifier (FNC) classifies the background noise into 7 types i.e., Stationary ( Car, Train, Bus-Dump), Non-stationary (Street, Factory, Construction, Babble). l The volume change to be applied is based on the noise class. E.g., the volume increase may not be the same for Car noise and the factory noise. l The Fuzzy rule base contains IF/THEN rules such as: äIf noise level is High and Volume Level is Low and noise class is Car Then volume level Change is LP äIf noise level is Low and Volume Level is High and noise class is Train Then Volume level Change is MN
16
15 Fuzzy Noise Classifier (FNC) l Based on work by F. Beritelli et al., l Feature Extraction 15 parameters of speech input l FNC operates in 4 levels with 6 Fuzzy Systems trained to match the features l 7 classes of Background noise Street Feature Extraction Matching (Fuzzy System n.1) Fuzzy System n.2 Fuzzy System n.3 Fuzzy System n.4 Fuzzy System n.5 Fuzzy System n.6 Stationary Non-stationary Bus CarTrain Constr Babble Factory
17
16 Personalization of FVT The hearing loss is measured with an audiogram. An Audiogram of a person shows the amount of hearing loss in each of the frequency bands as shown in the fig. A person with hearing loss will perceive different frequencies at different levels.
18
17 Personalization of FVT The speech intelligibility and quality of hearing can vary and also are dependent on the background noise. l Hearing loss is measured using Audiogram l Translate Audiogram data to Fuzzy rule base for personalization of FVT l Fuzzy Volume Tuner uses this rule-base
19
18 Evolutionary Algorithm l The evolutionary algorithm for fuzzy volume tuner performs 3 functions.: ä Tunes the input-output(IO) scaling factors ä It tunes the membership functions –Optimizes the fuzzy rule-base. Evolutionary Algorithm Fuzzy Volume Tuner User/Trainer Input Mobile Environment Inputs Fuzzy Sets Fuzzy Rules Fitness Output
20
19 FuzzyControl++ l This is a tool which can be easily used for configuring and simulating a fuzzy system. l The inputs, outputs and IF/THEN block are easily configured. However, the membership functions are fixed and no choice is available. The rules can be easily edited. l There is also a provision for editing rules by matrix. The ranges for the linguistic terms can be easily defined. l It provides an impressive 3D graphics display to view the decision surface. l A rule activity window and a simulation window enable simulation studies of the fuzzy system. The choices for the waveform are restricted compared to ECANSE. However, this tool can generate code for target systems.
21
20 FuzzyControl++ Simulation
22
21 FuzzyControl++ Simulation
23
22 MIPS and Memory Estimation Based on the benchmark results: For Fuzzy Mobile Phone application ( 15-25 fuzzy rules, 2 inputs, 1 output, 4-6 labels per variable), we may need less than 2Kbytes for storing the code. The execution cycles may not exceed 15,000 cycles. This is for a conventional 8 bit microcontroller such as 68HC11. Processing Delay = 80ms Speech Frame ( 160 Samples) Speech Codec Fuzzy Noise Classifier VAD Fuzzy Volume Tuner 20ms dscdvad dfnc dFV T 80ms for 100MIPS processor
24
23 Quality of Speech (QoS) l Attributes of Speech Quality for Fuzzy Mobile Phone: äStress, Intelligibility, Pleasantness, Loudness l Overall Quality Evaluation by Mean Opinion Score(MOS): äITU-T Rec. P.800 Most recognized methodology for evaluating subjective quality of speech. –MOS is a five level scale {(1, bad), (2,Poor), (3, Fair), (4, Good), (5, Excellent)} –The listener’s task is to simply evaluate the tested speech with MOS scale. –The use of simple five level scale is easy and provides some instant explicit information. l ITU-T Rec. P.861 – Perceptual Speech Quality Measurement (PSQM) objective measurement of speech quality ( Tools such as Opera for PSQM)
25
24 Summary and Conclusion l Cellular phones with EFVT will have several benefits: l Improved QoS for stationary and non-stationary noise in mobile environments. As the fuzzy volume Tuner uses the information on background noise level and class to adjust the volume level. l Some classes of noise such as car noise fall into low-frequency noise. They do not affect the intelligibility of speech compared to noise classes such as factory noise. Hence the fuzzy volume Tuner has to be dependent on noise classes for effective volume adjustments. l The fuzzy volume Tuner is easily embedded in the mobile handset as it has very less memory and computational requirements. The computations are carried out by the microcontroller within the baseband chip. l The fuzzy volume Tuner can be personalized based on the audiogram for a hearing impaired person. l The Evolutionary algorithm tunes the scaling factors, membership functions and optimizes the fuzzy rule-base for improving and optimizing Cellphone performance.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.