Masters Presentation at Griffith University Master of Computer and Information Engineering Magnus Nilsson 2000 - 2001.

Slides:

Advertisements

Similar presentations

David Hansen and James Michelussi

Advertisements

© 2003 Xilinx, Inc. All Rights Reserved Course Wrap Up DSP Design Flow.

SirenDetect Alerting Drivers about Emergency Vehicles Jennifer Michelstein Department of Electrical Engineering Adviser: Professor Peter Kindlmann May.

Voiceprint System Development Design, implement, test unique voiceprint biometric system Research Day Presentation, May 3 rd 2013 Rahul Raj (Team Lead),

Digital Kommunikationselektronik TNE027 Lecture 5 1 Fourier Transforms Discrete Fourier Transform (DFT) Algorithms Fast Fourier Transform (FFT) Algorithms.

LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Use of Frequency Domain Telecommunication Channel |A| f fcfc Frequency.

Dual-domain Hierarchical Classification of Phonetic Time Series Hossein Hamooni, Abdullah Mueen University of New Mexico Department of Computer Science.

A Matlab Playground for JPEG Andy Pekarske Nikolay Kolev.

F 鍾承道 Acoustic Features for Speech Recognition: From Mel-Frequency Cepstrum Coefficients (MFCC) to BottleNeck Features(BNF)

1 Speech Parametrisation Compact encoding of information in speech Accentuates important info –Attempts to eliminate irrelevant information Accentuates.

Characterization Presentation Neural Network Implementation On FPGA Supervisor: Chen Koren Maria Nemets Maxim Zavodchik

Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Parallel Programming in C with MPI and OpenMP Michael J. Quinn.

Department of Electrical and Computer Engineering Texas A&M University College Station, TX Abstract 4-Level Elevator Controller Lessons Learned.

Fourier theory made easy (?). 5*sin (2  4t) Amplitude = 5 Frequency = 4 Hz seconds A sine wave.

Introduction to Computer Programming CSC 1401: Introduction to Programming with Java Lecture 2 Wanda M. Kunkle.

1 Music Classification Using SVM Ming-jen Wang Chia-Jiu Wang.

Real-Time Speech Recognition Thang Pham Advisor: Shane Cotter.

A PRESENTATION BY SHAMALEE DESHPANDE

Field Programmable Gate Array (FPGA) Layout An FPGA consists of a large array of Configurable Logic Blocks (CLBs) - typically 1,000 to 8,000 CLBs per chip.

1 DSP Implementation on FPGA Ahmed Elhossini ENGG*6090 : Reconfigurable Computing Systems Winter 2006.

Sarah Middleton Supervised by: Anton van Wyk, Jacques Cilliers, Pascale Jardin and Florence Nadal 3 December 2010.

© 2011 Xilinx, Inc. All Rights Reserved Intro to System Generator This material exempt per Department of Commerce license exception TSU.

Representing Acoustic Information

Classification of place of articulation in unvoiced stops with spectro-temporal surface modeling V. Karjigi , P. Rao Dept. of Electrical Engineering,

Student : Andrey Kuyel Supervised by Mony Orbach Spring 2011 Final Presentation High speed digital systems laboratory High-Throughput FFT Technion - Israel.

Introduction CSE 1310 – Introduction to Computers and Programming Vassilis Athitsos University of Texas at Arlington 1.

VBS Documentation and Implementation The full standard initiative is located at Quick description Standard manual.

Introduction CSE 1310 – Introduction to Computers and Programming Vassilis Athitsos University of Texas at Arlington 1.

CS 6068 Parallel Computing Fall 2013 Lecture 10 – Nov 18 The Parallel FFT Prof. Fred Office Hours: MWF.

International Conference on Intelligent and Advanced Systems 2007 Chee-Ming Ting Sh-Hussain Salleh Tian-Swee Tan A. K. Ariff. Jain-De,Lee.

Implementing a Speech Recognition System on a GPU using CUDA

Voice Recognition All Talk No Walk.

Jacob Zurasky ECE5526 – Spring 2011

Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.

Radix-2 2 Based Low Power Reconfigurable FFT Processor Presented by Cheng-Chien Wu, Master Student of CSIE,CCU 1 Author: Gin-Der Wu and Yi-Ming Liu Department.

Speaker Recognition by Habib ur Rehman Abdul Basit CENTER FOR ADVANCED STUDIES IN ENGINERING Digital Signal Processing ( Term Project )

Speech Recognition Feature Extraction. Speech recognition simplified block diagram Speech Capture Speech Capture Feature Extraction Feature Extraction.

ACCESS IC LAB Graduate Institute of Electronics Engineering, NTU Under-Graduate Project Case Study: Single-path Delay Feedback FFT Speaker: Yu-Min.

Hardware Accelerator for Hot-word Recognition Gautam Das Govardan Jonathan Mathews Wasim Shaikh Mojes Koli.

Designing a Voice Activated Compartmentalized Safe with Speech Processing using Matlab Preliminary Design Review Amy Anderson Ernest Bryant Mike Joyner.

Automatic Speech Recognition A summary of contributions from multiple disciplines Mark D. Skowronski Computational Neuro-Engineering Lab Electrical and.

Copyright © 2004, Dillon Engineering Inc. All Rights Reserved. An Efficient Architecture for Ultra Long FFTs in FPGAs and ASICs  Architecture optimized.

BY KALP SHAH Sentence Recognizer. Sphinx4 Sphinx4 is the best and versatile recognition system. Sphinx4 is a speech recognition system which is written.

Designing a Voice Activated Compartmentalized Safe with Speech Processing using Matlab Final Presentation Amy Anderson Ernest Bryant Mike Joyner Collins.

Speaker: Darcy Tsai Advisor: Prof. An-Yeu Wu Date: 2013/10/31

Introduction CSE 1310 – Introduction to Computers and Programming Vassilis Athitsos University of Texas at Arlington 1.

Chapter 7 Speech Recognition Framework  7.1 The main form and application of speech recognition  7.2 The main factors of speech recognition  7.3 The.

A New Class of High Performance FFTs Dr. J. Greg Nash Centar ( High Performance Embedded Computing (HPEC) Workshop.

1 Electrical and Computer Engineering Binghamton University, State University of New York Electrical and Computer Engineering Binghamton University, State.

Speaker Verification System Middle Term Presentation Performed by: Barak Benita & Daniel Adler Instructor: Erez Sabag.

FFTC: Fastest Fourier Transform on the IBM Cell Broadband Engine David A. Bader, Virat Agarwal.

Introduction to Programming 1 1 2Introduction to Java.

Sound Controlled Smoke Detector Group 67 Meng Gao, Yihao Zhang, Xinrui Zhu 1.

 presented by- ARPIT GARG ISHU MISHRA KAJAL SINGHAL B.TECH(ECE) 3RD YEAR.

CORDIC Based 64-Point Radix-2 FFT Processor

BIOMETRICS VOICE RECOGNITION. Meaning Bios : LifeMetron : Measure Bios : LifeMetron : Measure Biometrics are used to identify the input sample when compared.

The content of lecture This lecture will cover: Fourier Transform

Spectral and Temporal Modulation Features for Phonetic Recognition Stephen A. Zahorian, Hongbing Hu, Zhengqing Chen, Jiang Wu Department of Electrical.

ARTIFICIAL NEURAL NETWORKS

Artificial Intelligence for Speech Recognition

Presentation on Artificial Neural Network Based Pathological Voice Classification Using MFCC Features Presenter: Subash Chandra Pakhrin 072MSI616 MSC in.

Embedded Systems Design

Introduction CSE 1310 – Introduction to Computers and Programming

Isolated word, speaker independent speech recognition

Ala’a Spaih Abeer Abu-Hantash Directed by Dr.Allam Mousa

A maximum likelihood estimation and training on the fly approach

Speaker: Chris Chen Advisor: Prof. An-Yeu Wu Date: 2014/10/28

Measuring the Similarity of Rhythmic Patterns

Keyword Spotting Dynamic Time Warping

♪ Embedded System Design: Synthesizing Music Using Programmable Logic

Presentation transcript:

Masters Presentation at Griffith University Master of Computer and Information Engineering Magnus Nilsson

Masters Presentation FFT, Realization and Implementation in FPGA Speaker Verification in JAVA Demonstration of FFT and JAVA application

FFT, Realization and Implementation in FPGA Technical Function History of Fourier Transform Discrete Fourier Transform Fast Fourier Transform The Radix-2 Algorithm The Radix-4 Algorithm FPGA Complex FFT Bit Length and Implementation Results Conclusion

Technical Function Background Ericsson Microwave System XILINX FPGA 1024-point FFT Task Study, Implement and evaluate FFT in FPGA Technical Function Collect data, execute a FFT and output the data. The implementation shall be optimized on execution time, area and cost.

History of Fourier Transform Jean-Baptiste-Joseph Fourier Problem of flow of heat in solid bodies The analytical theory of heat, 1815 Universal problem solving technique Radar Speaker Verification Medical Science

Discrete Fourier Transform  N 2 complex multiplications  (N)(N-1) complex additions  ~ 2N 2 additions and multiplications

Fast Fourier Transform Early 1960’s John W. Tukey and James W Cooley An algorithm for the machine calculation of complex Fourier series Runge and König Radix - 2 algorithm

The Radix-2 Algorithm ~ 2N 2 additions and multiplications for DFT, Vs. N 2 /2 for FFT

The Radix-4 Algorithm Base 4 i.e. N = 4 x More complex but less computation power utilized The Radix-4 algorithm consists of v steps (log(N)/log(4)) Each step involves N/4 number of butterflies 3*v*N/4 = (3N/8)log2N number of complex multiplications (3N/2)log2N complex additions Radix-2 Vs. Radix-4: 25% reduction of complex multiplications Complex additions will increase by 50%

Radix-4 Dragonfly

FPGA Field Programmable Gate Array For fast time to market hardware implementation Xilinx Virtex-E PCB with a Xilinx Virtex-E 1000

Complex FFT Specification Minimum: 16 complex samples Maximum: 1024 complex samples Typical: 16 or 64 Number of bits for the input signal Minimum: 10 bits Maximum: 16 bits Typical: 12

Construction configuration

Bit Length Implementing in hardware Multiplications N bits x N bits = 2 N bits Minimize the phase and amplitude error Realizable Different types of bit lengths

Implementation Ease, Eale, Modelsim and Symplify Virtex-E F c = 55 MHz, Computation phase = 640 ns my FFT Xilinx Virtex LogiCore 110 MHz, Computation phase = 1.92 ms LogiCore = Serial My FFT = Parallel

Result Tested using a Logical Analyser, Hewlett Pacard HP16555D (2.0 M Samples, 110/500 MHz) and a Pattern Generator

Result

Conclusion As the Radix-4 FFT algorithm utilizes less complex multipliers than the Radix-2 FFT algorithm, the Radix-4 algorithm is preferable for hardware implementation. A parallel programming approach seems to be the model when a real time system with high sampling rate is desired. To reach an acceptable level of phase error, it is desirable to use 16 bits precision on the input signal and the phase factor

Speaker Verification in JAVA Technical Function JAVA Speaker Verification and Speaker Recognition VQ Speaker Verification Hardware Parallel Port Extension and access Results Conclusion

Technical Function Background It would be interesting to develop a Speaker Verification system/software in JAVA, since the JAVA language is said to be platform independent and would be interesting as a research language. Task To study, implement and evaluate a VQ (Vector Quantization) Speaker Verification system in JAVA, using MFCC’s (Mel Frequency Cepstral Coefficients). Technical Function A graphical software implementation which shall record speech from a person through a microphone, verify the person as true speaker or false speaker.

JAVA Sun Microsystems by James Gosling & Co Address the problem of building software for network consumer devices Survive transport across networks Operate on any client Safe to run Capability to work on a wide range of platforms and CPU's Virtual Machine Applet and Application Platform portability

Speaker Verification and Speaker Recognition

Accessing confidential information areas Access to remote computers Voice dialing Banking by telephone Telephone shopping Database access services Information services Voice mail PIN code for your ATM

VQ Speaker Verification Speech Feature Extraction

Mel Frequency Cepstral Coefficients

Mel Frequency Wrapping

Cepstral Coefficients Power of the triangular filter = summarized Log calculated Convert them to time domain using the Discrete Cosine Transform (DCT) Result is called the mel frequency cepstral coefficients (MFCC).

Feature Matching Dynamic Time Warping (DTW) Hidden Markov Modeling (HMM) Gaussian Mixture Model (GMM) Vector Quantization (VQ) High Accuracy Interesting to implement

LBG algorithm

MFCC’s and Codebook

Verification Threshold Cohort Speakers Ratio

Graphical Implementation of application in JAVA

Results

Conclusion Since Java is said to be platform independent, some experiments has been conducted that has showed this is not always the case. As soon as you would like to access the hardware through Java you will face problems that will make your software to become platform dependent. It is interesting to note that all the algorithm developed for the speaker verification system is platform independent, except for those parts accessing the sound card of the users computer, and can easily be executed under both Microsoft Windows and Linux. Testing, simulation and verification of the speaker verification program show a total error rate of four percent.