Research & Technology Experiments on different feature sets; comparison with DC baseline system RESPITE workshop Jan.25-27 2001 Martigny Joan Mari Hilario.

Slides:



Advertisements
Similar presentations
Advances in WP1 Trento Meeting January
Advertisements

Franz de Leon, Kirk Martinez Web and Internet Science Group  School of Electronics and Computer Science  University of Southampton {fadl1d09,
In collaboration with Hualin Gao, Richard Duncan, Julie A. Baca, Joseph Picone Human and Systems Engineering Center of Advanced Vehicular System Mississippi.
Topics Recognition results on Aurora noisy speech databaseRecognition results on Aurora noisy speech database Proposal of robust formant.
Pitch Prediction From MFCC Vectors for Speech Reconstruction Xu shao and Ben Milner School of Computing Sciences, University of East Anglia, UK Presented.
HIWIRE MEETING Paris, February 11, 2005 JOSÉ C. SEGURA LUNA GSTC UGR.
AN INVESTIGATION OF DEEP NEURAL NETWORKS FOR NOISE ROBUST SPEECH RECOGNITION Michael L. Seltzer, Dong Yu Yongqiang Wang ICASSP 2013 Presenter : 張庭豪.
Advances in WP1 Turin Meeting – 9-10 March
HIWIRE MEETING Nancy, July 6-7, 2006 José C. Segura, Ángel de la Torre.
Communications & Multimedia Signal Processing Analysis of the Effects of Train noise on Recognition Rate using Formants and MFCC Esfandiar Zavarehei Department.
Speaker Adaptation in Sphinx 3.x and CALO David Huggins-Daines
MODULATION SPECTRUM EQUALIZATION FOR ROBUST SPEECH RECOGNITION Source: Automatic Speech Recognition & Understanding, ASRU. IEEE Workshop on Author.
Classification of Music According to Genres Using Neural Networks, Genetic Algorithms and Fuzzy Systems.
Authors: Anastasis Kounoudes, Anixi Antonakoudi, Vasilis Kekatos
Audio classification Discriminating speech, music and environmental audio Rajas A. Sambhare ECE 539.
„Bandwidth Extension of Speech Signals“ 2nd Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction 22nd and 23rd June.
SoundSense by Andrius Andrijauskas. Introduction  Today’s mobile phones come with various embedded sensors such as GPS, WiFi, compass, etc.  Arguably,
Bringing Inverse Modeling to the Scientific Community Hydrologic Data and the Method of Anchored Distributions (MAD) Matthew Over 1, Daniel P. Ames 2,
Handwritten Signatures Authentication using ANNs Committee Machines M.Heinen, F. Osório and P. Engel October Handwritten Signatures Authentication.
Artificial Neural Nets and AI Connectionism Sub symbolic reasoning.
Modeling speech signals and recognizing a speaker.
International Conference on Intelligent and Advanced Systems 2007 Chee-Ming Ting Sh-Hussain Salleh Tian-Swee Tan A. K. Ariff. Jain-De,Lee.
Reporter: Shih-Hsiang( 士翔 ). Introduction Speech signal carries information from many sources –Not all information is relevant or important for speech.
Zhiphone: A Mobile Phone that Learns Context and User Preferences Jing Michelle Liu Raphael Hoffmann CSE567 class project, AUT/05.
Intelligent Vision Systems ENT 496 Object Shape Identification and Representation Hema C.R. Lecture 7.
Supervisor: Dr. Eddie Jones Co-supervisor: Dr Martin Glavin Electronic Engineering Department Final Year Project 2008/09 Development of a Speaker Recognition/Verification.
LOG-ENERGY DYNAMIC RANGE NORMALIZATON FOR ROBUST SPEECH RECOGNITION Weizhong Zhu and Douglas O’Shaughnessy INRS-EMT, University of Quebec Montreal, Quebec,
Dan Rosenbaum Nir Muchtar Yoav Yosipovich Faculty member : Prof. Daniel LehmannIndustry Representative : Music Genome.
Documentation: What we might do in an ideal world Ian McCrea.
Preprocessing for Data Mining Vikram Pudi IIIT Hyderabad.
Overview ► Recall ► What are sound features? ► Feature detection and extraction ► Features in Sphinx III.
A Baseline System for Speaker Recognition C. Mokbel, H. Greige, R. Zantout, H. Abi Akl A. Ghaoui, J. Chalhoub, R. Bayeh University Of Balamand - ELISA.
Look who’s talking? Project 3.1 Yannick Thimister Han van Venrooij Bob Verlinden Project DKE Maastricht University.
Sea Ice Mapping Systems Archive Browser Interface Distribution IngestProduction Ice Analyst Application Database Henrik Steen Anderson DMI Virtual System.
Learning Long-Term Temporal Feature in LVCSR Using Neural Networks Barry Chen, Qifeng Zhu, Nelson Morgan International Computer Science Institute (ICSI),
Quantile Based Histogram Equalization for Noise Robust Speech Recognition von Diplom-Physiker Florian Erich Hilger aus Bonn - Bad Godesberg Berichter:
Duraid Y. Mohammed Philip J. Duncan Francis F. Li. School of Computing Science and Engineering, University of Salford UK Audio Content Analysis in The.
Speaker Identification by Combining MFCC and Phase Information Longbiao Wang (Nagaoka University of Technologyh, Japan) Seiichi Nakagawa (Toyohashi University.
Data and Applications Security Developments and Directions Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #15 Secure Multimedia Data.
DNA Microarray Data Analysis using Artificial Neural Network Models. by Venkatanand Venkatachalapathy (‘Venkat’) ECE/ CS/ ME 539 Course Project.
Combining Speech Attributes for Speech Recognition Jeremy Morris November 9, 2006.
Performance Comparison of Speaker and Emotion Recognition
RCC-Mean Subtraction Robust Feature and Compare Various Feature based Methods for Robust Speech Recognition in presence of Telephone Noise Amin Fazel Sharif.
Research & Technology Progress in the framework of the RESPITE project at DaimlerChrysler Research & Technology Dr-Ing. Fritz Class and Joan Marí Sheffield,
Copyright  2004 limsoon wong Using WEKA for Classification (without feature selection)
1 Electrical and Computer Engineering Binghamton University, State University of New York Electrical and Computer Engineering Binghamton University, State.
1 Voicing Features Horacio Franco, Martin Graciarena Andreas Stolcke, Dimitra Vergyri, Jing Zheng STAR Lab. SRI International.
January 2001RESPITE workshop - Martigny Multiband With Contaminated Training Data Results on AURORA 2 TCTS Faculté Polytechnique de Mons Belgium.
1 Experiments with Detector- based Conditional Random Fields in Phonetic Recogntion Jeremy Morris 06/01/2007.
Message Source Linguistic Channel Articulatory Channel Acoustic Channel Observable: MessageWordsSounds Features Bayesian formulation for speech recognition:
The Speech Solution BABEL DEMONSTRATOR RESPITE PROJECT.
There are many leading online sources that are providing reliable encryption solution for your online as well as offline file security through smart software.
1 LOW-RESOURCE NOISE-ROBUST FEATURE POST-PROCESSING ON AURORA 2.0 Chia-Ping Chen, Jeff Bilmes and Katrin Kirchhoff SSLI Lab Department of Electrical Engineering.
The three basis vectors are mutually perpendicular.
Behavioral Interactive and Introspective Objects
Spectral and Temporal Modulation Features for Phonetic Recognition Stephen A. Zahorian, Hongbing Hu, Zhengqing Chen, Jiang Wu Department of Electrical.
SQL Server 2000 and Access 2000 limits
Hierarchical Multi-Stream Posterior Based Speech Recognition System
Data Mining: Concepts and Techniques Course Outline
Automatic Speech Recognition: Conditional Random Fields for ASR
Virtual Ice Charting System
AN ANALYSIS OF TWO COMMON REFERENCE POINTS FOR EEGS
Ala’a Spaih Abeer Abu-Hantash Directed by Dr.Allam Mousa
ورود اطلاعات بصورت غيربرخط
Speaker Identification:
Speech / Non-speech Detection
Learning Long-Term Temporal Features
Presented by Chen-Wei Liu
3 EGR Identification Service
Combination of Feature and Channel Compensation (1/2)
Presentation transcript:

Research & Technology Experiments on different feature sets; comparison with DC baseline system RESPITE workshop Jan Martigny Joan Mari Hilario Fritz Class Experiments on AURORA 2000 database: Features of DC baseline system: training on N1... N4 sets (multi-condition training) NSPS (nonlinear spectral subtraction) VTN (vocaltract length normalization) MFCC features with cepstral mean normalization „cepstral“ interface: preprocessing, feature extraction „cepstral“ data files SCHMM recognizer „cepstral“ interface

Research & Technology RESPITE workshop Jan Martigny Joan Mari Hilario Fritz Class Experiments on different feature sets; comparison with DC baseline system % WER Comparison DC-baseline / ICSI-Tandem features

Research & Technology RESPITE workshop Jan Martigny Joan Mari Hilario Fritz Class Experiments on different feature sets; comparison with DC baseline system Comparison DC-baseline / ICSI-Tandem features

Research & Technology RESPITE workshop Jan Martigny Joan Mari Hilario Fritz Class Experiments on different feature sets; comparison with DC baseline system Comparison DC-baseline / FPM‘s FPM‘s: word models trained on clean speech DC: word models multi-condition training % WER

Research & Technology RESPITE workshop Jan Martigny Joan Mari Hilario Fritz Class Experiments on different feature sets; comparison with DC baseline system Comparison DC-baseline / FPM‘s FPM‘s: word models trained on clean speech DC: word models multi-condition training

Research & Technology RESPITE workshop Jan Martigny Fritz Class Discussion about demonstrators RESPITE demonstrators Statements: our demonstrator strategy: „show project achivements (possibility of online application of the new techniques), not commercially relevant“!! a demonstrator makes sence only, if there are better techniques than in the baseline system ==> if we have really found such techniques (compared to the baseline system in offline simulations), we can build a demonstrator a full integration of the new techniques means a redesign of the complete system ==> not possible within RESPITE ==> combination of different modules (processes) via interfaces (files) or using DLL‘s under windows a demonstration could be done e.g. in a car using a laptop

Research & Technology RESPITE workshop Jan Martigny Fritz Class Discussion about demonstrators possible demonstration system: TANDEM features with DC system Feature calculatio n Neural net classifier file with „cep“- features DC-system PLP MSG TANDEM feature vectors process 1process 2 architecture 1 Feature calculation (ICSI-software, TANDEM‘s) DC-system TANDEM feature vectors architecture 2 1 process under Windows NT

Research & Technology RESPITE workshop Jan Martigny Fritz Class Discussion about demonstrators RESPITE demonstrators Questions: sources (ICSI) for TANDEM features ? missing data demonstrator ? What are „potential users with respect to the demonstrators“? Is anywhere a online system available ? Portable? Under which system?