#SummitNow Yes, I'm able to index audio files within Alfresco 2013 Fernando González @fegorama.

Slides:



Advertisements
Similar presentations
Visit the ccScan Website Scan, Import, and Automatically File documents to the Cloud SCAN, IMPORT, AND AUTOMATICALLY FILE DOCUMENTS TO SALESFORCE ® Introduction.
Advertisements

Samsung Smart TV is a web-based application running on an application engine installed on digital TVs connected to the Internet.
USA AREA CODES APPLICATION by Koffi Eddy Ihou May 6,2011 Florida Institute of Technology 1.
                      Digital Audio 1.
Using Multimedia on the Web Enhancing a Web Site with Sound, Video, and Applets.
INSTRUCTOR:Dr.Veton Kepuska STUDENT:Dileep Narayan.Koneru YES/NO RECOGNITION SYSTEM.
Sean Powers Florida Institute of Technology ECE 5525 Final: Dr. Veton Kepuska Date: 07 December 2010 Controlling your household appliances through conversation.
PHONEXIA Can I have it in writing?. Discuss and share your answers to the following questions: 1.When you have English lessons listening to spoken English,
SPEECH RECOGNITION Kunal Shalia and Dima Smirnov.
1 Component Description Multimodal Interface Carnegie Mellon University Prepared by: Michael Bett 3/26/99.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials 2.
ITCS 6010 Spoken Language Systems: Architecture. Elements of a Spoken Language System Endpointing Feature extraction Recognition Natural language understanding.
1 CS 502: Computing Methods for Digital Libraries Lecture 22 Repositories.
Macromedia Dreamweaver 4 Advanced Level Course. Add Rollovers Rollovers or mouseovers are possibly the most popular effects used in designing Web pages.
Microsoft Office Illustrated Inserting Illustrations, Objects, and Media Clips.
Outline of Presentation Introduction of digital video libraries Introduction of the CMU Informedia Project Informedia: user perspective Informedia:
CALO Decoder Progress Report for June Arthur (Decoder, Trainer, ICSI Training) Yitao (Live-mode Decoder) Ziad (ICSI Training) Carnegie Mellon University.
Tutorial 7 Working with Multimedia. XP Objectives Explore various multimedia applications on the Web Learn about sound file formats and properties Embed.
Free Sound Recorder By FreeAudioVideoSoft. Pricing & Installation Software is absolutely FREE With agreement to terms and conditions Installation Requirements:
Automatic Transcript Generation Helmer Strik A 2 RT Dept. of Language & Speech University of Nijmegen.
The SEASR project and its Meandre infrastructure are sponsored by The Andrew W. Mellon Foundation SEASR Overview Loretta Auvil and Bernie Acs National.
Using an Ipad to teach English. Why use an Ipad Using an Ipad to teach is a good Idea as teachers can access more information in order to help get the.
Temple University Speech Recognition using Sphinx 4 (Ti Digits test) Jaykrishna shukla,Amir Harati,Mubin Amehed,& cara Santin Department of Electrical.
© Cheltenham Computer Training 2001 Macromedia Dreamweaver 4 - Slide No 1 Macromedia Dreamweaver 4 Advanced Level Course.
How Spread Works. Spread Spread stands for Speech and Phoneme Recognition as Educational Aid for the Deaf and Hearing Impaired Children It is a game used.
ITCS 6010 SALT. Speech Application Language Tags (SALT) Speech interface markup language Extension of HTML and other markup languages Adds speech and.
CapturaTalk4Android Demonstration Abi James
Tutorial 7 Working with Multimedia. XP Objectives Explore various multimedia applications on the Web Learn about sound file formats and properties Embed.
Spoken dialog for e-learning supported by domain ontologies Dario Bianchi, Monica Mordonini and Agostino Poggi Dipartimento di Ingegneria dell’Informazione.
CMU Shpinx Speech Recognition Engine Reporter : Chun-Feng Liao NCCU Dept. of Computer Sceince Intelligent Media Lab.
1 CS 430 / INFO 430 Information Retrieval Lecture 23 Non-Textual Materials 2.
A brief overview of Speech Recognition and Spoken Language Processing Advanced NLP Guest Lecture August 31 Andrew Rosenberg.
By: Meghal Bhatt.  Sphinx4 is a state of the art speaker independent, continuous speech recognition system written entirely in java programming language.
Tutorial 7 Working with Multimedia. New Perspectives on HTML, XHTML, and XML, Comprehensive, 3rd Edition 2 Objectives Explore various multimedia applications.
Tutorial 7 Working with Multimedia. New Perspectives on HTML, XHTML, and XML, Comprehensive, 3rd Edition 2 Objectives Explore various multimedia applications.
Voice Recognition (Presentation 2) By: Priya Devi A. S/W Developer, Xsys technologies Bangalore.
Spoken Dialog Systems and Voice XML Lecturer: Prof. Esther Levin.
LandGrantMEDIA The Digital Media Library. Genesis Distance Diagnostics through Digital Imaging Need to retain current media assets Need for centralized.
DataMAPPER - Applied Database Tech. 이화여대 과학기술대학원 석사 3 학기 992COG08 김지혜.
Information Architecture & Design Week 3 Schedule -Syllabus Updates -Group Project Deliverables -IA Methodologies -Research Topic Presentations.
Animation Liveliness Simulation of motions A video made from a series of drawings/images simulating motions by means of slight progressive changes.
Controlling Computer Using Speech Recognition (CCSR) Creative Masters Group Supervisor : Dr: Mounira Taileb.
© 2013 by Larson Technical Services
Basic structure of sphinx 4
BY KALP SHAH Sentence Recognizer. Sphinx4 Sphinx4 is the best and versatile recognition system. Sphinx4 is a speech recognition system which is written.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Reducing uncertainty in speech recognition Controlling mobile devices through voice activated commands Neil Gow, GWXNEI001 Stephen Breyer-Menke, BRYSTE003.
Cross Language Clone Analysis Team 2 February 3, 2011.
1 CS 430 / INFO 430 Information Retrieval Lecture 17 Metadata 4.
Behrooz ChitsazLorrie Apple Johnson Microsoft ResearchU.S. Department of Energy.
ALPHABET RECOGNITION USING SPHINX-4 BY TUSHAR PATEL.
O dyssey Collaboration System: OCS. What is Distributed Collaboration? Work by teams whose members are separated by space and time.
W3C Multimodal Interaction Activities Deborah A. Dahl August 9, 2006.
#SummitNow Fighting viruses with Alfviral 2013 Fernando González @fegorama.
23 November 1999Sticky Technology for Augmented Reality Systems Rachel I. Goldstein Repair Team STARS Project Carnegie Mellon University 23.
Christoph Prinz / Automatic Speech Recognition Research Progress Hits the Road.
#SummitNow Alfresco Workdesk – Technical Insights November 12, 2013 Martin Kappel.
Visual Information Retrieval
Reza Yazdani Albert Segura José-María Arnau Antonio González
Yes, I'm able to index audio files within Alfresco
Supervisor: Prof Michael Lyu Presented by: Lewis Ng, Philip Chan
Alfresco Workdesk – Technical Insights
Tutorial 7 Working with Multimedia
Lab 2: Isolated Word Recognition
EPIC INFOTECH CONSULTING GROUP
Speech Capture, Transcription and Analysis App
Lab 3: Isolated Word Recognition
PROJ2: Building an ASR System
Add content to the Library
The Application of Hidden Markov Models in Speech Recognition
Presentation transcript:

#SummitNow Yes, I'm able to index audio files within Alfresco 2013 Fernando González @fegorama

#SummitNow Why? A lot of audio/video files in many companies The need to seek words in audio files Transcription of important conversations Efficiency in DAM @fegorama

#SummitNow AAT (Alfresco Audio Transcriber) Alfresco Action (Java) for audio transcription with Sphinx-4 from Carnegie Mellon University What is it? @fegorama

#SummitNow A group of speech recognition systems developed at Carnegie Mellon University. These include a series of speech recognizers (Sphinx 2 - 4) and an acoustic model trainer (SphinxTrain). What is Sphinx-4? @fegorama

#SummitNow Language model: Grammars Dictionaries Acoustic models: Hidden Markov Model (HMM) Elements of Sphinx-4 @fegorama

#SummitNow How does the action work? The action… Transcribes by direct execution Transcribes using content rules Transcribes using UI-Actions Transcribes with Alfresco Scheduler @fegorama

#SummitNow Features Use of Sphinx-4 and JSAPI2 for recognition Use of "policies" to transcribe uploaded content Use of "scheduler" to transcribe spaces programmatically Use of action “Audio Transcriber" in user interfaces (Alfresco Explorer and Share) List of available Audio Files Assignment of "aspects" to control transcriptions @fegorama

#SummitNow Architecture Alfresco API (Actions) Share API (UI-Actions) JSAPI2 Sphinx-4 API @fegorama

#SummitNow Transcriber Action Upload the file (WAV,…) Run the Action Call to transcriber and recognizer Capture words and other properties Indexing…

#SummitNow Model for audio-indexing Aspect: Transcriber Property: Words Index: Atomic and Tokenized Property: Frames Index: No Words and Frames are multiple

#SummitNow Ways to transcribe Automatic transcription Upload/Create and Load documents Actions/Rules Programming transcription Scheduled Actions Interactive transcription Repository action running UI Action running

#SummitNow Fields of application DAM (Digital Asset Management) Trials recording Movies and Songs Radio and TV Education

#SummitNow To Do… New formats of audio files for transcriptions Internationalization (Grammars and Acoustic models) Specialized Dictionaries Refactoring, refactoring and refactoring…

#SummitNow