Reproducible ASR workflows (RASR) - Final report David Risinamhodzi – North-west University – South Africa (David.Risinamhodzi@nwu.ac.za) e-Research Summer Hackfest – Catania (Italy)
Work done during the Hack fest Results Future plans Outline Scientific Problem Work done during the Hack fest Results Future plans Summary and conclusions
Requirements & Scientific problem Speech recognition requirements: Sufficient storage space for large audio and text datasets High Performance Computing (HPC): Many CPUs as most training and recognition is performed in parallel GPUs - use GPUs to train Deep neural nets (DNNs) High speed internet to move data around compute nodes Mechanism to manage datasets
Requirements & Scientific problem Lack of collaboration Lack of exploitation of the available distributed computing Long hours of training systems on personal computers Research questions to be answered : Are speech recognition models reproducible ? How do speech recognition models vary according to different dictionaries and training ? Corollary issues : Provenance and publication of models ease of exploring ASR models – access for researchers Make access to national language resources easier
Work done during the Hackfest Website developed (90% finished):- (Jekyll )- display a few pages Application ported to CODE-RADE WebApp architecture completed and development commenced (Rails using MVC) gLibrary installed Access to FutureGateway test server obtained Access to OAR obtained
WebApp architecture and designed to completion Results Website developed WebApp architecture and designed to completion Test case of the application successfully executed locally
Configure authentication and authorization Data ingestion into OAR Future plans (until end of July and until the Workshop in Dar es Salaam on 5th of September - duplicate this slides as many time as needed) End of July Configure authentication and authorization Data ingestion into OAR Complete the Website Test the job submission workflow Workshop in Dar es Salaam (5 September) WebApp complete Use case complete Test cases ran Results obtained Authoring of paper on the use case in progress
Summary and conclusions (tell us your experience about the Hackfest) Excellent initiative in building capacity and bridging the digital divide Educational and practically oriented (learnt a lot) Good exposure and a platform to network with other researchers The support was good Organizers were ever present to assist Gratitude goes out to: Simon, Roberto, Bruce and the rest of the organizers and EU The (geeks): Marco, Mario, Riccardo The administration Everyone else who contributed but I did not mention Lastly to my University for assisting in making this possible Sicily was great! The food awesome! Thank you guys.
Thank you! sci-gaia.eu info@sci-gaia.eu