Introduction to BLaRKs Helmer Strik Dept. of Linguistics Centre for Language and Speech Technology (CLST) Radboud University Nijmegen, the Netherlands.

Slides:



Advertisements
Similar presentations
Don Priebe - November New York and the 1099-R Everyone knows that... If its on a 1099-R NY doesnt tax it So where do I put it on the IT-201?
Advertisements

June 11, Florida Ready to Work Bayside High School
Syracuse PBT - Tax Year Introduction & AARP Major 2008 Changes.
1 Mesures détalement Mesures détalement par SiProt avec TimePix CEA Saclay Réunion RESIST 7 avril 2008 David ATTIÉ
Z cross-section with simultaneous fit Luca Lista for: Annapaola De Cosa, Michele De Gruttola, Salvatore Di Guida, Francesco Fabozzi, Pasquale Noli, Davide.
INSTITUTE FOR CYBER SECURITY April Access Control and Semantic Web Technologies Ravi Sandhu Executive Director and Endowed Chair Institute for Cyber.
PRAGMA 14 – Taichung March High Performance and Grid Computing Group Faculty of Computer Science and Engineering Ho Chi Minh City University.
GL10 – December 8-9, Grey literature in French digital repositories: a survey J. Schöpfel (University of Lille 3) C. Stock (INIST-CNRS)
Financial and Grants Management Institute - March 18-20, Key Concepts for Learn and Serve.
August 27th Availability, Pricing and Affordability of Cardiovascular Medicines Draft report for comments Maaike S.M. van Mourik University.
29 May GNSO Improvements Top Level Plan 29 May 2009 Plan distributed 22 May by Avri.
Sep 3, 2008NVOSS VO Analysis Using Local Utilities Mike Fitzpatrick NOAO.
Sep 3, 2008NVOSS Mobile VO Mike Fitzpatrick NOAO.
A centralized approach to language resources Piek Vossen S&T Forum on Multilingualism, Luxembourg, June 6th 2005.
Professionalisation of Bachelors Preparing Bachelors for the labour market.
Climate Change Community Response Portal CCCRP FMI, SYKE, HUT/ Centre for Urban and Regional Studies SYKE Life+ seminar Maria Holmberg.
Masterclass Introduction to hands-on Exercise Aim of the exercise Find out what happens in proton-proton collisions at the LHC as seen by the ATLAS.
Masterclass Introduction to hands-on Exercise Aim of the exercise Identify electrons, muons, neutrinos in the ATLAS detector Types of Events (particles.
25 April Implementation of Uniform Guidelines for Ethics Review in Sri Lanka Malik Fernando M.B.,Ch.B. (Bristol)
Wyoming Healthcare Commission - March 10, Nurses in Demand: Statement of the Problem Tom Gallagher, Manager Research & Planning Wyoming Department.
10/04/20081 TWG of ESF Committee 10 April 2008 Franck Sébert Head of unit DG EMPL/I/1 Relations with Control Authorities Action plan to strengthen the.
HOW DOES THE PATENT SYSTEM AFFECT PRIZE CONTESTS? Lee Davis Dept. of Innovation and Organizational Economics Copenhagen Business School KEI & UNI-MERIT.
XML-publication in Finnish Labour Force Survey (LFS) ESTP training course on Data Dissemination and Publication of Statistics Madrid, Kalle.
National greenhouse gas inventories and official statistics - Finnish experiences Riitta Pipatti Statistics Finland Conference on Climate Change, Development.
Martin Wolpers & Erik Duval 7 Dezember  Today – LAST LECTURE!  Student presentations  Wrap-up  Oral examens  Feedback  About the course 
DISCO Development and Integration of Speech technology into Courseware for language learning Stevin project partners: CLST, UA, UTN, Polderland Radboud.
How to integrate automatic speech recognition (ASR) into CALL applications Helmer Strik Department of Linguistics Centre for Language and Speech Technology.
Results of R&D: BLaRK for Dutch Helmer Strik Dept. of Linguistics Centre for Language and Speech Technology (CLST) Radboud University Nijmegen, the Netherlands.
Core Competencies Training for Supervisors
CAR-IX Caribbean Internet Exchange Job Witteman
Warschauer, M. (2002). A developmental perspective on technology in language education. TESOL Quarterly, 36(3) ELTAM A Developmental Perspective.
CoReWIKI and Semantic Web A node for cultural heritage standards.
FNV Bondgenoten Henk van der Ploeg Union officer (Representative of all the employees from the Dutch Unions between 2000 and 2006)
Competitive Intelligence – It’s Not Just For Spies! March 10, 2008 Linda Rink President.
© IPC, IPC Initiative Future of Mail by Air; why we started.
02/12/ a tutorial on Markov Chain Monte Carlo (MCMC) Dima Damen Maths Club December 2 nd 2008.
Modular – Flexible – Networked
UK Higher Education library statistics The role of SCONUL.
systems of linear equations
Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2007) Learning for Semantic Parsing Advisor: Hsin-His.
Empirical and Data-Driven Models of Multimodality Advanced Methods for Multimodal Communication Computational Models of Multimodality Adequate.
Spoken Language Technologies: A review of application areas and research issues Analysis and synthesis of F0 contours Agnieszka Wagner Department of Phonetics,
1/7 INFO60021 Natural Language Processing Harold Somers Professor of Language Engineering.
Designing a Multi-Lingual Corpus Collection System Jonathan Law Naresh Trilok Pace University 04/19/2002 Advisors: Dr. Charles Tappert (Pace University)
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Language Technology 2005/06 Hans Uszkoreit Universität des Saarlandes
1 Human Language Technology and communicative disabilities: Requirements and possibilities for the future Catia Cucchiarini, Dutch Language Union, the.
Machine Translation, Digital Libraries, and the Computing Research Laboratory Indo-US Workshop on Digital Libraries June 23, 2003.
The South African HLT Audit 1 HLT Research Group, CSIR, South Africa 2 Graduate School of Technology Management, University of Pretoria, South Africa 3.
Hamburg, The Basic Language Resources Kit (BLARK) Steven Krauwer Utrecht Institute of Linguistics UiL OTS / ELSNET.
Linguistics & AI1 Linguistics and Artificial Intelligence Linguistics and Artificial Intelligence Frank Van Eynde Center for Computational Linguistics.
Roadmap for Language Resources and Evaluation in a Multilingual Environment Minority Languages in the African Context Justus Roux Centre for Language and.
ENABLER, BLARK, what’s next? Steven Krauwer Utrecht University / ELSNET.
Suléne Pilon & Danie Prinsloo Overview: Teaching and Training in South Africa 25 November 2008;
Sign Language corpora for analysis, processing and evaluation A. Braffort, L. Bolot, E. Chételat-Pelé, A. Choisier, M. Delorme, M. Filhol, J. Segouat,
Dutch HLT Resources: from BLARK to Priority Lists Helmer Strik, Diana Binnenpoorte, Janienke Sturm, Folkert de Vriend, and Catia Cucchiarini* A 2 RT, Dept.
Language Technology I © 2005 Hans Uszkoreit Language Technology I 2005/06 Hans Uszkoreit Universität des Saarlandes and German Research Center for Artificial.
EVikings II WP3: Language Technologies. HLT Human Language Technologies (HLT) play a crucial role in the Information Society For small languages it is.
Introduction to Human Language Technologies Tomaž Erjavec Karl-Franzens-Universität Graz Tomaž Erjavec Lecture 1: Overview
Catia Cucchiarini, Walter Daelemans and Helmer Strik Strengthening the Dutch Language and Speech Technology Infrastructure Catia Cucchiarini, Walter Daelemans.
Towards a roadmap for standardization in language technology Laurent Romary & Nancy Ide Loria-INRIA — Vassar College.
Creating & Testing CLARIN Metadata Components A CLARIN-NL project Folkert de Vriend Meertens Institute, Amsterdam 18/05/2010.
金聲玉振 Taiwan Univ. & Academia Sinica 1 Spoken Dialogue in Information Retrieval Jia-lin Shen Oct. 22, 1998.
Introduction A field survey of Dutch language resources has been carried out within the framework of a project launched by the Dutch Language Union (Nederlandse.
1 An Introduction to Computational Linguistics Mohammad Bahrani.
Search and Annotation Tool for Oral History INTER-VIEWS Henk van den Heuvel, Centre for Language and Speech Technology (CLST) Radboud University Nijmegen,
INTRODUCTION TO APPLIED LINGUISTICS
COCOSDA/WRITE Roadmap for Language Resources and Evaluation
Artificial Intelligence 2004 Speech & Natural Language Processing
Emre Yılmaz, Henk van den Heuvel and David A. van Leeuwen
Presentation transcript:

Introduction to BLaRKs Helmer Strik Dept. of Linguistics Centre for Language and Speech Technology (CLST) Radboud University Nijmegen, the Netherlands Radboud University Nijmegen

Cape Town, Introduction: Background BLaRK: Basic Language Resources Kit NTU: define the BLaRK for Dutch (more details in next presentation) How to define the Basic Language Resources for a language for a given context? Basic Language Resources for Dutch, in general Basic Language Resources for Dutch, handicapped Basic Language Resources for SA Also for many other languages

Radboud University Nijmegen Cape Town, BLaRK: Basic Language Resources Kit Components: Data: sets of language data and descriptions in machine readable form Modules (or semi-products): the basic software components of HLT applications Applications: classes of applications rather than specific applications or products 2 matrices: 1.Modules x Data 2.Applications x Modules  BLaRK

Radboud University Nijmegen Cape Town, DataApplications Modules LanguageTechnology SpeechTechnology Quantify: 0, 1, or 2 (+’s) Field survey & Expert opinions

Radboud University Nijmegen Cape Town, BLaRK Language technology Modules Robust modular text preprocessing Morphological analysis and morphosyntactic disambiguation Robust syntactic analysis Aspects of semantic analysis (word meaning and reference) Data Monolingual lexicon Annotated corpus of written Dutch Benchmarks for evaluation

Radboud University Nijmegen Cape Town, BLaRK Speech technology Modules Automatic speech recognition Speech synthesis system Tools for annotation of speech corpora Confidence measures and utterance verification Identification (speaker, language, dialect) Data Monolingual speech corpora for specific applications Multilingual speech corpora Multimodal/medial speech corpora Benchmarks for evaluation

Radboud University Nijmegen Cape Town, From BLaRK to priority lists 1.BLaRK: Basic Language Resources Kit 2.Inventory & Evaluation 3.Priority lists BLaRKinventory priority

Radboud University Nijmegen Cape Town, Inventory & Evaluation Inventory: Which components in BLaRK are available? Bought Freely obtainable Reusable Of sufficient quality Evaluation: And of sufficient quality? Checklist approach or formal evaluation

Radboud University Nijmegen Cape Town, Availability Quantify:1-10 Field survey & Expert opinions Modules Data

Radboud University Nijmegen Cape Town, Priority lists The prioritisation was based on the following requirements: The components should currently be unavailable, inaccessible, or of insufficient quality. The components should be relevant for a large number of applications. Developing the components should be possible in the short term.

Radboud University Nijmegen Cape Town, Consensus, broad support Report version 1 Feedback Academia & industry Sent to the Dutch-Flemish HLT field (1000 sites) Workshop 15/11/2001  Report version 2, final version

Radboud University Nijmegen Cape Town, From BLaRK to priority lists 1.BLaRK 2.Inventory & Eval. 3.Priority lists Report 1 Feedback: HLT FieldHLT Field WorkshopWorkshop 1.BLaRK 2.Inventory & Eval. 3.Priority lists Report 2

Radboud University Nijmegen Cape Town, Introduction: Background BLaRK: Basic Language Resources Kit How to define the Basic Language Resources for a language for a given context? Basic Language Resources for Dutch, in general Basic Language Resources for Dutch, handicapped Basic Language Resources for SA Also for many other languages

Radboud University Nijmegen Cape Town, Questions?