ENABLER, BLARK, what’s next? Steven Krauwer Utrecht University / ELSNET.

Slides:



Advertisements
Similar presentations
WP8 – Innovation Support Kelly Vavasi General Secretariat for Research and Technology (GSRT) SP meeting Becici, 11 November 2010.
Advertisements

Identification of critical success factors for implementing NLLS, through collaboration and exchange of expertise IDENTIFY LLP-2008-RO-KA1-KA1NLLS.
Cairo /08/2002 Annual Report of the European Regional Focal Point. Dominique PLANCHENAULT.
Steven KrauwerCLARIN-NL Launch CLARIN-EU: Where do we stand? Steven Krauwer Utrecht institute of Linguistics UiL OTS CLARIN-EU Coordinator.
CLARIN: Common Language Resources and Technology Infrastructure for the Social Sciences and Humanities Steven Krauwer Utrecht institute of Linguistics.
CLARIN: Goals and Structure of the Project Steven Krauwer CLARIN Coordinator Utrecht institute of Linguistics UiL-OTS (NL)
Steven KrauwerLREC20081 CLARIN: Common Language Resources and Technology Infrastructure for the Humanities and Social Sciences Kimmo Koskenniemi (University.
Open Library Environment Designing technology for the way libraries really work November 19, 2008 ~ ASERL, Atlanta Lynne O’Brien Director, Academic Technology.
Slide: 1 27 th CEOS Plenary |Montréal | November 2013 Agenda Item: 15 Chu ISHIDA(JAXA) on behalf of Rick Lawford, GEO Water CoP leader GEO Water.
Project part financed by the European Union (European Development Fund) within the Interreg IIIC programme European Cultural Tourism Network Presentation.
Facilitate Open Science Training for European Research Where Librarians can learn and teach Open Science for European Researchers LIBER 2015 London,
RCC meeting Gas Regional Investment Plan project TITRE 18/10/2012 RCC meeting Gas Regional Investment Plan project CRE.
PARIS21 CONSORTIUM MEETING Paris, October 2002 Progress Report of the Task Team on Food, Agriculture and Rural Statistics  Objectives  Past activities.
LKR2004, Tokyo March The European Resources Landscape Steven Krauwer ELSNET / Utrecht University The Netherlands.
June ELSNET Review1 A Roadmap for Computational Linguistics Steven Krauwer ELSNET / Utrecht University (NL)
WHAT IS “CLASS”? A BRIEF ORIENTATION TO THE CLASS METHODOLOGY.
Critical Role of ICT in Parliament Fulfill legislative, oversight, and representative responsibilities Achieve the goals of transparency, openness, accessibility,
United Nations Educational, Scientific and Cultural Organization UNESCO Bangkok 1 Preparation of the Regional Strategy for the DESD in the Asia-Pacific.
Presentation to the Bree Collaborative November 30, 2012.
Report on UNSD activities since the last meeting of the Expert Group on International Economic and Social Classifications Meeting of the Expert Group on.
6th GEO Capacity Building Committee Meeting Hanover, Germany 13 to 14 February 2008 CB-07-01a Marta ANGOLOTI INM Spain.
Roadmap for Language Resources and Evaluation in a Multilingual Environment Minority Languages in the African Context Justus Roux Centre for Language and.
Grundtvig Learning Partnerships Project ”The development of ex-prisoners' integration into the labour market” DEPILM 1-st Partners Meeting in Vilnius,
Tracking of GEF Portfolio: Monitoring and Evaluation of Results Sub-regional Workshop for GEF Focal Points Aaron Zazueta March 2010 Hanoi, Vietnam.
European Commission eContent Towards the Workprogramme.
Dutch HLT Resources: from BLARK to Priority Lists Helmer Strik, Diana Binnenpoorte, Janienke Sturm, Folkert de Vriend, and Catia Cucchiarini* A 2 RT, Dept.
ERAMED & PROMEDAccess Projects. ERAMED project ERAMED project “ Strengthening the European Research Area in the Mediterranean Countries” “ Strengthening.
T 2.3 Follow up to Conferences Leonardo Piccinetti, EFB FORESTA Project Meeting Brasilia, 20 September 2011.
ETwinning: opportunities for educational innovation and professional development Anne Gilleran Pedagogical Manager eTwinning Central Support Service.
EU Funding opportunities : Rights, Equality and Citizenship Programme Justice Programme Jose Ortega European Commission DG Justice.
 ELRA/ELDA EU Enlargement and Integration Workshop Arona, September 2005 Victoria Arranz 1 European Language Resources Association ELRA/ELDA: The Importance.
CLARIN work packages. Conference Place yyyy-mm-dd
Riga, Apr HLT in the Baltics, 10 years after 1994 Steven Krauwer ELSNET / Utrecht University (NL)
Introduction 1. Purpose of the Chapter 2. Institutional arrangements Country Practices 3. Legal framework Country Practices 4. Preliminary conclusions.
WP8 – Innovation Support Kelly Vavasi General Secretariat for Research and Technology (GSRT) 1 st Innovation Dialogue Forum Becici, 8-9 November 2010.
Arctic Marine Strategic Plan Update to the EPPR Working Group Meeting, Ottawa, Canada June 16-17, 2014.
S&T cooperation Mediterranean and EU countries Spanish proposal Casablanca, 17th November 2009 Dr Angeles Rodríguez-Peña Deputy European Programmes Ministry.
Hong Kong, 7 October 2000 Europe ELSNET and Europe What is ELSNET What is happening in Europe Steven Krauwer.
The industrial relations in the Commerce sector EU Social dialogue: education, training and skill needs Ilaria Savoini Riga, 9 May 2012.
Catia Cucchiarini, Walter Daelemans and Helmer Strik Strengthening the Dutch Language and Speech Technology Infrastructure Catia Cucchiarini, Walter Daelemans.
Support for the coordination of activities TECHNOLOGY PLATFORMS Taking Account of Strategic Research Agendas in the Research Themes of the FP7 Co-operation.
Introduction A field survey of Dutch language resources has been carried out within the framework of a project launched by the Dutch Language Union (Nederlandse.
1 Item 2.1.b of the agenda IT Governance in the ESS and related issues Renewal of mandates STNE Adam WROŃSKI Eurostat, Unit B5.
Extending ICT Research Co-operation between the European Union, Eastern Europe and the Southern Caucasus EECA FP7 Help-Desk EXTEND Training Workshop Baku,
WP2 – Definition of European DCA strategy EuroVO-DCA Final Review, 29 April 2009 Françoise Genova, Coordinator, WP2 lead WP2 Definition of European Data.
1 Role of HEREs in the Erasmus+ programme HERE meeting Alger
COST Action and European GBIF Nodes Anne-Sophie Archambeau.
Partnerships Horizon 2020 / Eurostars expert: Dr. Radosław Piesiewicz.
NCP_WIDE.NET NCP_Academy_Synergies
Support to NAPs by the LEG, AC and other bodies under the Convention
DG Environment, Unit D.2 Marine Environment and Water Industry
5. PROFILING IDP SITUATIONS
Marine Strategy Framework Directive (MSFD)
Priority 3 NMP: generalities
Overview of working draft v. 29 January 2018
INSPIRE fitness for purpose – Analysis
CRUE – The Way Forward Vicki Jackson
Implementation of INSPIRE in EIONET Core Data Flows
Smart Grid Technology – a Master Program
Marine Strategy Framework Directive (MSFD)
Infrastructrural Language Resources and International Cooperation
REFIT Platform 20/02/2019 Diversity Europe Group.
European Statistical System Network on Culture (ESSnet Culture)
Third Meeting of the United Nations Committee of Experts on
Sylvia Barova Unit B.3 – Nature DG Environment, European Commission
European Statistical System Network on Culture (ESSnet Culture)
STRUCTURE AND METHODS OF CO-OPERATION
5.b3 Monitoring & Reporting 2019
Scene setter European Commission DG Environment
Gender Training Workshop Name of Institution Place Date
Presentation transcript:

ENABLER, BLARK, what’s next? Steven Krauwer Utrecht University / ELSNET

Overview ENABLER BLARK BLARK Results Recent developments CLARIN Some reflections MyBLARK Concluding remarks

ENABLER EU Project, FP5, under Information Society Technologies (see bringing together national language resources projects in many EU countries aimed at providing a cooperative framework to foster cooperation and interoperability with a strong industrial drive led by Pisa, and ended –as an EU project– in 2004 … but still existing as a community, in close collaboration with ELSNET (

BLARK (1) Basic Language Resource Kit Idea (first launched in 1998): definition of the minimal set that is needed to do any (precompetitive) R&D and education at all Definition should be in principle language independent (although specific languages may require specific adaptations)

BLARK (2) Definition should include both data collections (corpora, lexicons) and modules (taggers, parsers, synthesizers, annotation tools) It should include both qualitative aspects (e.g. standards) and quantitative aspects (e.g. size)

BLARK (3) Once the definition is available it can be used as a common reference point that allows to –assess the resources situation of a language (how much of the BLARK is available, and what is still missing) –make priority plans for bringing the resources situation up to date

BLARK (4) Note that the BLARK is necessarily dynamic, as new technological developments will come with new requirements Note that the BLARK for a language will only work if there is a body that takes responsibility for its implementation and for the maintenance and distribution of the resources created

BLARK Results First adopted by the Dutch Language Union, resulting in a first 12 Meuro implementation programme launched at the end of 2004 Explored and developed for Arabic in the NEMLAR project (CST, ELDA, ELSNET, and others; see and the presentation at this conference O27-G on Thursday) BLARK concept included in a number of proposals, but without tangible results Suggestions for a more advanced variant (ELARK) have been put forward by ELDA and others

Recent developments CLARIN: Common Language Resources and Technology Infrastructure (see LREC 2006 workshop on May 22, or otherwise NOT a project proposal, but rather a proposal for a Research Infrastructure to be included in the European Roadmap for Research Infrastructures

CLARIN (1) Creation of open European Language Resources Network with strong service centers and repositories, providing the humanities community at large (i.e. not just the language and speech technology community) with –knowledge about which language resources and tools exist and how to use them –access to existing language resources –coordinated creation of new resources –access to advanced services for access and adaptation –bundling of expertise in specific problem areas –training centers

CLARIN (2) Three important observations: CLARIN has no industrial drive CLARIN aims at addressing all languages in the EU (and associated countries) One of CLARIN’s objectives is the definition and the coordinated creation of BLARKs for all languages of the EU

Some reflections Whatever progress has been made (DLU, NEMLAR, ELARK) was mostly inspired by industrial needs Industrial considerations do not favour smaller languages Progress of the BLARK since 1998 has been slow No new funding opportunities in FP6 to get anything done CLARIN may offer exciting opportunities (if successful), but this will take a lot of time

More reflections The present (embryonic) BLARK definition may be one or more steps too far for under-resourced languages So, why not add to the concept the BLARKette, which should represent a very basic entry level variant of the BLARK, targeting exclusively the research and (especially) education community Small and simple, should fit on a CDROM

And yet more reflections Nothing funded will happen before well into 2007 Why wait until then, e.g. if and when CLARIN is in place and some formal process has put into motion to define the BLARK (and the BLARKette)? Why not start an action to consult the language communities and to arrive at a first proposal for a BLARK and BLARKette definition?

MyBLARK, the proposal We initiate MyBLARK, aiming at collecting (for each language in the EU) –a description of the essential components of the BLARK –and of the BLARKette We try to distill from this a broadly supported proposal for the definition of both concepts We offer this as an input to the CLARIN project if it ever happens, or otherwise use it to launch other initiatives

MyBLARK, the process ELSNET (possibly in collaboration with COCOSDA/WRITE) will send out a simple questionnaire to all known language resources centers, asking for descriptions of BLARK and BLARKette components ELSNET (maybe with COCOSDA/WRITE) will set up a committee to synthesize the results in the form of recommendations

MyBLARK participants Language resources centers for languages of EU and associated countries known to us Language resources centers in the EU (+associated countries) that send me a message that they are willing to participate

MyBLARK Questionnaire Language Type of resource Usage Size Annotation required Brief description Available for your language? If so: pointer to it If not, pointer to similar resource for another language References Comments

MyBLARK Schedule June – August 2006: collection of contacts Sept 2006: questionnaires sent out October 2006: questionnaires in, 1 st analysis and draft definition proposals November 2006: proposals sent out for feedback December 2006 – January 2007: collecting feedback February 2007: Final report

Concluding remarks I have proposed the introduction of a slightly weaker variant of the BLARK, the BLARKette, for under-resourced languages I have proposed an action entitled MyBLARK to arrive at an initial definition of both the BLARK and the BLARKette I hope that this will (a) speed up the process, and (b) provide an intermediate coverage level for under-resourced languages