CLARIN: Goals and Structure of the Project Steven Krauwer CLARIN Coordinator Utrecht institute of Linguistics UiL-OTS (NL)

Slides:



Advertisements
Similar presentations
Computational Paradigms in the Humanities – eHumanities and their role and impact in transdisciplinary research Gerhard Budin University of Vienna.
Advertisements

Supporting education and research E-learning tools, standards and systems Sarah Porter Head of Development, JISC.
Steven KrauwerCLARIN-NL Launch CLARIN-EU: Where do we stand? Steven Krauwer Utrecht institute of Linguistics UiL OTS CLARIN-EU Coordinator.
CLARIN: Common Language Resources and Technology Infrastructure for the Social Sciences and Humanities Steven Krauwer Utrecht institute of Linguistics.
Steven KrauwerLREC20081 CLARIN: Common Language Resources and Technology Infrastructure for the Humanities and Social Sciences Kimmo Koskenniemi (University.
Technical Review Group (TRG)Agenda 27/04/06 TRG Remit Membership Operation ICT Strategy ICT Roadmap.
CENTRAL EUROPE PROGRAMME SUCCESS FACTORS FOR PROJECT DEVELOPMENT: focus on activities and partnership JTS CENTRAL EUROPE PROGRAMME.
Networks ∙ Services ∙ People John DYER TF-MSP Video Conference Community Procurement Support Building on the SPOT-ON Proposal Smart Procurement,
Research and Innovation Research and Innovation Research and Innovation Research and Innovation Research Infrastructures and Horizon 2020 The EU Framework.
LKR2004, Tokyo March The European Resources Landscape Steven Krauwer ELSNET / Utrecht University The Netherlands.
European University Presses take the initiative to develop an Open Access model for peer reviewed books in Humanities and Social Sciences.
CLARIN-NL First Call Jan Odijk CLARIN-NL Kick-off Meeting Utrecht, 27 May 2009.
1 CLARIN - NL Language Resources and Technology Infrastructure for the Humanities and the Social Sciences in the Netherlands Jan Odijk LREC May.
CLARIN Common Language Resources and Technology Infrastructure Daan Broeder & Dieter van Uytvanck Max-Planck Institute for Psycholinguistics TF-EMC2 Meeting,
CLARIN-NL Second Open Call Jan Odijk CLARIN-NL Call 2 Info-session Amsterdam, 26 Aug 2010.
The Preparatory Phase Proposal a first draft to be discussed.
EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
NSD©2014 Bjørn Henrichsen From Fragmentation to a Infrastructural System DASISH Strategic Board Gothenburg, November
Dr. Jūratė Kuprienė Director for innovations and infrastructure development Workshop: Information services for research process , Rīga Research.
Intellectual Property Rights (IPR) Helpdesk A presentation by Daniela Nolte.
AARC Overview Licia Florio, David Groep 21 Jan 2015 presented by David Groep, Nikhef.
1 DG RTD-B ERA: Research Programmes and Capacity Research Infrastructures Unit Maria Theofilatou FP7 Community actions Research Infrastructures of Social.
CLARIN ERIC Progress according to the Strategy Plan Steven Krauwer, Bente Maegaard 1.
The role of Parthenos for CLARIN ERIC Steven Krauwer CLARIN ERIC Executive Director 1.
1 INFRA : INFRA : Scientific Information Repository supporting FP7 “The views expressed in this presentation are those of the author.
WP7 - Architecture and implementation plan Objectives o Integrating the legal, governance and financial plans with technological implementation through.
E-SENS Electronic Simple European Networked Services WP2 kick off Berlin, Germany Apr 10th 2013.
EPOS Preparatory phase Torild van Eck (ORFEUS) Call INFRA Deadline: December 3, 2009 Funding: between 3 and 6 MEuro Duration: max 4 year.
ESPON Seminar 15 November 2006 in Espoo, Finland Review of the ESPON 2006 and lessons learned for the ESPON 2013 Programme Thiemo W. Eser, ESPON Managing.
CLARIN Infrastructure Vision (and some real needs) Daan Broeder CLARIN EU/NL Max-Planck Institute for Psycholinguistics.
ENABLER, BLARK, what’s next? Steven Krauwer Utrecht University / ELSNET.
1 CLARIN - NL Language Resources and Technology Infrastructure for the Humanities and the Social Sciences in the Netherlands.
DASISH Final Conference Common Solutions to Common Problems.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Semantic Web services Interoperability for Geospatial decision.
Results of the HPC in Europe Taskforce (HET) e-IRG Workshop Kimmo Koski CSC – The Finnish IT Center for Science April 19 th, 2007.
VIRTUAL INFORMATION AND KNOWLEDGE ENVIRONMENT FRAMEWORK IP-FP
1 Support for New Research Infrastructures in the EU 7 th Framework Programme for Research Elena Righi European Commission SKADS Workshop, Paris, 4 September.
ESTELA Summer Workshop, 26 June 2013 The EU-SOLARIS project.
1 SMEs – a priority for FP6 Barend Verachtert DG Research Unit B3 - Research and SMEs.
CLARIN work packages. Conference Place yyyy-mm-dd
Riga, Apr HLT in the Baltics, 10 years after 1994 Steven Krauwer ELSNET / Utrecht University (NL)
EVA Workshop, 26 March 2003, Florence, Italy1 COINE Cultural Objects In Networked Environments Anthi Baliou University of Macedonia,Library Thessaloniki,
1 Direction scientifique Networks of Excellence objectives  Reinforce or strengthen scientific and technological excellence on a given research topic.
Recent Developments in CLARIN-NL Jan Odijk P11 LREC, Istanbul, May 23,
1 CLARIN - NL What is going on? Jan Odijk Amsterdam 26 Aug 2010.
CREATING GOOD GOVERNANCE STRUCTURE TO STRENGTHEN CONSORTIA: KENYA LIBRARY AND INFORMATION SERVICES CONSORTIUM (KLISC) BY Rosemary Otando Eifl Country Coordinator.
Global Geospatial Information Management (GGIM) A UN-DESA Initiative in collaboration with Cartographic Section, DFS Stefan Schweinfest UNSD.
Security Policy: From EGEE to EGI David Kelsey (STFC-RAL) 21 Sep 2009 EGEE’09, Barcelona david.kelsey at stfc.ac.uk.
© Services GmbH Proposal writing: Part B 2/1/ St. Petersburg, May 18, 2011 Dr. Andrey Girenko
DRIVER Action plan for an International Repository Organisation Dale Peters OAI6 Breakout Session Joining up Repositories 18 June 2009.
SEVESO II transposition and implementation: Possible approaches and lessons learned from member states and new member states SEVESO II transposition and.
TIARA – WP6 Involving Industry in TIARA Lucio Rossi (WPD) CERN.
Thomas Gutberlet HZB User Coordination NMI3-II Neutron scattering and Muon spectroscopy Integrated Initiative WP5 Integrated User Access.
CLARIN EUDAT2020 uptake plan Dieter Van Uytvanck CLARIN ERIC EUDAT User Forum, Rome.
H. PERO, E uropean Commission European policy developments and challenges in the field of Research Infrastructures.
Partnerships Horizon 2020 / Eurostars expert: Dr. Radosław Piesiewicz.
André Hoddevik, Project Director Enlargement of the PEPPOL-consortium 2009.
Meeting of the ESFRI Social Sciences and Humanities Projects, City University London, 27/01/2009 Report to the Legal Workshop, Brussels, 6 th February.
Project KA2-CBHE School-to-Work Transition for Higher education students with disabilities in Serbia, Bosnia & Herzegovina and Montenegro (Trans2Work)
WP1 - Consortium coordination and management
Antonella Fresa Technical Coordinator
Darja Fišer CLARIN ERIC Director of User Involvement
CLARIN ERIC and the science cloud
Common Solutions to Common Problems
Integrating social science data in Europe
VCC 4 General VCC meeting, 2/3 April 2012, Utrecht, The Netherlands
Juan Gonzalez eGovernment & CIP operations
VCC 3 General VCC meeting, 2/3 April 2012, Utrecht, The Netherlands
It’s all about people Data-related training experiences from EUDAT, OpenAIRE, DANS Marjan Grootveld, DANS EDISON workshop, 29 August 2017.
EOSC Secretariat.
Presentation transcript:

CLARIN: Goals and Structure of the Project Steven Krauwer CLARIN Coordinator Utrecht institute of Linguistics UiL-OTS (NL)

Steven KrauwerCLARIN - Riga Overview Problem & Mission Some why-questions Some who-questions Overall plan –Technical dimension –Language dimension –User dimension –Governance and legal dimension What CLARIN is NOT about How we work Funding Structure To conclude

Steven KrauwerCLARIN - Riga The problem Much data in digital archives language based Only known to insiders Archives mostly unconnected Every archive has its own standards for storage and access Normally only simple retrieval of files (text, audio or video documents) Social sciences and humanities researchers are not language or speech technologists They are often not aware of the potential benefits of using language and speech technology Available tools are hard to use for non- specialist

Steven KrauwerCLARIN - Riga The CLARIN Mission What: Create an infrastructure that makes language resources and technology (LRT), available to scholars of all disciplines, especially social sciences and humanities (SSH) How: Unite existing digital archives into a federation of archives with unified web access Provide existing language and speech technology tools as web services operating on language data in archives

Steven KrauwerCLARIN - Riga Why a European infrastructure? too much fragmentation lack of coordination across countries lack of visibility lack of interoperability lack of sustainability expertise exists but not in all countries language independent tools can be shared language dependent tools can often be ported most countries not able to bear the cost

Steven KrauwerCLARIN - Riga Why now? Exponential growth of digital data Increasing maturity of language and speech technology: –high speed –large volumes –new research questions Growing interest at EU level in research infrastructures (RI) RI Roadmap published in 2006 by ESFRI includes 35 accepted proposals for RIs CLARIN is one of them all of them will get funding for a 1-3 year preparatory phase

Steven KrauwerCLARIN - Riga Who we are and where we come from The CLARIN consortium has now 32 partners from 22 EU and associated countries (and more on the waiting list) The CLARIN community has 142 members in 32 countries (Oct 2008) CLARIN is based on 4 earlier initiatives with many participants: –LangWeb –EARL –TELRI –(and later) DAM-LR

Steven KrauwerCLARIN - Riga Who else do we need? Both our membership and our consortium are quite unbalanced: –Speech & multimodality underrepresented –Humanities other than linguistics underrepresented –Social sciences underrepresented –Some countries still missing There is no money to extend the consortium but we have to fill these gaps

Steven KrauwerCLARIN - Riga Overall plan for CLARIN Preparatory phase: Put everything in place Construction phase: Build and populate with tools and resources Exploitation phase: 2016-…. CLARIN in full service Budget: Prep phase –4.1 M€ from EC – ??? from countries Estimated budget until 2020: ca 200 M€

Steven KrauwerCLARIN - Riga dimensional approach in the preparatory phase First 3 years dedicated The technical dimension The language dimension to the design: The user dimension The governance and legal dimension

Steven KrauwerCLARIN - Riga Technical Technical specification of the infrastructure Construction of a prototype Validation on rich variety of –languages (>20) –resources –services Federation of existing archives Based on existing resources, tools Strong focus on interoperability standards Conversion of existing resources Encapsulation of existing tools

Steven KrauwerCLARIN - Riga Languages Cover all languages spoken or studied in participating countries Representational and descriptive standards should be adequate and validated for all languages Same minimal coverage of basic resources and tools for all languages BLARK (Basic Language Resources Toolkit) to be defined and implemented (funds from other sources needed)

Steven KrauwerCLARIN - Riga Language activities Survey of resources and tools, including: –encoding and annotation data –quality indicators taxonomies and ontologies agreeing on common standards Focus on integration of tools interoperability usage scenarios creating missing essential resources validating specifications and prototype

Steven KrauwerCLARIN - Riga User Users are SSH scholars (including linguists, translation experts) Do WE know what they need? Do THEY know what they need? Actions: –analyze past and ongoing SSH projects –user consultation –launch typical example projects to show potential –expertise centers –awareness actions

Steven KrauwerCLARIN - Riga Legal IPR issues aim at open source, but IPR for existing and future non-open resources must be accommodated federation of archives requires authentication, authorization and trust between archives aim at limited number of template license agreements for most common cases respect national legislation address ethical issues

Steven KrauwerCLARIN - Riga Governance and Funding Agree on e.g.: Who is going to pay for the construction and exploitation of the infrastructure How will it be managed How will it be coordinated with national policies Actions: Analyse best practice in funding and management of transnational projects Prepare agreement between (now) 22 countries about long term joint funding of CLARIN

Steven KrauwerCLARIN - Riga What CLARIN is NOT about building the infrastructure – we are just preparing it creating new resources – at this stage we want to use what is there and adapt it if necessary creating new applications – except maybe some essential tools or demonstrators focusing on the big languages – we find all languages equally important strengthening European industry – our target audience are SSH researchers, but we don’t want to exclude anyone

Steven KrauwerCLARIN - Riga How we work (1) Work packages: WP1: Management and coordination WP2: Designing the infrastructure and building the prototype WP3: Humanities overview WP5: Language resources and technology overview WP6: Dissemination WP7: IPR and business models WP8: Construction and exploitation agreement

Steven KrauwerCLARIN - Riga How we work (2) WP8 Org&Legal Framework WP7 IPR, A&A, licensing WP5 LRT Exploration WP2 Infrastructure Prototype WP3 Humanities Projects

Steven KrauwerCLARIN - Riga How we work (3) Most tasks executed in Working Groups WGs consist of project partners & other experts (CLARIN is open!) Some WGs do work (e.g. build prototype), others create consensus Participation by others essential as e.g. standards cannot be imposed by a small group Unfortunately no EC funding available for WG participation – only reward is influence!

Steven KrauwerCLARIN - Riga Funding & what to use it for From EC: 4.1 M€, used for generic, language independent tasks From countries: ??? M€, to be used for preparing CLARIN at the national level in every country: –build and organize local national CLARIN community –support for participation in working groups (e.g. travel) –validation tasks for own language(s) –creation or adaptation of essential resources –pilots and demonstrators & humanities projects –(co-)organisation of local or international events –preparing for future role (expertise centers, repositories)

Steven KrauwerCLARIN - Riga Structure Executive Board, consisting of the 7 WP leaders plus a special representative to liaise with the humanities community (a.o. through the DARIAH sister project) Boards: –Scientific Board –Strategic Coordination Board –International Advisory Board Meetings (virtual or face to face): –Consortium meetings –Member meetings –Working group meetings

Steven KrauwerCLARIN - Riga More info CLARIN Website: CLARIN Office: CLARIN Newsletter: CLARIN Members: