Steven KrauwerLREC20081 CLARIN: Common Language Resources and Technology Infrastructure for the Humanities and Social Sciences Kimmo Koskenniemi (University.

Slides:



Advertisements
Similar presentations
A centre of expertise in digital information managementwww.ukoln.ac.uk Approaches To E-Learning: Developing An E-Learning Strategy Brian Kelly UKOLN University.
Advertisements

1 Project Management. 2 Mission and governance and scope Users and Usage Technical and funding assistance for some countries Support and involvement and.
CLARIN AAI, Web Services Security Requirements
CLARIN Technical Infrastructure How far are we?. Short Overview CLARIN is one of the 44 accepted ESFRI Roadmap Initiatives official start: , Kick-off:
Computational Paradigms in the Humanities – eHumanities and their role and impact in transdisciplinary research Gerhard Budin University of Vienna.
Cultural Content and Digital Heritage Bernard Smith European Commission INFSO/D2.
CLARIN and the DSA Paul Trilsbeek The Language Archive Max Planck Institute for Psycholinguistics.
Steven KrauwerCLARIN-NL Launch CLARIN-EU: Where do we stand? Steven Krauwer Utrecht institute of Linguistics UiL OTS CLARIN-EU Coordinator.
CLARIN: Common Language Resources and Technology Infrastructure for the Social Sciences and Humanities Steven Krauwer Utrecht institute of Linguistics.
CLARIN: Goals and Structure of the Project Steven Krauwer CLARIN Coordinator Utrecht institute of Linguistics UiL-OTS (NL)
Sirlis Sõmer Ministry of Social Affairs Disability Policy Challenges.
Libraries, archives and museums working together! Learning by doing! Making the collections and services of libraries, archives and museums digitally available.
A project implemented by the HTSPE consortium This project is funded by the European Union SUSTAINABLE GROWTH LIFE
CLARIN-NL First Call Jan Odijk CLARIN-NL Kick-off Meeting Utrecht, 27 May 2009.
Hung LST Day 1 Language Technology for the Humanities: why and how? Steven Krauwer Utrecht University CLARIN ERIC Executive Director.
1 CLARIN - NL Language Resources and Technology Infrastructure for the Humanities and the Social Sciences in the Netherlands Jan Odijk LREC May.
Ülevaade projektist CLARIN Eesti keeleressursside keskus Koostööst Tulevikust.
CLARIN Common Language Resources and Technology Infrastructure Daan Broeder & Dieter van Uytvanck Max-Planck Institute for Psycholinguistics TF-EMC2 Meeting,
CLARIN-NL Second Open Call Jan Odijk CLARIN-NL Call 2 Info-session Amsterdam, 26 Aug 2010.
EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
SDH-NEERI 2010 Panel on Integrating Infrastructures: Moderator: Peter Farago (ESFRI-SSH/FORS) CLARIN: Steven Krauwer Open Aire: Wolfram Horstmann TEL/Europeana:
The Language Archive – Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands Increasing the usage of endangered language archives in the.
CLARIN ERIC Progress according to the Strategy Plan Steven Krauwer, Bente Maegaard 1.
WP2 Legal work Helle PEDERSEN Elisabeth KOHLER (CNRS)
Communicating and disseminating EU research Charlotte Haentzel European Commission’s Representation in Sweden.
The role of Parthenos for CLARIN ERIC Steven Krauwer CLARIN ERIC Executive Director 1.
1 INFRA : INFRA : Scientific Information Repository supporting FP7 “The views expressed in this presentation are those of the author.
WP7 - Architecture and implementation plan Objectives o Integrating the legal, governance and financial plans with technological implementation through.
EPOS Preparatory phase Torild van Eck (ORFEUS) Call INFRA Deadline: December 3, 2009 Funding: between 3 and 6 MEuro Duration: max 4 year.
A complementary view from the DIGOIDUNA study Paolo Bouquet, University of Trento, Italy SMART 2010/0054.
CLARIN Infrastructure Vision (and some real needs) Daan Broeder CLARIN EU/NL Max-Planck Institute for Psycholinguistics.
6 April Research for a secure Europe Christiane BERNARD European Commission - DG Enterprise and Industry Athens.
Helix Nebula The Science Cloud CERN – 14 May 2014 Bob Jones (CERN) This document produced by Members of the Helix Nebula consortium is licensed under a.
Towards a European network for digital preservation Ideas for a proposal Mariella Guercio, University of Urbino.
ENABLER, BLARK, what’s next? Steven Krauwer Utrecht University / ELSNET.
DASISH Final Conference Common Solutions to Common Problems.
Results of the HPC in Europe Taskforce (HET) e-IRG Workshop Kimmo Koski CSC – The Finnish IT Center for Science April 19 th, 2007.
1 Support for New Research Infrastructures in the EU 7 th Framework Programme for Research Elena Righi European Commission SKADS Workshop, Paris, 4 September.
1 SMEs – a priority for FP6 Barend Verachtert DG Research Unit B3 - Research and SMEs.
CLARIN work packages. Conference Place yyyy-mm-dd
Riga, Apr HLT in the Baltics, 10 years after 1994 Steven Krauwer ELSNET / Utrecht University (NL)
A public-private partnership building a multidisciplinary cloud platform for data intensive science Bob Jones Head of openlab IT dept CERN This document.
W HAT IS I NTEROPERABILITY ? ( AND HOW DO WE MEASURE IT ?) INSPIRE Conference 2011 Edinburgh, UK.
1 Direction scientifique Networks of Excellence objectives  Reinforce or strengthen scientific and technological excellence on a given research topic.
Hong Kong, 7 October 2000 Europe ELSNET and Europe What is ELSNET What is happening in Europe Steven Krauwer.
Recent Developments in CLARIN-NL Jan Odijk P11 LREC, Istanbul, May 23,
EPOS - Rome meeting, September 1 EC + ESFRI Recommendations Elisabeth Kohler CNRS-INSU.
FARM Africa/SOS Sahel Ethiopia Strengthening Sustainable livelihoods and Forest Management Over view of the program April 6, 2013 Bahir dar.
1 CLARIN - NL What is going on? Jan Odijk Amsterdam 26 Aug 2010.
Helix Nebula The Science Cloud CERN – 13 June 2014 Alberto Di MEGLIO on behalf of Bob Jones (CERN) This document produced by Members of the Helix Nebula.
European Life Sciences Infrastructure for Biological Information ELIXIR and Identity Management 2 nd Workshop on Federated Identity.
Building Strong Library Associations | Sustaining Your Library Association BSLA Stakeholders Workshop Yaounde, Cameroon, April 2012 Managing Relationships.
A Solution Perspective An Open Source Collaborative and Foundational Solution Targeted at Non-OECD Member Countries February 9, 2016.
Introduction: What is Development Communication? Module 1 De La Salle – Lipa College of Education Arts and Sciences COMM Department.
SEVESO II transposition and implementation: Possible approaches and lessons learned from member states and new member states SEVESO II transposition and.
AAI needs of the Distributed Computing Infrastructures - CLARIN Dieter Van Uytvanck Max Planck Institute for Psycholinguistics
1 ENERGY, ENVIRONMENT AND SUSTAINABLE GROWTH 3rd OPET ANNUAL GENERAL MEETING Mariehamn, September 1999 “International Cooperation in the next OPET.
Building Strong Library Associations | Regional Convenings DAY 2 Session 6 How will I get support for my proposal? External relationships.
Authentication and Authorisation for Research and Collaboration Heiko Hütter, Martin Haase, Peter Gietz, David Groep AARC 3 rd.
ICT4IAL European Agency for Special Needs and Inclusive Education Marcella Turner-Cmuchal and Klára Somogyi.
The European Transport Research Alliance - ETRA Prof. G. A. Giannopoulos Chairman, ETRA.
How to build a winning team for Horizon 2020 collaborative proposals Louise Mothersole UK National Contact Point for Transport +44 (0)
André Hoddevik, Project Director Enlargement of the PEPPOL-consortium 2009.
Work Plan for the Second Period Bob Jones, CERN First Helix Nebula Review 03 July This document produced by Members of the Helix Nebula consortium.
Meeting of the ESFRI Social Sciences and Humanities Projects, City University London, 27/01/2009 Report to the Legal Workshop, Brussels, 6 th February.
Libraries, archives and museums working together. Learning by doing
Krister Lindén and Ville Oksanen FINCLARIN / University of Helsinki
CLARIN ERIC and the science cloud
Common Solutions to Common Problems
Juan Gonzalez eGovernment & CIP operations
Presentation transcript:

Steven KrauwerLREC20081 CLARIN: Common Language Resources and Technology Infrastructure for the Humanities and Social Sciences Kimmo Koskenniemi (University of Helsinki) Steven Krauwer (Utrecht University) Tamas Varadi (Hungarian Academy of Sciences) Peter Wittenburg (Max Planck Institute, Nijmegen) Martin Wynne (Oxford Text Archive) and many other contributors from the CLARIN community

Steven KrauwerLREC20082 The problem Much data in digital archives language based Only known to insiders Archives mostly unconnected Every archive has its own standards for storage and access Normally only simple retrieval of files (text, audio or video documents) Social sciences and humanities researchers are not language or speech technologists They are often not aware of the potential benefits of using language and speech technology Available tools are hard to use for non- specialist

Steven KrauwerLREC20083 The CLARIN Mission What: Create a European infrastructure that makes language resources and technology (LRT), available to scholars of all disciplines, especially social sciences and humanities (SSH) How: Unite existing digital archives into a federation of archives with unified web access Provide language and speech technology tools as web services operating on language data in archives

Steven KrauwerLREC20084 Why a European infrastructure? too much fragmentation lack of coordination across countries lack of visibility lack of interoperability lack of sustainability expertise exists but not in all countries language independent tools can be shared language dependent tools can often be ported most countries not able to bear the cost

Steven KrauwerLREC20085 Why now? Exponential growth of digital data Increasing maturity of language and speech technology: –high speed & large volumes –new research questions Growing interest at EU level in –digital research infrastructures –support for humanities and social sciences –resulting in a Special EU Call for Proposals for research infrastructure initiatives in 2006/2007

Steven KrauwerLREC20086 Who we are and where we come from CLARIN is an EU Infrastructure project with 4.1 Meuro funding for a 3 year preparatory phase Additional funding expected from national governments (at this moment at least 5 Meuro) The CLARIN consortium has now 32 partners from 22 EU and associated countries (and more on the waiting list) The CLARIN community has 112 member organisations in 32 countries (mostly from NLP) CLARIN is based on 4 earlier initiatives with many participants: LangWeb, EARL, TELRI (and later) DAM-LR

Steven KrauwerLREC20087 Overall plan for CLARIN Preparatory phase: Put everything in place Construction phase: Build and populate with tools and resources Exploitation phase: 2016-… Overall cost until 2020: 200 Meuro

Steven KrauwerLREC20088 Planning and design (1) The technical dimension: specifications, prototype, validation based on existing archives, resources and services strong focus on standards and interoperability The language dimension: all languages equally important surveys of what exists and what is missing standards, taxonomies, ontologies integration of tools validation for all participating languages

Steven KrauwerLREC20089 Planning and design (2) The user dimension: Investigating current practice Establishing their needs Usage scenarios Pilot projects and demonstrators Expertise centers, awareness actions

Steven KrauwerLREC Planning and design (3) Legal and IPR issues aim at open source, but IPR for existing and future non-open resources must be accommodated federation of archives requires authentication, authorization and trust between archives aim at limited number of template license agreements for most common cases respect national legislation address ethical issues

Steven KrauwerLREC Planning and design (4) Agree on e.g.: Who is going to pay for the construction and exploitation of the infrastructure How will it be managed How will it be coordinated with national policies To be agreed upon by the funding agencies (not the researchers)

Steven KrauwerLREC What CLARIN is NOT about building the infrastructure – we are just preparing it creating new resources – at this stage we want to use what is there and adapt it if necessary creating applications – except maybe some demonstrators focusing on the big languages – we find all languages equally important strengthening European industry – our target audience are SSH researchers, but we don’t want to exclude anyone

Steven KrauwerLREC Concluding remarks The CLARIN infrastructure does not yet exist, we haven’t even started constructing it yet: we have just started planning the construction Why do we feel confident that we can succeed? Three key factors: –Moral and financial support from the EC –Moral support from (now 26) national governments (financial commitments expected) –Massive support from the European LRT community More info: & Newsletterwww.clarin.eu