Term Extraction with the Steunpunt Nederlandstalige Terminologie Dutch Terminology Service Centre Hennie van der Vliet Vrije Universiteit Amsterdam

Slides:



Advertisements
Similar presentations
Building Wordnets Piek Vossen, Irion Technologies.
Advertisements

FP7, Information Day Call 5, Luxembourg, May 11-12, 2009 KYOTO (ICT ) Yielding Ontologies for Transition-Based Organization FP7: Intelligent Content.
Evaluation Capacity Building Identifying and Addressing the Fields Needs.
OECD/INFE High-level Principles for the evaluation of financial education programmes Adele Atkinson, PhD OECD With the support of the Russian/World Bank/OECD.
Domain Engineering Silvio Romero de Lemos Meira
University of Minho School of Engineering Algoritmi Centre Uma Escola a Reinventar o Futuro – Semana da Escola de Engenharia - 24 a 27 de Outubro de 2011.
Bridging the Gap between Academia and the Market Laila Galal Rizk Faculty of Al Alsun (Languages) Ain Shams University.
What’s Inside the Box? OR Marketing your library with the Libraries Building Communities reports.
1 EFPC congress Gőteborg 2012 Interprofessional Education for Primary Care Professionals L. van Amsterdam.
E-Government Policies, Strategies and Implementation Jamal Shahin Institute for European Studies, Vrije Universiteit Brussel 15:00 – 15:20, 21 December.
Space Adventures. Ambitions Recap Section-explanation and reflections Further methodological developments Agenda.
A Bridge to the Future Using a Public (non-ownable) Demand Side Communication Infrastructure.
Job Analysis OS352 HRM Fisher January 31, Agenda Follow up on safety discussion Job analysis – foundation of HR – Purpose – Various techniques.
Open Information Extraction From The Web Rani Qumsiyeh.
Domain-Specific Software Engineering Alex Adamec.
 Awareness of the importance of Health and Safety;  Basic theoretical knowledge of manufacturing processes/practical work confidently and efficiently;
1. 1. OLD Italian SYSTEM 2 INDIRIZZO ECONOMICO-AZIENDALE Economics Maths History Italian language and literature Science Subject x 4e - 5e Project area,
IPTS workshop on ICTs for learning the host country language by adult migrants in the EU Seville 1-2 October Key challenges Workshop on ICTs for.
IFLA/UNESCO Multicultural Library Manifesto Understanding the Manifesto: A Workshop.
Managing IT/IS Projects Across Borders: Opportunities and Challenges for Hong Kong Joseph Lee.
Distributed Access to Data Resources: Metadata Experiences from the NESSTAR Project Simon Musgrave Data Archive, University of Essex.
Evaluation of software engineering. Software engineering research : Research in SE aims to achieve two main goals: 1) To increase the knowledge about.
Welcome to Aalborg University No. 1 of 31 Balancing centralised and decentralised EHR approaces to manage standardisation Department of Health Science.
Workforce Development Framework for NS Public Health To meet the needs for population health, there are some key HR issues that need to be addressed through.
OECD/INFE Tools for evaluating financial education programmes Adele Atkinson, PhD Policy Analyst OECD With the support of the Russian/World Bank/OECD Trust.
GET CONNECTED Core Motivation. Purpose To clarify the relationship between the team members’ personal and organizational motivations. 1 Source: Barrett.
Training Facilitators to Use the Community Health Worker Assessment and Improvement Matrix (CHW AIM) Toolkit Model Slides for Adaptation by Implementers.
Legal localization of P3P as a requirement for its privacy enhancing effect 1 W3C Workshop on the long term Future of P3P and Enterprise Privacy Languages.
Lesley Rickards MEDIN Core Team European initiatives and how MEDIN and the data in the framework relates to them.
Blended Learning in Higher Education Workshop 3
CLARIN work packages. Conference Place yyyy-mm-dd
Terminology and documentation*  Object of the study of terminology:  analysis and description of the units representing specialized knowledge in specialized.
Problems/Disc. Adoption of standards Should there be standards? (not a big problem – responsibility lies with data centre – onus not on scientist) (peer.
Mission Formation. m The Mission Statement should define the broad purposes the college is aiming to achieve, describe the community the program is designed.
Integrating lexical units, synsets and ontology in the Cornetto Database Piek Vossen 1, 2, Isa Maks 1, Roxane Segers 1, Hennie van der Vliet 1 1: Faculty.
Hong Kong, 7 October 2000 Europe ELSNET and Europe What is ELSNET What is happening in Europe Steven Krauwer.
URBPD 442 Urban and regional geospatial analysis This course provides theoretical and practical skills for analyzing spatial patterns and phenomena in.
Further steps: How to make your initiative sustainable Discipline Change Programme –November 2012.
What is a research proposal? What is research?. University research is scholars working on problems that are  collective  systematic  global.
01/12/2015 | 1 Dr. Margo van den Brink / ‘So What? Why we need interpretive methods for water governance’
ESIP Vision: “Achieve a sustainable world” by Serving as facilitator and advisor for the Earth science information community Promoting efficient flow of.
Exploring ‘Workspaces’ Tom Visser, SARA compute and networking services, Amsterdam Garching Workshop 21 st September 2010.
Rezekne, 26 March 2008 Fryslân and the Mercator Research Centre Cor van der Meer Mercator European Research Centre on Multilingualism and Language Learning.
Data Structures and Algorithms Dr. Tehseen Zia Assistant Professor Dept. Computer Science and IT University of Sargodha Lecture 1.
African Centre for Statistics United Nations Economic Commission for Africa Data compilation and management proposal for Africa Statistical Commission.
English for Specific Purposes
Link Translation provides training and practical experience on industry- standard Computer Assisted Translation (CAT) tools for our team of linguists.
Recognition of volunteers’ skills and competences within the context of non-formal and informal education and training systems in sport Simone Digennaro.
Paradigmo presentation Core activities Presented by Olivier Naveau Managing Director Paradigmo SA.
VideoAktiv project has been funded with support from the European Commission. This presentation reflects the views only of the.
Knowledge Support for Modeling and Simulation Michal Ševčenko Czech Technical University in Prague.
WonderWeb. Ontology Infrastructure for the Semantic Web. IST WP4: Ontology Engineering Heiner Stuckenschmidt, Michel Klein Vrije Universiteit.
Vrije Universiteit amsterdam 1 From text to Position An application of.
1 Domain Analysis Model  A Domain Analysis Model (DAM) is a structured way to describe and docu- ment the information requirements of a particular area.
The Times 100 Business Case Studies UNISON Use of PEST analysis at UNISON.
Health Technology Assessment Methodology, an EUnetHTA View Basics for the Assessment in EU Countries PharmDr. Martin Visnansky, MBA, PhD., MSc. (HTA) HTA.
Big Data: Every Word Managing Data Data Mining TerminologyData Collection CrowdsourcingSecurity & Validation Universal Translation Monolingual Dictionaries.
Priorities in building up statistics in pre-accession countries Barbara Domaszewicz Agriculture Department, Central Statistical Office of Poland Workshop.
ICT for Special Needs “ Providing Equal Opportunities “ ICT Trust Fund Ministry of Communication and Information Technology Presented by Hoda Dahroug November.
Case studies in Engineering
Thai AGROVOC Ontology Base for Agricultural Information Retrieval
User working in web-browser
Information Technology (IT)
TERMINOLOGY AND TRANSLATION
Domain Analysis Model A Domain Analysis Model (DAM) is a structured way to describe and docu-ment the information requirements of a particular area of.
JIAMCATT 2015 Dutch Terminology Service Centre (Steunpunt Nederlandstalige Terminologie) Anneleen Schoen.
The Language Experience
e-Text Not All Created Equally
Requirement Engineer Terms and Conditions
Core Motivation.
Presentation transcript:

Term Extraction with the Steunpunt Nederlandstalige Terminologie Dutch Terminology Service Centre Hennie van der Vliet Vrije Universiteit Amsterdam

roadmap -The Steunpunt What is the Steunpunt What does the Steunpunt do -Term extraction What is it What can it do for you How does it work Demo

Steunpunt Nederlandstalige Terminologie -CrossLang, Gent -VU-University Hennie van der Vliet, Anneleen Schoen full time equivalent

what is the Steunpunt -founded by NTU, Dutch Language Union -Dutch as a language for business, technology, politics, science Core business Serving the user: motivating, infrastructure, know-how Priorities:practical needs Main problem: unknown, unloved, time consuming

what is the Steunpunt -a website - termlists, education, webcourse, agenda... -a counter -a quarterly newsletter -the TiNT-congress -workshops

The Dutch TSC mission statement There is a gap between theoretical terminology and the possibilities, skills and needs of workers in the field. Our aim is bridging the gap and making terminology management possible, to help the users to help themselves.

The Dutch TSC mission statement There is a gap between theoretical terminology and the possibilities, skills and needs of workers in the field. Our aim is bridging the gap and making terminology management possible, to help the users to help themselves. -Providing tools for terminology management for Dutch -Develop a methodology to work with these tools in an efficient way -Organize workshops etc.

plan of action: tools and infrastructure Language specific tools to be developed by the NTU - term extractor (TermTreffer) - term management (TermBeheer) - implemented in a work bench Presupposes corpus based terminology - know-how, basic tools

plan of action: collecting data Future plans: Collecting terminological data (term bases) -not much available -hard to collect -not always reliable -variation in formats, data structures, etc.  the Dutch terminology workbench will open up opportunities

What is in this for you? Tools for Dutch -Term extraction: TermTreffer  what are terms  what can you do with terms  how can you find terms  demo

What are terms Terms as linguistic objects (lexemes) are defined as linguistic expressions of concepts which are typically used within a particular knowledge domain and by particular members of the linguistic community Willy Martin Words cost you money, terms bring the money home Piek Vossen

What can you do with terms Terms are characteristic for the text Terms tell you what the key concepts in the text are (and: what the key concepts in he domain are) Terms contain the information in the text Terms tell you what the text is about

How can you find terms The most frequent words are not terms (Zipf’s law) The most unexpected word(group)s are terms Those are words and wordgroups with a higher frequency than expected  for each word –what is the frequency in the text? –what is the frequency in a reference corpus?

Demo