Download presentation
Presentation is loading. Please wait.
Published byAmi Tyler Modified over 9 years ago
1
Term Extraction with the Steunpunt Nederlandstalige Terminologie Dutch Terminology Service Centre Hennie van der Vliet Vrije Universiteit Amsterdam h.d.vander.vliet@vu.nl steunpunt@let.vu.nl
2
roadmap -The Steunpunt What is the Steunpunt What does the Steunpunt do -Term extraction What is it What can it do for you How does it work Demo
3
Steunpunt Nederlandstalige Terminologie -CrossLang, Gent -VU-University Hennie van der Vliet, Anneleen Schoen - 1.0 full time equivalent
4
what is the Steunpunt -founded by NTU, Dutch Language Union -Dutch as a language for business, technology, politics, science Core business Serving the user: motivating, infrastructure, know-how Priorities:practical needs Main problem: unknown, unloved, time consuming
5
what is the Steunpunt -a website www.nedterm.nlwww.nedterm.nl - termlists, education, webcourse, agenda... -a counter steunpunt@vu.nlsteunpunt@vu.nl -a quarterly newsletter -the TiNT-congress -workshops
6
The Dutch TSC mission statement There is a gap between theoretical terminology and the possibilities, skills and needs of workers in the field. Our aim is bridging the gap and making terminology management possible, to help the users to help themselves.
7
The Dutch TSC mission statement There is a gap between theoretical terminology and the possibilities, skills and needs of workers in the field. Our aim is bridging the gap and making terminology management possible, to help the users to help themselves. -Providing tools for terminology management for Dutch -Develop a methodology to work with these tools in an efficient way -Organize workshops etc.
8
plan of action: tools and infrastructure Language specific tools to be developed by the NTU - term extractor (TermTreffer) - term management (TermBeheer) - implemented in a work bench Presupposes corpus based terminology - know-how, basic tools
9
plan of action: collecting data Future plans: Collecting terminological data (term bases) -not much available -hard to collect -not always reliable -variation in formats, data structures, etc. the Dutch terminology workbench will open up opportunities
10
What is in this for you? Tools for Dutch -Term extraction: TermTreffer what are terms what can you do with terms how can you find terms demo
11
What are terms Terms as linguistic objects (lexemes) are defined as linguistic expressions of concepts which are typically used within a particular knowledge domain and by particular members of the linguistic community Willy Martin Words cost you money, terms bring the money home Piek Vossen
12
What can you do with terms Terms are characteristic for the text Terms tell you what the key concepts in the text are (and: what the key concepts in he domain are) Terms contain the information in the text Terms tell you what the text is about
13
How can you find terms The most frequent words are not terms (Zipf’s law) The most unexpected word(group)s are terms Those are words and wordgroups with a higher frequency than expected for each word –what is the frequency in the text? –what is the frequency in a reference corpus?
14
Demo http://www.termtreffer.nl
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.