GEMET, the General Environmental Multilingual Thesaurus: Development, User Perspectives and Plans for a Thesaurus System Part 1: overhead presentation.

Slides:



Advertisements
Similar presentations
Step 1 Start your web browser (Internet Explorer or Firefox). Step 2 Type: in the Address box Step 3 Press Enter on the keyboard.
Advertisements

GEMET human and machine readable interfaces WIKTIONARY Stefan Jensen, EEA, Copenhagen.
European Schoolnet ETB IST European Schools Treasury Browser ‘ETB’
Reportnet Reportnet is a system of integrated IT tools and business processes creating a shared information infrastructure optimised to support European.
CESSDA Question Databank Tender, results and future Maarten Hoogerwerf, CESSDA expert seminar 2009.
Larry Fitzwater and Linda Spencer September 29, 1999 SDC JE-1032.
Sakaibrary in 2.4: User Feedback Guides Development Jon Dunn and Mark Notess Digital Library Program Indiana University.
Chapter 2. Slide 1 CULTURAL SUBJECT GATEWAYS CULTURAL SUBJECT GATEWAYS Subject Gateways  Started as links of lists  Continued as Web directories  Culminated.
Features and Uses of a Multilingual Full-Text Electronic Theses and Dissertations (ETDs) System Yin Zhang Kent State University Kyiho Lee, Bumjong You.
IAEA International Atomic Energy Agency INIS Collection Search: Introduction and main features INIS Training Seminar 7-11 October 2013, Vienna Domenico.
STARDAT DATA ARCHIVING SUITE European Survey Research Association (ESRA), July 18 – 22, 2011, Lausanne, Switzerland Monika Linne, Evelyn Brislinger, Wolfgang.
PRIME MINISTRY REPUBLIC OF TURKEY TURKISH STATISTICAL INSTITUTE Foreign Relations Department USST Programme Phase I.
IES - Institute for Environment and Sustainability SDIU – Spatial Data Infrastructures Unit Ispra - Italy
1 Oct 30, 2006 LogicSQL-based Enterprise Archive and Search System How to organize the information and make it accessible and useful ? Li-Yan Yuan.
Matthias Menger, DC-2001, Tokyo / Japan European Environment Agency Management of Environmental Information in the European Information and Observation.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
The Internet. What is the Internet? A community with about 100 million users Available in almost every country about 160,000 people are added each month.
Greenstone Digital Library Usage and Implementation By: Paul Raymond A. Afroilan Network Applications Team Preginet, ASTI-DOST.
 Speed  Cost  Compatibility with existing H/W and other S/W  Ability to import other files  Quality of documentation  Ease of learning and ease of.
The UDK: The Environmental Data Catalog of Germany and Austria Dr. Fred Kruse Coordination Center UDK/GEIN.
Development Principles PHIN advances the use of standard vocabularies by working with Standards Development Organizations to ensure that public health.
Educause October 29, 2001 A GEM of a Resource: The Gateway to Educational Materials Copyright Nancy Virgil Morgan, This work is the intellectual.
Lecturer: Ghadah Aldehim
The Internet. What is the Internet?  The Internet is a network of networks.  It gives users access to a wide variety of information from millions of.
INFOBALT, October 22, 2004, Vinius IST4Balt project information dissemination using web-based knowledge systems Zigmas Bigelis EU projects consultant Asociation.
Chapter 16 The World Wide Web Chapter Goals Compare and contrast the Internet and the World Wide Web Describe general Web processing Describe several.
The OpenAIRE Project Open Access Infrastructure for Research in Europe Stefania Biagioni, Donatella Castelli, Paolo Manghi CNR - ISTI GL11 - Library of.
Environmental Terminology Research in China HE Keqing, HE Yangfan, WANG Chong State Key Lab. Of Software Engineering
Data archive in developing countries: preservation and dissemination of microdata as an instrument for better development results Olivier Dupriez Senior.
- is a global system of interconnected computer networks that use the standard internet protocol suite to serve billions of users worldwide. INTERNET.
Validating, Promoting, & Publishing Your Web Site Writing For the Web The Internet Writer’s Handbook 2/e.
SMRs, Users and the Web Kate Fernie HEIRNET. SMRs on the Web Finding your site Search engines Using your site.
LOGOS GROUP From Translation Memory to Authoring Memory November, 2002.
Monitoring public satisfaction through user satisfaction surveys Committee for the Coordination of Statistical Activities Helsinki 6-7 May 2010 Steve.
IBISAdmin Utah’s Web-based Public Health Indicator Content Management System.
The CERA2 Data Base Data input – Data output Hans Luthardt Model & Data/MPI-M, Hamburg Services and Facilities of DKRZ and Model & Data Hamburg,
The World Wide Web: Information Resource. Hock, Randolph. The Extreme Searcher’s Internet Handbook. 2 nd ed. CyberAge Books: Medford. (2007). Internet.
World Wide Web Library 150 Week 8. The Web The World Wide Web is one part of the Internet. No one controls the web Diverse kinds of services accessed.
APAN AG-WG Bangkok Food and Agriculture Organization of the UN Library and Documentation Systems Division Margherita Sini Slide Sustainable.
Search Tools and Search Engines Searching for Information and common found internet file types.
Econ-GI Frankfurt Book Fair 11-Oct-01 Econ-GI Economic Approaches Unlocking Geographic Information in the Public.
GEMET GEneral Multilingual Environmental Thesaurus leading the way to federated terminologies Stefan Jensen, Head of information services group with input.
Shawn Jones INDUS Corporation January 18, 2000 Open Forum on Metadata Registries Santa Fe, NM SDC JE-2029.
IA Tools to Inform IA Summit 2003 Madonnalisa G. Chan.
User-Driven Integrated Statistical Solutions: Government for the People by the People Open Forum on Metadata Registries Santa Fe, New Mexico January 20,
IAEA International Atomic Energy Agency INIS Collection Search: Introduction and main features The Role of the International Nuclear Information System.
SDC JE-2031 Linda Spencer U.S. EPA January 19, 2000 Open Forum on Metadata Registries Santa Fe, NM.
The World Wide Web: Information Resource. How a Search Engine works… How Search Works - YouTube
The World Wide Web. What is the worldwide web? The content of the worldwide web is held on individual pages which are gathered together to form websites.
Terminology Components for Ecoinformatics Sharing Gail Hodge Consultant to USGS BIO/NBII Information International Associates, Inc. 28 January 2004 science.
Usability Lab 2002 Cascade Kick-Off Meeting User Requirements - Web Site Design Multimedia Interface to Material Databases Flavio Fontana (Ulab)
NeOn Components for Ontology Sharing and Reuse Mathieu d’Aquin (and the NeOn Consortium) KMi, the Open Univeristy, UK
Annotation of Multimedia Documents. Approaches to Cooperation and Personalization. Annotation System January 1998
Freedom by design Company Presentation to ICOLC October 5, 2001 Matt Goldner Executive Vice President Fretwell-Downing, Inc.
Web Design Terminology Unit 2 STEM. 1. Accessibility – a web page or site that address the users limitations or disabilities 2. Active server page (ASP)
Week-6 (Lecture-1) Publishing and Browsing the Web: Publishing: 1. upload the following items on the web Google documents Spreadsheets Presentations drawings.
ACSIUS Technologies Pvt. Ltd. Tomorrow’s Success Starts Today!
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
Work Package 2 Eva Heckl 1/10/2014 Feasibility study on an internet-based e-platform for women entrepreneurs.
The Web Web Design. 3.2 The Web Focus on Reading Main Ideas A URL is an address that identifies a specific Web page. Web browsers have varying capabilities.
SENSE project – towards a next (second) part Stefan Jensen - head - SEIS and SDI group.
Distributed Control and Measurement via the Internet
JavaScript and Ajax (Internet Background)
CNIT 131 Internet Basics & Beginning HTML
Teacher: Alison Roberts Northern Sydney Institute of TAFE

Accessing OECD Statistics WPFS December 2010
Institute for Environment and Sustainability
The Internet and Electronic mail
Presentation transcript:

GEMET, the General Environmental Multilingual Thesaurus: Development, User Perspectives and Plans for a Thesaurus System Part 1: overhead presentation Bruno Felluga, CNR - Consiglio Nazionale delle Richerche, Rome, Italy Part 2: slide show Stefan Jensen, project leader ETC/CDS, Lower Saxony Ministry of the Environment Hannover, Germany Open Forum on Metadata Registries, Santa Fe, NM January 20, 2000 E UROPEAN T OPIC C ENTRE ON C ATALOGUE OF D ATA S OURCES (ETC/CDS) E UROPEAN E NVIRONMENT A GENCY

Outline GEMET presentation - part 2 “linking terminology and applications” E UROPEAN T OPIC C ENTRE ON C ATALOGUE OF D ATA S OURCES (ETC/CDS) E UROPEAN E NVIRONMENT A GENCY GEMET activities performed by the ETC/CDS - co-ordination of the thesaurus development - GEMET usage for indexing and retrieving environmental metadata - development of application around GEMET - assessing 3rd party user needs to incorporate into future developments

Co-ordination of the development E UROPEAN T OPIC C ENTRE ON C ATALOGUE OF D ATA S OURCES (ETC/CDS) E UROPEAN E NVIRONMENT A GENCY - Encouraging and co-ordinating the translation of the core terminology into 12 languages - Contracting application development around GEMET - implement shared coding lists (value domains) - Promoting the use of GEMET through marketing activities - Distributing GEMET and supplying technical helpdesk

GEMET - usage for indexing metainformation E UROPEAN T OPIC C ENTRE ON C ATALOGUE OF D ATA S OURCES (ETC/CDS) E UROPEAN E NVIRONMENT A GENCY GEMET has been used in the work of EEA to index metadata from the following resources: - The Directory of Information Resources (DIR) - The Reporting Obligation Database (ROD) to do this, 2 applications were developed: - MS-Access based tool for metadata registry (WinCDS) - Webbased JAVA tool for online registration(prototype)

Thesaurus part of WinCDS E UROPEAN T OPIC C ENTRE ON C ATALOGUE OF D ATA S OURCES (ETC/CDS) E UROPEAN E NVIRONMENT A GENCY

JAVA based online registration - the indexing E UROPEAN T OPIC C ENTRE ON C ATALOGUE OF D ATA S OURCES (ETC/CDS) E UROPEAN E NVIRONMENT A GENCY

GEMET - usage for indexing metainformation E UROPEAN T OPIC C ENTRE ON C ATALOGUE OF D ATA S OURCES (ETC/CDS) E UROPEAN E NVIRONMENT A GENCY Directory of Information Resources Total dataset sum:931 Controlled terms in use: 655 of ~5300 Total descriptors sum:4714 (GEMET terms used for indexing) Term ranking: 121 of 655 terms have been used more than 10 times

E UROPEAN T OPIC C ENTRE ON C ATALOGUE OF D ATA S OURCES (ETC/CDS) E UROPEAN E NVIRONMENT A GENCY Terms used more than 60 times

DIR : term ranking

E UROPEAN T OPIC C ENTRE ON C ATALOGUE OF D ATA S OURCES (ETC/CDS) E UROPEAN E NVIRONMENT A GENCY Reporting Obligations Database ROD prototype Questions Datasets total: Controlled terms in use: 323 of 5300 have been used times Top ranking: Term ‚atmospheric emissions‘ has been used times Sources Datasets total: 42 Controlled terms in use: 65 of 5300 have been used 256 times GEMET - usage for indexing metainformation

ROD questions: term ranking

between 10 and 33 ROD sources: term ranking

GEMET - usage to browse and retrieve metadata E UROPEAN T OPIC C ENTRE ON C ATALOGUE OF D ATA S OURCES (ETC/CDS) E UROPEAN E NVIRONMENT A GENCY GEMET is used to browse or retrieve metadata within 5 applications: The ThesShow GEMET browser The WebCDS accessing the DIR via HTML The WebCDS accessing the DIR via JAVA applets The multilingual search service (MSS) The Reporting Obligation Database (ROD)

JAVA based thesaurus browser for WebCDS E UROPEAN T OPIC C ENTRE ON C ATALOGUE OF D ATA S OURCES (ETC/CDS) E UROPEAN E NVIRONMENT A GENCY

The Multilingual Search Service (MSS) E UROPEAN T OPIC C ENTRE ON C ATALOGUE OF D ATA S OURCES (ETC/CDS) E UROPEAN E NVIRONMENT A GENCY Motivation –distributed multilingual document collections of European institutions (European Environment Agency, EEA) –support query formulation in user‘s native language –search and retrieve documents in all understandable languages Approach of EEA‘s Multilingual Search Service (MSS) –thesaurus support for query formulation (domain specific thesaurus required, e.g. GEMET) –translation by making use of multilinguality of theseaurus (GEMET is available in 12 languages) –use translations as input for off-the-shelf Web search engine (e.g., Netscape Compass Server)

Using a term for searching metadata E UROPEAN T OPIC C ENTRE ON C ATALOGUE OF D ATA S OURCES (ETC/CDS) E UROPEAN E NVIRONMENT A GENCY

Search results from websites within the EERC E UROPEAN T OPIC C ENTRE ON C ATALOGUE OF D ATA S OURCES (ETC/CDS) E UROPEAN E NVIRONMENT A GENCY

E UROPEAN T OPIC C ENTRE ON C ATALOGUE OF D ATA S OURCES (ETC/CDS) E UROPEAN E NVIRONMENT A GENCY Questionnaire about GEMET usage Goals: - learn more about current users - get guidance from usage for future development Process: - the current ~200 GEMET users from all over the world have been addressed by - the 2 page (+annexes) questionnaire was made available digitally and as a form on internet - Survey was performed in November and December 1999

E UROPEAN T OPIC C ENTRE ON C ATALOGUE OF D ATA S OURCES (ETC/CDS) E UROPEAN E NVIRONMENT A GENCY 42% translation 33% indexing 22% retrieval 56% translation 22% indexing/retrieval 100% translation Areas and frequency of GEMET usage

% Current usage of languages in GEMET

Evaluation of the thesaurus content %

Usage of the GEMET browser ThesShow %

E UROPEAN T OPIC C ENTRE ON C ATALOGUE OF D ATA S OURCES (ETC/CDS) E UROPEAN E NVIRONMENT A GENCY GEMET - conclusions from the own indexing experience and the questionnaire General guidelines: - The GEMET content should remain stable, minor improvements are justified - There is a need to add new functionalities to the tools to allow the user to customise an own “thesaurus system”

Contact information E UROPEAN T OPIC C ENTRE ON C ATALOGUE OF D ATA S OURCES (ETC/CDS) E UROPEAN E NVIRONMENT A GENCY URL: or