Mind the lexical gap- Eurovoc Luxembourg, 18-19 November 2010 Automatic Eurovoc indexing of parliamentary documentation Live demostration Victoria Fernández.

Slides:



Advertisements
Similar presentations
1. XP 2 * The Web is a collection of files that reside on computers, called Web servers. * Web servers are connected to each other through the Internet.
Advertisements

1 Copyright © 2002 Pearson Education, Inc.. 2 Chapter 1 Introduction to Perl and CGI.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide STARTING OUT WITH Visual Basic 2008 FOURTH EDITION Tony Gaddis.
Classes, Exceptions, Collections, and Scrollable Controls
How to Author Teaching Files Draft Medical Imaging Resource Center.
Eurovoc and parliamentary documents: a semi-automatic classification experience at the Camera dei deputati Calogero Salamone Luxembourg, 19 november 2010.
Hanneke COPPOLECCHIA-SOMERS European Parliament DG Internal Policies
JRC-Ispra, , Slide 1 Introduction – Presentation of the Programme Ralf Steinberger Addressing the Language Barrier Problem in the Enlarged EU Automatic.
CINAHL DATABASE FOR HINARI USERS: nursing and allied health information (Module 7.1)
PubMed Overview From the main HINARI webpage, we can access PubMed by clicking on Search HINARI journal articles through PubMed (Medline). Note: If you.
PubMed/How to Search, Display, Download & (module 4.1)
PubMed Overview From the HINARI Content page, we can access PubMed by clicking on Search inside HINARI full-text using PubMed. Note: If you do not properly.
HINARI – Accessing Articles: Problems and Solutions.
Click to edit Master title style Page - 1 OneSky Teams Step-by-Step Online Corporate Communication Support 2006.
Michigan Electronic Grants System Plus
Michigan Electronic Grants System Plus
© Telcordia Technologies 2004 – All Rights Reserved AETG Web Service Tutorial AETG is a service mark of Telcordia Technologies. Telcordia Technologies.
For Details Visit : or For any Help Contact the Librarian EBSCOhost 2.0.
“The Honeywell Web-based Corrective Action Solution”
May 2009 D2L Upgrade to Version 8.4 Desire2Learn Changes in Version 8.4.
Macromedia Dreamweaver MX 2004 – Design Professional Dreamweaver GETTING STARTED WITH.
RefWorks: The Basics October 12, What is RefWorks? A personal bibliographic software manager –Manages citations –Creates bibliogaphies Accessible.
Logging In Go to web site:
Reference and Instruction Automated Statistics Gathering and Reporting System Members: Patrick Chen (pyc7) Soo-Yung Cho (sc444) Gregg Herlacher (gah24)
Start the slide show by clicking on the "Slide Show" option in the above menu and choose "View Show”. or – hit the F5 Key.
Maintain and Modify By: Sahar Aftab (1253 ) and Mehboob Nazim (1085) Central Library.
Electronic Thesis And Dissertation Database Errors Luke Schmader Ryan Mestre Client: Zhiwu Xie CS4624 5/6/2014.
Jean Phillips Schwerdtfeger Library Space Science and Engineering Center University of Wisconsin-Madison November 2005.
Toll Free: Project Manager Tutorial.
PubMed/How to Search, Display, Download & (module 4.1)
Refworks Presented by Margaret Clark, Reference Librarian FSU College of Law Library September 20, 2005.
Start the slide show by clicking on the "Slide Show" option in the above menu and choose "View Show”. or – hit the F5 Key.
Start the slide show by clicking on the "Slide Show" option in the above menu and choose "View Show”. or – hit the F5 Key.
Turkey IDA Info-Day PM Session, September 25, 2003 CIRCA 1 CIRCA : The IDA Collaborative Software Tool Grzegorz Ambroziewicz European Commission - DG Enterprise.
ONLINE TECHNICAL REPORT SYSTEM Team Crash Course Ryan Ashe, Eileen Balci, James Kirk, Taylor Paschal.
Confidential - © 2012 StreamWIDE © StreamWIDE
Chapter 5 Java Script And Forms JavaScript, Third Edition.
Basics of Web Databases With the advent of Web database technology, Web pages are no longer static, but dynamic with connection to a back-end database.
The FP7 How to submit a project electronically AN INFORMATION POINT FOR FP7 IN PALESTINE: Training Seminar of experts Nicosia, Cyprus November.
Tutorial 10 Adding Spry Elements and Database Functionality Dreamweaver CS3 Tutorial 101.
JRC-Ispra, , Slide 1 Next Steps / Technical Details Bruno Pouliquen & Ralf Steinberger Addressing the Language Barrier Problem in the Enlarged.
Online Reporting Guide
Online Training for TEXAS TECH UNIVERSITY and TEXAS TECH HSC Hiring Managers Employment Office April 2003.
Customer Service and Support Sutherland Global Services Consultant Learning Services Microsoft Store.
Tutorial 121 Creating a New Web Forms Page You will find that creating Web Forms is similar to creating traditional Windows applications in Visual Basic.
Start the slide show by clicking on the "Slide Show" option in the above menu and choose "View Show”. or – hit the F5 Key.
PubMed Overview From the HINARI Content page, we can access PubMed by clicking on Search inside HINARI full-text using PubMed. Note: If you do not properly.
Part 1 – PubMed Interface, Display options, Saving, Printing, and ing results. Instructions This part of the course is a PowerPoint demonstration.
Getting Started with BDI-2™ Mobile Data Solution for Windows®
OARE Module 5A: Scopus (Elsevier). Table of Contents About Scopus (Elsevier) Using Scopus Search Page Results/Refine Search Pages Download, PDF, Export,
Accessing journals by via PubMed Note the link to find articles through HINARI/PubMed. Using this option will be covered in later in the Short Course.
Exercise Your your Library ® RefWorks: The Basics October 10, 2006.
Compiling, processing and accessing the collection of legal regulations of the Republic of Croatia T. Didak Prekpalaj, T. Horvat, D. Miletić, D. Mokriš.
Table of Contents TopicSlide Administrator Login 2 Administrator Navigations 3 Managing AlternativeDr.com Blogs 4 Managing Dr. Lloyd May Blogs 5 Managing.
Introduction to Morpho RCN Workshop Samantha Romanello Long Term Ecological Research University of New Mexico.
At the Workforce Development and Continuing Education Student Menu select Register for Noncredit course(s) under Registration.
October RefWorks Basics Creating accounts and folders Adding references (manually & electronically) Sorting, editing and linking Creating a bibliography.
Partner Publishers’ Websites From the Partner publisher services dropdown menu, click on the Elsevier Science - Science Direct website. Note that this.
PubMed/How to Search, Display, Download & (module 4.1)
PHP Form Processing * referenced from
 Project Team: Suzana Vaserman David Fleish Moran Zafir Tzvika Stein  Academic adviser: Dr. Mayer Goldberg  Technical adviser: Mr. Guy Wiener.
1. Begin Quick Start 2. Administration 3. Good to Know 4. Slightly Technical 5. User Experience 6. You are ready to go !
The New NAP Members’ Area Development. Elgg What is elgg? –Elgg is an award-winning open source social networking platform.
How to use Drupal Awdhesh Kumar (Team Leader) Presentation Topic.
COM621: Advanced Interactive Web Development Lecture 10 PHP and MySQL.
By Janet Crawford and Dam Luong Submitted to the Faculty of
1 2 3 Here we are on the Ohio Web Library’s home page. To get to Business Source Premier, use the following steps: 1. Go to Ohio Web Library 2. Click on.
PubMed Database Interface (Basic Course Module 4 Part A)
Presentation transcript:

Mind the lexical gap- Eurovoc Luxembourg, November 2010 Automatic Eurovoc indexing of parliamentary documentation Live demostration Victoria Fernández Mera Congreso de los Diputados

JRC tool was retrained on more than parliamentary Spanish texts (short abstracts, manually indexed with 3 and 3.1 Eurovoc versions). 5th June 2005, the European Community and the Congress of Deputies signed a Software License Agreement to grant a free of charge licence on the software. It has been the main indexing tool since November Available from any computer with a web browser inside the Congress of Deputies. Login and an associated password to access. Joint Research Centre automatic indexing software at the Congress of Deputies Mind the lexical gap- Eurovoc Luxembourg, November 2010

How does the system work? Web interface USER Argo Database Gets texts Stores indexation Simple computer with a Web browser Server (Linux fedora) With: - Perl installed V Oracle Client Apache server eic server nogal.congreso.es From Bruno Pouliquen. Technical documentation, overview of the tool. Global architecture and requirements. (Information brochure unpublished). 17 p.

ORACLE database The information is organized on text, numerical and data fields Gathers information on any and all written communications submitted to the Congress of Deputies. Mind the lexical gap- Eurovoc Luxembourg, November 2010 Congress of Deputies Parliamentary activities information system Argo

Types and numerical codes of parliamentary texts Legislative initiatives : Governments bills (121) Private Members´ bills (122, 123, 124,124) Decree-laws (130) International Treaties (110,111,112) Control of the Executive: Granting and withdrawal of confidence: Investiture of the Government (80) Censure motions (82) Question of confidence (81) Checking on the Government´s performance Interpelations and motions (161,162,170,171,172,173) Oral and written questions (180,181,184) Attendances: Members of Government (210, 213, 214) Other Authorities (212, 219) Government communications, programmes, plans and other reports Nominations and appointments of high-ranking officials to certain State bodies Mind the lexical gap- Eurovoc Luxembourg, November 2010

Main indexing language since 1987 Eurovoc official edition Spanish geographical application Short abstracts or titles are indexed Descriptors are only assigned to the one document that start the procedure in the House Average number of three descriptors assigned Mind the lexical gap- Eurovoc Luxembourg, November 2010 Eurovoc at the Argo database

Mind the lexical gap- Eurovoc Luxembourg, November 2010 Welcome page

Clicking on Index a Congreso text, we will be ready to index Mind the lexical gap- Eurovoc Luxembourg, November 2010

Document indexing page (to tap the numerical code of the texts to index )

The system always displays all the texts that have not been indexed yet Clicking on the box ready to index, we will go to the validation interface. Mind the lexical gap- Eurovoc Luxembourg, November 2010 Indexation interface

Mind the lexical gap- Eurovoc Luxembourg, November 2010 Validation interface It displays a ranked list of 30 descriptors. The descriptors assigned are ranked by their score. Ticking the corresponding box to choose the good descriptors Clicking on the link below Id, the browser shows all the thesaurus relations descriptor.

Mind the lexical gap- Eurovoc Luxembourg, November 2010 Thesaurus relations descriptor

Mind the lexical gap- Eurovoc Luxembourg, November 2010 Look for new descriptors (in the box Add a new descriptor tap a Eurovoc descriptor code, if known, or a plain text)

Mind the lexical gap- Eurovoc Luxembourg, November 2010 Look for new descriptors ( The box search for …. in Eurovoc allows to look for new descriptors and look through the thesaurus on line)

Mind the lexical gap- Eurovoc Luxembourg, November 2010 To display geographical descriptors click on the button show INE

Mind the lexical gap- Eurovoc Luxembourg, November 2010 Clicking on some additional administrative tools here a new interface performs several funtions

Mind the lexical gap- Eurovoc Luxembourg, November 2010 Clicking on Add documents, the system is ready to plan text indexation

Mind the lexical gap- Eurovoc Luxembourg, November 2010 Planning indexation (this interface resumes the codes to be indexed)

Conclusions The software is able to assign keywords from a controlled language It performs a high average of correct descriptors among the 10 first assigned It is able to retrain continuously the assignment of new descriptors It is a reliable system It gives a list of Eurovoc descriptors, which have to be validated by the human indexers. So, we can define it as a good automatic assignment tool to help and support indexers work. Mind the lexical gap- Eurovoc Luxembourg, November 2010