RECENT TRENDS IN METADATA GENERATION

Slides:



Advertisements
Similar presentations
E-learning and Libraries WSIS Forum, Geneva,11 May 2010 Tullio Basaglia, CERN Scientific Information Service, Geneva.
Advertisements

DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Interoperability Scenarios All Working Groups Meeting May, Rome, Italy.
Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
Chapter 5: Introduction to Information Retrieval
Knowledge Portal: An Innovative Approach to Libraries Presented at NACLIN New Delhi By Sharad Kumar Sonker Department of Lib. & Info. Sci. Babasaheb Bhimrao.
CNRIS CNRIS 2.0 Challenges for a new generation of Research Information Systems.
CZECH STATISTICAL OFFICE | Na padesatem 81, Prague 10 | Jitka Prokop, Czech Statistical Office SMS-QUALITY The project and application.
Information and Business Work
Search Engines and Information Retrieval
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
The MetaDater Model and the formation of a GRID for the support of social research John Kallas Greek Social Data Bank National Center for Social Research.
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
© Tefko Saracevic, Rutgers University1 digital libraries and human information behavior Tefko Saracevic, Ph.D. School of Communication, Information and.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Search Engines and Information Retrieval Chapter 1.
Mining the Semantic Web: Requirements for Machine Learning Fabio Ciravegna, Sam Chapman Presented by Steve Hookway 10/20/05.
LIS510 lecture 3 Thomas Krichel information storage & retrieval this area is now more know as information retrieval when I dealt with it I.
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
DL.org All WGs Meetings, Rome, May 2010 Quality Interoperability Approaches, case studies and open issues DL.org Quality Working Group Rome, 28 th.
RCDL Conference, Petrozavodsk, Russia Context-Based Retrieval in Digital Libraries: Approach and Technological Framework Kurt Sandkuhl, Alexander Smirnov,
P. Schirmbacher Humboldt-Universität zu Berlin The Changing Process of Scholarly Publishing or the Necessity of a New Culture of Electronic.
Task-oriented approach to information handling support within web-based education Lora M. Aroyo 15 November 2001.
Dec 3 rd, 2004STC 6 th Annual Conference API Documentation Trends and Opportunities Rajeev Jain
Company LOGO Digital Infrastructure of RPI Personal Library Qi Pan Digital Infrastructure of RPI Personal Library Qi Pan.
Digital Libraries Lillian N. Cassel Spring A digital library An informal definition of a digital library is a managed collection of information,
ProjFocusedCrawler CS5604 Information Storage and Retrieval, Fall 2012 Virginia Tech December 4, 2012 Mohamed M. G. Farag Mohammed Saquib Khan Prasad Krishnamurthi.
Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Towards a Reference Quality Model for Digital Libraries Maristella Agosti Nicola Ferro Edward A. Fox Marcos André Gonçalves Bárbara Lagoeiro Moreira.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
Extracting value from grey literature Processes and technologies for aggregating and analysing the hidden Big Data treasure of the organisations.
A System for Automatic Personalized Tracking of Scientific Literature on the Web Tzachi Perlstein Yael Nir.
IR&NLP Coursework P1 Text Analysis Within The Fields Of Information Retrieval and Natural Language Processing By Ben Addley Academic Year 2004.
DELOS Network of Excellence on Digital Libraries Yannis Ioannidis University of Athens, Hellas Digital Libraries: Future Research Directions for a European.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
© NCSR, Frascati, July 18-19, 2002 CROSSMARC big picture Domain-specific Web sites Domain-specific Spidering Domain Ontology XHTML pages WEB Focused Crawling.
Role of Metadata in dissemination of census data Regional Seminar on dissemination and spatial analysis of census data, Nairobi, September, 2010.
1 Representing and Reasoning on XML Documents: A Description Logic Approach D. Calvanese, G. D. Giacomo, M. Lenzerini Presented by Daisy Yutao Guo University.
Information Retrieval in Practice
Towards a framework for architectural design decision support
TRSS Terminology Registry Scoping Study
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Dependency Management
Towards more flexibility in responding to users’ needs
The Role of Ontologies for Mapping the Domain of Landscape Architecture An introduction.
The Systems Engineering Context
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Active Data Management in Space 20m DG
Basic Microsoft Word 2013.
MANAGEMENT INFORMATION SYSTEM MEHTAP PARLAK Industrial Engineering Department, Dokuz Eylul University, Turkey 1.
Version 3 April 21, 2006 Takahiro Yamada (JAXA/ISAS)
Outline Pursue Interoperability: Digital Libraries
Basics of Drupal for Researchers
XML Based Interoperability Components
MSDs and combined metadata reporting
Measuring Data Quality and Compilation of Metadata
The new metadata structure & Country Specific Notes
Introduction to Information Retrieval
Malte Dreyer – Matthias Razum
Metadata The metadata contains
Márton Németh – László Drótos How to catalogue a web archive?
digital libraries and human information behavior
The role of metadata in census data dissemination
Teacher Name: School Name: Language:
Presentation transcript:

RECENT TRENDS IN METADATA GENERATION Milena Dobreva, Nikola Ikonomov IMI-BAS SEEDI Conference, Cetinje, September 2007

SEEDI Conference, Cetinje, September 2007 Warm-up question: Do we need this talk? We are all metadata experts! SEEDI Conference, Cetinje, September 2007

SEEDI Conference, Cetinje, September 2007 A Metadata Metaphor? We know there are Many standards Too many ad-hoc solutions We even know how to use some of them (or at least which is the right one for our project) BUT we typically do not know How to save time and human effort in creating and editing metadata? SEEDI Conference, Cetinje, September 2007

SEEDI Conference, Cetinje, September 2007 The current picture We can not avoid looking for answers to the question of saving time/effort, because We live in the time of data deluge The number of digitally born objects grows rapidly i.e. the demand for metadata and quality grows SEEDI Conference, Cetinje, September 2007

SEEDI Conference, Cetinje, September 2007 Metadata in the Digital Library Context: the DELOS project reference model SEEDI Conference, Cetinje, September 2007

SEEDI Conference, Cetinje, September 2007 Metadata seems to be part only of the CONTENT, but it influences all core concepts Content is the entry point for all the concepts related to the content that is managed and disseminated by the DL e.g. collections, information space model, metadata, ontologies; User is the root for concepts like roles, communities, profiles, etc., that represent aspects of the DL users; Functionality is the entrance to that part of the model which concerns DL functions; Architecture regards software components, hosting nodes and how these are linked and constrained; Quality groups qualitative parameters characterizing the digital library behavior within a given operational domain; Policy covers all the concepts that are related to established procedures or plans of actions governing the DL, such as collection management, preservation, access rights, etc. SEEDI Conference, Cetinje, September 2007

SEEDI Conference, Cetinje, September 2007 Definitions Recall: proportion of relevant documents, which are retrieved out of all relevant documents; Precision: proportion of retrieved and relevant documents; Accuracy: denotes the quantity of retrieved docs which are matching exactly the topic. SEEDI Conference, Cetinje, September 2007

Automatic extraction of metadata A group of NLP methods – text analysis aimed at extraction of specific metadata elements Various elements Measurement: through information retrieval measures (accuracy, recall, precision) SEEDI Conference, Cetinje, September 2007

SEEDI Conference, Cetinje, September 2007 Current research SEEDI Conference, Cetinje, September 2007

Current research (cont’d) SEEDI Conference, Cetinje, September 2007

Current research (cont’d) SEEDI Conference, Cetinje, September 2007

SEEDI Conference, Cetinje, September 2007 Conclusions These tools are all used for processing of English texts – the Balkan languages impose more challenges The quality of achieved results is not high enough yet, but this is a field of active work Integration of image and text processing is another direction for future work. SEEDI Conference, Cetinje, September 2007

Thank you for your attention! SEEDI Conference, Cetinje, September 2007