Noriko Kando National Institute of Informatics Presented at: Roadmap for Language Resources and Evaluation In a Multilingual Environment: Organised by.

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

SEARCHING QUESTION AND ANSWER ARCHIVES Dr. Jiwoon Jeon Presented by CHARANYA VENKATESH KUMAR.
Evaluating Search Engine
IR Challenges and Language Modeling. IR Achievements Search engines  Meta-search  Cross-lingual search  Factoid question answering  Filtering Statistical.
A Web of Concepts Dalvi, et al. Presented by Andrew Zitzelberger.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
INFO 624 Week 3 Retrieval System Evaluation
Advance Information Retrieval Topics Hassan Bashiri.
1 MARG-DARSHAK: A Scrapbook on Web Search engines allow the users to enter keywords relating to a topic and retrieve information about internet sites (URLs)
Exercise 1: Bayes Theorem (a). Exercise 1: Bayes Theorem (b) P (b 1 | c plain ) = P (c plain ) P (c plain | b 1 ) * P (b 1 )
WHAT HAVE WE DONE SO FAR?  Weeks 1 – 8 : various components of an information retrieval system  Now – look at various examples of information retrieval.
SemanTic Interoperability To access Cultural Heritage Frank van Harmelen Henk Matthezing Peter Wittenburg Marjolein van Gendt Antoine Isaac Lourens van.
 Official Site: facility.org/research/evaluation/clef-ip-10http:// facility.org/research/evaluation/clef-ip-10.
GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic Extending the “Facets” concept by applying NLP tools to catalog records of scientific literature *E.
Result presentation. Search Interface Input and output functionality – helping the user to formulate complex queries – presenting the results in an intelligent.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Claudia Marzi Institute for Computational Linguistics, “Antonio Zampolli” – Italian National Research Council University of Pavia – Dept. of Theoretical.
Machine Translation, Digital Libraries, and the Computing Research Laboratory Indo-US Workshop on Digital Libraries June 23, 2003.
Upstream & Downstream Defined Challenges Existing Models Case Studies Conclusion.
Multilingual Information Exchange APAN, Bangkok 27 January 2005
Knowledge Representation and Indexing Using the Unified Medical Language System Kenneth Baclawski* Joseph “Jay” Cigna* Mieczyslaw M. Kokar* Peter Major.
© Copyright 2008 STI INNSBRUCK NLP Interchange Format José M. García.
Domain-Specific Software Development Terminology: Do We All Speak the Same Language? Arturo Sánchez-Ruíz, University of North Florida, USA Motoshi Saeki,
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
Carnegie Mellon School of Computer Science Copyright © 2001, Carnegie Mellon. All Rights Reserved. JAVELIN Project Briefing 1 AQUAINT Phase I Kickoff December.
Towards an ecosystem of data and ontologies Mathieu d’Aquin and Enrico Motta Knowledge Media Institute The Open University.
Ontologies and Lexical Semantic Networks, Their Editing and Browsing Pavel Smrž and Martin Povolný Faculty of Informatics,
The Agricultural Ontology Service (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Library and Documentation Systems.
Péter Schönhofen – Ad Hoc Hungarian → English – CLEF Workshop 20 Sep 2007 Performing Cross-Language Retrieval with Wikipedia Participation report for Ad.
Aquenergy Portal Elisabetta Zuanelli, University of Rome “Tor Vergata”, Italy E-Age 2014 Muscat december.
Search Engine Architecture
Research Topics/Areas. Adapting search to Users Advertising and ad targeting Aggregation of Results Community and Context Aware Search Community-based.
Internationalization and the Workplace Jody Richards, CPIM November 14, 2005.
1SDO-based Metadata ModelDC-2004, October A Metadata Model Based on the Concept of Structured Digital Object (SDO) and Its Application in Digital.
Clarity Cross-Lingual Document Retrieval, Categorisation and Navigation Based on Distributed Services
Using Domain Ontologies to Improve Information Retrieval in Scientific Publications Engineering Informatics Lab at Stanford.
KNOWLEDGE FOR LIFE Librarian 101: Subscriber Services Tom Corser, CABI’s Training Manager
KNOWLEDGE FOR LIFE Let the database do the work (CAB Direct platform) CAB Abstracts (via EbscoHost) Tom Corser, Training manager November.
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
WEB PAGE CONTENTS VERIFICATION AGAINST TAGS USING DATA MINING TOOL IKNOW VІI scientific and practical seminar with international participation "Economic.
KNOWLEDGE FOR LIFE The power of the datasheet CAB Abstracts (via EbscoHost) Tom Corser, CABI Training manager June 2015.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Search Result Interface Hongning Wang Abstraction of search engine architecture User Ranker Indexer Doc Analyzer Index results Crawler Doc Representation.
Information Retrieval CSE 8337 Spring 2007 Introduction/Overview Some Material for these slides obtained from: Modern Information Retrieval by Ricardo.
KNOWLEDGE FOR LIFE Let the database do the work (CAB Direct platform) CAB Abstracts (via EbscoHost) Tom Corser, Training manager November.
CIW Lesson 6MBSH Mr. Schmidt1.  Define databases and database components  Explain relational database concepts  Define Web search engines and explain.
Supporting Knowledge Discovery: Next Generation of Search Engines Qiaozhu Mei 04/21/2005.
AQUAINT AQUAINT Evaluation Overview Ellen M. Voorhees.
Evaluation of Information Retrieval Systems Xiangming Mu.
NeOn Components for Ontology Sharing and Reuse Mathieu d’Aquin (and the NeOn Consortium) KMi, the Open Univeristy, UK
© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Quality Assurance, Ontology Engineering, and Semantic Interoperability.
Genoa – May 23, 2006 LREC workshop From Media Crossing to Media Mining Franciska de Jong University of Twente/TNO ICT
The Agricultural Ontology Server (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Food and Agriculture Organization.
© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Quality Assurance, Ontology Engineering, and Semantic Interoperability.
WP 2: Ontology & Metadata Models ITD
Mohammad Alqahtani, Dr. Eric Atwell
Dr. LEE Jong-wook Fellowship
Far East: Patents, Trademarks and Designs
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
FAR EAST AND PATENT INFORMATION
Lesson 6: Databases and Web Search Engines
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Search Engine Architecture
Information Science in International Perspective
Q4 Measuring Effectiveness
CSE 635 Multimedia Information Retrieval
Lesson 6: Databases and Web Search Engines
Search Engine Architecture
COCOSDA/WRITE Roadmap for Language Resources and Evaluation
Information Retrieval and Web Design
Presentation transcript:

Noriko Kando National Institute of Informatics Presented at: Roadmap for Language Resources and Evaluation In a Multilingual Environment: Organised by COCOSDA and WRITE Genoa, 28 May 2006 In Conjunction with LREC 2006 Roadmap for Language Resources in the viewpoint from Information Access Technology Evaluation

Issues LR and Information Access Multi-linguality across Culture Emerging Areas: Genres, Opinion, Subjectivity, Community-based Ontology

LR and Information Access Information Access (IA)Technologies (Information Retrieval, Question Answering, Summarization, Text mining, etc) needs better LR: coverage, richness, quality. –Evaluation –Development Ex. AQUAINT (Advanced QA) Program has supported Resources (WordNet, Gazetteer, etc.), Component Modules, and QA systems.

LR and Information Access – cont’d Extrinsic (Task-based) LR Evaluation –So far LR evaluation had placed emphasis on intrinsic evaluation. Eg. Accuracy, consistency, standards, etc. –Extrinsic LR evaluation: How LR improved the effectiveness of the IA technologies? –Good ways to appeal LR’s social importance. Easy to be understood by non-experts and sponsors

LR and Information Access – cont’d LR can be enriched or created through usage in IA systems Ex. Search Engine Query logs Users’ Relevance judgments, click through, etc.

Multi-linguality Axes to characterize CLIA systems –Languages –Type of media –Tasks and users –Success criteria or relevance judgments –Document genres –Layers of CLIR technologies –Information access process [Kando 2002; Gey, Kando & Peters 2002 ]

Layers of Cross-Lingual IA Technologies; pragmatic layer: cultural & social aspects, semantic layer: concept mapping syntactic lager: lexical layer: language identify, indexing symbol layer: character codes physical layer: network [ Kando 1999; 2002; Gey, Kando & Peters 2002]

Multi-linguality in Pragmatic layer Pragmatic layer of CLIA technologies –include issues related to text structure, intra & inter- text relationship –identifying the differences of the viewpoints across the languages or cultures is also critical.

Emerging Areas Esp. Conjunction with WEB, –Heterogeneous Document Genres –Subjectivity, Opinion, Emotion, etc. –Community-based or Domain-specific Ontology –Multi-faceted Ontology –Interactivity –Multi-modal

Summary LR and Information Access Multi-linguality across Culture Emerging Areas: Genres, Opinion, Subjectivity, Community-based Ontology

ThanksMerci Danke schön Gracie Gracias Ta!Tack Köszönöm Kiitos Terima KasihKhap Khun AhsanteTak 謝謝ありがとう ThanksMerci Danke schön Gracie Gracias Ta!Tack Köszönöm Kiitos Terima KasihKhap Khun AhsanteTak 謝謝ありがとう