Co-occurrence and place name disambiguation. GeoCLEF 2006 Simon Overell João Magalhães Stefan Rüger.

Slides:



Advertisements
Similar presentations
A probabilistic model for retrospective news event detection
Advertisements

Towards Methods for the Collective Gathering and Quality Control of Relevance Assessments SIGIR´09, July 2009.
Date : 2013/09/17 Source : SIGIR’13 Authors : Zhu, Xingwei
Search Results Need to be Diverse Mark Sanderson University of Sheffield.
Leveraging Community-built Knowledge For Type Coercion In Question Answering Aditya Kalyanpur, J William Murdock, James Fan and Chris Welty Mehdi AllahyariSpring.
GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.
Evaluating Evaluation Measure Stability Authors: Chris Buckley, Ellen M. Voorhees Presenters: Burcu Dal, Esra Akbaş.
Predicting Text Quality for Scientific Articles AAAI/SIGART-11 Doctoral Consortium Annie Louis : Louis A. and Nenkova A Automatically.
Dissemination and Critical Evaluation of Published Research Peg Bottjen, MPA, MT(ASCP)SC.
A Combinatorial Fusion Method for Feature Mining Ye Tian, Gary Weiss, D. Frank Hsu, Qiang Ma Fordham University Presented by Gary Weiss.
1 Lecture 5: Automatic cluster detection Lecture 6: Artificial neural networks Lecture 7: Evaluation of discovered knowledge Brief introduction to lectures.
Gimme’ The Context: Context- driven Automatic Semantic Annotation with CPANKOW Philipp Cimiano et al.
An Empirical Study of Instance-Based Ontology Mapping Antoine Isaac, Lourens van der Meij, Stefan Schlobach, Shenghui Wang funded by NWO Vrije.
By ANDREW ZITZELBERGER A Framework for Extraction Ontology Based Information Management.
THIS IS THE TITLE OF MY POSTER. THIS IS THE TITLE OF MY POSTER NAMES, NAMES, & NAMES This is my abstract. This is my abstract. This is my abstract. This.
Presented by Zeehasham Rasheed
Multi-Concept Alignment and Evaluation Shenghui Wang, Antoine Isaac, Lourens van der Meij, Stefan Schlobach Ontology Matching Workshop Oct. 11 th, 2007.
ISP 433/633 Week 6 IR Evaluation. Why Evaluate? Determine if the system is desirable Make comparative assessments.
Dr. Erica Cirillo-McCarthy Assistant Director of Graduate and ADEP Writing The California Lutheran University Writing Center.
Bloom’s Critical Thinking Level 1 Knowledge Exhibits previously learned material by recalling facts, terms, basic concepts, and answers.
How to Critically Review an Article
(TOPIC NAME) Scientific Writing. Importance Scientists communicate their findings to the scientific community by publishing their experimental results.
Today’s presentation Introduction – Morten Analysis – Lars Implementation – Umachanger Demonstration – Nirojan Closure - Stephan.
Unit 1 Accuracy & Precision.  Data (Singular: datum or “a data point”): The information collected in an experiment. Can be numbers (quantitative) or.
A hybrid method for Mining Concepts from text CSCE 566 semester project.
1 Formal Models for Expert Finding on DBLP Bibliography Data Presented by: Hongbo Deng Co-worked with: Irwin King and Michael R. Lyu Department of Computer.
PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.
Morpho Challenge competition Evaluations and results Authors Mikko Kurimo Sami Virpioja Ville Turunen Krista Lagus.
Intent Subtopic Mining for Web Search Diversification Aymeric Damien, Min Zhang, Yiqun Liu, Shaoping Ma State Key Laboratory of Intelligent Technology.
Dr Jamal Roudaki Faculty of Commerce Lincoln University New Zealand.
Automatic Syllabus Classification JCDL – Vancouver – 22 June 2007 Edward A. Fox (presenting co-author), Xiaoyan Yu, Manas Tungare, Weiguo Fan, Manuel Perez-Quinones,
Title of Presentation Name Date 2011 Presentation Template.
Question Answering over Implicitly Structured Web Content
Iterative Translation Disambiguation for Cross Language Information Retrieval Christof Monz and Bonnie J. Dorr Institute for Advanced Computer Studies.
Semantic v.s. Positions: Utilizing Balanced Proximity in Language Model Smoothing for Information Retrieval Rui Yan†, ♮, Han Jiang†, ♮, Mirella Lapata‡,
Language Model in Turkish IR Melih Kandemir F. Melih Özbekoğlu Can Şardan Ömer S. Uğurlu.
Warmup 1)Peer edit the procedures you completed for homework. 2) Talk within your group and decide on a procedure you’ll all follow.
Ranking Related Entities Components and Analyses CIKM’10 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.
Number Sense Disambiguation Stuart Moore Supervised by: Anna Korhonen (Computer Lab)‏ Sabine Buchholz (Toshiba CRL)‏
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Iterative Translation Disambiguation for Cross-Language.
Automating Readers’ Advisory to Make Book Recommendations for K-12 Readers by Alicia Wood.
Improved Video Categorization from Text Metadata and User Comments ACM SIGIR 2011:Research and development in Information Retrieval - Katja Filippova -
FILTERED RANKING FOR BOOTSTRAPPING IN EVENT EXTRACTION Shasha Liao Ralph York University.
INAOE at GeoCLEF 2008: A Ranking Approach based on Sample Documents Esaú Villatoro-Tello Manuel Montes-y-Gómez Luis Villaseñor-Pineda Language Technologies.
Title Authors Introduction Text, text, text, text, text, text Background Information Text, text, text, text, text, text Observations Text, text, text,
Learning in a Pairwise Term-Term Proximity Framework for Information Retrieval Ronan Cummins, Colm O’Riordan Digital Enterprise Research Institute SIGIR.
 Nonfiction text is…  Writing that tells about actual people, events, places, or things.  Sometimes called informational or expository text.
THE WRITING PROFICIENCY. Basic Information  It is an every changing test  Many changes in the past-got rid of narrative  Next year the test will be.
Rule-Based Method for Entity Resolution IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING JANUARY 2015.
People and Families of the Bible Nathan Friedly. Overview Introduction Key Ideas Description and use Deliverables Demonstration Conclusion.
Multi-objective evolutionary generation of mamdani fuzzy rule-based systems based on rule and condition selection International Workshop On Genetic And.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Enhancing Text Clustering by Leveraging Wikipedia Semantics.
ANAPHORA RESOLUTION SYSTEM FOR NATURAL LANGUAGE REQUIREMENTS DOCUMENT IN KOREAN 課程 : 自然語言與應用 課程老師 : 顏國郎 報告者 : 鄭冠瑀.
Meta-Path-Based Ranking with Pseudo Relevance Feedback on Heterogeneous Graph for Citation Recommendation By: Xiaozhong Liu, Yingying Yu, Chun Guo, Yizhou.
Formal Report Strategies. Types of Formal Reports Informational Presents Info Analytical Presents Info Analyses info and draws conclusions Recommendation.
BSA 375 Week 1 Individual Software Development Presentation Create an annotated or narrated 7- to 10-slide presentation that includes the following: An.
<Student’s name>
TITLE Authors Institution RESULTS INTRODUCTION CONCLUSION AIMS METHODS
إعداد د/زينب عبد الحافظ أستاذ مساعد بقسم الاقتصاد المنزلي
مناهــــج البحث العلمي
Put your name here Name of the Department, School or College
Boldly going where no ROSE has gone before.
Put your name here Name of the Department, School or College
A Framework for Benchmarking Entity-Annotation Systems
Put your name here Name of the Department, School or College
Functions and Relations
Title Introduction: Discussion & Conclusion: Methods & Results:
Question: Think Talk Idea Talk Idea Talk Idea Notes: We Understand.
Insert case title © Bloomsbury Publishing plc
PowerPoint Presentations
Presentation transcript:

Co-occurrence and place name disambiguation. GeoCLEF 2006 Simon Overell João Magalhães Stefan Rüger

Introduction The Problem –Place name disambiguation The Structure of this Talk –Methods of place name disambiguation –Previous work –Our Geographic Information Retrieval System –Results –Conclusions and Future work

Disambiguation Rule-Based Methods –Based on heuristics Data Driven –Require a large annotated corpus Hybrid (Bootstrapping) Methods –Semi-supervised methods requiring only a small annotated corpus

Disambiguation

Previous Work Identifying and grounding descriptions of places* –Demonstrated and evaluated a hierarchical rule based method of place-name disambiguation –Used Wikipedia as a corpus –Proposed using co-occurrence for place-name disambiguation *published at Workshop on Geographic Information Retrieval SIGIR 2006

Rule-based Disambiguation

Co-occurrence model Disambiguate Place tupples based on the likelihood of a co-occurrence as reflected in the generated model

Co-occurrence model

Co-occurrence based Disambiguation

Results Runs –Title, Description and Narrative –Title and Description Mean Average Precision TDNTDWorstQ1MedianQ3Best 19.53%16.49%4%15.64%21.62%24.59%32.23%

Summary Conclusions –The system model is valid –Co-occurrence models are a viable method of place name disambiguation –The accuracy of the model agrees with previous experiments –Further tuning is needed! Future Work –Larger scale co-occurrence model evaluation Larger co-occurrence model Compare multiple methods of applying the model

Co-occurrence and place name disambiguation. Simon Overell João Magalhães Stefan Rüger