Disambiguating Queries for Geographic Information Retrieval Carolyn Hafernik Thesis Proposal May 10, 2006 Computer Science Advisor: Lisa Ballesteros.

Slides:



Advertisements
Similar presentations
Semantics Rule, Keywords Drool J. Brooke Aker CEO Expert System USA February 2010.
Advertisements

Metadata in Carrot II Current metadata –TF.IDF for both documents and collections –Full-text index –Metadata are transferred between different nodes Potential.
SINAI-GIR A Multilingual Geographical IR System University of Jaén (Spain) José Manuel Perea Ortega CLEF 2008, 18 September, Aarhus (Denmark) Computer.
Nuno Cardoso, Bruno Martins, Marcirio Chaves, Leonardo Andrade and Mário J. Silva XLDB Group - Department of Informatics Faculdade de Ciências da Universidade.
The XLDB Group at GeoCLEF 2005 Nuno Cardoso, Bruno Martins, Marcirio Chaves, Leonardo Andrade, Mário J. Silva XLDB Group - Department of Informatics Faculdade.
Search Results Need to be Diverse Mark Sanderson University of Sheffield.
Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.
The XLDB Group at GeoCLEF 2005 Nuno Cardoso, Bruno Martins, Marcírio Chaves, Leonardo Andrade, Mário J. Silva
Retrieving Documents with Geographic References Using a Spatial Index Structure Based on Ontologies Database Laboratory University of A Coruña A Coruña,
A Flexible Workbench for Document Analysis and Text Mining NLDB’2004, Salford, June Gulla, Brasethvik and Kaada A Flexible Workbench for Document.
Cláudio Baptista, UFCG A Model for Geographic Knowledge Extraction on Web Documents Cláudio E. C. Campelo and Cláudio de Souza.
SLIDE 1IS 240 – Spring 2007 Prof. Ray Larson University of California, Berkeley School of Information Tuesday and Thursday 10:30 am - 12:00.
SIMS 202 Information Organization and Retrieval Prof. Marti Hearst and Prof. Ray Larson UC Berkeley SIMS Tues/Thurs 9:30-11:00am Fall 2000.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
8/28/2001Database Management -- Fall R. Larson Database Management: Introduction University of California, Berkeley School of Information Management.
Automatic Acquisition of Lexical Classes and Extraction Patterns for Information Extraction Kiyoshi Sudo Ph.D. Research Proposal New York University Committee:
Welcome Introduction and Overview Computer Science Research Practicum Fall 2012 Andrew Rosenberg.
LREC Combining Multiple Models for Speech Information Retrieval Muath Alzghool and Diana Inkpen University of Ottawa Canada.
The Project Planning Process
Evaluating the Contribution of EuroWordNet and Word Sense Disambiguation to Cross-Language Information Retrieval Paul Clough 1 and Mark Stevenson 2 Department.
CLEF – Cross Language Evaluation Forum Question Answering at CLEF 2003 ( Bridging Languages for Question Answering: DIOGENE at CLEF-2003.
CLEF Ǻrhus Robust – Word Sense Disambiguation exercise UBC: Eneko Agirre, Oier Lopez de Lacalle, Arantxa Otegi, German Rigau UVA & Irion: Piek Vossen.
“How much context do you need?” An experiment about context size in Interactive Cross-language Question Answering B. Navarro, L. Moreno-Monteagudo, E.
1 The Domain-Specific Track at CLEF 2008 Vivien Petras & Stefan Baerisch GESIS Social Science Information Centre, Bonn, Germany Aarhus, Denmark, September.
Requirements for the Course
JULIO GONZALO, VÍCTOR PEINADO, PAUL CLOUGH & JUSSI KARLGREN CLEF 2009, CORFU iCLEF 2009 overview tags : image_search, multilinguality, interactivity, log_analysis,
CLEF 2004 – Interactive Xling Bookmarking, thesaurus, and cooperation in bilingual Q & A Jussi Karlgren – Preben Hansen –
1 Formal Models for Expert Finding on DBLP Bibliography Data Presented by: Hongbo Deng Co-worked with: Irwin King and Michael R. Lyu Department of Computer.
“ SINAI at CLEF 2005 : The evolution of the CLEF2003 system.” Fernando Martínez-Santiago Miguel Ángel García-Cumbreras University of Jaén.
The CLEF 2003 cross language image retrieval task Paul Clough and Mark Sanderson University of Sheffield
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
Cross-Language Evaluation Forum (CLEF) IST Expected Kick-off Date: August 2001 Carol Peters IEI-CNR, Pisa, Italy Carol Peters: blabla Carol.
MIRACLE Multilingual Information RetrievAl for the CLEF campaign DAEDALUS – Data, Decisions and Language, S.A. Universidad Carlos III de.
Interactive Probabilistic Search for GikiCLEF Ray R Larson School of Information University of California, Berkeley Ray R Larson School of Information.
Péter Schönhofen – Ad Hoc Hungarian → English – CLEF Workshop 20 Sep 2007 Performing Cross-Language Retrieval with Wikipedia Participation report for Ad.
Extracting Metadata for Spatially- Aware Information Retrieval on the Internet Clough, Paul University of Sheffield, UK Presented By Mayank Singh.
Comparing syntactic semantic patterns and passages in Interactive Cross Language Information Access (iCLEF at the University of Alicante) Borja Navarro,
Integrating Data Analysis Across the Curriculum Feel free to edit and change this slide.
An Architecture for Emergent Semantics Sven Herschel, Ralf Heese, and Jens Bleiholder Humboldt-Universität zu Berlin/ Hasso-Plattner-Institut.
P2P Concept Search Fausto Giunchiglia Uladzimir Kharkevich S.R.H Noori April 21st, 2009, Madrid, Spain.
Thomas Mandl: GeoCLEF Track Overview th Workshop of the Cross-Language Evaluation Forum (CLEF) Århus, 18 th Sept
How robust is CLIR? Proposal for a new robust task at CLEF Thomas Mandl Information Science Universität Hildesheim 6 th Workshop.
National Institute of Advanced Industrial Science and Technology Query Processing for Distributed RDF Databases Using a Three-dimensional Hash Index Akiyoshi.
GeoCLEF Breakout Notes Fred Gey, Ray Larson, Paul Clough.
CLEF 2008 Workshop September 17-19, 2008 Aarhus, Denmark.
Iterative Translation Disambiguation for Cross Language Information Retrieval Christof Monz and Bonnie J. Dorr Institute for Advanced Computer Studies.
© 2004 Chris Staff CSAW’04 University of Malta of 15 Expanding Query Terms in Context Chris Staff and Robert Muscat Department of.
Translating Dialects in Search: Mapping between Specialized Languages of Discourse and Documentary Languages Vivien Petras UC Berkeley School of Information.
CLEF Kerkyra Robust – Word Sense Disambiguation exercise UBC: Eneko Agirre, Arantxa Otegi UNIPD: Giorgio Di Nunzio UH: Thomas Mandl.
Thomas Mandl: GeoCLEF Track Overview Cross-Language Evaluation Forum (CLEF) Thomas Mandl, (U. Hildesheim) 8 th Workshop.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Iterative Translation Disambiguation for Cross-Language.
Stiftung Wissenschaft und Politik German Institute for International and Security Affairs CLEF 2005: Domain-Specific Track Overview Michael Kluck SWP,
1 Evaluating High Accuracy Retrieval Techniques Chirag Shah,W. Bruce Croft Center for Intelligent Information Retrieval Department of Computer Science.
Principals of Research Writing. What is Research Writing? Process of communicating your research  Before the fact  Research proposal  After the fact.
INAOE at GeoCLEF 2008: A Ranking Approach based on Sample Documents Esaú Villatoro-Tello Manuel Montes-y-Gómez Luis Villaseñor-Pineda Language Technologies.
The Loquacious ( 愛說話 ) User: A Document-Independent Source of Terms for Query Expansion Diane Kelly et al. University of North Carolina at Chapel Hill.
A Logistic Regression Approach to Distributed IR Ray R. Larson : School of Information Management & Systems, University of California, Berkeley --
Combining Text and Image Queries at ImageCLEF2005: A Corpus-Based Relevance-Feedback Approach Yih-Cheng Chang Department of Computer Science and Information.
Thomas Mandl: Robust CLEF Overview 1 Cross-Language Evaluation Forum (CLEF) Thomas Mandl Information Science Universität Hildesheim
Analyzing Data (e.g., that part of “analyzing data” that is relevant to Research Design)
1 The Domain-Specific Track at CLEF 2007 Vivien Petras, Stefan Baerisch & Max Stempfhuber GESIS Social Science Information Centre, Bonn, Germany Budapest,
Toward Entity Retrieval over Structured and Text Data Mayssam Sayyadian, Azadeh Shakery, AnHai Doan, ChengXiang Zhai Department of Computer Science University.
Analysis of Experiments on Hybridization of different approaches in mono and cross-language information retrieval DAEDALUS – Data, Decisions and Language,
Information and Communication Technologies 1 Overview of GeoCLEF 2007 IR techniques IE/NLP techniques GIR techniques Systems Resources Experiments Translation.
GeoCLEF topic creation Mark Sanderson. 21/03/2016© The University of Sheffield / Department of Marketing and Communications Topics 25 adhoc topics Developed.
F. López-Ostenero, V. Peinado, V. Sama & F. Verdejo
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
6 ~ GIR.
Name of my project My Name Silver Spring International Middle School
Cheshire at GeoCLEF 2008: Text and Fusion Approaches for GIR
Presentation transcript:

Disambiguating Queries for Geographic Information Retrieval Carolyn Hafernik Thesis Proposal May 10, 2006 Computer Science Advisor: Lisa Ballesteros

Information Retrieval (IR) What are the goals of an IR system? What are the goals of an IR system? What is a relevant document? What is a relevant document? How does one determine which documents are relevant? How does one determine which documents are relevant? How are IR systems evaluated? How are IR systems evaluated?

Geographic Information Retrieval (GIR) GIR is an extension of IR GIR is an extension of IR It aims to use geospatial information to help improve retrieval effectiveness It aims to use geospatial information to help improve retrieval effectiveness What makes GIR challenging? What makes GIR challenging? Poor query specification Poor query specification Ambiguity of language Ambiguity of language No central repository for geospatial information No central repository for geospatial information

Geospatial Information Map from Map from Locations Locations Population statistics Population statistics Name variations Name variations Nearby landmarks Nearby landmarks How can geospatial information be used to increase retrieval effectiveness given a query? How can geospatial information be used to increase retrieval effectiveness given a query? Example query: “Hiking near the Bay Area” Example query: “Hiking near the Bay Area”

Sample GeoCLEF 2005 Topics <top> GC001 GC001 C084 C084 Shark Attacks off Australia and California Shark Attacks off Australia and California Documents will report any information relating to shark attacks on humans. Documents will report any information relating to shark attacks on humans. Identify instances where a human was attacked by a shark, including where the attack took place and the circumstances surrounding the attack. Only documents concerning specific attacks are relevant; unconfirmed shark attacks or suspected bites are not relevant. Identify instances where a human was attacked by a shark, including where the attack took place and the circumstances surrounding the attack. Only documents concerning specific attacks are relevant; unconfirmed shark attacks or suspected bites are not relevant. Shark Attacks Shark Attacks near near Australia Australia California California </top><top> GC004 GC004 C126 - C126 - Actions against the fur industry in Europe and the U.S.A. Actions against the fur industry in Europe and the U.S.A. Find information on protests or violent acts against the fur industry. Find information on protests or violent acts against the fur industry. Relevant documents describe measures taken by animal right activists against fur farming and/or fur commerce, e.g. shops selling items in fur. Articles reporting actions taken against people wearing furs are also of importance. Relevant documents describe measures taken by animal right activists against fur farming and/or fur commerce, e.g. shops selling items in fur. Articles reporting actions taken against people wearing furs are also of importance. Animal Rights Actions against the fur industry Animal Rights Actions against the fur industry in in Europe Europe United States United States </top>

Previous Work GeoCLEF 2005 GeoCLEF 2005 Common approaches Common approaches Places to store information Places to store information Named Entity Recognition Named Entity Recognition Query Expansion Query Expansion Traditional IR approaches Traditional IR approaches

Hypothesis My hypothesis is that using geospatial information for query expansion and to re-weight geospatial components for each query will improve retrieval effectiveness. My hypothesis is that using geospatial information for query expansion and to re-weight geospatial components for each query will improve retrieval effectiveness. Improvement will occur because the expanded query will provide the system with more specific information than that contained in the original query. Improvement will occur because the expanded query will provide the system with more specific information than that contained in the original query.

Timeline Timeline Timeline Fall Semester Fall Semester Build the Gazetteer Build the Gazetteer Modify Query Analyzer Modify Query Analyzer Design Experiments Design Experiments Do More Background Reading Do More Background Reading Start writing thesis Start writing thesis January Term January Term Run experiments Run experiments Continue writing thesis Continue writing thesis Spring Semester Spring Semester Analyze results Analyze results Run more experiments (If necessary) Run more experiments (If necessary) Finish thesis Finish thesis

References [1] Davide Buscaldi, Paolo Rosso, Emilio Sanchia Arnal. A WordNet-based Query Expansion method for Geographical Information Retrieval [1] Davide Buscaldi, Paolo Rosso, Emilio Sanchia Arnal. A WordNet-based Query Expansion method for Geographical Information Retrieval [2] Nuno Cardoso, Bruno Martins, Marcirio Silveira Chaves, Leonardo Andrade, Mario J. Silva. The XLDB Group at GeoCLEF [2] Nuno Cardoso, Bruno Martins, Marcirio Silveira Chaves, Leonardo Andrade, Mario J. Silva. The XLDB Group at GeoCLEF [3] O. Ferrandez, Z. Kozareve, A. Toral, E. Noguera, A. Montoyo, R. Munoz, Fernando Llopis. Univeristy of Alicante at GeoCLEF [3] O. Ferrandez, Z. Kozareve, A. Toral, E. Noguera, A. Montoyo, R. Munoz, Fernando Llopis. Univeristy of Alicante at GeoCLEF [4] Daniel Ferres, Alicia Ageno, Horacio Rodriguez. The GeoTALP-IR System at GeoCLEF-2005: Experiments Using a QA-based IR System, Linguistic Analysis, and a Geographical Thesaurus [4] Daniel Ferres, Alicia Ageno, Horacio Rodriguez. The GeoTALP-IR System at GeoCLEF-2005: Experiments Using a QA-based IR System, Linguistic Analysis, and a Geographical Thesaurus [5] Fredric Gey, Ray Larson, Mark Sanderson, Hideo Joho, Paul Chlough. GeoCLEF: the CLEF 2005 Cross-Language Geographic Information Retrieval Track Overview [5] Fredric Gey, Ray Larson, Mark Sanderson, Hideo Joho, Paul Chlough. GeoCLEF: the CLEF 2005 Cross-Language Geographic Information Retrieval Track Overview [6] Fredric Gey, Vivien Petras. Berkeley2 at GeoCLEF: Cross-Language Geographic Information Retrieval of German and English Documents [6] Fredric Gey, Vivien Petras. Berkeley2 at GeoCLEF: Cross-Language Geographic Information Retrieval of German and English Documents [7] Rocio Guillen. CSUSM Experiments in GeoCLEF2005: Monolingual and Bilingual Tasks [7] Rocio Guillen. CSUSM Experiments in GeoCLEF2005: Monolingual and Bilingual Tasks [8] Baden Hughes. NICTA i2d2 at GeoCLEF [8] Baden Hughes. NICTA i2d2 at GeoCLEF [9] Andras Kornai. MetaCarta at GeoCLEF [9] Andras Kornai. MetaCarta at GeoCLEF [10] Sara Lana-Serrano, Jose M. Goni-Menoyo, Jose C. Gonzalez-Cristobal. Miracle’s 2005 Approach to Geographical Information Retrieval [10] Sara Lana-Serrano, Jose M. Goni-Menoyo, Jose C. Gonzalez-Cristobal. Miracle’s 2005 Approach to Geographical Information Retrieval [11] Ray R. Larson. Chesire II at GeoCLEF: Fusion and Query Expansion for GIR [11] Ray R. Larson. Chesire II at GeoCLEF: Fusion and Query Expansion for GIR [12] Jochen L. Leidner. Preliminary Experiments with Geo-Filtering Predicates for Geographic IR [12] Jochen L. Leidner. Preliminary Experiments with Geo-Filtering Predicates for Geographic IR [13] Johannes Leveling, Sven Hartrumpf, Dirk Veiel. University of Hagen at GeoCLEF 2005: Using Semantic Networks for Interpreting Geographical Queries [13] Johannes Leveling, Sven Hartrumpf, Dirk Veiel. University of Hagen at GeoCLEF 2005: Using Semantic Networks for Interpreting Geographical Queries

Thank you! Questions? Comments?