Report on the Lucene4IR Workshop

Slides:



Advertisements
Similar presentations
Language Technologies Reality and Promise in AKT Yorick Wilks and Fabio Ciravegna Department of Computer Science, University of Sheffield.
Advertisements

Our vision for a pedagogic planning tool Practical assistance for lecturers designing blended learning activities Both literature-based and adaptable in.
Strategic decision making with exploratory search Toby Mostyn CTO Polecat.
Dan Bolser, EMBL-EBI transPLANT portal: Overview and search Versailles, 12th-13th November 2012 trans-National Infrastructure for Plant Genomic Science.
Institute Engagement in AIAA Event Model January 2013.
How to Use LucidWorks Search
Overview of Collaborative Information Retrieval (CIR) at FIRE 2012 Debasis Ganguly, Johannes Leveling, Gareth Jones School of Computing, CNGL, Dublin City.
Web Search - Summer Term 2006 III. Web Search - Introduction (Cont.) (c) Wolfgang Hürst, Albert-Ludwigs-University.
Learning and Teaching with the UK Census Developing the Collection of Historical and Contemporary Census Data and Materials into a Major Learning and Teaching.
Information Retrieval in Practice
Search Engines and Information Retrieval
Information Retrieval in Practice
Re-ranking Documents Segments To Improve Access To Relevant Content in Information Retrieval Gary Madden Applied Computational Linguistics Dublin City.
Information Retrieval - Organization of the course Jian-Yun Nie 聂建云.
Lucene Brian Nisonger Feb 08,2006. What is it? Doug Cutting’s grandmother’s middle name Doug Cutting’s grandmother’s middle name A open source set of.
Course Module 1: Service-Oriented Programming (SOP)
Gregg Festa Project Manager Teaching Matters Program Overview Unique & Compelling Internet Content & Professional Development for Middle School.
Cardiff University April 6, 2011 Simon Parker.
In Situ Evaluation of Entity Ranking and Opinion Summarization using Kavita Ganesan & ChengXiang Zhai University of Urbana Champaign
International Week 2012, March 19-23, TUT Library Information Literacy developments at Tallinn University of Technology Library Gerda Koidla Deputy Director,
EUROPEAN NETWORK OF UNIVERSITY - ENTERPRISE COOPERATION (EUE-Net) a case of Good Practice in ERASMUS Thematic Networks Dan Grigorescu Professor, University.
WikiQuery.org -- An interactive collaboration interface for creating, storing and sharing effective CNF queries Le Zhao*, Xiaozhong Liu #, Jamie Callan*
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
Search Engines and Information Retrieval Chapter 1.
Terrier: TERabyte RetRIevER An Introduction By: Kavita Ganesan (Last Updated April 21 st 2009)
The DSpace Course Module – An introduction to DSpace.
Per Møldrup-Dalum State and University Library SCAPE Information Day State and University Library, Denmark, SCAPE Scalable Preservation Environments.
IST 441 Example Projects. Undergrad Project Find a customer – interest in xbox game forum Build a search engine for Xbox game forums etc. Compare two.
Search Engine By Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
AIAA New Event Model January Why the New Event Model? Our profession is evolving and AIAA must change with it  More emphasis in the industry.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
T 2.3 Follow up to Conferences Leonardo Piccinetti, EFB FORESTA Project Meeting Brasilia, 20 September 2011.
Semantic Technologies & GATE NSWI Jan Dědek.
Monitoring public satisfaction through user satisfaction surveys Committee for the Coordination of Statistical Activities Helsinki 6-7 May 2010 Steve.
Java Portals and Portlets Submitted By: Rashi Chopra CIS 764 Fall 2007 Rashi Chopra.
 hd.jpg hd.jpg Information Retrieval and Interaction.
IT-522: Web Databases And Information Retrieval By Dr. Syed Noman Hasany.
CSCI 572: Information Retrieval and Search Engines: Summer 2011 Prof. Chris A. Mattmann.
Learning Management System Training Workshop IIUM, PJ campus 24 – 25 May 2010 Assoc Prof Dr Kamal Basha b. Madarsha, Inst of Education.
Search Solutions 2010 Covent Garden, London 21 October 2010.
Lucene. Lucene A open source set of Java Classses ◦ Search Engine/Document Classifier/Indexer 
IR Homework #1 By J. H. Wang Mar. 16, Programming Exercise #1: Vector Space Retrieval - Indexing Goal: to build an inverted index for a text collection.
Information Retrieval
Comparing Document Segmentation for Passage Retrieval in Question Answering Jorg Tiedemann University of Groningen presented by: Moy’awiah Al-Shannaq
1 Evaluating High Accuracy Retrieval Techniques Chirag Shah,W. Bruce Croft Center for Intelligent Information Retrieval Department of Computer Science.
How Can I Use This Method? 2015 IEEE/ACM 37TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING HOW.
Certified Trainer Conference Call August 26 th, 2005.
PAIR project progress report Yi-Ting Chou Shui-Lung Chuang Xuanhui Wang.
Information Retrieval and Extraction 2009 Term Project – Modern Web Search Advisor: 陳信希 TA: 蔡銘峰&許名宏.
Working in Open Source Search
Information Retrieval in Practice
Information Retrieval in Practice
Search Engine Architecture
Information Retrieval (in Practice)
Proposal for Term Project
Wei Wei, PhD, Zhanglong Ji, PhD, Lucila Ohno-Machado, MD, PhD
Detailed search stats from DSpace Solr
Martin Moyle Digital Curation Manager UCL Library Services, UK
Implementation Issues & IR Systems
About the European Local Transport Information Service
שילוב קורסים לפיתוח מיומנויות למידה במכללה להנדסה
Data Mining Chapter 6 Search Engines
physical and electronic libraries
WISER Science Finding quality information on the internet
Overview Goals Components Team / Resources Technology Procedures Q & A
HRT Human Resources Toolkit
Professional Learning Network
Derek Sergeant Leeds University Library
CSCI 572: Information Retrieval and Search Engines: Summer 2010
Presentation transcript:

Report on the Lucene4IR Workshop Charlie Hull - Managing Director 30th November 2015 Search Solutions charlie@flax.co.uk www.flax.co.uk/blog +44 (0) 8700 118334 Twitter: @FlaxSearch

@FlaxSearch What was Lucene4IR? “to bring together researchers and developers to create a set of evaluation resources showing how to use Lucene to perform typical IR operations (i.e. indexing, retrieval, etc.) as well as how to extend, modify and work with Lucene to extract typical statistics, implement typical retrieval models, and to evaluate various TREC tasks.” Funded by the European Science Foundation / ELIAS Network (Grant No. SM 5916) & sponsored by

@FlaxSearch Who & where? Around 30 attendees from academia & industry (Flax, Lucidworks, Bloomberg...) Held on 8th and 9th of September 2016, at the University of Strathclyde in Glasgow

Themes Lucene-based search engines widely used in industry @FlaxSearch Themes Lucene-based search engines widely used in industry But academics usually work with IR-specific tools (Terrier, Lemur, Indri...) Skills shortage in industry Developments in IR are slow to appear in Lucene How do we make it easier to teach Lucene skills?

Sessions Industry Lucene in Industry – Charlie Hull (Flax) @FlaxSearch Sessions Industry Lucene in Industry – Charlie Hull (Flax) Deep Dive into the Lucene Query/Weight/Scorer Java Classes - Jake Mannix (Lucidworks) Learning to Rank – Diego Ceccarelli (Bloomberg) Academia Introduction – Leif Azzopardi (University of Glasgow) Using Lucene for Teaching and Learning IR - Prof. Juan Manual Fernandez Luna (University of Granada) Evaluation and Reproducible Experiments - Sauparna "Rup" Palchowdhury (NIST) Hackathon & Breakouts

@FlaxSearch References & outputs Programme & slides https://sites.google.com/site/lucene4ir Github repository https://github.com/leifos/lucene4ir Simple overview of Lucene Test data sets Indexing, Retrieval & Stats applications using Lucene Also worked on customised indexer process, BM25L, query expansion with synonyms, alternative scoring methods... Paper submitted to ACM SIGIR Forum https://github.com/leifos/lucene4ir/tree/master/sigirforumreport

What next? Continue to build links between industry & academia @FlaxSearch What next? Continue to build links between industry & academia Note Lucidworks offers some reduced student pricing for http://lucenerevolution.org Flax runs Lucene Hackdays via http://www.meetup.com/Apache-Lucene-Solr-London-User-Group/ next is Jan 20th for FullFact Continue to develop the code built during the hackathon Integrate the applications in an IR course Contact Dr. Leif Azzopardi to get involved http://www.dcs.gla.ac.uk/~leif/

Thankyou! Any questions? charlie@flax.co.uk www.flax.co.uk/blog +44 (0) 8700 118334 Twitter: @FlaxSearch