Presentation is loading. Please wait.

Presentation is loading. Please wait.

Lemur Indri Search Engine Yatish Hegde 03/03/2010.

Similar presentations


Presentation on theme: "Lemur Indri Search Engine Yatish Hegde 03/03/2010."— Presentation transcript:

1 Lemur Indri Search Engine Yatish Hegde 03/03/2010

2 Background Open source text search engine Combines language modeling and inference networks Inquery query language API – accesible from C++, Java, C# and PHP. Html, xml, txt, trectext, trecweb, ppt, doc*, ppt*

3 Resources Website: http://lemurproject.orghttp://lemurproject.org Tutorials: http://sourceforge.net/apps/trac/lemur/wiki http://sourceforge.net/apps/trac/lemur/wiki Forum: http://sourceforge.net/projects/lemur/forums http://sourceforge.net/projects/lemur/forums

4 How to get started? Cygwin: http://cygwin.com (include “perl”, “vi editor” and “make” package while installing)http://cygwin.com Lemur Toolkit: http://sourceforge.net/projects/lemur/develo p http://sourceforge.net/projects/lemur/develo p TREC Eval: http://trec.nist.gov/trec_eval/http://trec.nist.gov/trec_eval/

5 Installing Lemur Inside Lemur Directory -./configure make make install Build Index – IndriBuildIndex Run Query - IndriRunQuery

6 Building Index IndriBuildIndex /home/lemur/testindex 1G /home/lemur/testdata/firstCorpus trectext /home/lemur/testdata/secondCorpus trecweb krovetz p

7 Running Query IndriRunQuery Query File 701 oil industry history Stop Word File the Query Options File true /path/to/index 1000

8 Converting Topic File into Query File Topic File Number: 301 International Organized Crime Description: Identify organizations that participate in international criminal activity, the activity, and, if possible, collaborating organizations and the countries involved. Narrative: A relevant document must as a minimum identify the organization and the type of illegal activity (e.g., Columbian cartel exporting cocaine). Vague references to international drug trade without identification of the organization(s) involved would not be relevant.

9 Converting Topic File into Query File Perl Program:./topicToQuery.pl [-t] [-d]./topicToQuery.pl -h

10 TREC Eval make trec_eval -q -c -M1000 official_qrels query_results More Documentation: http://trecvid.nist.gov/trecvid.tools/trec_eval_video/ README http://trecvid.nist.gov/trecvid.tools/trec_eval_video/ README

11 Lemur Search UI User Interface: http://sourceforge.net/apps/trac/lemur/wiki/ The%20Lemur%20CGI%20Application http://sourceforge.net/apps/trac/lemur/wiki/ The%20Lemur%20CGI%20Application How it looks? http://sewell.syr.edu/lemur/lemur.cgi http://sewell.syr.edu/lemur/lemur.cgi

12 Indri Query Langauge #combine( white house) #1(white house) #5(white house) #band(white house) #band(oil fields) #1(white house) 301 #combine( Identify organizations that participate in #max( #1( international criminal activity) international criminal activity ) the activity and if possible collaborating organizations and the countries involved)

13 Contact If you have questions - Yatish Hegde: yhegde@syr.edu

14 Thank You


Download ppt "Lemur Indri Search Engine Yatish Hegde 03/03/2010."

Similar presentations


Ads by Google