Course Overview: An Introduction to Information Retrieval and Applications J. H. Wang Feb. 22, 2012.

Slides:



Advertisements
Similar presentations
Course Overview: An Introduction to Information Retrieval and Applications J. H. Wang Feb. 17, 2014.
Advertisements

Chapter 5: Introduction to Information Retrieval
Modern Information Retrieval Chapter 1: Introduction
Pemrosesan Teks Pendahuluan. Buku referensi [1]Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schutze Introduction to Information.
Web Search and Mining Course Overview 1 Wu-Jun Li Department of Computer Science and Engineering Shanghai Jiao Tong University Lecture 0: Course Overview.
An Introduction to Information Retrieval and Applications J. H. Wang Feb. 19, 2008.
Web Search – Summer Term 2006 I. General Introduction (c) Wolfgang Hürst, Albert-Ludwigs-University.
Modern Information Retrieval Chapter 1: Introduction
Information Retrieval - Organization of the course Jian-Yun Nie 聂建云.
Web Information Retrieval and Extraction Chia-Hui Chang, Associate Professor National Central University, Taiwan Sep. 16, 2005.
Introduction to Operating Systems J. H. Wang Sep. 18, 2012.
1 Web Search and Advanced Internet Services 290N Class Introduction Tao Yang, 2014.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Introduction to Information Retrieval Hongning Wang
1 Information Retrieval and Advanced Internet Services 290N Class Introduction Tao Yang, 2015
CS6501 Information Retrieval Course Policy Hongning Wang
CS523 INFORMATION RETRIEVAL COURSE INTRODUCTION YÜCEL SAYGIN SABANCI UNIVERSITY.
Introduction to Information Security J. H. Wang Sep. 15, 2014.
Introduction to Network Security J. H. Wang Feb. 24, 2011.
Information Retrieval CENG 555 Spring Course Web Page Authoritative source of administrivia In-class announcements generally reflected on Web.
Object Oriented Programming (OOP) Design Lecture 1 : Course Overview Bong-Soo Sohn Assistant Professor School of Computer Science and Engineering Chung-Ang.
Introduction to Discrete Mathematics J. H. Wang Sep. 14, 2010.
Introduction to Operating Systems J. H. Wang Sep. 18, 2015.
Information Retrieval and Web Search Lecture 1. Course overview Instructor: Rada Mihalcea Class web page:
Course Overview for Web Computing J. H. Wang Sep. 19, 2011.
Course Overview: An Introduction to Information Retrieval and Applications J. H. Wang Apr. 24, 2013.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
Proposal for Term Project J. H. Wang Mar. 2, 2015.
Autumn Web Information retrieval (Web IR) Handout #0: Introduction Ali Mohammad Zareh Bidoki ECE Department, Yazd University
Introduction to Information Security J. H. Wang Sep. 10, 2013.
Object Oriented Programming (OOP) Design Lecture 1 : Course Overview Bong-Soo Sohn Associate Professor School of Computer Science and Engineering Chung-Ang.
Object Oriented Programming (FIT-II) J. H. Wang Feb. 20, 2009.
IR Homework #2 By J. H. Wang Mar. 31, Programming Exercise #2: Query Processing and Searching Goal: to search relevant documents for a given query.
GUIDED BY DR. A. J. AGRAWAL Search Engine By Chetan R. Rathod.
IR Homework #1 By J. H. Wang Mar. 21, Programming Exercise #1: Vector Space Retrieval Goal: to build an inverted index for a text collection, and.
Introduction to Operating Systems J. H. Wang Sep. 15, 2010.
Introduction to Computer Programming (FIT-I pro) J. H. Wang Sep. 17, 2007.
IR Homework #1 By J. H. Wang Mar. 16, Programming Exercise #1: Vector Space Retrieval - Indexing Goal: to build an inverted index for a text collection.
Modern Information Retrieval Presented by Miss Prattana Chanpolto Faculty of Information Technology.
Introduction to Information Security J. H. Wang Sep. 18, 2012.
Course Overview for Compilers J. H. Wang Sep. 14, 2015.
Object Oriented Programming (FIT-II) J. H. Wang Jan. 31, 2008.
Information Retrieval CSE 8337 Spring 2007 Introduction/Overview Some Material for these slides obtained from: Modern Information Retrieval by Ricardo.
Information Retrieval and Web Search Course overview Instructor: Rada Mihalcea.
Information Retrieval
ITIS 4510/5510 Web Mining Spring Overview Class hour 5:00 – 6:15pm, Tuesday & Thursday, Woodward Hall 135 Office hour 3:00 – 5:00pm, Tuesday, Woodward.
Course Overview for Compilers J. H. Wang Sep. 20, 2011.
Introduction to Operating Systems J. H. Wang Sep. 13, 2013.
CSCE 5073 Section 001: Data Mining Spring Overview Class hour 12:30 – 1:45pm, Tuesday & Thur, JBHT 239 Office hour 2:00 – 4:00pm, Tuesday & Thur,
CS798: Information Retrieval Charlie Clarke Information retrieval is concerned with representing, searching, and manipulating.
1 Advanced Database System Design Instructor: Ruoming Jin Fall 2010.
Information Retrieval CIS-462 Dr. Samir Tartir 2013/2014 First Semester.
Course Overview: Linear Algebra
IR Homework #2 By J. H. Wang Apr. 13, Programming Exercise #2: Query Processing and Searching Goal: to search for relevant documents Input: a query.
Term Project Proposal By J. H. Wang Apr. 7, 2017.
Introduction to Operating Systems
Information Storage and Retrieval Fall Lecture 1: Introduction and History.
CS6501 Advanced Topics in Information Retrieval Course Policy
Introduction to Information Security
Proposal for Term Project
Course Overview: An Introduction to Information Retrieval and Applications J. H. Wang Feb. 22, 2017.
CS598CXZ (CS510) Advanced Topics in Information Retrieval (Fall 2016)
Introduction to Operating Systems
INFORMATION RETRIEVAL TECHNIQUES BY DR. ADNAN ABID
Information Retrieval Systems
Information Retrieval and Extraction
CSCE 4143 Section 001: Data Mining Spring 2019.
Information Retrieval CIS-462
Web Search and Advanced Internet Services
ADVANCED TOPICS IN INFORMATION RETRIEVAL AND WEB SEARCH
Presentation transcript:

Course Overview: An Introduction to Information Retrieval and Applications J. H. Wang Feb. 22, 2012

IR, Spring 2012NTUT CSIE2 Instructor & TA Instructor –J. H. Wang ( 王正豪 ) –Assistant Professor, CSIE, NTUT –Office: R1534, Technology Building – –Tel: ext –Office Hour: 9:00-12:00 am, every Tuesday and Wednesday TA –Mr. Liu ( 劉瀚之 ) –R1424, Technology Building

IR, Spring 2012NTUT CSIE3 Course Description Course Web Page – Time: 9:10-12:00am, Thu. Classroom: R1322, Technology Building Textbook: –Christopher D. Manning, Prabhakar Raghavan and Hinrich Schuetze, Introduction to Information Retrieval, Cambridge University Press, Introduction to Information Retrieval Available online International Student Edition, imported by Kai-Fa ( 開發 ) Publishing Prerequisites: –Basic knowledge of data structures and algorithms, linear algebra, and probability theory –Programming experience is *required* for homeworks & projects

IR, Spring 2012NTUT CSIE4 Additional References References: –Ricardo Baeza-Yates and Berthier Ribeiro-Neto, Modern Information Retrieval: The Concepts and Technology behind Search, Addison-Wesley, Modern Information Retrieval: The Concepts and Technology behind Search This is the second edition of their book Modern Information Retrieval in ( 華通 )Modern Information Retrieval –Stefan Buettcher, Charles L.A. Clarke, and Gordon V. Cormack, Information Retrieval: Implementing and Evaluating Search Engines, MIT Press, 2010.Information Retrieval: Implementing and Evaluating Search Engines –Bruce Croft, Donald Metzler, and Trevor Strohman, Search Engines: Information Retrieval in Practice, Addison-Wesley, ( 全華 ) Search Engines: Information Retrieval in Practice

IR, Spring 2012NTUT CSIE5 More Books on IR Gerald Salton, Automatic information organization and retrieval, McGraw-Hill, Gerald Salton and M.J. McGill, Introduction to modern information retrieval, McGraw-Hill, – Two classics, but out-of-print. C. J. van Rijsbergen, Information Retrieval, Butterworths, 1979.Information Retrieval – The classic. More than 40 years old, but still worth reading. K. Sparck Jones, P. Willett, Readings in Information Retrieval, Morgan Kaufmann, 1997.Readings in Information Retrieval – A collection of classical IR papers. (out of print) I.H. Witten, A. Moffat, T.C. Bell. Morgan Kaufmann, Managing Gigabytes, 2nd edition, Managing Gigabytes – The authority on index construction and compression.

IR, Spring 2012NTUT CSIE6 Grading Policy Homework assignments and programming exercises: 40% Mid-term exam: 25% Term project: 35% –Including the proposal and final report

IR, Spring 2012NTUT CSIE7 Programming Exercises and Term Project About 3 programming exercises –Team-based (at most 2 persons per team) –You can either write your own code or reuse existing open source code The term project –Either team-based system development (the same as programming exercises) –Or academic paper presentation Only one person per team allowed –A proposal is required before midterm (Apr. 12, 2012)

IR, Spring 2012NTUT CSIE8 About the Term Project The score you get depends on the difficulty and quality of your project –For system development: System functions and correctness –For academic paper presentation Quality and your presentation of the paper Major methods/experimental results *must* be presented Papers from top conferences are strongly suggested –E.g. SIGIR, WWW, CIKM, WSDM, JCDL, ICMR, … Proposals are *required* for each team, and will counted in the score

IR, Spring 2012NTUT CSIE9 Online Submission Submission instructions –Programs, project proposals, and project reports in electronic files must be submitted to the TA online at: –Before submission: User name: Your student ID Please change your default password at your first login

IR, Spring 2012NTUT CSIE10 What this Course is NOT about This course will NOT tell you –The tips and tricks of using search engines, although power users might have better ideas on how to improve them There’re plenty of books and websites on that… –How to find books in libraries, although it’s somewhat related to the basic IR concepts –How to make money on the Web, although the currently largest search engine did it

IR, Spring 2012NTUT CSIE11 What’s Information Retrieval

IR, Spring 2012NTUT CSIE12 On Wikipedia

IR, Spring 2012NTUT CSIE13 On Google Images

IR, Spring 2012NTUT CSIE14 On Google Video Search

IR, Spring 2012NTUT CSIE15 On Google News (TW)

IR, Spring 2012NTUT CSIE16 On Google News (US)

IR, Spring 2012NTUT CSIE17 On Blogs

IR, Spring 2012NTUT CSIE18 On Google Translate…

IR, Spring 2012NTUT CSIE19 Or More Related Keywords NBA New York Knicks Linsanity …

IR, Spring 2012NTUT CSIE20 What if We Search in Chinese

IR, Spring 2012NTUT CSIE21 And More… 紐約尼克 哈佛 台裔球員 … And other languages… And other search engines… And social websites…

IR, Spring 2012NTUT CSIE22 In Google Trends

IR, Spring 2012NTUT CSIE23 And More…

IR, Spring 2012NTUT CSIE24 And Other Keywords…

IR, Spring 2012NTUT CSIE25 And Other Keywords…

IR, Spring 2012NTUT CSIE26 Palanteer – TW Election

IR, Spring 2012NTUT CSIE27

IR, Spring 2012NTUT CSIE28

IR, Spring 2012NTUT CSIE29 What Is Information Retrieval? “Information retrieval is a field concerned with the structure, analysis, organization, storage, searching, and retrieval of information.” (Salton, 1968)

IR, Spring 2012NTUT CSIE30 Goal Information retrieval (IR): a research field that targets at effectively and efficiently searching information in text and multimedia documents In this course, we will introduce the basic text and query models in IR, retrieval evaluation, indexing and searching, and applications for IR

IR, Spring 2012NTUT CSIE31 A Big Picture

IR, Spring 2012NTUT CSIE32 Inverte d Index User Interface Text Operations Query Expansion Indexing Retrieval Ranking Text query user need user feedback ranked docs retrieved docs Doc representation logical view inverted file Document Collection

IR, Spring 2012NTUT CSIE33 Topics Text IR –Indexing and searching –Query languages and operations Retrieval evaluation Modeling –Boolean model –Vector space model –Probabilistic model Applications for IR –Multimedia IR –Web search –Digital libraries

IR, Spring 2012NTUT CSIE34 Organization of the Textbook Basics in IR (focus) –Inverted indexes for boolean queries (Ch.1-5) –Term weighting and vector space model (Ch. 6-7) –Evaluation in IR (Ch. 8) Advanced Topics –Relevance feedback (Ch. 9) –XML retrieval (Ch. 10) –Probabilistic IR (Ch. 11) –Language models (Ch. 12) Machine learning in IR (useful) –Text classification (Ch ) –Document clustering (Ch ) Web Search –Web crawling and indexes (Ch ) –Link analysis (Ch. 21)

IR, Spring 2012NTUT CSIE35 Pointers to Other Topics Cross-language IR Image, video, and multimedia IR Speech retrieval Music retrieval User interfaces Parallel, distributed, and P2P IR Digital libraries Information science perspective Logic-based approaches to IR Natural language processing techniques

IR, Spring 2012NTUT CSIE36 Tentative Schedule Before midterm –Boolean retrieval (1 wk) –Indexing (2 wks) –Vector space model and evaluation (2 wk) –Relevance feedback (1 wk) –Probabilistic IR (2 wk) After midterm –Text classification (1-2 wk) –Document clustering (1-2 wk) –Web search (2 wks) –Advanced topics: CLIR, IE, … (2 wks) –Term Project Presentation (3 wks)

IR, Spring 2012NTUT CSIE37 Generic Resources Wikipedia page on Information Retrieval: n_retrieval n_retrieval Information Retrieval Resources: csli.stanford.edu/~hinrich/information- retrieval.html csli.stanford.edu/~hinrich/information- retrieval.html

IR, Spring 2012NTUT CSIE38 Academic Resources Journals –ACM TOIS: Transactions on Information Systems –JASIST: Journal of the American Society of Information Sciences –IP&M: Information Processing and Management –IEEE TKDE: Transactions on Knowledge and Data Engineering Conferences –ACM SIGIR: International Conference on Information Retrieval –WWW: World Wide Web Conference –ACM CIKM: Conference on Information Knowledge and Management –JCDL: ACM/IEEE Joint Conference on Digital Libraries –ACM WSDM: International Conference on Web Search and Data Mining –TREC: Text Retrieval Conference

IR, Spring 2012NTUT CSIE39 Thanks for Your Attention!