Web IR: Recent Trends; Future of Web Search

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
A Web of Concepts Dalvi, et al. Presented by Andrew Zitzelberger.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Advance Information Retrieval Topics Hassan Bashiri.
Employing Two Question Answering Systems in TREC 2005 Harabagiu, Moldovan, et al 2005 Language Computer Corporation.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
A Social Help Engine for Online Social Network Mobile Users Tam Vu, Akash Baid WINLAB, Rutgers University May 21,
Information Retrieval in Practice
Databases & Data Warehouses Chapter 3 Database Processing.
CS344: Introduction to Artificial Intelligence Vishal Vachhani M.Tech, CSE Lecture 34-35: CLIR and Ranking in IR.
Artificial Intelligence What’s Possible, What’s Not, How Do We Move Forward? Adam Cheyer Co-Founder, VP Engineering Siri Inc.
Result presentation. Search Interface Input and output functionality – helping the user to formulate complex queries – presenting the results in an intelligent.
CSC 9010 Spring Paula Matuszek A Brief Overview of Watson.
1 The BT Digital Library A case study in intelligent content management Paul Warren
Going Beyond Simple Question Answering Bahareh Sarrafzadeh CS 886 – Spring 2015.
Tables to Linked Data Zareen Syed, Tim Finin, Varish Mulwad and Anupam Joshi University of Maryland, Baltimore County
Semantic Search: different meanings. Semantic search: different meanings Definition 1: Semantic search as the problem of searching documents beyond the.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
A Probabilistic Graphical Model for Joint Answer Ranking in Question Answering Jeongwoo Ko, Luo Si, Eric Nyberg (SIGIR ’ 07) Speaker: Cho, Chin Wei Advisor:
Structured Use of External Knowledge for Event-based Open Domain Question Answering Hui Yang, Tat-Seng Chua, Shuguang Wang, Chun-Keat Koh National University.
Personalized Search Xiao Liu
Detecting Dominant Locations from Search Queries Lee Wang, Chuang Wang, Xing Xie, Josh Forman, Yansheng Lu, Wei-Ying Ma, Ying Li SIGIR 2005.
Page 1 Alliver™ Page 2 Scenario Users Contents Properties Contexts Tags Users Context Listener Set of contents Service Reasoner GPS Navigator.
Course grading Project: 75% Broken into several incremental deliverables Paper appraisal/evaluation/project tool evaluation in earlier May: 25%
GTRI.ppt-1 NLP Technology Applied to e-discovery Bill Underwood Principal Research Scientist “The Current Status and.
Search Engine Architecture
Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.
You sexy beast. Ok, inappropriate. How about: Web of links to Web of Meaning Hello Semantic Web!
Exploiting Relevance Feedback in Knowledge Graph Search
Mining Dependency Relations for Query Expansion in Passage Retrieval Renxu Sun, Chai-Huat Ong, Tat-Seng Chua National University of Singapore SIGIR2006.
Visualization in Text Information Retrieval Ben Houston Exocortex Technologies Zack Jacobson CAC.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Web IR: Recent Trends; Future of Web Search CSC 575 Intelligent Information Retrieval.
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
BBY 464 Semantic Information Management (Spring 2016) Semantic Query Languages Yaşar Tonta & Orçun Madran [yasartonta, Hacettepe.
Siri Voice controlled Virtual Assistant Haroon Rashid Mithun Bose 18/25/2014.
Trends in NL Analysis Jim Critz University of New York in Prague EurOpen.CZ 12 December 2008.
Information Retrieval in Practice
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Search Engine Architecture
PRESENTED BY: PEAR A BHUIYAN
Data and Applications Security Developments and Directions
Search Engine Optimisation
Statistical Learning Methods for Natural Language Processing on the Internet 徐丹云.
Information Retrieval and Web Search
Information Retrieval and Web Search
Federated & Meta Search
What is IR? In the 70’s and 80’s, much of the research focused on document retrieval In 90’s TREC reinforced the view that IR = document retrieval Document.
Information Retrieval and Web Search
Lecture 19: Question Answering
Thanks to Bill Arms, Marti Hearst
Semantic Network & Knowledge Graph
CS & CS Capstone Project & Software Development Project
Multimedia Information Retrieval
CSE 635 Multimedia Information Retrieval
How to publish in a format that enhances literature-based discovery?
International Marketing and Output Database Conference 2005
Language and Learning Introduction to Artificial Intelligence COS302
Introduction to Information Retrieval
GOOGLE ALGORITHM UPDATES HISTORY Top major Google updates.
CS246: Information Retrieval
Search Engine Architecture
Web Mining Research: A Survey
Information Retrieval and Web Search
Topic: Semantic Text Mining
Introduction to Search Engines
Journal of Web Semantics 55 (2019)
Introduction Dataset search
14. Question-Answering (QA)
Presentation transcript:

Web IR: Recent Trends; Future of Web Search CSC 575 Intelligent Information Retrieval

What’s been happening now? (Google) “Mobilegeddon” (Apr 21, 2015): “Mobile friendliness” as a major ranking signal “Pigeon” update (July 2014): For U.S. English results, the “Pigeon Update” is a new algorithm to provide more useful, relevant and accurate local search results that are tied more closely to traditional web search ranking signals More use of distance and location in ranking signals “App Indexing” (Android, iOS support May 2015) Search results can take you to an app Why? About half of all searches are now from mobile Making/wanting good changes, but obvious self-interest in trying to keep people using mobile web rather than apps

What’s been happening now? (Google) New search index at Google: “Hummingbird” http://www.forbes.com/sites/roberthof/2013/09/26/google-just-revamped-search-to-handle-your-long-questions/ Answering long, “natural language” questions better Partly to deal with spoken queries on mobile More use of the Google Knowledge Graph Concepts versus words

What’s been happening now? Move to mobile favors a move to speech which favors “natural language information search” Will we move to a time when over half of searches are spoken?

3 approaches to question answering: Knowledge-based approaches (Siri) Build a semantic representation of the query Times, dates, locations, entities, numeric quantities Map from this semantics to query structured data or resources Geospatial databases Ontologies (Wikipedia infoboxes, dbPedia, WordNet, Yago) Restaurant review sources and reservation services Scientific databases Wolfram Alpha

Text-based (mainly factoid) QA Question Processing Detect question type, answer type, focus, relations Formulate queries to send to a search engine Passage Retrieval Retrieve ranked documents Break into suitable passages and re-rank Answer Processing Extract candidate answers (as named entities) Rank candidates using evidence from relations in the text and external sources Do analysis on the fly. Have exact context. Not good for aggregate questions or information integration.

Hybrid approaches (IBM Watson) Build a shallow semantic representation of the query Generate answer candidates using IR methods Augmented with ontologies and semi-structured data Score each candidate using richer knowledge sources Geospatial databases Temporal reasoning Taxonomical classification

What’s been happening now? Google Knowledge Graph Facebook Graph Search Bing’s Satori Things like Wolfram Alpha Common theme: Doing graph search over structured knowledge rather than traditional text search Two Goals: Things not strings Inference not search

Example from Fernando Pereira (Google)

Structured Data Direct Answer

Desired experience: Towards actions (Patrick Pantel, et al Desired experience: Towards actions (Patrick Pantel, et al., Microsoft Research)

Desired experience: Towards actions

Actions vs. Intents

Learning actions from web usage logs Goes off to build probabilistic models that I won’t show….

The Facebook Graph Collection of entities and their relationships Entities (users, pages, photos, etc.) are nodes Relationships (friendship, checkins, tagging, etc.) are edges Nodes and edges have metadata Nodes have a unique id – the fbid

Facebook Graph Search

Facebook Graph Snippet EVENT fbid: 213708728685 type: PAGE name: Breville mission: To design the best … … LIKES FRIEND TAGGED fbid: 586206840 type: USER name: Sriram Sankar … PHOTO

Social search/QA

Facebook Graph Search Uses a weighted context free grammar (WCFG) to represent the Graph Search query language: [start] => [users]                             $1   [users] => my friend                      friends(me) [users] => friends of [users]          friends($1) [users] => {user}                            $1 [start] => [photos]                          $1 [photos] => photos of [users] photos($1)  A terminal symbol can be an entity, e.g., {user}, {city}, {employer}, {group}; it can also be a word/phrase, e.g., friends, live in, work at, members, etc. A parse tree is produced by starting from [start] and expanding the production rules until it reaches terminal symbols. https://www.facebook.com/notes/facebook-engineering/under-the-hood-the-natural-language-interface-of-graph-search/10151432733048920 http://spectrum.ieee.org/telecom/internet/the-making-of-facebooks-graph-search

If outside grammar, you fall back to a Bing search