FindAll: A Local Search Engine for Mobile Phones Aruna Balasubramanian University of Washington.

Slides:



Advertisements
Similar presentations
Lightspeed Filtering Mark Shrimpton Schools Broadband Team EiS.
Advertisements

LeadManager™- Internet Marketing Lead Management Solution May, 2009.
Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.
Crawling, Ranking and Indexing. Organizing the Web The Web is big. Really big. –Over 3 billion pages, just in the indexable Web The Web is dynamic Problems:
ECOS: Leveraging Software-Defined Networks to Support Mobile Application Offloading Aaron Gember, Christopher Dragga, Aditya Akella University of Wisconsin-Madison.
Project Title: Deepin Search Member: Wenxu Li & Ziming Zhai CSCI 572 Project.
Chapter 12: Web Usage Mining - An introduction
1 Searching the Web Junghoo Cho UCLA Computer Science.
1 A Framework for Lazy Replication in P2P VoD Bin Cheng 1, Lex Stein 2, Hai Jin 1, Zheng Zhang 2 1 Huazhong University of Science & Technology (HUST) 2.
CS246 Search Engine Bias. Junghoo "John" Cho (UCLA Computer Science)2 Motivation “If you are not indexed by Google, you do not exist on the Web” --- news.com.
1 Internet and Data Management Junghoo “John” Cho UCLA Computer Science.
FACT: A Learning Based Web Query Processing System Hongjun Lu, Yanlei Diao Hong Kong U. of Science & Technology Songting Chen, Zengping Tian Fudan University.
Presented by Zeehasham Rasheed
University of Kansas Data Discovery on the Information Highway Susan Gauch University of Kansas.
Overview of Web Data Mining and Applications Part I
Section 2: Finding and Refinding Jaime Teevan Microsoft Research 1.
Niranjan Balasubramanian Aruna Balasubramanian Arun Venkataramani University of Massachusetts Amherst Energy Consumption in Mobile Phones: A Measurement.
“ The Initiative's focus is to dramatically advance the means to collect,store,and organize information in digital forms,and make it available for searching,retrieval,and.
Databases & Data Warehouses Chapter 3 Database Processing.
Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan.
Information Re-Retrieval Repeat Queries in Yahoo’s Logs Jaime Teevan (MSR), Eytan Adar (UW), Rosie Jones and Mike Potts (Yahoo) Presented by Hugo Zaragoza.
How Search Engines Work. Any ideas? Building an index Dan taylor Flickr Creative Commons.
Google Chrome Your Customized Google Buddy April 2012 John Riley and Denise Tate-Kuhler.
Accelerating Mobile Applications through Flip-Flop Replication
1 Anonshare 2.0 P2P Anonymous Browsing History Share Frank Chiang Terry Go Rui Ma Anita Mathew.
© , Digital Axle, LLC All Rights Reserved Website Optimization, SEO, and Analytics Bruce Carlisle Digital
Aruna Balasubramanian Brian Neil Levine Arun Venkataramani University of Massachusetts, Amherst Enhancing Interactive Web Applications in Hybrid Networks.
Search Engines and Information Retrieval Chapter 1.
Authors: Maryam Kamvar and Shumeet Baluja Date of Publication: August 2007 Name of Speaker: Venkatasomeswara Pawan Addanki.
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
Master Thesis Defense Jan Fiedler 04/17/98
The Anatomy of a Large-Scale Hypertextual Web Search Engine Presented By: Sibin G. Peter Instructor: Dr. R.M.Verma.
Improving Web Search Ranking by Incorporating User Behavior Information Eugene Agichtein Eric Brill Susan Dumais Microsoft Research.
PERSONALIZED SEARCH Ram Nithin Baalay. Personalized Search? Search Engine: A Vital Need Next level of Intelligent Information Retrieval. Retrieval of.
Tag Data and Personalized Information Retrieval 1.
When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.
The Effect of Collection Organization and Query Locality on IR Performance 2003/07/28 Park,
Energy Consumption in Mobile Phones: A Measurement Study and Implications for Network Applications REF:Balasubramanian, Niranjan, Aruna Balasubramanian,
SWE.org. Not what we wanted to hear … but we needed to listen Some choice findings from our initial member survey “Make the material more user friendly.
The Anatomy of a Large-Scale Hyper textual Web Search Engine S. Brin, L. Page Presenter :- Abhishek Taneja.
Autumn Web Information retrieval (Web IR) Handout #1:Web characteristics Ali Mohammad Zareh Bidoki ECE Department, Yazd University
GUIDED BY DR. A. J. AGRAWAL Search Engine By Chetan R. Rathod.
Net-Centric Software and Systems I/UCRC A Framework for QoS and Power Management for Mobile Devices in Service Clouds Project Lead: I-Ling Yen, Farokh.
Adish Singla, Microsoft Bing Ryen W. White, Microsoft Research Jeff Huang, University of Washington.
End-to-End Performance Analytics For Mobile Apps Lenin Ravindranath, Jitu Padhye, Ratul Mahajan Microsoft Research 1.
D. Heynderickx DH Consultancy, Leuven, Belgium 22 April 2010EuroPlanet, London, UK.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Search Result Interface Hongning Wang Abstraction of search engine architecture User Ranker Indexer Doc Analyzer Index results Crawler Doc Representation.
Enhancing Web Search by Promoting Multiple Search Engine Use Ryen W. W., Matthew R. Mikhail B. (Microsoft Research) Allison P. H (Rice University) SIGIR.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
ASSOCIATIVE BROWSING Evaluating 1 Jinyoung Kim / W. Bruce Croft / David Smith for Personal Information.
1 SearchTogether: An Interface for Collaborative Web Search Florian Eiteljörge June 11, 2013Florian Eiteljörge.
Optimized File Uploads in Mobile Cloud Computing Yash Sheth Vishal Sahu Swapnil Tiwari
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Why Decision Engine Bing Demos Search Interaction model Data-driven Research Problems Q & A.
Predicting User Interests from Contextual Information R. W. White, P. Bailey, L. Chen Microsoft (SIGIR 2009) Presenter : Jae-won Lee.
Predicting Short-Term Interests Using Activity-Based Search Context CIKM’10 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.
ASSOCIATIVE BROWSING Evaluating 1 Jin Y. Kim / W. Bruce Croft / David Smith by Simulation.
Enhancing Mobile Apps to Use Sensor Hubs without Programmer Effort Haichen Shen, Aruna Balasubramanian, Anthony LaMarca, David Wetherall 1.
Reach people on mobile. Mobile Search Ads Reach people with Mobile Search Ads.
1 Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan, MIT Susan T. Dumais, Microsoft Eric Horvitz, Microsoft SIGIR 2005.
Presented By: Carlton Northern and Jeffrey Shipman The Anatomy of a Large-Scale Hyper-Textural Web Search Engine By Lawrence Page and Sergey Brin (1998)
Presentation by: Rebecca Chambers WebDuck Designs
Simultaneous Support for Finding and Re-Finding
AdWords Sitelinks. Increasing choice and relevancy in your Search ads.
Prepared by Rao Umar Anwar For Detail information Visit my blog:
08/03/14 Energy Consumption in Mobile Phones: A Measurement Study and Implications for Network Applications REF:Balasubramanian, Niranjan, Aruna Balasubramanian,
Building a Database on S3
Monitoring Physical Activities Using Smartphones
Presentation transcript:

FindAll: A Local Search Engine for Mobile Phones Aruna Balasubramanian University of Washington

Co-Authors Information Retrieval Systems Niranjan Balasubramanian, UW Sam Huston, UMass Don Metzler, USC (now Google) David Wetherall, UW

Mobile web search performance poor Order of magnitude slower on cellular networks

Cellular connectivity is poor Pew, April 2012 This work: Can we trade storage for connectivity to improve search performance?

Leveraging re-finding. Searching for a previously viewed page. Mobile: 70% of searches for 50% users. Non-Mobile: 40% to 60% of all searches.

FindAll local search engine Search interface to search any previously viewed page, on any of your device

Is this the same as caching/history? It is a search interface on top of caching: History seldom used Is this same as Google history or chrome sync?

What is a search interface? Uses indexes and retrieval algorithms for effective search – Keyword matching is easy but not effective – Database of search queries miss query changes and non-searched web pages Challenge: Search engines are memory/energy intensive

Talk outline User study – Identifies re-finding behavior FindAll – Design of search engine for phones Evaluation – Results of tradeoffs in practice

Talk outline User study – Identify re-finding behavior FindAll – Design of search engine for phones Evaluation – Results of tradeoffs in practice

IR-approved study Monitored 23 participants for 1 month – Grad and under-grad students Collected logs from user’s mobile/desktop – Visited URL and search query (anonymized) Mark URL re-found if – Page revisited via search query, and unchanged

Examples

Re-finding accounts for 52% of search Cross-device re- finding is 70% >20% of re-finds have different query Lots of opportunities to search locally.

45% re-finding occurs within 50 minutes Time between first visit and subsequent re-finding Need to index when the page is first accessed.

User’s show diverse re-finding patterns Need to adapt to user

User’s re-finding fairly constant This user: Avg re-finding 43%, std deviation 9%

User study summary Lots of opportunities to leverage re-finding Need to index near when page is accessed Need to adapt to users

Talk outline User study – Identifies re-finding behavior FindAll – Design of search engine for phones Evaluation – Results of tradeoffs in practice

Search engine basics Indexer – Map of : Retriever – Use index to rank pages in the order of relevance – A web page is available for search, only if it is indexed

FindAll architecture Storage Partial Indexes Cache

When to index? High availability High index energy Low availability Low index energy

FindAll indexing Maximize availability, such that total energy consumption is no more than default search Expected energy for indexing <= Expected energy if indexing not done (default search) FindAll estimates expectations based on user behavior

Predicting user re-finding probability Online classier: What is the probability of a web page being re-found in the next T minutes. Classifier features 1. base re-finding probability of user? 2. user in a browsing session? 3. web page been re-found recently?

Expected cost of default search E[~I] Cost of network download, if a web page is re- found but not indexed

Expected cost of indexing E[I] Sum of – Cost of indexing current block of web pages – A penalty estimated based on future indexing Build a simple linear model by indexing different block sizes

Prototype on Android Adapt Galago search engine for phones – Implement partial indexing and merging Implement online energy cost estimator – Train classifier when mobile is charging – Make an indexing decision every 5 mins

Talk outline User study – Identify re-finding behavior FindAll – Design of search engine for phones Evaluation – Results of tradeoffs in practice

Evaluation goals Benefits and Costs Latency, Availability, 3G data usage Energy, Storage Alternate approaches Keyword, Database Alternate indexing strategies Cloud index, Always index, Fixed index Results based on prototype and user traces

Evaluation goals Benefits and Costs Latency, Availability, 3G data usage Energy, Storage Alternate approaches Keyword, Database Alternate indexing strategies Cloud index, Always index, Fixed index

FindAll reduced 3G data usage

FindAll improves web page latency

FindAll does not increase energy

Availability under limited connectivity 43% (Under a random 50% connectivity model)

FindAll indexing important for energy benefits

Conclusions FindAll makes a win-win tradeoff for search – Decrease latency and increase availability, with reduced energy and bandwidth Future directions Search primitive: Integrating re-finding with other mobile apps Context-based re-finding: Adding sensor cues to pages

Questions? Contact:

Other results Static Indexing strategies – Increase energy by up to 50% compared to default search for low re-find users – Decreases availability by up to 39% for high re-find users Storage requirement less than 1.7GB per month