The Case for Browser Provenance Daniel W. Margo and Margo Seltzer Harvard School of Engineering and Applied Sciences.

Slides:



Advertisements
Similar presentations
RSS: A Primer for Advocates and Managers
Advertisements

Drybridge Consulting Party Identification Directory Installing the Microsoft Research Service IDEAlliance and Drybridge Consulting – collaborating to deliver.
Protecting Browser State from Web Privacy Attacks Collin Jackson, Andrew Bortz, Dan Boneh, John Mitchell Stanford University.
1 Random Sampling from a Search Engines Index Ziv Bar-Yossef Maxim Gurevich Department of Electrical Engineering Technion.
Access Part I Accessing Health Information Through the Internet.
Getting Maximum Value From Your Cengage Learning Products Chris Houghton February 25, 2013.
Internet Search Lecture # 3.
Introduction to Online Ordering. Log in to OK Choice The web address for OK Choice is Enter your User.
Provenance-Aware Storage Systems Margo Seltzer April 29, 2005.
For Details Visit : or For any Help Contact the Librarian EBSCOhost 2.0.
HyperText Markup Language (HTML). Introduction to HTML Hyper Text Markup Language HTML Example The structure of an HTML document Agenda.
October 4, 2013 Wilmington High School. What is Evernote? Simply stated, Evernote is a web service and software application that can be used to help us.
PROVENANCE FOR THE CLOUD (USENIX CONFERENCE ON FILE AND STORAGE TECHNOLOGIES(FAST `10)) Kiran-Kumar Muniswamy-Reddy, Peter Macko, and Margo Seltzer Harvard.
Introducing… EBSCOhost 2.0 A redesigned EBSCOhost Coming in July 2008.
Browser Comparisons Internet Explorer 8 & 9, Chrome 11 and Firefox 4 Security, Privacy, Add-ons & Convenience.
Google Chrome & Search C Chapter 18. Objectives 1.Use Google Chrome to navigate the Word Wide Web. 2.Manage bookmarks for web pages. 3.Perform basic keyword.
© Network Ltd 2006 Social Bookmarking in the Work Place Network Ltd Ryan Butler David Gould
Retrieval Evaluation. Introduction Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
University of Kansas Department of Electrical Engineering and Computer Science Dr. Susan Gauch April 2005 I T T C Dr. Susan Gauch Personalized Search Based.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
The Internet & Web Browsers Business Webpage Design Kelly Seale.
Browser Comparisons - Convenience Internet Explorer 8 & 9, Chrome 11 and Firefox 4 Searching, Convenience & Add-ons.
Google Chrome Your Customized Google Buddy April 2012 John Riley and Denise Tate-Kuhler.
Google Confidential and Proprietary Chrome 101 & Max Stacy Behmer Google Certified Teacher & Google Education Trainer
Searching Provenance Shankar Pasupathy, Network Appliance PASS Workshop, Harvard October 2005.
1 Session 1: Introduction to HTML Spring Today’s Agenda Cover useful terminology for today’s session HTML, browsers, servers, etc. HTML Tags Get.
Audio and Video Chris McConnell Department of Radio-TV-Film November 30, 2006.
Markup and Validation Agents in Vijjana – A Pragmatic model for Self- Organizing, Collaborative, Domain- Centric Knowledge Networks S. Devalapalli, R.
GoogleDictionary Paul Nepywoda Alla Rozovskaya. Goal Develop a tool for English that, given a word, will illustrate its usage.
MIS 424 Professor Sandvig. Overview  Why Analytics?  Two major approaches:  Server logs  Google Analytics.
Searching the Web Know what you are doing. Which computer type? Are you using a PC or a Mac? This is a Mac or Apple computer.
Mozilla Firefox is a web browser descended from the Mozilla Application Suite and managed by Mozilla CorporationCorporation. Firefox had 21.73% of the.
Personalized Search Xiao Liu
L/O/G/O I phone Safari Application Name : Malak Saleh Al-Qahtani ID :
Presentation by Heather C. Ware. What is Personal Information Management (PIM) Personal Information Management (PIM) refers to both the practice and the.
Google Tools Preview Party.  Because they are free  High quality applications and services  Constantly enhancing existing features and adding new applications.
Using HTML Textual and Structural Data for Web Image Search Cheng Thao, Ethan Munson, Jim Dabrowski, Nikolas D. Bohne University of Wisconsin-Milwaukee.
Planetary Science Archive PSA User Group Meeting #1 PSA UG #1  July 2 - 3, 2013  ESAC PSA Introduction / Context.
The World Wide Web: Information Resource. Hock, Randolph. The Extreme Searcher’s Internet Handbook. 2 nd ed. CyberAge Books: Medford. (2007). Internet.
A radiologist analyzes an X-ray image, and writes his observations on papers  Image Tagging improves the quality, consistency.  Usefulness of the data.
EMu Interface and the Web Clear identification of web fields for users and administrators Visual identifier of the web presentations in EMu, ie Collection.
MODULE 3 Internet Basics © Paradigm Publishing, Inc.1.
1 Computer Forensics Dr. Randy M. Kaplan. 2 Browser Forensics.
My eClass, Destiny Catalog, and Follett Shelf e-books
October 2015 LCCU Meeting We’ll answers members’ questions: –How do you use OneNote to get organized?
Brought to you as a public service by: Ian O’Donnell DISASTER RISK REDUCTION SOCIAL TAGGING SITE:
Solving the hard problems of User Experience
Quick Launch. Google Drive 30 GB Cloud Space Document.
The World Wide Web: Information Resource. How a Search Engine works… How Search Works - YouTube
1 Some Guidelines for Good Research Dr Leow Wee Kheng Dept. of Computer Science.
Automated Access to Statistical Facts via Statline4 Web Services Olav ten Bosch Statistics Netherlands UN-ECE conference, Bratislava April.
Contextual Text Cube Model and Aggregation Operator for Text OLAP
Event-Based Model for Reconciling Digital Entities Ahmet Fatih Mustacoglu Ahmet E. Topcu Aurel Cami Geoffrey C. Fox Indiana University Computer Science.
INTERNET VOCAB. WEB BROWSER An app for finding info on the web.
Internet Searching the World Wide Web. The Internet and the World Wide Web The Internet is a worldwide collection of networks that allows people to communicate.
What Is Firefox? __________ is a Web ___________ that you use to search for and view Web pages, save pages for use in the future, and maintain a list.
The Internet & Web Browsers Business Webpage Design Created by Kelly Seale Adapted by Jill Einerson.
Dr. Cyrus ED 217 Jacob Camacho.  A user friendly tool.  Can be accessed at campus.uogdistance.com  Easily accessible, useful in posting blogs, assignments,
How to fix Netflix Signing In Issues? For More Details Visit Our Website
Web Browser presentation Name/ Hassan AL-Abdulmohsen
Neighborhood - based Tag Prediction
Enter PNE Name, Paperless Nation Engineer
Introduction to Smart Search
Left Click to view the next slide.
Skills Development Program
Magnet & /facet Zheng Liang
All About the Internet.
Allena Keller DSI 101 Section 2
Internet Vocabulary Beth Felton McKelvey.
Who is Using your webSite?
Presentation transcript:

The Case for Browser Provenance Daniel W. Margo and Margo Seltzer Harvard School of Engineering and Applied Sciences

Overview Problem: Browser Data Management Solution: Provenance for Web Browsers Use Cases Details and Challenges Implementation

The Modern Browser: A Super-Application Originally a distributed document reader. But now most documents are distributed. And the definition of “document” has changed: –Webmail –YouTube –Google Apps It is difficult for users to manage all this data. –e.g., recall a specific web page.

Browser Data Management (I) A “little big data” problem… –My history: ~25k objects in ~2 months. –Tractable for computers, but not for users. Traditional solution: Bookmarks. –Requires users to tag their data in advance… –…and to manage the bookmarks. Advanced solutions: –History Search (Google Chrome’s “New Tab” page) –Autocompletion (form history, saved passwords)

Browser Data Management (II) Firefox 3’s “Smart Location Bar” from Most solutions powered by history and usage statistics. “History and usage statistics” = provenance.

Traditional Browser History

Web Graphs (Firefox 3 Places)

Browser Provenance

Use Case: Contextual History Search Most history search is textual Edges imply contextual relationships. –E.g. “rosebud”  “Citizen Kane”. 2-phase contextual search (Shah et. al): –Perform a textual history search. –Then, push the weight of results to neighbors. Similar to modern web search… –And good for the same reasons.

Use Case: Personalizing Web Search Context is created by the user. –So a gardener relates “rosebud”  “flower”. –Frustrating if Google returns “Citizen Kane”. Browser could clarify context to search engine! –Naïve: Just insert “flower” into “rosebud” searches. –If engine had a better interface, we could do better. Personalization with privacy. –Browser knows more about user than cookies can. –No need to give third parties raw personal data.

Use Case: Time-Contextual History Search Current histories can’t recreate prior state. –e.g., “were these two pages open simultaneously?” Time relationships… –Are natural: “rosebud, and I think I was also looking at gardening tools around that time.” –Narrow the search space a great deal. Related Work: –Gyllstrom and Soules’ “SeeTrieve” –Dumals et. al’s “Stuff I’ve Seen”

Use Case: Download Lineage Need to know where data comes from. –For source attribution, finding updates, etc. URL is not always sufficient. –“This image came from…ImageShack!” This is exactly what provenance is for! –Just query ancestors!

Conclusion Browsers record many statistics. These statistics are provenance records. Provenance techniques can improve: –History search, via context. –Web search, via personalization. –Data management, via lineage. Some details in the paper. Excruciating details in future work.