Ronny Kohavi, Microsoft Slides available at

Slides:



Advertisements
Similar presentations
WEB DESIGN TABLES, PAGE LAYOUT AND FORMS. Page Layout Page Layout is an important part of web design Why do you think your page layout is important?
Advertisements

User Research Findings. 1 Overview Background Study goals Methodology Participants Findings Recommendations.
Google Chrome & Search C Chapter 18. Objectives 1.Use Google Chrome to navigate the Word Wide Web. 2.Manage bookmarks for web pages. 3.Perform basic keyword.
Updates to USDA LINC Available August 18, 2008.
Resources. Overview Problem Report WebCT Faculty & Student Support Searching.
HTML Form Processing Learning Web Design – Chapter 9, pp Squirrel Book – Chapter 11, pp
Swami NatarajanJune 17, 2015 RIT Software Engineering Reliability Engineering.
SE 450 Software Processes & Product Metrics Reliability Engineering.
CS246 Search Engine Bias. Junghoo "John" Cho (UCLA Computer Science)2 Motivation “If you are not indexed by Google, you do not exist on the Web” --- news.com.
CM143 - Web Week 2 Basic HTML. Links and Image Tags.
Dawn Pedersen Art Institute. Introduction All your hard design work will suffer in anonymity if people can't find your site. The most common way people.
“ The Initiative's focus is to dramatically advance the means to collect,store,and organize information in digital forms,and make it available for searching,retrieval,and.
BROWSERS & BROWSING What, Which & Why. WHAT IS A BROWSER? Once you have an Internet connection, some programs access the internet automatically to operate.
HTML and Designing Web Pages. u At its creation, the web was all about –Web pages were clumsily assembled –Web sites were accumulations of hyperlinked.
ASP.NET AJAX. Content ASP.NET AJAX Ajax Control Toolkit Muzaffer DOĞAN - Anadolu University2.
Basic Web Design. Technology is a tool  FIRST, understand how people actually interact with each other and with the information in their lives, in all.
HTML 5 and Content Strategy for the Web By: Christina Fregosi ISC 496.
Browser Wars and the Politics of Search Engines
Build a Free Website1 Build A Website For Free 2 ND Edition By Mark Bell.
Ronny Kohavi with Alex Deng, Brian Frasca, Roger Longbotham, Toby Walker, Ya Xu Slides available at
Five Challenging Problems for A/B/n Tests Slides at (Follow-on talk to KDD 2015 keynote on Online Controlled Experiments: Lessons.
Programming in HTML.  Programming Language  Used to design/create web pages  Hyper Text Markup Language  Markup Language  Series of Markup tags 
Universiti Utara Malaysia Chapter 3 Introduction to ASP.NET 3.5.
Objective Understand concepts used to web-based digital media. Course Weight : 5%
New Features in Release 12.1 (March 25, 2012). Release 11.3 New Features –History Tab will be removed (Document Search replaces History Tab) –Document.
Extending HTML CPSC 120 Principles of Computer Science April 9, 2012.
The Road to Online Marketing. A Magic Voyage Begins!!!
Lecture Note 1: Getting Started With ASP.  Introduction to ASP  Introduction to ASP An ASP file can contain text, HTML tags and scripts. Scripts in.
Improving Cloaking Detection Using Search Query Popularity and Monetizability Kumar Chellapilla and David M Chickering Live Labs, Microsoft.
Don't Let Third Parties Slow You Down Arvind Jain, Michael Kleber Google.
The World Wide Web: Information Resource. Hock, Randolph. The Extreme Searcher’s Internet Handbook. 2 nd ed. CyberAge Books: Medford. (2007). Internet.
A/B testing aka split testing, bucket testing, and multivariant testing.
1 3 Computing System Fundamentals 3.4 Networked Computer Systems.
AJAX 10 Most Common Mistakes. 1. Not giving immediate visual cues for clicking widgets. If something I'm clicking on is triggering Ajax actions, you have.
Creating Web Documents catch-up JavaScript slide show tools redirection.
MODULE 3 Internet Basics © Paradigm Publishing, Inc.1.
Browser Wars (Click on the logo to see the performance)
Challenging Problems in Online Controlled Experiments Slides at Ron Kohavi, Distinguished Engineer,
INT222 - Internet Fundamentals Shi, Yue (Sunny) Office: T2095 SENECA COLLEGE.
ASP. ASP is a powerful tool for making dynamic and interactive Web pages An ASP file can contain text, HTML tags and scripts. Scripts in an ASP file are.
?. When designing, you should analyze your target audience. Design you webpage around your target audience Target Audience.
Measuring the value of search trails in web logs Presentation by Maksym Taran & Scott Breyfogle Research by Ryen White & Jeff Huang.
G042 - Lecture 09 Commencing Task A Mr C Johnston ICT Teacher
Thực hiện: D3 GVLT: BROWERS. Browser Compatibility I Check the compatibility II Tools III.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
 AJAX technology  Rich User Experience  Characteristics  Real live examples  JavaScript and AJAX  Web application workflow model – synchronous vs.
Concurrency and Performance Based on slides by Henri Casanova.
The Internet What is the Internet? The Internet is a lot of computers over the whole world connected together so that they can share information. It.
ONBASE: HOW TO UPLOAD DOCUMENTATION EFFICIENTLY April 2, 2015.
Ronny Kohavi, Distinguished Engineer, General Manager, Analysis and Experimentation, Microsoft Joint work with Thomas Crook, Brian Frasca, and Roger Longbotham,
GOOD WEB HOSTING FOR YOUR BUSINESS. Web Host o Business, nowadays, use the Internet to tap potential customers and stay in touch with existing clients.
Ronny Kohavi, Microsoft Joint work with Alex Deng, Brian Frasca, Roger Longbotham, Toby Walker, Ya Xu Based on KDD 2012 talk, available at
Page Redirection When you click a URL to reach to a page X but internally you are directed to another page Y that simply happens because of page re- direction.
What is LinkScanner In AVG Antivirus AVG antivirus software 2017 has came up with several new and latest features for their users to make their.
Section 10.1 Define scripting
Pitfalls in Online Controlled Experiments Slides at
JavaScript and Ajax (Ajax Tutorial)
Pitfalls in Online Controlled Experiments Slides at
Not a Language but a series of techniques
Lesson 4: Web Browsing.
Seven Pitfalls to Avoid when Running Controlled Experiments on the Web
Understanding Search Engines
Tips to Researching on the Internet
PCard Supporting Documentation: OnBase tips, tricks and best practices
Release 3.0 User Interface and Highlights
CSE 154 Lecture 22: AJAX.
Lesson 4: Web Browsing.
Screaming Frog, MOZ and More: What You Need to Know for Good SEO
Designing Experimentation Metrics
Cognos Analytics v For Report Viewers
Presentation transcript:

Ronny Kohavi, Microsoft Slides available at

2 Wrap-up section based on KDD 2012 paper, co-authored by Ronny Kohavi, Alex Deng, Brian Frasca, Roger Longbotham, Toby Walker, and Ya Xu How does one determine the OEC for a search engine? What are some of the most surprising results we faced, and how did we resolve them

3 An OEC is the Overall Evaluation Criterion It is a metric (or set of metrics) that guides the org as to whether A is better than B in an A/B test In prior work, we emphasized long-term focus and thinking about customer lifetime value, but operationalizing it is hard Search engines (Bing, Google) are evaluated on query share (distinct queries) and revenue as long-term goals Puzzle A ranking bug in an experiment resulted in very poor search results Distinct queries went up over 10%, and revenue went up over 30% What metrics should be in the OEC for a search engine?

4

5 A piece of code was added, such that when a user clicked on a search result, additional JavaScript was executed (a session-cookie was updated with the destination) before navigating to the destination page This slowed down the user experience slightly, so we expected a slightly negative experiment. Results showed that users were clicking more! Why?

6 User clicks (and form submits) are instrumented and form the basis for many metrics Instrumentation is typically done by having the web browser request a web beacon (1x1 pixel image) Classical tradeoff here Waiting for the beacon to return slows the action (typically navigating away) Making the call asynchronous is known to cause click-loss, as the browsers can kill the request (classical browser optimization because the result can’t possibly matter for the new page) Small delays, on-mouse-down, or redirect are used

7 Click-loss varies dramatically by browser Chrome, Firefox, Safari are aggressive at terminating such reqeuests. Safari’s click loss > 50%. IE respects image requests for backward compatibility reasons White paper available on this issue herehere Other cases where this impacts experiments Opening link in new tab/window will overestimate the click delta Because the main window remains open, browsers can’t optimize and kill the beacon request, so there is less click-loss Using HTML5 to update components of the page instead of refreshing the whole page has the overestimation problem

8 Primacy effect occurs when you change the navigation on a web site Experienced users may be less efficient until they get used to the new navigation Control has a short-term advantage Novelty effect happens when a new design is introduced Users investigate the new feature, click everywhere, and introduce a “novelty” bias that dies quickly if the feature is not truly useful Treatments have a short-term advantage

9 Given the high failure rate of ideas, new experiments are followed closely to determine if new idea is a winner Multiple graphs of effect look like this Negative on day 1: -0.55% Less negative on day 2: -0.38% Less negative on day 3: -0.21% Less negative on day 4: -0.13% The experimenter extrapolates linearly and says: primacy effect. This will be positive in a couple of days, right? Wrong! This is expected

10

11 The longer graph This was an A/A test, so the true effect is 0

12

13

14 Experiment is run, results are surprising. (This by itself is fine, as our intuition is poor.) Rerun the experiment, and the effects disappear Reason: bucket system recycles users, and the prior experiment had carryover effects These can last for months! Must run A/A tests, or re-randomize

15 OEC: evaluate long-term goals through short-term metrics The difference between theory and practice is greater in practice than in theory Instrumentation issues (e.g., click-tracking) must be understood Carryover effects impact “bucket systems” used by Bing, Google, and Yahoo require rehashing and A/A tests Experimentation insight: Effect trends are expected Longer experiments do not increase power for some metrics. Fortunately, we have a lot of users