Experiences with Big Data and Learning What’s Worth Keeping Roger Liew, CTO Orbitz Worldwide Wolfram Data Summit 2011.

Slides:



Advertisements
Similar presentations
Configuration management
Advertisements

A Cost-Based Comparison of Traditional vs. Online Stores Characteristics of Electronic Markets.
1. Who is Db-Line 16 years of videogames industry Db-Line has been founded in The company grew quickly and positioned itself as a strong point of.
EXMBAQ 2011 N1 GRUPO E. EXMBAQ 2011 N1 GRUPO E | 2 Just a digital shop …
Marketing Management 11 May New Product Development.
Akamai networks,48000 servers and 70 countries in the world.
A Fast Growing Market. Interesting New Players Lyzasoft.
ICS 421 Spring 2010 Data Warehousing (1) Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 3/18/20101Lipyeow.
Chapter 12: Web Usage Mining - An introduction
4 Lecture Electronic Business and Electronic Commerce.
Wrapup Amol Deshpande CMSC424. “Inventing the Future” Wednesday at 3:30pm 1115 CSIC Exam.
MP3 / MD740 Strategy & Information Systems Oct. 13, 2004 Databases & the Data Asset, Types of Information Systems, Artificial Intelligence.
Like any machine, a website is the sum of its parts. Web analytics are the tools used to transform ordinary sites into high performance money making machines.
1 Introduction to OBIEE: Learning to Access, Navigate, and Find Data in the SWIFT Data Warehouse Lesson 1: SWIFT Data Warehouse Overview This course, Introduction.
Copyright © 2015 Criteo State of Mobile Commerce Apps and cross-device lead mobile business Q
WEB ANALYTICS Prof Sunil Wattal. Business questions How are people finding your website? What pages are the customers most interested in? Is your website.
This presentation was scheduled to be delivered by Brian Mitchell, Lead Architect, Microsoft Big Data COE Follow him Contact him.
Machine Learning at Orbitz Robert Lancaster and Jonathan Seidman Strata 2011 February 02 | 2011.
Digital Experience Cloud Overview. Agenda Market Overview Product Overview Future Plans.
Selling Hospitality & Tourism
Road to the Cloud The Economics of Cloud Computing.
USING HADOOP & HBASE TO BUILD CONTENT RELEVANCE & PERSONALIZATION Tools to build your big data application Ameya Kanitkar.
E-Tourism Lecture 7. E-Tourism is used to refer to e-business in the field of travel and tourism, the use of ICT to enable tourism providers destinations.
Systems analysis and design, 6th edition Dennis, wixom, and roth
Social Networks Facebook Quick Facts: Average time spent per visit on Facebook is 20 min 46 seconds vs. 56 second average on other websites. 80%
Distribution & Sales Systems 14 th January Distribution & Online Marketing Tools.
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
Electronic Commerce Semester 2 Term 2 Lecture 24.
Distributed Indexing of Web Scale Datasets for the Cloud {ikons, eangelou, Computing Systems Laboratory School of Electrical.
Data Warehousing at Acxiom Paul Montrose Data Warehousing at Acxiom Paul Montrose.
Display & Remarketing What You Need to Know. PROPRIETARY AND CONFIDENTIAL / COPYRIGHT © 2013 BE FOUND ONLINE, LLC 2 WHAT IS DISPLAY?
Understanding and Predicting Personal Navigation Date : 2012/4/16 Source : WSDM 11 Speaker : Chiu, I- Chih Advisor : Dr. Koh Jia-ling 1.
Database Design Part of the design process is deciding how data will be stored in the system –Conventional files (sequential, indexed,..) –Databases (database.
Data, Insights and Actions Anil Batra Blog:
Chapter 12: Web Usage Mining - An introduction Chapter written by Bamshad Mobasher Many slides are from a tutorial given by B. Berendt, B. Mobasher, M.
MANAGEMENT INFORMATION SYSTEM Prepared By/-Dr N.K.Ojha.
Building Data and Document-Driven Decision Support Systems How do managers access and use large databases of historical and external facts?
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Brayden Wendt Bus 111. History  Founded 2002  Launched 2003  Reid Hoffman.
MAIN BOOKS 1. DATA WAREHOUSING IN THE REAL WORLD : Sam Anshory & Dennis Murray, Pearson 2. DATA MINING CONCEPTS AND TECHNIQUES : Jiawei Han & Micheline.
Event Websites, Part II: Setting Goals and Measuring Conversions John Curtis, Quotient Stephen Nold, Advon Technologies Ian Strain-Seymour, Apogee Search.
Cost9a 1 Software Estimating Technology: A Survey Richard Stutzke Crosstalk, May96 text pp
OpenField Consolidates Stadium Data, Provides CRM and Analysis Functions for an Intelligent, End-to-End Solution COMPANY PROFILE : OPENFIELD Founded by.
Recommendation Algorithms for E-Commerce. Introduction Millions of products are sold over the web. Choosing among so many options is proving challenging.
Kautilya Ecommerce Efficiency Measurer
Big Data Analytics with Excel Peter Myers Bitwise Solutions.
Information Systems in Organizations Managing the business: decision-making Growing the business: knowledge management, R&D, and social business.
Web Analytics and Reporting Michal Neuwirth Product Manager – Kentico Software.
FINDING THE EFFICIENT FRONTIER OF DIGITAL MARKETING NAVIS LEADERSHIP CONFERENCE 2016.
Course : Study of Digital Convergence. Name : Srijana Acharya. Student ID : Date : 11/28/2014. Big Data Analytics and the Telco : How Telcos.
This is a free Course Available on Hadoop-Skills.com.
Sitecues Metrics David Young (617) Discussion Document.
E-marketing plan Organic search Results Visitors: 1.85mi + 24% Revenue: £2.3mi +23% Conclusion Largest e-marketing channel Use of MOZ helps improve.
Data mining in web applications
Data Analytics 1 - THE HISTORY AND CONCEPTS OF DATA ANALYTICS
Fan Engagement Solution
Data Platform Modernization
Software Estimating Technology: A Survey
Published Date: 14th October 2013
IBM Tivoli Web Site Analyzer Training Document
EMarketing: The Essential Guide to Marketing in a Digital World Data Analytics What you’ll learn How to set solid objectives, goals and measurements.
CME Medical Conferences | Medical Meetings | eMedEvents
E-commerce 2017 business. technology. society.
Data Platform Modernization
INEOGroup presents:.
Xerox Summary Marketers are faced with enormous challenges when designing products and product launches in multinational settings, especially in diverse.
COIT 20253: Business Intelligence Using Big Data
Data Warehouse and OLAP Technology
Copyright © JanBask Training. All rights reserved Get Started with Hadoop Hive HiveQL Languages.
Canadian Apartment Properties Real Estate Investment Trust
Presentation transcript:

Experiences with Big Data and Learning What’s Worth Keeping Roger Liew, CTO Orbitz Worldwide Wolfram Data Summit 2011

page 2 Launched in 2001 Over 160 million bookings 7 th Largest seller of travel in the world

page 3 How did Orbitz end up with big data?

Optimizing Hotel Sorting page 4

The Data We Collected Did Not Meet Our Research Needs Top clicked links Top paths through site Top referrers Top bounce pages Path conversion Browser type OS version Geographic distribution … page 5 Transactions Aggregate Consumer behavior

We Have Two Distinct Databases Serving Different Constituents page 6 Transactional Data (e.g. bookings) Data Warehouse Aggregated Non- Transactional Data (e.g. searches) Data Warehouse

page 7 Hadoop technology and economics allow us to do things we couldn’t do before Thousand $ per TB

Our New Infrastructure Allows Us to Record and Process User Activity at the Individual Level page 8 Transactional Data (e.g. bookings) Data Warehouse Detailed Non- Transactional Data (What Each User Sees and Clicks) Hadoop

page 9 Transactions Detailed events

Recommendations For You page 10

Orbitz World page 11

page 12 Advance hotel booking days

Safari Users Seem to be Interested in More Expensive Hotels page 13

Historical Prices From a Given Origin

Simple Price Predictions

page 16 Our goal is a unified view of the data, allowing us to use the power of our existing tools for reporting and analysis

We Can Begin to solve business questions Answer how customers move between marketing channels Identify transactions attributed to a marketing channel that are not truly incremental Anticipate customer needs 17

We’re Dealing With a Few Challenges Most of these efforts are driven by technology It’s batch oriented and simplistically controlled The challenge now is to unlock the value in this data by making it more available to the rest of the organization What is worth keeping page 18

The pace of data collection has outpaced growth curve of storage and processing page 19

What’s Worth Keeping What’s Hot? Raw data / source data Intentional data What’s Not System metrics (Most) Application logs Incidental data page 20

Keep These Things in Mind When Dealing with Big Data Create a universal key Always keep source data Data Scientists like SQL Operationalize your bleeding edge infrastructure It’s difficult to share data with third parties Challenge assumptions and older lessons learned Fast evolving technology space that’s challenging to keep up with page 21