© Copyright 1998-2000, Blue Martini Software. San Mateo California, USA 1 1 Integrating E-Commerce and Data Mining: Architecture and Challenges Llew Mason.

Slides:



Advertisements
Similar presentations
Web Mining.
Advertisements

Web Usage Mining Web Usage Mining (Clickstream Analysis) Mark Levene (Follow the links to learn more!)
09/04/2015Unit 2 (b) Back-Office processes Unit 2 Assessment Criteria (b) 10 marks.
Cingular Case Studies September 24, © 2007 BearingPoint, Inc.IM – Content, Portals and IntegrationCommunications and Utilities AT&T B2B Wireless.
Kurt Ghekiere eng. Sr Consultant & CTO Ris Technics Web analytics for smarter decisions by abs creative group.
Chapter 12: Web Usage Mining - An introduction
Introduction to Web Analytics Web analytics is the measurement, collection, analysis and reporting of internet data for purposes of understanding and optimizing.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Components and Architecture CS 543 – Data Warehousing.
Web Usage Mining - W hat, W hy, ho W Presented by:Roopa Datla Jinguang Liu.
SESSION 9 THE INTERNET AND THE NEW INFORMATION NEW INFORMATIONTECHNOLOGYINFRASTRUCTURE.
© Copyright , Blue Martini Software. San Mateo California, USA Personalization Panel Ronny Kohavi, Director, Data Mining Blue Martini Software.
The Medusa Proxy A Tool For Exploring User- Perceived Web Performance Mimika Koletsou and Geoffrey M. Voelker University of California, San Diego Proceeding.
Data Mining – Intro.
Insight on Google Analytics Features - Suresh. K.
The Importance Of Transactions In The World Of Analytics Doug Aoyama Director, Product Marketing.
Integrating CRM On Demand with the E-Business Suite to Supercharge your Sales Team Presented by: Tom Connolly, Jason Lieberman Company: BizTech Session.
Content Management and Process Automation Presented by Mark Chambers SE Regional Manager Document Imaging Solutions, Inc.
“If you build it, they will come.”. Virtual Business  There is much more that goes into a virtual business than just building the web site.  You will.
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
WEB ANALYTICS Prof Sunil Wattal. Business questions How are people finding your website? What pages are the customers most interested in? Is your website.
Prof. Vishnuprasad Nagadevara Indian Institute of Management Bangalore
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Managing Data Interoperability with FME Tony Kent Applications Engineer IMGS.
An OLAM Framework for Web Usage Mining and Business Intelligence Reporting Xiaohua (Tony) Hu Drexel University Philadelphia, PA,
FALL 2012 DSCI5240 Graduate Presentation By Xxxxxxx.
LAYING OUT THE FOUNDATIONS. OUTLINE Analyze the project from a technical point of view Analyze and choose the architecture for your application Decide.
1.Knowledge management 2.Online analytical processing 3. 4.Supply chain management 5.Data mining Which of the following is not a major application.
Interoperability Scenario Producing summary versions of compound multimedia historical documents.
Copyright © 2009 Pearson Education, Inc. Slide 6-1 Chapter 6 E-commerce Marketing Concepts.
Strategies for improving Web site performance Google Webmaster Tools + Google Analytics Marshall Breeding Director for Innovative Technologies and Research.
MSF Requirements Envisioning Phase Planning Phase.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
April, 2008 Better Together! Integrated GP & CRM AN INDEPENDENT MEMBER OF BAKER TILLY INTERNATIONAL 505 AFFILIATE OFFICES WORLDWIDE.
material assembled from the web pages at
©2010 John Wiley and Sons Chapter 12 Research Methods in Human-Computer Interaction Chapter 12- Automated Data Collection.
Using SAS® Information Map Studio
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
Sustainability: Web Site Statistics Marieke Napier UKOLN University of Bath Bath, BA2 7AY UKOLN is supported by: URL
Data Warehouse. Design DataWarehouse Key Design Considerations it is important to consider the intended purpose of the data warehouse or business intelligence.
Log files presented to : Sir Adnan presented by: SHAH RUKH.
1 Challenges in Scaling E-Business Sites  Menascé and Almeida. All Rights Reserved. Daniel A. Menascé Department of Computer Science George Mason.
Chapter 12: Web Usage Mining - An introduction Chapter written by Bamshad Mobasher Many slides are from a tutorial given by B. Berendt, B. Mobasher, M.
Carey Probst Technical Director Technology Business Unit - OLAP Oracle Corporation.
BUSINESS ANALYTICS AND DATA VISUALIZATION
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Building Data and Document-Driven Decision Support Systems How do managers access and use large databases of historical and external facts?
SOA-25: Data Distribution Solutions Using DataXtend ® Semantic Integrator for Sonic ™ ESB Users Jim Barton Solution Architect.
Sule Ozmen-CRM CRM Customer Relationship Management Şule Özmen Week 6 Digital Economy Customers Became Number One They are empowered customers.
NCR Confidential NCR RETAIL ONLINE Ecommerce Made Simple 1.
Internet Architecture and Governance
Technology for E-commerce Helena Ahonen-Myka. In this part... n search tools n metadata n personalization n collaborative filtering n data mining.
See it. Click it. Get it. INTERACTIVE VIDEO “Where Psychology Meets Technology” THE FUTURE IS NOW.
Chapter6: E-Commerce Web Sites HNDIT11062 – Web Development 1.
1 Copyright © Oracle Corporation, All rights reserved. Business Intelligence and Data Warehousing.
Web Analytics and Reporting Michal Neuwirth Product Manager – Kentico Software.
Chapter 8: Web Analytics, Web Mining, and Social Analytics
 Cloud Computing technology basics Platform Evolution Advantages  Microsoft Windows Azure technology basics Windows Azure – A Lap around the platform.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Understand The Use Of Technologies In Fashion Merchandising And Marketing FM 3.02.
Data Resource Management – MGMT An overview of where we are right now SQL Developer OLAP CUBE 1 Sales Cube Data Warehouse Denormalized Historical.
Data Mining & OLAP What is Data Mining? Data Mining is the set of activities used to find new, hidden, or unexpected patterns in data.
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
Adobe Digital Marketing
Strategies for improving Web site performance
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Chapter 12: Automated data collection methods
Automating Profitable Growth™
Technical Capabilities
Data Warehousing Concepts
Presentation transcript:

© Copyright , Blue Martini Software. San Mateo California, USA 1 1 Integrating E-Commerce and Data Mining: Architecture and Challenges Llew Mason Joint work with Suhail Ansari, Ron Kohavi, Zijian Zheng Blue Martini Software WEB-KDD Workshop August, 2000

© Copyright , Blue Martini Software. San Mateo California, USA 2 2 Outline ò E-Commerce: A Killer Domain ò Integrated Architecture ò Data Collection ò Analysis ò Challenges ò Summary

© Copyright , Blue Martini Software. San Mateo California, USA 3 3 Killer Domain E-Commerce ò Data records are plentiful ò Electronic collection provides reliable data ò Enables closed-loop analysis ò Insight can easily be turned into action ò Success can be directly measured e.g., Return on investment (ROI)

© Copyright , Blue Martini Software. San Mateo California, USA 4 4 Business Data Definition Customer Interaction Analysis Integrated Architecture Stage Data Deploy Results Build Data Warehouse

© Copyright , Blue Martini Software. San Mateo California, USA 5 5 Business Data Definition Customer Interaction Analysis Integrated Architecture Stage Data Deploy Results Build Data Warehouse Business facing Products, content Attributes Shared meta-data

© Copyright , Blue Martini Software. San Mateo California, USA 6 6 Business Data Definition Customer Interaction Analysis Integrated Architecture Stage Data Deploy Results Build Data Warehouse Build store Test before production Transform for efficiency Zero down-time

© Copyright , Blue Martini Software. San Mateo California, USA 7 7 Business Data Definition Customer Interaction Analysis Integrated Architecture Stage Data Deploy Results Build Data Warehouse Customer facing Multiple Touchpoints Integrated Data Collection

© Copyright , Blue Martini Software. San Mateo California, USA 8 8 Business Data Definition Customer Interaction Analysis Integrated Architecture Stage Data Deploy Results Build Data Warehouse Build warehouse Automated using meta-data Reduces pre-processing Transform for analysis

© Copyright , Blue Martini Software. San Mateo California, USA 9 9 Business Data Definition Customer Interaction Analysis Integrated Architecture Stage Data Deploy Results Build Data Warehouse Analysis Data transformations Exploration Modeling

© Copyright , Blue Martini Software. San Mateo California, USA 10 Business Data Definition Customer Interaction Analysis Integrated Architecture Stage Data Deploy Results Build Data Warehouse Close the loop Transfer scores, models Personalize

© Copyright , Blue Martini Software. San Mateo California, USA 11 Clickstream Logging ò Web server logs ò Logs every HTTP request - filtering required ò Stateless - must identify users and sessions ò Captures URLs - must map to content ò Can’t understand dynamic content ò Packet sniffers ò Streaming data - must parse to understand content ò Can’t understand encrypted data (SSL) ò Solution : Application server logging

© Copyright , Blue Martini Software. San Mateo California, USA 12 Beyond Clickstream Logging ò Business Event Logging Consider several requests as one logical event ò Add or remove from shopping cart ò Initiate or finalize checkout ò Search ò Register ò Personalization rule evaluation ò Provides business insight ò Difficult to log outside of application server

© Copyright , Blue Martini Software. San Mateo California, USA 13 Aggregation ò Data occurs at multiple granularities Customers Sessions Requests Finer Granularity ò Many interesting attributes need to be aggregated for analysis Customers Orders Cities

© Copyright , Blue Martini Software. San Mateo California, USA 14 Aggregation ò Interesting customer attributes ò What wallet share did each customer spend on books? ò How much is each female customer’s average order amount above the mean value for female customers? ò What is the total amount of each customer’s five most recent purchases over $30? ò What is the frequency of each customer’s purchases? ò How long ago was each customer’s last purchase?

© Copyright , Blue Martini Software. San Mateo California, USA 15 Hierarchies Products ClothingBooks MensWomens 2 Product ID 1$12TF Quantity Price Clothing/Mens Clothing/Womens ò E-Commerce data contains many hierarchies ò How can we use them in analysis? F Books

© Copyright , Blue Martini Software. San Mateo California, USA 16 Analytical Tools ò Reporting ò Who are the top referrers by sales generated? ò What are the top abandoned products? ò What are the conversion rates for each product? ò OLAP ò How do sales vary over time in each geographic region? ò Modeling Algorithms ò What characterizes visitors that do not buy? ò What characterizes customers that prefer promotions? ò Which are the potential cross-sells and up-sells? ò Visualization

© Copyright , Blue Martini Software. San Mateo California, USA 17 E-Commerce Challenges ò Make data mining comprehensible ò Support multiple granularity levels ò Utilize hierarchies ò Support date and time types effectively ò Support external events and changing data ò Identify bots and crawlers ò Handle large amounts of data

© Copyright , Blue Martini Software. San Mateo California, USA 18 Summary ò Integrated E-Commerce and data mining enables effective closed-loop analysis ò Application server logging provides integrated data collection and reduces pre-processing ò Powerful data transformations and a broad suite of analysis techniques are needed ò There are many challenges ahead