Modeling Inter-Login Time on the Internet By Rahul Telang.

Slides:



Advertisements
Similar presentations
The Simple Linear Regression Model Specification and Estimation Hill et al Chs 3 and 4.
Advertisements

Regression and correlation methods
Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Objectives 10.1 Simple linear regression
A.M. Alonso, C. García-Martos, J. Rodríguez, M. J. Sánchez Seasonal dynamic factor model and bootstrap inference: Application to electricity market forecasting.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Correlation and regression Dr. Ghada Abo-Zaid
Simple Regression. Major Questions Given an economic model involving a relationship between two economic variables, how do we go about specifying the.
REGRESI SEDERHANA BAGIAN 2
The General Linear Model. The Simple Linear Model Linear Regression.
Inference1 Data Analysis Inferential Statistics Research Methods Gail Johnson.
Econ 140 Lecture 151 Multiple Regression Applications Lecture 15.
Chapter 12 Simple Linear Regression
Combining the strengths of UMIST and The Victoria University of Manchester An analysis of the relationship between time spent on active leisure and educational.
Lecture 8 Relationships between Scale variables: Regression Analysis
Chapter 13 Multiple Regression
The Simple Linear Regression Model: Specification and Estimation
Linear Regression.
Class 17: Tuesday, Nov. 9 Another example of interpreting multiple regression coefficients Steps in multiple regression analysis and example analysis Omitted.
Chapter 10 Simple Regression.
Chapter 3 Simple Regression. What is in this Chapter? This chapter starts with a linear regression model with one explanatory variable, and states the.
Correlational Designs
Chapter 12 Section 1 Inference for Linear Regression.
Multivariate Probability Distributions. Multivariate Random Variables In many settings, we are interested in 2 or more characteristics observed in experiments.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
3.1 Ch. 3 Simple Linear Regression 1.To estimate relationships among economic variables, such as y = f(x) or c = f(i) 2.To test hypotheses about these.
Chapter 11 Simple Regression
Hypothesis Testing in Linear Regression Analysis
1 Validation & Verification Chapter VALIDATION & VERIFICATION Very Difficult Very Important Conceptually distinct, but performed simultaneously.
© 2002 Prentice-Hall, Inc.Chap 14-1 Introduction to Multiple Regression Model.
2-1 MGMG 522 : Session #2 Learning to Use Regression Analysis & The Classical Model (Ch. 3 & 4)
Random Sampling, Point Estimation and Maximum Likelihood.
L 1 Chapter 12 Correlational Designs EDUC 640 Dr. William M. Bauer.
Chapter 3: Introductory Linear Regression
Modelling Charitable Donations: A Latent Class Panel Approach Sarah Brown (Sheffield) William Greene (New York) Mark Harris (Monash) Karl Taylor (Sheffield)
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Multilevel Linear Models Field, Chapter 19. Why use multilevel models? Meeting the assumptions of the linear model – Homogeneity of regression coefficients.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 18 Inference for Counts.
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
1 G Lect 7M Statistical power for regression Statistical interaction G Multiple Regression Week 7 (Monday)
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 19 Linear Patterns.
Managerial Economics Demand Estimation & Forecasting.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Y X 0 X and Y are not perfectly correlated. However, there is on average a positive relationship between Y and X X1X1 X2X2.
Inference for Regression Chapter 14. Linear Regression We can use least squares regression to estimate the linear relationship between two quantitative.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 26.
Class 4 Ordinary Least Squares CERAM February-March-April 2008 Lionel Nesta Observatoire Français des Conjonctures Economiques
Lecture 10 Chapter 23. Inference for regression. Objectives (PSLS Chapter 23) Inference for regression (NHST Regression Inference Award)[B level award]
1 Overview of Experimental Design. 2 3 Examples of Experimental Designs.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 21 The Simple Regression Model.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
1 B IVARIATE AND MULTIPLE REGRESSION Estratto dal Cap. 8 di: “Statistics for Marketing and Consumer Research”, M. Mazzocchi, ed. SAGE, LEZIONI IN.
MARKET APPRAISAL. Steps in Market Appraisal Situational Analysis and Specification of Objectives Collection of Secondary Information Conduct of Market.
Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.
2/16/2016 Subject Name: Digital Switching Systems Subject Code:10EC82 Prepared By: Aparna.P, Farha Kowser Department: Electronics and Communication Date:
Chapter 9 Estimation using a single sample. What is statistics? -is the science which deals with 1.Collection of data 2.Presentation of data 3.Analysis.
Introduction. We want to see if there is any relationship between the results on exams and the amount of hours used for studies. Person ABCDEFGHIJ Hours/
Multiple Regression Reference: Chapter 18 of Statistics for Management and Economics, 7 th Edition, Gerald Keller. 1.
Lecture 6 Feb. 2, 2015 ANNOUNCEMENT: Lab session will go from 4:20-5:20 based on the poll. (The majority indicated that it would not be a problem to chance,
Inference for Regression (Chapter 14) A.P. Stats Review Topic #3
CHAPTER 7 Linear Correlation & Regression Methods
The Simple Linear Regression Model: Specification and Estimation
CHAPTER 29: Multiple Regression*
A New Product Growth Model for Consumer Durables
The Simple Linear Regression Model
Discrete Event Simulation - 4
Simple Linear Regression
BEC 30325: MANAGERIAL ECONOMICS
Presentation transcript:

Modeling Inter-Login Time on the Internet By Rahul Telang

2 Road Map Motivation Previous Literature Model development Data Results Conclusion and Future Direction

Rahul Telang3 Motivation Understanding Consumer behavior on the Internet is interesting from the point of view of - Social scientists E-tailers and Electronic commerce managers as well as advertisers.

Rahul Telang4 Previous Literature Homenet Field Trial (Kraut et. al. 1996) Information and communication (Kraut et. al. 1999) Internet and Web use in USA (Hoffman, Kalsbeek, Novak 1996) All the models trying to predict web use (time spent) through linear regression model.

Rahul Telang5 Login Interval (12 hour) Freq.

Rahul Telang6 Previous Literature Use of poisson model is very common in economic literature (Grilichez 1984; Cameron and Trivedi 1986). Exponential distribution extensively used in marketing literature too to model inter- purchase or inter-shopping time (Ehrenberg 1959, 1972; Morrison, Schmittlein 1981, 1988; Zufryden 1977, 78; Gupta 1991)

Rahul Telang7 Model Start with simple exponential model – We can get the likelihood for n consumers as – Where population is homogeneous and Sum of n i inter-login times for user i

Rahul Telang8 Model In reality we expect users to be different. We can capture this variation by letting λ vary across users as gamma distribution with parameter r and α. So likelihood for any user is – The log likelihood for N users is then

Rahul Telang9 Model Till now, Model is static with no explanatory variables. We can include some time varying covariates (Gupta 1991) by using non homogenous poisson process Variables of Interest : H1: Time spent on the Web: (consumption theory?) More time spent at time t on the web might delay the next login time. H2: Experience : As time progresses, inter login time increases. Demographics : Not time varying but different across users.

Rahul Telang10 Model Login rate is a function of explanatory variables – Where λ 0 is the base login rate. [Interpretation of β – as X increases, λ increases for positive beta and hence inter-login time reduces] This leads to log likelihood for N consumers as

Rahul Telang11 Model Heterogeneity again can be incorporated by varying λ as gamma distributed. The likelihood in that case is –

Rahul Telang12 Data Data comes from the Homenet study being conducted in CMU. It captures the navigation pattern of users for long period of time. Since the data comes from the server, it is very accurate and unbiased. Total sample of 143 users for 1 year from May 15 th 1998 to 15 th May First 40 weeks of data used for calibration, rest 12 weeks as hold-out sample to test the predictive power of model.

Rahul Telang13 Data Session : to be able to correctly measure the IL time, we follow the rule that until there is 30 minutes delay, the new session does not begin. We delete those users, who do not have at least 10 observations. Demographic information collected from questionnaire. (We include gender, generation, education, race, marital status, and household income in our model) Total sample consists of 121 users for 40 week, leading to distinct data points.

Rahul Telang14 Results Exponential Exp/Gamma λ0.3321(120.4) r 2.12 (17.6) α7.17 (20.54) LL(n=14503)

Rahul Telang15 Results ExponentialExp/Gamma λ0.149 (10.86) r1.41 (15.27) α 6.45 (12.2) Time spent (-4.35)-0.07 (-3.5) Sessions0.05 (82.3)-0.05 (-28.7) Gender0.11 (9.92)0.23 (2.17) Generation0.15 (17.13)0.20 (2.5) Education-0.06 (-34.9)-0.1 (-5.7) Race0.14 (16.5)0.10 (1.27) Marital Status 0.09 (24.7)0.14 (4.26) Income0.013 (8.65)0.11 (7.7) LL (n=14451)

Rahul Telang16 Conclusion and Future work Both the hypothesis are verified. Demographics are very significant. Comparison with Linear regression shows a much better fit. We need to test the results with the hold-out sample.