An Analysis of WoW Players’ Game Hours Matt Ross, Christian Ebinger, Anthony Morgan.

Slides:



Advertisements
Similar presentations
Exponential Distribution
Advertisements

Authors: J.A. Hausman, M. Kinnucan, and D. McFadden Presented by: Jared Hayden.
Hawawini & VialletChapter 7© 2007 Thomson South-Western Chapter 7 ALTERNATIVES TO THE NET PRESENT VALUE RULE.
Battle of Botcraft: Fighting Bots in Online Games with Human Observational Proofs Steven Gianvecchio, Zhenyu Wu, Mengjun Xie, and Haining Wang.
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
What is Forecasting? A forecast is an estimate of what is likely to happen in the future. Forecasts are concerned with determining what the future will.
Chapter 6: Correlational Research Examine whether variables are related to one another (whether they vary together). Correlation coefficient: statistic.
Estimation of the Number of Relevant Images in Infinite Databases Presented by: Xiaoling Wang Supervisor: Prof. Clement Leung.
Stochastic Processes Dr. Nur Aini Masruroh. Stochastic process X(t) is the state of the process (measurable characteristic of interest) at time t the.
W. Feng, “A Long-term Study of a Popular MMORPG", NetGames 2007, Sept , A Long-term Study of a Popular MMORPG Wu-chang Feng Debanjan Saha David.
Chapter 12 - Forecasting Forecasting is important in the business decision-making process in which a current choice or decision has future implications:
Simple Linear Regression
Location Clustering Peter Kamm Marcel Flores Peter Kamm Marcel Flores.
On the Constancy of Internet Path Properties Yin Zhang, Nick Duffield AT&T Labs Vern Paxson, Scott Shenker ACIRI Internet Measurement Workshop 2001 Presented.
Issues with Measurement-based characterization of on- line games Prasad.
A Hierarchical Characterization of a Live Streaming Media Workload E. Veloso, V. Almeida W. Meira, A. Bestavros, S. Jin Proceedings of Internet Measurement.
1 Introduction to Macroeconomics Chapter 20 © 2006 Thomson/South-Western.
A Hierarchical Characterization of a Live Streaming Media Workload IEEE/ACM Trans. Networking, Feb Eveline Veloso, Virg í lio Almeida, Wagner Meira,
Understanding Churn in Peer-to-Peer Networks Daniel Stutzbach – University of Oregon Reza Rejaie – University of Oregon Internet Measurement Conference.
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
1 Measurement-based Characterization of a Collection of On-line Games Chris Chambers Wu-chang Feng Portland State University Sambit Sahu Debanjan Saha.
1. 2 Which Costs and Benefits to Measure? Controllability: Cost or benefit that changes because of the decision  Measured relative to status quo Relevance:
BCOR 1020 Business Statistics Lecture 11 – February 21, 2008.
RESEARCH METHODS IN EDUCATIONAL PSYCHOLOGY
1 Efficient Management of Data Center Resources for Massively Multiplayer Online Games V. Nae, A. Iosup, S. Podlipnig, R. Prodan, D. Epema, T. Fahringer,
1 CHAPTER M4 Cost Behavior © 2007 Pearson Custom Publishing.
1 Measurement-based Characterization of a Collection of On-line Games Chris Chambers Wu-chang Feng Portland State University Sambit Sahu Debanjan Saha.
6 am 11 am 5 pm Fig. 5: Population density estimates using the aggregated Markov chains. Colour scale represents people per km. Population Activity Estimation.
1 Statistical Analysis - Graphical Techniques Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND.
Class Meeting #11 Data Analysis. Types of Statistics Descriptive Statistics used to describe things, frequently groups of people.  Central Tendency 
Chapter 9 Uniprocessor Scheduling Spring, 2011 School of Computer Science & Engineering Chung-Ang University.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Variable  An item of data  Examples: –gender –test scores –weight  Value varies from one observation to another.
Data Analysis and Forecasting Project – Interim Report Delivered to the DJJ January 2008 Jennifer Lewis Priestley, Ph.D. Shan Muthersbaugh, MS Candidate.
Chapter 12 Examining Relationships in Quantitative Research Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Chapter One: Measurement 11.1 Measurements 11.2 Time and Distance 11.3 Converting Measurements 11.4 Working with Measurements.
Lecture on Correlation and Regression Analyses. REVIEW - Variable A variable is a characteristic that changes or varies over time or different individuals.
Examining Relationships in Quantitative Research
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 18 Inference for Counts.
1 The Quest for the Optimal Experiment RecSys
Time Series Analysis and Forecasting
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Time series Model assessment. Tourist arrivals to NZ Period is quarterly.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 14 Comparing Groups: Analysis of Variance Methods Section 14.1 One-Way ANOVA: Comparing.
1 …continued… Part III. Performing the Research 3 Initial Research 4 Research Approaches 5 Hypotheses 6 Data Collection 7 Data Analysis.
Random Variable The outcome of an experiment need not be a number, for example, the outcome when a coin is tossed can be 'heads' or 'tails'. However, we.
AP Statistics Section 11.1 B More on Significance Tests.
Disk Failures Eli Alshan. Agenda Articles survey – Failure Trends in a Large Disk Drive Population – Article review – Conclusions – Criticism – Disk failure.
Time Series Analysis and Forecasting. Introduction to Time Series Analysis A time-series is a set of observations on a quantitative variable collected.
Pin-Yun Tarng / An Analysis of WoW Players’ Game Hours Network and Systems Laboratory nslab.ee.ntu.edu.tw IEEE/IFIP DSN 2008 Network and Systems Laboratory.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Patch Scheduling for On-line Games Chris Chambers Wu-chang Feng Portland State University.
 Recall your experience when you take an elevator.  Think about usually how long it takes for the elevator to arrive.  Most likely, the experience.
Methodology: How Social Psychologists Do Research
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 14 l Time Series: Understanding Changes over Time.
© 2015 McGraw-Hill Education. All rights reserved. Chapter 17 Queueing Theory.
MODELING AND SIMULATION CS 313 Simulation Examples 1.
1 Outline 1.Count data 2.Properties of the multinomial experiment 3.Testing the null hypothesis 4.Examples.
LOAD FORECASTING. - ELECTRICAL LOAD FORECASTING IS THE ESTIMATION FOR FUTURE LOAD BY AN INDUSTRY OR UTILITY COMPANY - IT HAS MANY APPLICATIONS INCLUDING.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 13 Time.
SECTION 1 TEST OF A SINGLE PROPORTION
Department of Telecommunications NetGames 2011Ottawa, October 2011 MMORPG Player Behavior Model based on Player Action Categories Mirko Suznjevic, Ivana.
Determining How Costs Behave
Linear Regression.
Modeling and Simulation CS 313
CHAPTER 26: Inference for Regression
Process Capability.
Measurement-based Characterization of a Collection of On-line Games
Chapter 5: Sampling Distributions
Presentation transcript:

An Analysis of WoW Players’ Game Hours Matt Ross, Christian Ebinger, Anthony Morgan

What problem is the paper trying to solve? Predicting how long players will stay once they join a game by predicting players online gamers’ subscription time (length of time since he/she first joined the game to the time of his/her last login).

Why is it important? Predict gamers’ gaming hours and unsubscription decisions. With this companies can predict future revenue generated from game subscriptions and can also predict what are the main usage time of the servers.

Previous Research – Rocky MUD Medieval Fantasy MMOG (massive multiplayer online game) developed in 1993 Players advance their character, their way No arbitrary classes or levels, and unlimited power Built-in/live quests, hundreds of mini-quests Advanced tactical interface Massively unique, destinies and fighting styles

Previous Research – Rocky MUD cont. Measured by four variables: – inter-arrival times – avatars’ transition between different regions – region stay times – session lengths Inter-arrival times – time between the arrival of one gamer and the arrival of the next gamer Avatars’ transition between different regions – movement from one regional location to another Region stay times – time a gamer spent in a particular regional location Session lengths – time a gamer played in one sitting

Terms Defined Inter-arrival times of game sessions follow an exponential distribution Transition of avatars between different regions modeled by a first-order Markov chain Region stay time best modeled by Pearson distribution Session length described by Pareto distribution Exponential distribution - a process in which events occur continuously and independently at a constant average rate. First-order Markov chain - the next step only depends on the current state of the system, and not additionally on the state of the system at previous steps. Pearson distribution – is a family of continuous probability distributions. Pareto distribution - the Pareto principle or the "80-20 rule" says that 20% of the population controls 80% of the wealth; (20% of players have longest 80% session times)

Behavior Study of Counter-Strike Two issues: – users’ satisfaction with a game – predictability of the game server’s work load Found that it is extremely difficult to satisfy users Users have short attention spans, session times are usually < 1 hour Number of users on different servers follows a power-law distribution Server workloads exhibits predictable patterns in terms of day and week scales, but the predictability diminishes with larger time scales.

World of Warcraft traces Conjectured that at least four types of information are required to establish a prediction model: – server’s population changes over time – arrival rate and session duration of players – spatial distribution of avatars in the virtual world – movements of avatars over time (how many distinct regions the avatars visit and how long they stay in a region) Number of players fluctuated in a diurnal pattern; 5x increase in the number of players between 4am and 6pm. Session times appeared to follow a power-law distribution where approximately 50% of the gamers remain online for 10 minutes or less. Number of players versus the rank of each zone, from the most populated to the least populated, exhibited a power-law relationship.

Power-law distribution Session times followed a power-law distribution where approximately 50% of the gamers remain online for 10 minutes or less. Gamers online Session time 10mins 1 hour5 hours 7043 gamers

How does this paper solve the problem? Collected their traces by using the who command with different races, professions, and levels. Automated process that happened every 10 minutes with a character that remained connected at all times (ran for 2 years). Monitored 664 days, accounts were observed. Only 7,043 remained active for more than 30 days (Indicates that most accounts were never used after the free trial period expired) Only going to focus only on the 7043 accounts who subscriptions periods are longer than 30 days. Analyze Four Categories: – Subscription Time – Consecutive Game Play – Daily Activities – When Do Gamers Play

Subscription Time: Used the Kaplan-Meier estimator which takes account of the censored status (started playing before and after measurements) of each subscription periods, to estimate the distribution of players’ subscription times. – Kaplan-Meier estimator’s output is called the survival function (reduces to the cumulative distribution function if none of the subscription periods are censored). 50% of users will subscribe for longer than 500 days.

Consecutive Game Play: Consider the distribution of consecutive game play days in order to understand the extent of addiction of WoW gamers. ON period defined as a group of consecutive days during which a player joins the game everyday. OFF period as the interval between two ON periods. OFF periods slightly longer than ON periods on average, but the difference is insignificant. Probabilistically around 80% of gamers’ ON OFF periods are shorter than 5 days. 3% of OFF Periods are longer than 1 month 1% longer than 3 months. Forces another look at sessions (ON PERIODS) and vacations (OFF Periods longer than 30 days) [Chart b] Vacations generally longer then sessions difference is not significant. 50% of sessions are longer than 60 days. Less than 20% of vacations are longer than 180 days, only 20% of people returned to the game after a vacation that long. 20% of the seasons are shorter than 10 days.

When Do Gamers Play?: Occurred during the night, day weekday or weekend. Thought game play would be higher on weekend then on weekday. Slightly true but difference is not significant. Obvious difference between the number of gamers during night hours and morning hours. – Rapid increase around 6:00pm, start to play right after work. Peek at 10:00pm to midday, lowest from 5am to 7am. Number of gamers increase between 6am – 10 pm.

Predictability Analysis: short term Summary of players’ short-term behavior: – Average session time – Average daily session count – Average daily playtime Possible correlation with players’ long-tem behavior: – Average length of ON periods – the Average season length – the overall subscription time Fig. 5 shows the plots of the correlations between the three short-term behavioral factors and the three long-term behavioral factors.

Predictability Analysis: short term We observe that the lengths of the average ON periods are moderately correlated with all the short term behavioral factors, and the average daily play time has the strongest predictability. – Fig. 5(c) shows that, if players’ average daily game time is shorter than 1 hour, then their average ON periods will probably be less than 2 days. – On the other hand, the average daily playtime of highly addicted players can be as high as 10 hours, and they may play the game for more than 20 days without interruption. However, it is clear that the average length of seasons and the overall subscription time do not correlate with all the short-term behavioral factors. Since this indicates that players’ interests may change significantly over time, we cannot simply use an overall average of players’ short-term behavior to predict their long-term game play behavior. Instead monitor the evolution of players’ game hours over time and keep track of their interest in the game in order to accurately predict when unsubscription will occur.

Figure 5

Predictability Analysis: Long Term Examine whether players’ game play behavior in one time period will be carried over to the following period. As shown in Fig. 6, five types of time periods are considered: – session – day – Week – ON period – season. Not surprisingly, the overall playtime between consecutive weeks exhibits the strongest autocorrelations among all the time scales we consider. Session time and daily playtime are also strongly auto-correlated; however, the magnitude is not as strong as that of weekly playtime.

Figure 6

Authors Solution Short Term: – Prediction is feasible Long Term: – Much more difficult – Players’ interest in the game may increase or decrease over time.