1 Massivizing Social Games: Distributed Computing Challenges and High Quality Time – A. Iosup Alexandru Iosup Parallel and Distributed Systems Group Delft.

Slides:



Advertisements
Similar presentations
1 Cloud Futures Workshop 2010 – Cloud Computing Support for Massively Social Gaming Alexandru Iosup Parallel and Distributed Systems Group Delft University.
Advertisements

SLA-Oriented Resource Provisioning for Cloud Computing
EHarmony in Cloud Subtitle Brian Ko. eHarmony Online subscription-based matchmaking service Available in United States, Canada, Australia and United Kingdom.
The Who, What, Why and How of High Performance Computing Applications in the Cloud Soheila Abrishami 1.
Towards Autonomic Adaptive Scaling of General Purpose Virtual Worlds Deploying a large-scale OpenSim grid using OpenStack cloud infrastructure and Chef.
4.1.5 System Management Background What is in System Management Resource control and scheduling Booting, reconfiguration, defining limits for resource.
Ken Birman. Massive data centers We’ve discussed the emergence of massive data centers associated with web applications and cloud computing Generally.
1 NetGames 2010 – CAMEO: Continuous Analytics for Massively Multiplayer Online Games CAMEO : Enabling Social Networks for Massively Multiplayer Online.
1 Google Workshop at TU Delft, 2010 – Online Games and Clouds Cloudifying Games: Rain for the Thirsty Alexandru Iosup Parallel and Distributed Systems.
1 A Performance Study of Grid Workflow Engines Alexandru Iosup and Dick Epema PDS Group Delft University of Technology The Netherlands Corina Stratan Parallel.
1 Trace-Based Characteristics of Grid Workflows Alexandru Iosup and Dick Epema PDS Group Delft University of Technology The Netherlands Simon Ostermann,
1 ASCI, 2010 – Analysis of BBO Fans Social Networks Analysis of BBO Fans, an Online Social Gaming Community Alexandru Iosup Parallel and Distributed Systems.
Cloud Computing (101).
July 13, “How are Real Grids Used?” The Analysis of Four Grid Traces and Its Implications IEEE Grid 2006 Alexandru Iosup, Catalin Dumitrescu, and.
Euro-Par 2008, Las Palmas, 27 August DGSim : Comparing Grid Resource Management Architectures Through Trace-Based Simulation Alexandru Iosup, Ozan.
1 Efficient Management of Data Center Resources for Massively Multiplayer Online Games V. Nae, A. Iosup, S. Podlipnig, R. Prodan, D. Epema, T. Fahringer,
An Introduction to Cloud Computing. The challenge Add new services for your users quickly and cost effectively.
SOFTWARE AS A SERVICE PLATFORM AS A SERVICE INFRASTRUCTURE AS A SERVICE.
CERN IT Department CH-1211 Genève 23 Switzerland t Next generation of virtual infrastructure with Hyper-V Michal Kwiatek, Juraj Sucik, Rafal.
1 TI1100A – Internet-Based Applications – A. Iosup Alexandru Iosup Parallel and Distributed Systems Group Delft University of Technology Our team: Undergrad.
Course Announcements Tomorrow, Jan 11, at 09:45, Lecture Hall H: “GPU Programming: Tips and Tricks” Ana Varbanescu Exam-related questions: contact Hai.
1 Exploring Data Reliability Tradeoffs in Replicated Storage Systems NetSysLab The University of British Columbia Abdullah Gharaibeh Advisor: Professor.
August 28, Performance Analysis of Cloud Computing Services for Many-Tasks Scientific Computing Berkeley, CA, USA Alexandru Iosup, Nezih Yigitbasi,
August 29, Our team: Undergrad Nassos Antoniou, Thomas de Ruiter, Ruben Verboon, … Grad Siqi Shen, Nezih Yigitbasi, Ozan Sonmez Staff Henk Sips,
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
COnvergence of fixed and Mobile BrOadband access/aggregation networks Work programme topic: ICT Future Networks Type of project: Large scale integrating.
USING HADOOP & HBASE TO BUILD CONTENT RELEVANCE & PERSONALIZATION Tools to build your big data application Ameya Kanitkar.
: Massivizing Online Games using Cloud Computing Alexandru Iosup Parallel and Distributed Systems Group, Delft University of Technology, The Netherlands.
PhD course - Milan, March /09/ Some additional words about cloud computing Lionel Brunie National Institute of Applied Science (INSA) LIRIS.
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
1 EIT ICT Labs Workshop at TU Delft, May 2011 – Cloud Computing Parallel and Distributed Systems Group Delft University of Technology The Netherlands Our.
University of Zagreb MMVE 2012 workshop1 Towards Reinterpretation of Interaction Complexity for Load Prediction in Cloud-based MMORPGs Mirko Sužnjević,
LDBC-Benchmarking Graph-Processing Platforms: A Vision Benchmarking Graph-Processing Platforms: A Vision (A SPEC Research Group Process) Delft University.
1 TUD-PDS A Periodic Portfolio Scheduler for Scientific Computing in the Data Center Kefeng Deng, Ruben Verboon, Kaijun Ren, and Alexandru Iosup Parallel.
1 Cloud Computing Research at TU Delft – A. Iosup Alexandru Iosup Parallel and Distributed Systems Group Delft University of Technology The Netherlands.
May 25, Our team: Undergrad Tim Hegeman, Stefan Hugtenburg, Jesse Donkevliet … Grad Siqi Shen, Guo Yong, Nezih Yigitbasi Staff Henk Sips, Dick Epema,
1 EuroPar 2009 – POGGI: Puzzle-Based Online Games on Grid Infrastructures POGGI: Puzzle-Based Online Games on Grid Infrastructures Alexandru Iosup Parallel.
A Lightweight Platform for Integration of Resource Limited Devices into Pervasive Grids Stavros Isaiadis and Vladimir Getov University of Westminster
Copyright © 2011, Cost-Efficient Hosting and Load Balancing of Massively Multiplayer Online Games Nae, V.; Prodan, R.; Fahringer, T.; Grid Computing.
1 Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup Alexandru Iosup Parallel and Distributed Systems Group Delft University.
CCGrid 2014 Improving I/O Throughput of Scientific Applications using Transparent Parallel Compression Tekin Bicer, Jian Yin and Gagan Agrawal Ohio State.
October 18, Our team: Undergrad Anand Sawant, Ruben Verboon, Gargi Prasad, Arnoud Bakker, Nassos Antoniou, Thomas de Ruiter, … Grad Siqi Shen, Nezih.
1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan.
1 ROIA 2009 – CAMEO: Continuous Analytics for Massively Multiplayer Online Games CAMEO: Continuous Analytics for Massively Multiplayer Online Games Alexandru.
October 27, Our team: Undergrad Nassos Antoniou, Thomas de Ruiter, Ruben Verboon, … Grad Siqi Shen, Nezih Yigitbasi, Ozan Sonmez Staff Henk Sips,
VENI – Massively Multiplayer Online Games: Near Zero-Cost and Near-Infinite Capability 1 Massively Multiplayer Online Games Near-Zero Cost and Near-Infinite.
How AWS Pricing Works Jinesh Varia Technology Evangelist.
1 LSAP, 2011 – Analysis of Online and Face-to-Face Bridge Communities An Analysis of Social Networks Analysis in Online and Face-to-Face Bridge Communities.
Active Sampling for Accelerated Learning of Performance Models Piyush Shivam, Shivnath Babu, Jeff Chase Duke University.
November 29, Our team: Undergrad Thomas de Ruiter, Anand Sawant, Ruben Verboon, … Grad Siqi Shen, Guo Yong, Nezih Yigitbasi Staff Henk Sips, Dick.
: Massivizing Online Games using Cloud Computing Alexandru Iosup Parallel and Distributed Systems Group, Delft University of Technology, The Netherlands.
Cloud Computing is a Nebulous Subject Or how I learned to love VDF on Amazon.
1 Massivizing Social Games: Distributed Computing Challenges and High Quality Time – A. Iosup Alexandru Iosup Parallel and Distributed Systems Group Delft.
CISC 849 : Applications in Fintech Namami Shukla Dept of Computer & Information Sciences University of Delaware iCARE : A Framework for Big Data Based.
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.
Ensieea Rizwani An energy-efficient management mechanism for large-scale server clusters By: Zhenghua Xue, Dong, Ma, Fan, Mei 1.
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
Capacity Planning in a Virtual Environment Chris Chesley, Sr. Systems Engineer
Cloud Computing from a Developer’s Perspective Shlomo Swidler CTO & Founder mydrifts.com 25 January 2009.
Meeting with University of Malta| CERN, May 18, 2015 | Predrag Buncic ALICE Computing in Run 2+ P. Buncic 1.
Spark on Entropy : A Reliable & Efficient Scheduler for Low-latency Parallel Jobs in Heterogeneous Cloud Huankai Chen PhD Student at University of Kent.
Alexandru Iosup Parallel and Distributed Systems Group Delft University of Technology The Netherlands Cloud Computing : Open Research Questions.
Cloud Benchmarking, Tools, and Challenges
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
Introduction to Cloud Computing
Vlad Nae, Radu Prodan, Thomas Fahringer Institute of Computer Science
Cloud Performance Evaluation at TU Delft (2008—)
Mihai Neacşu, BSc. Prof.dr.eng. Alexandru Iosup Ir. Laurens Versluis
MagnaData: Scheduling Complex Workflows with Non-Functional Requirements in Datacenters Laurens Versluis Massivizing Computer research.
Presentation transcript:

1 Massivizing Social Games: Distributed Computing Challenges and High Quality Time – A. Iosup Alexandru Iosup Parallel and Distributed Systems Group Delft University of Technology Our team: Undergrad Adrian Lascateu, Alexandru Dimitriu (UPB, Romania), …, Grad Siqi Shen, Yong Guo, Ruud vd Bovenkamp (TU Delft, the Netherlands), … Staff Otto Visser, Dick Epema, Henk Sips, Fernando Kuipers (TU Delft), Thomas Fahringer, Radu Prodan (U. Innsbruck), Nicolae Tapus, Vlad Posea (UPB), … Massivizing Online Games: Distributed Computing Challenges and High Quality Time NICTA, Sydney, Australia, Jul 2012

Why Social Gaming? Massivizing Social Games: Distributed Computing Challenges and High Quality Time – A. Iosup 2

What is This Talk About? Massivizing Social Games: Distributed Computing Challenges and High Quality Time – A. Iosup 3

4 Online Gaming used to be multimedia, is now DC Online Gaming used to be networking, is now all DC Online Gaming used to be v-worlds, is now many apps Online Gaming used to be art, may now be computing Massivizing Social Gaming = Rich Challenge (of Clouds)

(TU) Delft – the Netherlands – Europe pop.: 100,000 pop:16.5 M founded 13 th century pop: 100,000 founded 1842 pop: 13,000

The Parallel and Distributed Systems Group at TU Delft August 31, Johan Pouwelse P2P systems Henk Sips multicore P2P systems Dick Epema grids/clouds e-Science P2P systems Alexandru Iosup online gaming grids/clouds P2P systems Home page Publications see PDS publication database at publications.st.ewi.tudelft.nlpublications.st.ewi.tudelft.nl

Massivizing Social Games: Distributed Computing Challenges and High Quality Time – A. Iosup 7 What’s in a name? MSG, MMOG, MMO, … 1.Virtual World Sim Explore, do, learn, socialize, compete + 2.Game Data Player stats and relationships, others + 3.Game Content Graphics, maps, puzzles, quests, culture Massively Social Gaming = (online) games with massive numbers of players (100K+), for which social interaction helps the gaming experience Over 250,000,000 active players

Massivizing Social Games: Distributed Computing Challenges and High Quality Time – A. Iosup 8 MSGs are a Popular, Growing Market 25,000,000+ subscribed players (from 250,000,000+ active) Over 10,000 MSGs in operation Subscription market size $7.5B+/year, Zynga $600M+/year Sources: MMOGChart, own research.Sources: ESA, MPAA, RIAA.

Massivizing Social Games: Distributed Computing Challenges and High Quality Time – A. Iosup 9 Bungie, Computing then Serving 1.4PB/yr. Halo 3 is one of the many successful games Halo 3 players get, in 1.4PB Detailed player profiles Detailed usage stats Ranking CERN produces ~15PB/year (10x larger) (Not) faster than the speed of light, the Higgs boson (?)

Massivizing Social Games: Distributed Computing Challenges and High Quality Time – A. Iosup 10 World of Warcraft, a Traditional HPC User (since 2003) 10 data centers 13,250 server blades, 75,000+ cores 1.3PB storage 68 sysadmins (1/1,000 cores)

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 11 Zynga, an Amazon WS User Sources: CNN, Zynga. Source: InsideSocialGames.com “Zynga made more than $600M in 2010 from selling in-game virtual goods.” S. Greengard, CACM, Apr 2011 Selling in-game virtual goods: “Zynga made est. $270M in 2009 from.”

Massivizing Social Games: Distributed Computing Challenges and High Quality Time – A. Iosup 12 Agenda 1.What’s in a Name? 2.Three Current Challenges 1.Platform Scalability Challenge 2.Gaming Analytics Challenge 3.Content Generation Challenge 3.The Next Five Years 4.Conclusion

Research Challenge: V-World Platform for MMOGs Scaling quickly to millions of players - 1M in 4 days, 10M in 2 months - Up-front and operational costs - Performance, Scalability, & Cost

15 november Rich Research Opportunities: How to Build the Cloud-Based Core? Understand the application Deployment models Scheduling Performance engineering Reliability engineering Scalability and Elasticity Cloud inter-operation (federation) Etc. (including Usability, Security, Utility Models, and Programming VENI Grant 2011

15 Computational Model for the Server Single sequential loop [Source: Nae, Iosup, and Prodan, ACM SC 2008] 3 steps in each loop: 1.Game-world state update 2.Entity interaction computation (dominant for MSGs) 3.Entity state updates Load generated by (2) non-deterministic  human factor

16 Model for Entity Interaction Computation Player to Player/Player to Environment interaction Low interaction: O(n), e.g. RTS Medium interaction: O(n∙log(n)) – RPG quest maps High interaction: O(n 2 ) – RPG war maps, FPS Very High interaction: O(n 2 ∙ log(n))/O(n 3 ), unit- target matching, team path-finding, maxflow alg., …

17 Online games hosting model Generic Online Games (non-MM) Static: dedicated isolated single servers MMOGs Static: dedicated clusters - using parallelization techniques Problems with these approaches 1.Large amount of over-provisioning 2.Non-efficient coverage of the world for the provided service [Source: Nae, Iosup, and Prodan, ACM SC 2008]

Cloud Computing VENI Massivizing Online Games using Cloud Computing

19 Using data centers for dynamic resource allocation Main advantages: 1. Significantly lower over-provisioning 2. Efficient coverage of the world is possible Proposed hosting model: dynamic Massive join Massive leave [Source: Nae, Iosup, and Prodan, ACM SC 2008]

20 MSG Ecosystem: Model Game operators Past player activity/business model  Predicted load  requests Data centers Local time-space renting policy  offers Time-Space renting policy, e.g., 1 node-hour Resource allocation: central request-offer matching Rules for ranking request-offer match: 1. The offer size and type vs. the request 2. The geographical proximity offer-request 3. The finer grained resources (quantity & time) [Source: Nae, Iosup, and Prodan, ACM SC 2008]

21 Experimental Setup [1/3] Discrete-Event Simulator Input Traces from RuneScape, a real top-5 MMOG 7 countries, 3 continents More than 130 game worlds Consisting of Geographical location Number of clients Over 10,000 samples at 2 min. interval, 2 weeks Output (for every time-step) Resource allocation decisions Resource allocation Performance metrics [Source: Nae, Iosup, and Prodan, ACM SC 2008]

22 Experimental Setup [3/3] Performance Metrics Resource over-provisioning [%] The wasted resources vs. optimal provisioning at each simulation time step for all utilized machines (cumulative) Resource under-provisioning [%] The amount of resources needed but not allocated, for all machines (computed individually) Significant under-provisioning events (count) Count of events: resource under-provisioning is >1%, for a period of 2 minutes  people leave [Source: Nae, Iosup, and Prodan, ACM SC 2008]

23 Resource Provisioning and Allocation Static vs. Dynamic Provisioning 250% 25% [Source: Nae, Iosup, and Prodan, ACM SC 2008] Over-provisioning (%)

Also Studied Via real game measurements Interactivity model (short-term msmt.) Effects of underperforming platform (long-term msmt.) Via prototype implementation Match model-reality [TPDS’11] Via simulation Impact of virtualization [NetGames’11][IJAMC’11] Economic models [under submission] 24

October 5, Remaining Challenge in Perf. Eng.: To the Real IaaS Cloud “The path to abundance” On-demand capacity Cheap for short-term tasks Great for web apps (EIP, web crawl, DB ops, I/O) “The killer cyclone” Not so great performance for scientific applications (compute- or data-intensive) Not so stable performance Tropical Cyclone Nargis (NASA, ISSS, 04/29/08) VS October 5, Iosup et al., Performance Analysis of Cloud Computing Services for Many Tasks Scientific Computing, IEEE TPDS, 2011, Iosup et al., On the Performance Variability of Production Cloud Services, CCGrid 2011, pds.twi.tudelft.nl/reports/2010/PDS pdfpds.twi.tudelft.nl/reports/2010/PDS pdf

Remaining Challenge in Scheduling: Provisioning and Allocation Policies October 5, Where? When? How many? Time Load ProvisioningAllocation From where? Which type? etc. When? etc. Q: How to select policies? (Source: A. Antoniou, MSc Defense, TU Delft, 2012.)

Remaining Challenge in Deployment: Zynga zCloud: Hybrid Self-Hosted/EC2 After Zynga had large scale More efficient self-hosted servers Run at high utilization Use EC2 for unexpected demand October 5, (Sources: and

Research Challenge: Continuous Analytics for MMOGs Analyzing the behavior of millions of players, on-time - Data mining, data access rights, cost v. accuracy, … - Reduce upfront costs - Low response time & Scalable - Large-scale Graph Processing

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup Sample Analytics Results Analysis of Meta-Gaming Network “When you play a number of games, not as ends unto themselves but as parts of a larger game, you are participating in a metagame.” (Dr. Richard Garfield, 2000) XFire: since 2008 (3+ years), 500K of 20M players * A. Iosup, POGGI: Puzzle-Based Online Games on Grid Infrastructures EuroPar 2009 (Best Paper Award) S. Shen, and A. Iosup, The XFire Online Meta-Gaming Network: Observation and High-Level Analysis, MMVE 2011 PhD

Massivizing Social Games: Distributed Computing Challenges and High Quality Time – A. Iosup Research Challenge: Content Generation for MMOGs Generating content on time for millions of players - Player-customized: Balanced, Diverse, Fresh - Up-front and operational costs - Response time, Scalability, & Cost

31 (Procedural) Game Content (Generation) Game Bits Texture, Sound, Vegetation, Buildings, Behavior, Fire/Water/Stone/Clouds Game Space Height Maps, Bodies of Water, Placement Maps, … Game Systems Eco, Road Nets, Urban Envs, … Game Scenarios Puzzle, Quest/Story, … Game Design Rules, Mechanics, … Hendricks, Meijer, vd Velden, Iosup, Procedural Content Generation for Games: A Survey, ACM TOMCCAP, 2012 Derived Content NewsGen, Storification

32 Puzzle-Specific Considerations Generating Player-Customized Content Puzzle difficulty Solution size Solution alternatives Variation of moves Skill moves Player ability Keep population statistics and generate enough content for most likely cases Match player ability with puzzle difficulty Take into account puzzle freshness 4 21

33 Agenda 1.What’s in a Name? 2.Three Current Challenges 3.The Next Five Years 1.Cloudification 2.Mobile Social Gaming 3.Social Everything! 4.Content, Content, Content 4.Conclusion

Massivizing Social Games: Distributed Computing Challenges and High Quality Time – A. Iosup 34 Cloudification: PaaS for MSGs (Platform Challenge) Build MSG platform that uses (mostly) cloud resources Close to players No upfront costs, no maintenance Compute platforms: multi-cores, GPUs, clusters, all-in-one! Performance guarantees Hybrid deployment model Code for various compute platforms—platform profiling Load prediction miscalculation costs real money What are the services? Vendor lock-in? My data

Massivizing Social Games: Distributed Computing Challenges and High Quality Time – A. Iosup 35 Mobile Social Gaming and the SuperServer (Platform Challenge) Support MSGs on mobile devices Mobiles everywhere (2bn+ users) Gaming industry for mobiles is new Growing Market SuperServer to generate content for low-capability devices? Battery for 3D/Networked games? Where is my server? (Ad-hoc mobile gaming networks?) Security, cheat-prevention

36 Social Everything! Social Network=undirected graph, relationship=edge Community=sub-graph, density of edges between its nodes higher than density of edges outside sub-graph (Analytics Challenge) Improve gaming experience Ranking / Rating Matchmaking / Recommendations Play Style/Tutoring Self-Organizing Gaming Communities Player Behavior

Massivizing Social Games: Distributed Computing Challenges and High Quality Time – A. Iosup 37 Content, Content, Content (Content Challenge) Produce and distribute content for 1BN people Game Analytics  Game statistic Crowdsourcing Storification Auto-generated game content Adaptive game content Content distribution/ Streaming content

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 38 Summary Current The Future Happy players Happy cloud operators Million-user, multi-bn market V-World, Content, Analytics Massivizing Online Gaming Upfront payment Cost and scalability problems Makes players Our Vision DC/Cloud has to help Economy of scale with Ongoing Work Content: POGGI Framework Platform: Analytics: CAMEO Framework Publications Gaming and Clouds 2008: ACM SC 2009: ROIA, CCGrid, NetGames, EuroPar (Best Paper Award), … 2010: IEEE TPDS, Elsevier CCPE 2011: Book Chapter CAMEO, IEEE TPDS, IJAMC 2012: IPDPS, CCGrid, … Graduation (Forecast) 2012—14: 3PhD, 6Msc, 6BSc

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 39 Thank you for your attention! Questions? Suggestions? Observations? Alexandru Iosup (or google “iosup”) Parallel and Distributed Systems Group Delft University of Technology More Info: Do not hesitate to contact me…

Massivizing Social Games: Distributed Computing Challenges and High Quality Time – A. Iosup 40

41 Impact on Game Experience Responsive game Unresponsive game October 5, 2015 [Source: Nae, Iosup, and Prodan, ACM SC 2008 and IEEE TPDS 2011]

42 Experimental Setup [2/3] Environment 1 game operator 17 data centers 11 data center time-space renting policies [Source: Nae, Iosup, and Prodan, ACM SC 2008]

43 Impact of Load Prediction Accuracy Q: How does the prediction accuracy impact resource provisioning? A: Good prediction matters. [Source: Nae, Iosup, and Prodan, ACM SC 2008] Under-provisioning events (count)

44 Impact of Interaction Compute-Intensiveness Q: How are different MMOG types handled under dynamic resource provisioning? (Interaction models Low ~ O(n), Medium, High, Very High ~ O(n 3 ) ) A: Over-provisioning, Under-provisioning worse with increase in interaction compute-intensiveness [Source: Nae, Iosup, and Prodan, ACM SC 2008] Over-/Under-provisioning (%)

Latency Tolerance: From None to High Q: What is the impact of latency tolerance on hosting? 45 A: (left) (mid) (right) very sensitive sensitive non-sensitive very costly costly cheap

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup Sample Analytics Results Skill Level Distribution in RuneScape Runescape: 135M active accounts, 7M active (2008) High-scoring players: 1.8M (2007) / 3.5M (2010) (largest MMOG msmt.) Player skill: distribution changes over time * A. Iosup, POGGI: Puzzle-Based Online Games on Grid Infrastructures EuroPar 2009 (Best Paper Award) A. Iosup, A. Lascateu, N. Tapus, CAMEO: Enabling Social Networks for Massively Multiplayer Online Games through Continuous Analytics and Cloud Computing, ACM NetGames 2010.

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 47 Bridge Base Online (BBO): 1M+ players, top free site Dataset: 100K players 9K group Social relationships from bridge pairing Large (~10K) online social groups can coordinate Identified player behavior community builder, community member, random player, faithful Sample Analytics Results Activity and Social Network Interaction group-socnet Coordinated large-scale social group M. Balint, V. Posea, A. Dimitriu, and A. Iosup, An Analysis of Social Gaming Networks in Online and Face to Face Bridge Communities, LSAP 2011.

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 48 Resource Provisioning and Allocation Compound Metrics Trade-off Utility-Cost still needs investigation Performance and Cost are not both improved by the policies we have studied Villegas, Antoniou, Sadjadi, Iosup. An Analysis of Provisioning and Allocation Policies for Infrastructure- as-a-Service Clouds, CCGrid, 2012.

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 49 (Variable) Blackbox Performance Engineering Performance Evaluation of Four Commercial Clouds Amazon EC2, GoGrid, Elastic Hosts, Mosso Resource acquisition Single- and Multi-Instance benchmarking Low compute and networking performance 1 Performance variability over time 2 1- Iosup et al., Performance Analysis of Cloud Computing Services for Many Tasks Scientific Computing, IEEE TPDS, 2011, Iosup et al., On the Performance Variability of Production Cloud Services, CCGrid 2011, pds.twi.tudelft.nl/reports/2010/PDS pdfpds.twi.tudelft.nl/reports/2010/PDS pdf

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 50 Multi-Resource Provisioning/Release Time for multi-resource increases with number of resources Iosup et al., Performance Analysis of Cloud Computing Services for Many Tasks Scientific Computing, (IEEE TPDS 2011). Q1

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 51 GAE Dataset: Run Service Fibonacci [ms]: Time it takes to calculate the 27 th Fibonacci number Highly variable performance until September Last three months have stable performance (low IQR and range) 51 Q2 Iosup, Yigitbasi, Epema. On the Performance Variability of Production Cloud Services, (IEEE CCgrid 2011).

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 52 Online Scheduling + Optimization ExPERT O. Agmon Ben-Yehuda, A. Schuster, A. Sharov, M. Silberstein, and A. Iosup, ExPERT: Pareto-Efficient Task Replication on Grids and a Cloud, IPDPS'12.

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 53 Performance Metrics Makespan very similar Very different job slowdown Q3 Villegas, Antoniou, Sadjadi, Iosup. An Analysis of Provisioning and Allocation Policies for Infrastructure- as-a-Service Clouds, (submitted). PDS Tech.Rep

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 54 Cost Metrics Very different results between actual and charged Cloud charging function an important selection criterion All policies better than Startup in actual cost Policies much better/worse than Startup in charged cost Charged CostActual Cost Q3 Villegas, Antoniou, Sadjadi, Iosup. An Analysis of Provisioning and Allocation Policies for Infrastructure- as-a-Service Clouds, (submitted). PDS Tech.Rep

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 55 Single Resource Provisioning/Release Time depends on instance type Boot time non-negligible Iosup et al., Performance Analysis of Cloud Computing Services for Many Tasks Scientific Computing, (IEEE TPDS 2011). Q1

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 56 CPU Performance of Single Resource ECU definition: “a 1.1 GHz 2007 Opteron” ~ 4 flops per cycle at full pipeline, which means at peak performance one ECU equals 4.4 gigaflops per second (GFLOPS) Real performance GFLOPS = ~1/4..1/7 theoretical peak Iosup et al., Performance Analysis of Cloud Computing Services for Many Tasks Scientific Computing, (IEEE TPDS 2011). Q1

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 57 HPLinpack Performance (Parallel) Low efficiency for parallel compute-intensive applications Low performance vs cluster computing and supercomputing Iosup et al., Performance Analysis of Cloud Computing Services for Many Tasks Scientific Computing, (IEEE TPDS 2011). Q1

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 58 Performance Stability (Variability) High performance stability for the best-performing instances Iosup et al., Performance Analysis of Cloud Computing Services for Many Tasks Scientific Computing, (IEEE TPDS 2011). Q1

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 59 All services exhibit time patterns in performance EC2: periods of special behavior SDB and S3: daily, monthly and yearly patterns SQS and FPS: periods of special behavior October 5, AWS Dataset (4/4): Summary Q2 Iosup, Yigitbasi, Epema. On the Performance Variability of Production Cloud Services, (IEEE CCgrid 2011).

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 60 October 5, Read Latency [s]: Time it takes to read a “User Group” Yearly pattern from January to August The last four months of the year exhibit much lower IQR and range More stable performance for the last five months Probably due to software/infrastructure upgrades GAE Dataset (2/4): Datastore Q2 Iosup, Yigitbasi, Epema. On the Performance Variability of Production Cloud Services, (IEEE CCgrid 2011).

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 61 October 5, PUT [ms]: Time it takes to put 1 MB of data in memcache. Median performance per month has an increasing trend over the first 10 months The last three months of the year exhibit stable performance GAE Dataset (3/4): Memcache Q2 Iosup, Yigitbasi, Epema. On the Performance Variability of Production Cloud Services, (IEEE CCgrid 2011).

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 62 All services exhibit time patterns Run Service: daily patterns and periods of special behavior Datastore: yearly patterns and periods of special behavior Memcache: monthly patterns and periods of special behavior URL Fetch: daily and weekly patterns, and periods of special behavior October 5, GAE Dataset (4/4): Summary Q2 Iosup, Yigitbasi, Epema. On the Performance Variability of Production Cloud Services, (IEEE CCgrid 2011).

Massivizing Social Games: Distributed Computing Challenges and High Quality Time – A. Iosup Social Everything! Social Network=undirected graph, relationship=edge Community=sub-graph, density of edges between its nodes higher than density of edges outside sub-graph (Analytics Challenge) Improve gaming experience Ranking / Rating Matchmaking / Recommendations Play Style/Tutoring Self-Organizing Gaming Communities Player Behavior

The Game Trace Archive (upcoming) Share gaming traces and best-practices on using them Support simulations and real-world experiments Massivizing Social Games: Distributed Computing Challenges and High Quality Time – A. Iosup 64 NamePeriodSize (GB) Node (M) Edge (M) Category KGS2002/ / Chess Game FICS1997/ / Chess Game BBO2009/ / Card Game XFire2008/ / OMGN Dota League2006/ / RTS DotaLicious2010/ / RTS Dota Garena2009/ / RTS WoWAH2006/ /1030.1N/AMMORPG

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 65 The CAMEO Framework 1.Address community needs Can analyze skill level, experience points, rank Can assess community size dynamically 2.Using on-demand technology: Cloud Comp. Dynamic cloud resource allocation, Elastic IP 3.Data management and storage: Cloud Comp. Crawl + Store data in the cloud (best performance) 4.Performance, scalability, robustness: Cloud Comp. A. Iosup, CAMEO: Continuous Analytics for Massively Multiplayer Online Games on Cloud Resources. ROIA, Euro-Par 2009 Workshops, LNCS 6043, (2010)

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 66 The POGGI Framework Focus on game content generation on grids Use existing middleware Control MMOG-specific workload demands and variability (soft guarantees for low response time by pre-generating content) … but do not forget lessons on system design Add components for capacity planning and process monitoring

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 67 Continuous Analytics for MMOGs MMOG Data = raw and derivative information from the virtual world (millions of users) Continuous Analytics for MMOGs = Analysis of MMOG data s.t. important events are not lost Data collection Data storage Data analysis Data presentation … at MMOG rate and scale

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 68 Continuous Analysis for MMOGs Main Uses By and For Gamers 1.Support player communities 2.Understand play patterns (decide future investments) 3.Prevent and detect cheating or disastrous game exploits (think MMOG economy reset) 4.Broadcasting of gaming events 5.Data for advertisement companies (new revenue stream for MMOGs)

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 69 Other Uses for MMOG Data Biology Disease spread models Social Sciences The emergence and performance of ad hoc groups in contemporary society Emergent behavior in complex systems Psychology Games as coping mechanism (minorities) Games as cure (agoraphobia) Economy Contemporary economic behavior

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 70 The CAMEO Framework [ROIA09] Continuous MSG Analytics on the Cloud Use own resources for continuous or predicted load Use cloud (on-demand, paid-for, guaranteed) resources for sparse or excess load Users (peers) may also provide service (future) A. Iosup, CAMEO: Continuous Analytics for Massively Multiplayer Online Games on Cloud Resources. ROIA, Euro-Par 2009 Workshops, LNCS 6043, pp Springer, Heidelberg (2010)

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 71 CAMEO: Analytics Capabilities 1.Various pieces of information Skill level, experience points, rank 2.Single and Multi-snapshot analysis 3.Analysis functions already implemented Ranking by one or more pieces of information Community statistical properties for a piece of information Identification of Top-K players in single/multi-snapshot Evolution of (Top-)K players Evolution of average community skill Identification of players with special skill combos

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 72 CAMEO: Cloud Resource Management Steady AnalyticsDynamic Analytics Burst Snapshot = dataset for a set of players More machines = more snapshots per time unit Periodic Unexpected

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 73 CAMEO: Exploiting Cloud Features Machines close(r) to server Traffic dominated by small packets (latency) Elastic IP to avoid traffic bans (legalese: acting on behalf of real people) A. Iosup, A. Lascateu, N. Tapus, CAMEO: Enabling Social Networks for Massively Multiplayer Online Games through Continuous Analytics and Cloud Computing, ACM NetGames 2010.

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 74 Cost of Continuous RuneScape Analytics Put a price on MMOG analytics (here, $425/month, or less than $ /user/month) Trade-off accuracy vs. cost, runtime is constant

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 75 Performance Results: Why Choosing the Cloud Matters Location of machines influences MMOG analytics performance (data acquisition)

76 Player-Customized Content Skill Level Distribution in RuneScape RuneScape: 135M+ open accounts (world record) Dataset: 3M players (largest measurement, to date) 1,817,211 over level 100 Max skill 2,280 Number of mid- and high-level players is significant New Content Generation Challenge High Level Mid Level A. Iosup, A. Lascateu, N. Tapus, CAMEO: Enabling Social Networks for Massively Multiplayer Online Games through Continuous Analytics and Cloud Computing, ACM NetGames 2010.

Massivizing Social Games: High Performance Computing and High Quality Time – A. Iosup 77 The POGGI Content Generation Framework Only the puzzle concept, and the instance generation and solving algorithms, are produced at development time * A. Iosup, POGGI: Puzzle-Based Online Games on Grid Infrastructures, EuroPar 2009 (Best Paper Award)

October 5, Bags-of-Tasks (BoTs) Grid Workload Components BoT = set of jobs… …that start at most Δs after the first job Time [units] Parameter Sweep App. = BoT with same binary Workflows (WFs) WF = set of jobs with precedence (think Direct Acyclic Graph)

October 5, Workflow Engines: Performance vs. Resource Consumption Karajan performs better than DAGMan, but runs quickly out of resources. !!!!!!!!!!!!!!!!!!!!!!!! KarajanDAGMan C. Stratan, A. Iosup, D. Epema: A performance study of grid workflow engines. GRID 2008: 25-32