Review Data IMGD 2905.

Slides:



Advertisements
Similar presentations
© 2004 Prentice-Hall, Inc.Chap 5-1 Basic Business Statistics (9 th Edition) Chapter 5 Some Important Discrete Probability Distributions.
Advertisements

© 2003 Prentice-Hall, Inc.Chap 5-1 Basic Business Statistics (9 th Edition) Chapter 5 Some Important Discrete Probability Distributions.
© 2003 Prentice-Hall, Inc.Chap 5-1 Business Statistics: A First Course (3 rd Edition) Chapter 5 Probability Distributions.
© 2002 Prentice-Hall, Inc.Chap 5-1 Basic Business Statistics (8 th Edition) Chapter 5 Some Important Discrete Probability Distributions.
Modeling Process Quality
Statistics.
1 Midterm Review Econ 240A. 2 The Big Picture The Classical Statistical Trail Descriptive Statistics Inferential Statistics Probability Discrete Random.
A Review of Probability and Statistics
BCOR 1020 Business Statistics Lecture 15 – March 6, 2008.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Visualizing Events Contingency Tables Tree Diagrams Ace Not Ace Total Red Black Total
Probability and Statistics Review
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Chapter 19 Data Analysis Overview
Discrete Probability Distributions
Statistics 800: Quantitative Business Analysis for Decision Making Measures of Locations and Variability.
Review What you have learned in QA 128 Business Statistics I.
Statistics for CS 312. Descriptive vs. inferential statistics Descriptive – used to describe an existing population Inferential – used to draw conclusions.
Normal and Sampling Distributions A normal distribution is uniquely determined by its mean, , and variance,  2 The random variable Z = (X-  /  is.
 Deviation is a measure of difference for interval and ratio variables between the observed value and the mean.  The sign of deviation (positive or.
Census A survey to collect data on the entire population.   Data The facts and figures collected, analyzed, and summarized for presentation and.
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Statistics & Biology Shelly’s Super Happy Fun Times February 7, 2012 Will Herrick.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
Statistical Review We will be working with two types of probability distributions: Discrete distributions –If the random variable of interest can take.
Random Sampling, Point Estimation and Maximum Likelihood.
PTP 560 Research Methods Week 8 Thomas Ruediger, PT.
OPIM 5103 Descriptive Statistics Random Sampling Intro to Probability and Discrete Distributions Jan Stallaert Professor of OPIM.
Measures of Dispersion CUMULATIVE FREQUENCIES INTER-QUARTILE RANGE RANGE MEAN DEVIATION VARIANCE and STANDARD DEVIATION STATISTICS: DESCRIBING VARIABILITY.
1 1 Slide Descriptive Statistics: Numerical Measures Location and Variability Chapter 3 BA 201.
MTH3003 PJJ SEM I 2015/2016.  ASSIGNMENT :25% Assignment 1 (10%) Assignment 2 (15%)  Mid exam :30% Part A (Objective) Part B (Subjective)  Final Exam:
Chapter 2 Describing Data.
OPIM 5103-Lecture #3 Jose M. Cruz Assistant Professor.
 A probability function is a function which assigns probabilities to the values of a random variable.  Individual probability values may be denoted by.
ENGR 610 Applied Statistics Fall Week 3 Marshall University CITE Jack Smith.
Lecture 2 Forestry 3218 Lecture 2 Statistical Methods Avery and Burkhart, Chapter 2 Forest Mensuration II Avery and Burkhart, Chapter 2.
Determination of Sample Size: A Review of Statistical Theory
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
Introduction to Statistics Santosh Kumar Director (iCISA)
ENGR 610 Applied Statistics Fall Week 2 Marshall University CITE Jack Smith.
MODULE 3: DESCRIPTIVE STATISTICS 2/6/2016BUS216: Probability & Statistics for Economics & Business 1.
 I can identify the shape of a data distribution using statistics or charts.  I can make inferences about the population from the shape of a sample.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 5 Discrete Random Variables.
BA 275 Quantitative Business Methods
Chap 5-1 Chapter 5 Discrete Random Variables and Probability Distributions Statistics for Business and Economics 6 th Edition.
Statistics -Descriptive statistics 2013/09/30. Descriptive statistics Numerical measures of location, dispersion, shape, and association are also used.
Statistics © 2012 Project Lead The Way, Inc.Principles of Engineering.
Chapter 18 Data Analysis Overview Yandell – Econ 216 Chap 18-1.
IMGD 2905 Probability Chapters 4 & 5.
Introduction to Probability - III John Rundle Econophysics PHYS 250
Descriptive Statistics
MECH 373 Instrumentation and Measurements
Data Analysis.
Statistics Principles of Engineering © 2012 Project Lead The Way, Inc.
Statistics Statistics Principles of EngineeringTM
Business Statistics Topic 4
Module 6: Descriptive Statistics
Statistics in Applied Science and Technology
Statistics Statistics Principles of EngineeringTM
Central tendency and spread
Statistics Statistics- Inferential Statistics Descriptive Statistics
Statistics: The Interpretation of Data
Statistics Principles of Engineering © 2012 Project Lead The Way, Inc.
Chapter 6: Becoming Acquainted with Statistical Concepts
Review for Exam 1 Ch 1-5 Ch 1-3 Descriptive Statistics
Chapter Nine: Using Statistics to Answer Questions
OAD30763 Statistics in Business and Economics
Introductory Statistics
Review IMGD 2905.
Presentation transcript:

Review Data IMGD 2905

What are two main sources for data for game analytics?

What are two main sources for data for game analytics? Quantitative – instrumented game Qualitative – subjective evaluation

What steps are in the game analytics pipeline?

What steps are in the game analytics pipeline? Game (instrumented) Data (collected from players) Extracted data (e.g., from scripts) Analysis Statistics, Charts, Tests Dissemination Report Talk

What is population versus sample?

What is population versus sample? Population – all members of group pertaining to study Sample – part of population selected for analysis

What is probability sampling?

What is probability sampling? Probability sampling - sampling considering likelihood of selection Consider likelihood as part of population

What is a Pareto chart? When used?

What is a Pareto chart? When used? Bar chart, arranged most to least frequent Line showing cumulative percent Helps identify most common https://goo.gl/S7qDTJ

When should you not use pie chart?

When should you not use pie chart? When too many slices http://cdn.arstechnica.net/FeaturesByVersion.png

When should you not use pie chart? (Often) when comparing pies

Which Measure of Central Tendency to Use? Why?

What are Quartiles?

What are Quartiles?

Describe how to Compute Variance

Describe how to Compute Variance Compute mean Compute how far each sample value is from mean. Square this. Add these up. Divide by number of samples.

Describe what Standard Deviation is in Words

Describe what Standard Deviation is in Words “The ‘average’ of how far each sample point is from the mean”

Empirical Rule 1000 data points Mean of 50 Standard deviation of 10 How many points are between 20-80? How many points are between 40-60? Between 40-60?

Empirical Rule 1000 data points Mean of 50 Standard deviation of 10 How many points are between 20-80? Nearly all (99.7%), so only about 3 outside How many points are between 40-60? About 700 (68%) Between 40-60? About 950 (95%)

Rank the Following High to Low in Susceptibility to Outliers Measure of Variation Most to Least Semi-interquartile Range Range Coefficient of Variation

Rank the Following High to Low in Susceptibility to Outliers Measure of Variation Most to Least Semi-interquartile Range Range Coefficient of Variation Range Coefficient of Variation Semi-interquartile Range

Probability In probability, what is an exhaustive set of events?

Probability In probability, what is an exhaustive set of events? A set of all possible outcomes of an experiment or observation e.g., coin: events {heads, tails} e.g., picking champion in LoL: events {Darius, Leona, Fizz, …} (all possible Champions listed)

Broadly, What are 3 Ways to Assign Probabilities?

Broadly, What are 3 Ways to Assign Probabilities? Classical (theory) Empirical (by measurement/observation) Subjective (hunch – sometimes guided by a bit of theory)

Probability Draw 2 cards. What is the probability of drawing 2 Jacks?

Probability Draw 2 cards. What is the probability of drawing 2 Jacks? P(2J) = P(J) x P(J | J) = 2/5 x 1/4 = 1/10

Probability Draw 3 cards. What is the probability of not drawing at least one King?

Probability Draw 3 cards. What is the probability of not drawing at least one King? P(K’) x P(K’ | K’) x P(K’ | K’K’) = 3/5 x 2/4 x 1/3 = 6/60 = 1/10

What are the characteristics of an experiment with a binomial distribution of outcomes?

What are the characteristics of an experiment with a binomial distribution of outcomes? Experiment consists of n independent, identical trials Each trial results in only success or failure Random variable of interest (X) is number of S’s in n trials http://www.vassarstats.net/textbook/f0603.gif

What are the characteristics of an experiment with a Poisson distribution of outcomes?

What are the characteristics of an experiment with a Poisson distribution of outcomes? Interval (e.g., time) with units Probability of event same for all interval unit Number of events in one unit independent of others Events occur singly (not simultaneously) Phrase people use is “random arrivals”

What is the Standard Normal Distribution?

What is the Standard Normal Distribution? Mean μ = 0 Std dev σ = 1

Sampling Error What is the sampling error?

Sampling Error What is the sampling error? Error from estimating population parameters from sample statistics The size of the error is based on what two main factors?

Sampling Error What is the sampling error? Error from estimating population parameters from sample statistics The size of the error is based on what two main factors? Population variance Number of samples

Confidence Intervals What is a confidence interval? Give an example

Confidence Intervals What is a confidence interval? Give an example Range of values with specific certainty that population parameter is within 95% confidence interval for time to complete a level in Super Mario: [1.25 minutes, 1.75 minutes] What is the size of confidence interval based on?

Confidence Intervals What is a confidence interval? Give an example Range of values with specific certainty that population parameter is within 95% confidence interval for time to complete a level in Super Mario: [1.25 minutes, 1.75 minutes] What is the size of confidence interval based on? Confidence (1-) Standard error (number of samples) standard deviation