Basic statistics Usman Roshan.

Slides:



Advertisements
Similar presentations
Probability Distribution
Advertisements

Lecture Discrete Probability. 5.3 Bayes’ Theorem We have seen that the following holds: We can write one conditional probability in terms of the.
AP Statistics Chapter 7 – Random Variables. Random Variables Random Variable – A variable whose value is a numerical outcome of a random phenomenon. Discrete.
Chapter 12 Probability © 2008 Pearson Addison-Wesley. All rights reserved.
Week11 Parameter, Statistic and Random Samples A parameter is a number that describes the population. It is a fixed number, but in practice we do not know.
Statistics 1: Introduction to Probability and Statistics Section 3-3.
Review of Basic Probability and Statistics
Descriptive statistics Experiment  Data  Sample Statistics Sample mean Sample variance Normalize sample variance by N-1 Standard deviation goes as square-root.
1 Chap 5 Sums of Random Variables and Long-Term Averages Many problems involve the counting of number of occurrences of events, computation of arithmetic.
Class notes for ISE 201 San Jose State University
SUMS OF RANDOM VARIABLES Changfei Chen. Sums of Random Variables Let be a sequence of random variables, and let be their sum:
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
The moment generating function of random variable X is given by Moment generating function.
 The Law of Large Numbers – Read the preface to Chapter 7 on page 388 and be prepared to summarize the Law of Large Numbers.
Statistical inference Population - collection of all subjects or objects of interest (not necessarily people) Sample - subset of the population used to.
Normal and Sampling Distributions A normal distribution is uniquely determined by its mean, , and variance,  2 The random variable Z = (X-  /  is.
Standard error of estimate & Confidence interval.
Prof. SankarReview of Random Process1 Probability Sample Space (S) –Collection of all possible outcomes of a random experiment Sample Point –Each outcome.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 4 and 5 Probability and Discrete Random Variables.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Discrete Random Variables Chapter 4.
Ch2: Probability Theory Some basics Definition of Probability Characteristics of Probability Distributions Descriptive statistics.
Day 2 Review Chapters 5 – 7 Probability, Random Variables, Sampling Distributions.
Random Variables. A random variable X is a real valued function defined on the sample space, X : S  R. The set { s  S : X ( s )  [ a, b ] is an event}.
MA-250 Probability and Statistics Nazar Khan PUCIT Lecture 26.
Bernoulli Trials Two Possible Outcomes –Success, with probability p –Failure, with probability q = 1  p Trials are independent.
Introduction to Behavioral Statistics Probability, The Binomial Distribution and the Normal Curve.
Binomial Experiment A binomial experiment (also known as a Bernoulli trial) is a statistical experiment that has the following properties:
X = 2*Bin(300,1/2) – 300 E[X] = 0 Y = 2*Bin(30,1/2) – 30 E[Y] = 0.
Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 5 Discrete Random Variables.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 5 Discrete Random Variables.
COMP 170 L2 L17: Random Variables and Expectation Page 1.
King Saud University Women Students
Week11 Parameter, Statistic and Random Samples A parameter is a number that describes the population. It is a fixed number, but in practice we do not know.
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
Math b (Discrete) Random Variables, Binomial Distribution.
Stats Probability Theory Summary. The sample Space, S The sample space, S, for a random phenomena is the set of all possible outcomes.
Expectation. Let X denote a discrete random variable with probability function p(x) (probability density function f(x) if X is continuous) then the expected.
Week 121 Law of Large Numbers Toss a coin n times. Suppose X i ’s are Bernoulli random variables with p = ½ and E(X i ) = ½. The proportion of heads is.
By Satyadhar Joshi. Content  Probability Spaces  Bernoulli's Trial  Random Variables a. Expectation variance and standard deviation b. The Normal Distribution.
1 Probability and Statistical Inference (9th Edition) Chapter 5 (Part 2/2) Distributions of Functions of Random Variables November 25, 2015.
6.2 BINOMIAL PROBABILITIES.  Features  Fixed number of trials (n)  Trials are independent and repeated under identical conditions  Each trial has.
Probability and Moment Approximations using Limit Theorems.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 5 Discrete Random Variables.
Chapter 6 Large Random Samples Weiqi Luo ( 骆伟祺 ) School of Data & Computer Science Sun Yat-Sen University :
Chebyshev’s Inequality Markov’s Inequality Proposition 2.1.
Background for Machine Learning (I) Usman Roshan.
Conditional Probability 423/what-is-your-favorite-data-analysis-cartoon 1.
Applied statistics Usman Roshan.
Sampling and Sampling Distributions
Applied statistics Usman Roshan.
Basic statistics Usman Roshan.
Probability for Machine Learning
Business Statistics Topic 4
Appendix A: Probability Theory
Discrete Random Variables
Parameter, Statistic and Random Samples
Probability Review for Financial Engineers
The Binomial Distribution
Probability Key Questions
Day 13 AGENDA: DG minutes.
Chebychev, Hoffding, Chernoff
CHAPTER 15 SUMMARY Chapter Specifics
Day 12 AGENDA: DG minutes Work time --- use this time to work on practice problems from previous lessons.
6.3 Sampling Distributions
9. Limit Theorems.
AP Statistics Chapter 16 Notes.
Bernoulli Trials Two Possible Outcomes Trials are independent.
Chap 11 Sums of Independent Random Variables and Limit Theorem Ghahramani 3rd edition 2019/5/16.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Chapter 11 Probability.
Presentation transcript:

Basic statistics Usman Roshan

Basic probability and stats Random variable Probability of an event Coin toss example Independent random variables Mean and variance of a random variable Correlation between random variables Probability distributions Central limit theorem

Random variable A variable normally takes on different values Random variable has values with different probabilities Coin toss example Dice example Probabilities must sum to 1

Probability of event Sample space: set of total possible outcomes Event space: set of outcomes of interest Probability of an event is (size of event space)/(size of sample space) Counting: how many ways to pick k unique items from a set of n items? Probability and counting Bernoulli trials Coin tossing example R function: rbinom

Basic stats Independent events: coin toss example Expected value of a random variable – example of Bernoulli and Binomal Variance of a random variable Correlation coefficient (same as Pearson correlation coefficient) Formulas: Covariance(X,Y) = E((X-μX)(Y-μY)) Correlation(X,Y)= Covariance(X,Y)/σXσY Pearson correlation

Probability distributions Binomial distribution sum of Bernoulli trials converges to Gaussian as number of trials approaches infinity R functions Reference card rbinom Repeated executions of rbinom Lists for loops sum, length Gaussian distribution Chi-square distribution

Limit theorems Law of large numbers: empirical mean converges to true mean as we do more trials (follows from Chebyshev’s and Markov’s inequalities)

Law of large numbers Markov’s inequality Chebyshev’s inequality Law of large numbers: sample mean of n i.i.d. random variables Xi converges to true one in probability Can be proved by applying Chebyshev’s inequality

Limit theorems Central limit theorem: average of sampling distribution converges to a normal distribution as we do more trials. Specifically, it is normally distributed with mean equal to the true mean μ and standard deviation equal to σ/sqrt(n) where n is number of trials and σ is true standard deviation How is this useful? Consider modeling the mean height of NJ residents. Can we assume it is normally distributed due to Central Limit Theorem?