Short Resume of Statistical Terms Fall 2013 By Yaohang Li, Ph.D.

Slides:



Advertisements
Similar presentations
Random Variables ECE460 Spring, 2012.
Advertisements

CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Chapter 5 Discrete Random Variables and Probability Distributions
Review of Basic Probability and Statistics
Chapter 4 Discrete Random Variables and Probability Distributions
Probability Densities
Chapter 6 Introduction to Sampling Distributions
Chapter 6 Continuous Random Variables and Probability Distributions
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Estimation of parameters. Maximum likelihood What has happened was most likely.
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
Probability By Zhichun Li.
Probability and Statistics Review
CHAPTER 6 Statistical Analysis of Experimental Data
Probability Distributions Random Variables: Finite and Continuous Distribution Functions Expected value April 3 – 10, 2003.
3-1 Introduction Experiment Random Random experiment.
Continuous Random Variables and Probability Distributions
Inferences About Process Quality
Discrete Random Variables and Probability Distributions
Lecture II-2: Probability Review
Modern Navigation Thomas Herring
Chapter 21 Random Variables Discrete: Bernoulli, Binomial, Geometric, Poisson Continuous: Uniform, Exponential, Gamma, Normal Expectation & Variance, Joint.
Continuous Probability Distribution  A continuous random variables (RV) has infinitely many possible outcomes  Probability is conveyed for a range of.
Review of Probability.
Review of Probability.
1 Chapter 12 Introduction to Statistics A random variable is one in which the exact behavior cannot be predicted, but which may be described in terms of.
Problem A newly married couple plans to have four children and would like to have three girls and a boy. What are the chances (probability) their desire.
STAT 552 PROBABILITY AND STATISTICS II
Chapter 5 Sampling Distributions
B AD 6243: Applied Univariate Statistics Understanding Data and Data Distributions Professor Laku Chidambaram Price College of Business University of Oklahoma.
Random variables Petter Mostad Repetition Sample space, set theory, events, probability Conditional probability, Bayes theorem, independence,
AP Statistics Chapter 9 Notes.
TELECOMMUNICATIONS Dr. Hugh Blanton ENTC 4307/ENTC 5307.
PROBABILITY & STATISTICAL INFERENCE LECTURE 3 MSc in Computing (Data Analytics)
Statistics for Engineer Week II and Week III: Random Variables and Probability Distribution.
Random Sampling, Point Estimation and Maximum Likelihood.
General Principle of Monte Carlo Fall 2013 By Yaohang Li, Ph.D.
Dr. Gary Blau, Sean HanMonday, Aug 13, 2007 Statistical Design of Experiments SECTION I Probability Theory Review.
Ex St 801 Statistical Methods Probability and Distributions.
Theory of Probability Statistics for Business and Economics.
Modular 11 Ch 7.1 to 7.2 Part I. Ch 7.1 Uniform and Normal Distribution Recall: Discrete random variable probability distribution For a continued random.
ENGR 610 Applied Statistics Fall Week 3 Marshall University CITE Jack Smith.
BINOMIALDISTRIBUTION AND ITS APPLICATION. Binomial Distribution  The binomial probability density function –f(x) = n C x p x q n-x for x=0,1,2,3…,n for.
Discrete Probability Distributions. Random Variable Random variable is a variable whose value is subject to variations due to chance. A random variable.
Lecture 2 Forestry 3218 Lecture 2 Statistical Methods Avery and Burkhart, Chapter 2 Forest Mensuration II Avery and Burkhart, Chapter 2.
1 Since everything is a reflection of our minds, everything can be changed by our minds.
Dr. Ahmed Abdelwahab Introduction for EE420. Probability Theory Probability theory is rooted in phenomena that can be modeled by an experiment with an.
Basic Concepts of Probability CEE 431/ESS465. Basic Concepts of Probability Sample spaces and events Venn diagram  A Sample space,  Event, A.
Stats Probability Theory Summary. The sample Space, S The sample space, S, for a random phenomena is the set of all possible outcomes.
PROBABILITY AND STATISTICS FOR ENGINEERING Hossein Sameti Department of Computer Engineering Sharif University of Technology Mean, Variance, Moments and.
B AD 6243: Applied Univariate Statistics Data Distributions and Sampling Professor Laku Chidambaram Price College of Business University of Oklahoma.
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
Review of Probability. Important Topics 1 Random Variables and Probability Distributions 2 Expected Values, Mean, and Variance 3 Two Random Variables.
Point Estimation of Parameters and Sampling Distributions Outlines:  Sampling Distributions and the central limit theorem  Point estimation  Methods.
Continuous Random Variables and Probability Distributions
1 Probability: Introduction Definitions,Definitions, Laws of ProbabilityLaws of Probability Random VariablesRandom Variables DistributionsDistributions.
Chapter 20 Statistical Considerations Lecture Slides The McGraw-Hill Companies © 2012.
Statistics Sampling Distributions and Point Estimation of Parameters Contents, figures, and exercises come from the textbook: Applied Statistics and Probability.
Chapter 2: Probability. Section 2.1: Basic Ideas Definition: An experiment is a process that results in an outcome that cannot be predicted in advance.
Week 21 Order Statistics The order statistics of a set of random variables X 1, X 2,…, X n are the same random variables arranged in increasing order.
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
Unit 4 Review. Starter Write the characteristics of the binomial setting. What is the difference between the binomial setting and the geometric setting?
Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Random Variables By: 1.
Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate accuracy.
MECH 373 Instrumentation and Measurements
IEE 380 Review.
Probability.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Presentation transcript:

Short Resume of Statistical Terms Fall 2013 By Yaohang Li, Ph.D.

Review Last Class –Introduction to Monte Carlo This Class –Important Statistics Terms Random Events –Independence of Random Events –Axioms on Random Events Random Variables –Independence of Random Variables CDF PDF Expectation –Characteristics of Expectation Moments of a Distribution –rth moment –rth central moment Mean Variance Standard Deviation Covariance –Characteristics of covariance Review of Statistics and Probability Terms Important Distribution Central Limit Theorem Estimand and Estimator Next Class –Monte Carlo for Integration

Random Events and Probability Random Event –An event which has a chance of happening Probability –A numerical measure of that chance –Lying between 0 and 1, both inclusive Terminology –P(A) The probability that an event A occurs –P(A+B+…) The probability that at least one of the events A, B, … occurs –P(AB…) The probability that all the events A, B, … occur –P(A|B) The probability that the event A occurs when it known that the event B occurs Conditional probability of A given B

Axioms in Probability P(A+B+…)  P(A)+P(B)+… –If only one of the events A, B, … can occur, they are called exclusive. The equality holds –If at least one of the events A, B, … must occur, they are called exhaustive. P(A+B+…)=1 P(AB)=P(A|B)P(B) –If P(A|B)=P(A), A and B are independent The chance of A occurring is uninfluenced by the occurrence of B

Random Variables and Distributions Random variable (  ) –A number to characterize a set of exclusive and exhaustive events Cumulative Distribution Function (CDF) –F(y)=P(   y) –The probability that the event which occurs has a value  not exceeding a prescribed y –F(+  )=1 and F(-  )=1 –F(y) is a non-decreasing function of y

Expectation If g(  ) is a function of , the expectation (or mean value) of g is denoted and defined by –Stieltjes integral –The integral is taken over all values of y Explanation –Continuous random events F(y) is continuous and f(y) is a derivative –Discrete random events F(y) is a step function and f i is the step of height at the points of y i Probability Density Function (pdf) –f(y) and y i are the probability density functions

More on Expectation The statistical physicist uses another notation for expectation –Suppose p i is the probability density function How about if g(x) is a constant function?

Linear Combination of the Expectation Values

Multi-dimensional Distribution Multi-dimensional Random Variable –Represented used a vector  Multi-dimensional CDF –F(y)=P(   y)   y means that each coordinate of  is not greater than the corresponding coordinate of y Expectation –Continuous multidimensional events where

Independence of Random Variables Consider a set of exhaustive and exclusive events, each characterized by a pair of numbers  and , for which F(y,z) is the distribution. G(y) is an CDF for  and H(z) is an CDF for . –F(y,z) = P(   y,   z) –G(y) = P(   y) –H(z) = P(   z) If it so happens that –F(y,z)=G(y)H(z) for all y and z –the random variables  and  are called independent

Characteristics of Expectations Hold regardless whether or not the random variables  i are independent or not Hold only  i are mutual independent

Moments of Distribution rth moment of a distribution –E(  r ) Principle moment –  = E(  ) rth central moment –  r = E{(  -  ) r } Most important moments –  = E(  ), known as the mean of  Measure of location of a random variable –  2, known as the variance of  (usually used abbreviation of “var”) Measure of dispersion about the mean –standard deviation –coefficients of variation  / 

Covariance Definition of covariance (usually abbreviation of cov) –If  and  are random variables with means  and v, respectively, the quantity E{(  -  )(  -v)} is called the covariance of  and  –If  and  are independent, the covariance is 0 Why? –Also, cov( ,  )=var(  ) Why?

Important Formula of Covariance

Correlation Coefficient Definition –Always between +1 and -1 –If  =0, they are not correlated –If  <0, they are negatively correlated –If  >0, they are positively correlated

Important Distributions Uniform Distribution Exponential Distribution Binomial Distribution Poison Distribution Normal Distribution

Uniform Distribution Uniform Distribution (Rectangle Distribution) –A distribution has constant probability –Mean? –Variance?

Exponential Distribution –mean 1/ –variance 1/ 2

Binomial Distribution –Discrete probability distribution Pp(n|N) of obtaining exactly n successes out of N Bernoulli trials –Each Bernoulli trial is true with probability p and false with probability q=1-p = =

Poisson Distribution –The limit of the Binomial Distribution –Mean is v –Variance is v

Normal Distribution Normal Distribution (Gaussian Distribution) –Bell curve –De Moivre developed the normal distribution as an approximation to the binomial distribution

Normal Distribution in Data Analysis 68.26% of the data will be found within one SD either side of the mean (±1SD)  95.44% of the data will be found within two SD either side of the mean(±2SD)  99.74% of the data will be found within three SD either side of the mean (±3SD)

Central Limit Theorem –The sum of n independent random variables has an approximately normal distribution when n is large Random variables conform to arbitrary distribution

Central Limit Theorem in Practice In practice –n = 10 is reasonably large number –n = 25 is rather large (effective infinite)

Estimation Monte Carlo Computation –Goal: estimating the unknown numerical value of some parameter of some distribution The parameter is called an estimand Sample The available data (may consist of a number of observed random variables) The number of observations in the sample is called the sample size Estimand –mean (  1+  2+…+  n)/n –weighted average (w1  1+w2  2+…+wn  n)/(w1+w2+…+wn) May be a better estimator Connection between the sample and the estimand –The estimand is a parameter of the distribution of the random variables constituting the sample

Sampling Distribution Parent Distribution –We can represent the sample by a vector  with coordinates  1,  2,  3,…,  n –The distribution of  1,  2,  3,…,  n is called the Parent Distribution –To estimate the estimand  (a parameter of the Parent Distribution), we use some function t(  ) t is an estimator Sampling Distribution –  is a random variable, so is t(  ) if we repeated the experiment, we should expect to get a different value of  –Since  varies from experiment, t(  ) has a distribution, called sampling distribution –If t(  ) is to be close to , then the sampling distribution ought to be closely concentrated around 

Measuring Sampling Distribution The bias of t –The difference between  and the average value of t(  ) –  =E{t(  )-  } –t is an unbiased estimator if  =0 The sampling variance of t –  2 t =var{t(  )}=E{[t(  )-Et(  )] 2 }=E{[t-  -  ] 2 } If  and  2 t are small, t is a good estimator

Important Estimators Mean of the parent distribution –standard error Variance of the parent distribution –standard error

Efficiency Goal of Monte Carlo Work –Obtain a respectably small standard error in the final result –More random samples can lead to better accuracy Not very rewarding –Variance Reduction Method

Summary Important Statistics Terms –Random Events Independence of Random Events Axioms on Random Events –Random Variables Independence of Random Variables –CDF –PDF –Expectation Characteristics of Expectation –Moments of a Distribution rth moment rth central moment –Mean –Variance –Standard Deviation –Covariance Characteristics of covariance –Correlation Coefficient

Summary (Cont.) Important Distributions –Uniform Distribution –Exponential Distribution –Binomial Distribution –Poison Distribution –Normal Distribution Estimation –Sample –Estimand –Parent Distribution –Sampling Distribution –Estimator Important estimators –Buffon’s Needle

What I want you to do? Review Slides Review basic probability/statistics concepts Work on your Assignment 1