DATA Exploration: Statistics (One Variable) 1.Basic EXCELL/MATLAB functions for data exploration 2.Measures of central tendency, Distributions 1.Mean 2.Median.

Slides:



Advertisements
Similar presentations
University of Minnesota-Duluth, Econ-2030 (Dr. Tadesse) Inferential Statistics.
Advertisements

Yaochen Kuo KAINAN University . SLIDES . BY.
1 1 Slide Continuous Probability Distributions Chapter 6 BA 201.
1 1 Slide © 2001 South-Western College Publishing/Thomson Learning Anderson Sweeney Williams Anderson Sweeney Williams Slides Prepared by JOHN LOUCKS QUANTITATIVE.
Chapter 6 Continuous Probability Distributions
Continuous Probability Distributions
Business and Finance College Principles of Statistics Eng. Heba Hamad 2008.
Sampling and Sampling Distribution
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Slides by JOHN LOUCKS St. Edward’s University.
Review of Basic Statistics. Parameters and Statistics Parameters are characteristics of populations, and are knowable only by taking a census. Statistics.
Sampling and Sampling Distributions Simple Random Sampling Point Estimation Sampling Distribution.
1 1 Slide © 2001 South-Western/Thomson Learning  Anderson  Sweeney  Williams Anderson  Sweeney  Williams  Slides Prepared by JOHN LOUCKS  CONTEMPORARYBUSINESSSTATISTICS.
1 1 Slide © 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
QMS 6351 Statistics and Research Methods Chapter 7 Sampling and Sampling Distributions Prof. Vera Adamchik.
Business Research Methods William G. Zikmund Chapter 17: Determination of Sample Size.
Chapter 7 Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution of Introduction to Sampling Distributions Introduction to.
Exploring Marketing Research William G. Zikmund
Continuous Probability Distributions A continuous random variable can assume any value in an interval on the real line or in a collection of intervals.
Chapter 3, Part B Continuous Probability Distributions
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Chapter 7 Sampling and Sampling Distributions n Simple Random Sampling n Point Estimation n Introduction to Sampling Distributions n Sampling Distribution.
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
B AD 6243: Applied Univariate Statistics Understanding Data and Data Distributions Professor Laku Chidambaram Price College of Business University of Oklahoma.
1 1 Slide © 2003 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2004 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2001 South-Western/Thomson Learning  Anderson  Sweeney  Williams Anderson  Sweeney  Williams  Slides Prepared by JOHN LOUCKS  CONTEMPORARYBUSINESSSTATISTICS.
Chapter 7 Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution of Introduction to Sampling Distributions Introduction to.
Sampling The sampling errors are: for sample mean
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide © 2016 Cengage Learning. All Rights Reserved. A population is the set of all the elements of interest. A population is the set of all the elements.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide © 2005 Thomson/South-Western Chapter 7, Part A Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution of Introduction.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide Chapter 7 (b) – Point Estimation and Sampling Distributions Point estimation is a form of statistical inference. Point estimation is a form of.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Business Research Methods William G. Zikmund Chapter 17: Determination of Sample Size.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved OPIM 303-Lecture #5 Jose M. Cruz Assistant Professor.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 7 Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution.
1 1 Slide Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution of Introduction to Sampling Distributions Introduction to.
1 Chapter 7 Sampling and Sampling Distributions Simple Random Sampling Point Estimation Introduction to Sampling Distributions Sampling Distribution of.
Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal
IT College Introduction to Computer Statistical Packages Eng. Heba Hamad 2009.
1 Chapter 6 Continuous Probability Distributions.
Determination of Sample Size: A Review of Statistical Theory
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
What does Statistics Mean? Descriptive statistics –Number of people –Trends in employment –Data Inferential statistics –Make an inference about a population.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 6 Continuous Probability Distributions n Uniform Probability Distribution n Normal.
1 1 Slide © 2003 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Continuous Probability Distributions. A continuous random variable can assume any value in an interval on the real line or in a collection of intervals.
Business Statistics (BUSA 3101). Dr.Lari H. Arjomand Continus Probability.
Chapter 3, Part A Descriptive Statistics: Numerical Measures n Measures of Location n Measures of Variability.
Fundamentals of Business Statistics chapter7 Sampling and Sampling Distributions 上海金融学院统计系 Statistics Dept., Shanghai Finance University.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide Chapter 2 Continuous Probability Distributions Continuous Probability Distributions.
St. Edward’s University
Chapter 7 (b) – Point Estimation and Sampling Distributions
St. Edward’s University
Sampling and Sampling Distribution
St. Edward’s University
Chapter 5, Part A [SBE 7/e, Ch. 7] Sampling and Sampling Distributions
Chapter 6 Continuous Probability Distributions
Econ 3790: Business and Economics Statistics
Chapter 7 Sampling and Sampling Distributions
Chapter 6 Continuous Probability Distributions
Presentation transcript:

DATA Exploration: Statistics (One Variable) 1.Basic EXCELL/MATLAB functions for data exploration 2.Measures of central tendency, Distributions 1.Mean 2.Median 3.Mode 3.Measures of spread 1.Range 2.Variance 4.Simple Sampling 5.Example of Sampling by using EXCELL

2 1. Working with Data in Excel: Arithmetic

3 Use “Insert” then “Function” then “All” or “Statistical” to find an alphabetical list of functions 1. Summary Statistics in EXCELL (One Variable)

4 1. Summary Statistics in EXCELL Average

5 1. Summary Statistics in EXCELL (Median)

6 1. Summary Statistics in EXCELL (Standard Deviation)

7 1. Summary Statistics in EXCELL (Rand & RandBetween)

8 1. Summary Statistics in EXCELL (Sort )

FunctionDescription maxMaximum value meanAverage or mean value medianMedian value minSmallest value modeMost frequent value stdStandard deviation varVariance, which measures the spread or dispersion of the values 1. Summary Statistics in MATLAB

2. Distributions Continuous Probability Distributions  Uniform Probability Distribution  Normal Probability Distribution  Exponential Probability Distribution f ( x ) x x Uniform x Normal x x Exponential

Uniform Probability Distribution where: a = smallest value the variable can assume b = largest value the variable can assume b = largest value the variable can assume f ( x ) = 1/( b – a ) for a < x < b f ( x ) = 1/( b – a ) for a < x < b = 0 elsewhere = 0 elsewhere f ( x ) = 1/( b – a ) for a < x < b f ( x ) = 1/( b – a ) for a < x < b = 0 elsewhere = 0 elsewhere n A random variable is uniformly distributed whenever the probability is proportional to the interval’s length. n The uniform probability density function is:

Var( x ) = ( b - a ) 2 /12 E( x ) = ( a + b )/2 Uniform Probability Distribution n Expected Value of x n Variance of x

The highest point on the normal curve is at the The highest point on the normal curve is at the mean, which is also the median and mode. mean, which is also the median and mode. The highest point on the normal curve is at the The highest point on the normal curve is at the mean, which is also the median and mode. mean, which is also the median and mode. Normal Probability Distribution n Characteristics x

Normal Probability Distribution n Characteristics The mean can be any numerical value: negative, The mean can be any numerical value: negative, zero, or positive. zero, or positive. The mean can be any numerical value: negative, The mean can be any numerical value: negative, zero, or positive. zero, or positive. x

3. Normal Probability Distribution n Characteristics  = 15  = 25 The standard deviation determines the width of the curve: larger values result in wider, flatter curves. The standard deviation determines the width of the curve: larger values result in wider, flatter curves. x

n Converting to the Standard Normal Distribution Standard Normal Probability Distribution We can think of z as a measure of the number of standard deviations x is from .

3. Normal Probability Distribution n Characteristics x  – 3   – 1   – 2   + 1   + 2   + 3  68.26% 95.44% 99.72%

4. Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution of Introduction to Sampling Distributions Introduction to Sampling Distributions Point Estimation Point Estimation Simple Random Sampling Simple Random Sampling Other Sampling Methods Other Sampling Methods Sampling Distribution of Sampling Distribution of

4. Simple Random Sampling:  Finite populations are often defined by lists such as:  Organization membership roster  Credit card account numbers  Inventory product numbers n n A simple random sample of size n from a finite population of size N is a sample selected such that each possible sample of size n has the same probability of being selected.

s is the point estimator of the population standard s is the point estimator of the population standard deviation . deviation . s is the point estimator of the population standard s is the point estimator of the population standard deviation . deviation . In point estimation we use the data from the sample In point estimation we use the data from the sample to compute a value of a sample statistic that serves to compute a value of a sample statistic that serves as an estimate of a population parameter. as an estimate of a population parameter. In point estimation we use the data from the sample In point estimation we use the data from the sample to compute a value of a sample statistic that serves to compute a value of a sample statistic that serves as an estimate of a population parameter. as an estimate of a population parameter. 4. Point Estimation We refer to as the point estimator of the population We refer to as the point estimator of the population mean . mean . We refer to as the point estimator of the population We refer to as the point estimator of the population mean . mean . is the point estimator of the population proportion p. is the point estimator of the population proportion p.

 Process of Statistical Inference The value of is used to make inferences about the value of . The sample data provide a value for the sample mean. A simple random sample of n elements is selected from the population. Population with mean  = ? Sampling Distribution of

4. Simple Random Sampling The applicants were numbered, from 1 to 900, as their applications arrived. She decides a sample of 30 applicants will be used. Furthermore, the Director of Admissions must obtain estimates of the population parameters of interest for a meeting taking place in a few hours. Now suppose that the necessary data on the current year’s applicants were not yet entered in the college’s database. The population parameters of interest are the SAT scores and the percentage of students planning to live in dorms.

 Taking a Sample of 30 Applicants Excel’s RAND function generates Excel’s RAND function generates random numbers between 0 and 1 random numbers between 0 and 1 Excel’s RAND function generates Excel’s RAND function generates random numbers between 0 and 1 random numbers between 0 and 1 4. Simple Random Sampling: Step 1: Assign a random number to each of the 900 applicants. applicants. Step 2: Select the 30 applicants corresponding to the 30 smallest random numbers. 30 smallest random numbers.

4. Using Excel to Select a Simple Random Sample n Excel Formula Worksheet

4. Using Excel to Select a Simple Random Sample n Excel Value Worksheet

n Put Random Numbers in Ascending Order 4. Using Excel to Select a Simple Random Sample Step 4 When the Sort dialog box appears: Choose Random Numbers in the Choose Random Numbers in the Sort by text box Sort by text box Choose Ascending Choose Ascending Click OK Click OK Step 3 Choose the Sort option Step 2 Select the Data menu Step 1 Select cells A2:A901

Using Excel to Select a Simple Random Sample n Excel Value Worksheet (Sorted)

as Point Estimator of  as Point Estimator of  n as Point Estimator of p Point Estimation Note: Different random numbers would have identified a different sample which would have resulted in different point estimates. s as Point Estimator of  s as Point Estimator of 

PopulationParameterPointEstimatorPointEstimateParameterValue  = Population mean SAT score SAT score  = Population std. deviation for deviation for SAT score SAT score 80 s = Sample std. s = Sample std. deviation for deviation for SAT score SAT score75.2 p = Population pro- portion wanting portion wanting campus housing campus housing Summary of Point Estimates Obtained from a Simple Random Sample = Sample mean = Sample mean SAT score SAT score = Sample pro- = Sample pro- portion wanting portion wanting campus housing campus housing

Other Sampling Methods n Stratified Random Sampling n Cluster Sampling n Systematic Sampling n Convenience Sampling n Judgment Sampling