Introduction to Statistics Santosh Kumar Director (iCISA)

Slides:



Advertisements
Similar presentations
Lesson Describing Distributions with Numbers parts from Mr. Molesky’s Statmonkey website.
Advertisements

Introduction to Summary Statistics
Descriptive Statistics
Measures of Dispersion or Measures of Variability
Statistics.
Descriptive (Univariate) Statistics Percentages (frequencies) Ratios and Rates Measures of Central Tendency Measures of Variability Descriptive statistics.
Descriptive Statistics, The Normal Distribution,
Calculating & Reporting Healthcare Statistics
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Descriptive Statistics Statistical Notation Measures of Central Tendency Measures of Variability Estimating Population Values.
Descriptive Statistics
Edpsy 511 Homework 1: Due 2/6.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Introduction to Statistics Chapter 3 Using Statistics to summarize.
Measures of Dispersion
1 Chapter 4: Variability. 2 Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure.
Measures of Central Tendency 3.1. ● Analyzing populations versus analyzing samples ● For populations  We know all of the data  Descriptive measures.
1 Measures of Central Tendency Greg C Elvers, Ph.D.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 4 Summarizing Data.
Chapter 2 Describing distributions with numbers. Chapter Outline 1. Measuring center: the mean 2. Measuring center: the median 3. Comparing the mean and.
Descriptive Statistics Healey Chapters 3 and 4 (1e) or Ch. 3 (2/3e)
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Describing Data: Numerical
Describing distributions with numbers
STATISTIC & INFORMATION THEORY (CSNB134) MODULE 2 NUMERICAL DATA REPRESENTATION.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Objectives 1.2 Describing distributions with numbers
Statistics and Research methods Wiskunde voor HMI Betsy van Dijk.
Summary statistics Using a single value to summarize some characteristic of a dataset. For example, the arithmetic mean (or average) is a summary statistic.
Measures of Central Tendency or Measures of Location or Measures of Averages.
1 Review Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central Measures (mean,
Descriptive statistics Describing data with numbers: measures of variability.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.1 Descriptive Statistics, The Normal Distribution, and Standardization.
Lecture 3 Describing Data Using Numerical Measures.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
1 Review Sections Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.
1 Univariate Descriptive Statistics Heibatollah Baghi, and Mastee Badii George Mason University.
Measures of Central Tendency And Spread Understand the terms mean, median, mode, range, standard deviation.
Statistics 11 The mean The arithmetic average: The “balance point” of the distribution: X=2 -3 X=6+1 X= An error or deviation is the distance from.
INVESTIGATION 1.
Chapter 3 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 3: Measures of Central Tendency and Variability Imagine that a researcher.
Descriptive Statistics The goal of descriptive statistics is to summarize a collection of data in a clear and understandable way.
INVESTIGATION Data Colllection Data Presentation Tabulation Diagrams Graphs Descriptive Statistics Measures of Location Measures of Dispersion Measures.
LECTURE CENTRAL TENDENCIES & DISPERSION POSTGRADUATE METHODOLOGY COURSE.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Edpsy 511 Exploratory Data Analysis Homework 1: Due 9/19.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
LIS 570 Summarising and presenting data - Univariate analysis.
Describing Samples Based on Chapter 3 of Gotelli & Ellison (2004) and Chapter 4 of D. Heath (1995). An Introduction to Experimental Design and Statistics.
Numerical descriptions of distributions
CHAPTER 2: Basic Summary Statistics
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Descriptive Statistics(Summary and Variability measures)
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Lecture 8 Data Analysis: Univariate Analysis and Data Description Research Methods and Statistics 1.
Numerical descriptions of distributions
Descriptive measures Capture the main 4 basic Ch.Ch. of the sample distribution: Central tendency Variability (variance) Skewness kurtosis.
Measures of Central Tendency
Descriptive Statistics (Part 2)
Description of Data (Summary and Variability measures)
Summary descriptive statistics: means and standard deviations:
Central Tendency.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Numerical Descriptive Measures
Descriptive Statistics: Describing Data
Mean, Median, Mode The Mean is the simple average of the data values. Most appropriate for symmetric data. The Median is the middle value. It’s best.
CHAPTER 2: Basic Summary Statistics
Numerical Descriptive Measures
Presentation transcript:

Introduction to Statistics Santosh Kumar Director (iCISA)

2 Describe the Sample  Descriptive Statistics Measures of Central Tendency Measures of Variability Other Descriptive Measures  Beware that poor samples may provide a distorted view of the population  In general, larger samples a more representative of the population.

3 Measures of Central Tendency  Measures the “center” of the data  Examples Mean Median Mode  The choice of which to use, depends…  It is okay to report more than one.

4 Mean  The “average”.  If the data are made up of n observations x 1, x 2,…, x n, the mean is given by the sum of the observations divided by the number of observations. For example, if the data are x 1 =1, x 2 =2, x 3 =3, then the mean is (1+2+3)/3=2.  We can calculate the sample mean from the data. It is often denoted as

5 Mean  The population mean is usually unknown (although we try to make inferences about this).  The sample mean is an unbiased estimator of the population mean.

6 Median  The “middle observation” according to its rank in the data.  The median is: The observation with rank (n+1)/2 if n is odd The average of observations with rank n/2 and (n+2)/2 if n is even

7 Median  Example: If a very senior officer is present in the room and we desire assess the income level of all the officers present  The median is more robust than the mean to extreme observations. If data are skewed to the right, then the mean > median (in general) If data are skewed to the left, then mean < median (in general) If data are symmetric, then mean=median

8 Mode  The value that occurs most  Good for ordinal or nominal data in which there are a limited number of categories  Not very useful for continuous data  Only measure of central tendency for qualitative data.

9 Measures of Variability  Measure the “spread” in the data  Some important measures Variance Standard Deviation Range Interquartile Range

10 Variance  The sample variance (s 2 ) may be calculated from the data. It is the average of the square deviations of the observations from the mean.  The population variance is often denoted by S 2. This is usually unknown.

11 Variance  The deviations are squared because we are only interested in the size of the deviation rather than the direction (larger or smaller than the mean).  Note  Why?

12 Variance  The reason that we divide by n-1 instead of n has to do with the number of “information units” in the variance. After estimating the sample mean, there are only n-1 observations that are a priori unknown (degrees of freedom).  This also makes s 2 an unbiased estimator of S 2.

13 Standard Deviation  Square root of the variance s = sqrt(s 2 ) = sample SD  Calculate from the data (see formula for s 2) S = sqrt(S 2 ) = population SD  Usually unknown  Expressed in the same units as the mean (instead of squared units like the variance)  But, s is not an unbiased estimator of S.

14 Range  Maximum - Minimum  Very sensitive to extreme observations (outliers)

15 Interquartile Range  Q3-Q1  More robust than the range to extreme observations

16 Other Descriptive Measures  Minimum and Maximum Very sensitive to extreme observations  Sample size (n)  Percentiles Examples:  Median = 50 th percentile  Q1, Q3

17 Small Samples  For very small samples (e.g., <5 observations), summary statistics are not meaningful. Simply list the data.

18 Probability(P)  It measures the likelihood that an event will occur

19 Probability Distributions  A probability distribution describes the behavior of the character of interest (called variable) It identifies possible values of the variable and provides information about the probability that these values (or ranges of values) will occur.  Important probability distributions to be considered here are Binomial, Poisson and Normal distribution