Statistics for Education Research Statistics for Education Research Lecture 1 Scales/Graph/Central Tendency Instructor: Dr. Tung-hsien He

Slides:



Advertisements
Similar presentations
Population vs. Sample Population: A large group of people to which we are interested in generalizing. parameter Sample: A smaller group drawn from a population.
Advertisements

Descriptive Statistics
Statistics. Review of Statistics Levels of Measurement Descriptive and Inferential Statistics.
Statistics.
Review of Basics. REVIEW OF BASICS PART I Measurement Descriptive Statistics Frequency Distributions.
Review of Basics. REVIEW OF BASICS PART I Measurement Descriptive Statistics Frequency Distributions.
Statistics for the Social Sciences
Quantitative Data Analysis Social Research Methods 2109 & 6507 Spring, 2006 March
03/05/2003 Week #2 江支弘 Measuring Center or Average 量度中心或平均 Stemplot: Mean: 平均數 arithmetic average of observations Median: 中位數 middle value of... (in increasing.
PSY 307 – Statistics for the Behavioral Sciences
Descriptive Statistics
Analysis of Research Data
Introduction to Educational Statistics
Edpsy 511 Homework 1: Due 2/6.
Ch 3 Central Tendency 中央集中趨勢測量.
Data observation and Descriptive Statistics
2 Textbook Shavelson, R.J. (1996). Statistical reasoning for the behavioral sciences (3 rd Ed.). Boston: Allyn & Bacon. Supplemental Material Ruiz-Primo,
Summarizing Scores With Measures of Central Tendency
Statistics for Linguistics Students Michaelmas 2004 Week 1 Bettina Braun.
@ 2012 Wadsworth, Cengage Learning Chapter 5 Description of Behavior Through Numerical 2012 Wadsworth, Cengage Learning.
Chapter 3 Statistical Concepts.
EPE/EDP 557 Key Concepts / Terms –Empirical vs. Normative Questions Empirical Questions Normative Questions –Statistics Descriptive Statistics Inferential.
Psychometrics.
Part II Sigma Freud & Descriptive Statistics
Statistics. Question Tell whether the following statement is true or false: Nominal measurement is the ranking of objects based on their relative standing.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Types of data and how to present them 47:269: Research Methods I Dr. Leonard March 31, :269: Research Methods I Dr. Leonard March 31, 2010.
Chapters 1 & 2 Displaying Order; Central Tendency & Variability Thurs. Aug 21, 2014.
Statistical Tools in Evaluation Part I. Statistical Tools in Evaluation What are statistics? –Organization and analysis of numerical data –Methods used.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Descriptive Statistics And related matters. Two families of statistics Descriptive statistics – procedures for summarizing, organizing, graphing, and,
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
Chapter 2 Characterizing Your Data Set Allan Edwards: “Before you analyze your data, graph your data.
Chapter 2 Describing Data.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Descriptive Statistics
Describing Data Lesson 3. Psychology & Statistics n Goals of Psychology l Describe, predict, influence behavior & cognitive processes n Role of statistics.
Skewness & Kurtosis: Reference
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Agenda Descriptive Statistics Measures of Spread - Variability.
Practice Page 65 –2.1 Positive Skew Note Slides online.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
I. Introduction to Data and Statistics A. Basic terms and concepts Data set - variable - observation - data value.
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
BASIC STATISTICAL CONCEPTS Chapter Three. CHAPTER OBJECTIVES Scales of Measurement Measures of central tendency (mean, median, mode) Frequency distribution.
IE(DS)1 Descriptive Statistics Data - Quantitative observation of Behavior What do numbers mean? If we call one thing 1 and another thing 2 what do we.
Edpsy 511 Exploratory Data Analysis Homework 1: Due 9/19.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
LIS 570 Summarising and presenting data - Univariate analysis.
Introduction to statistics I Sophia King Rm. P24 HWB
Describing Distributions Statistics for the Social Sciences Psychology 340 Spring 2010.
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
1 Day 1 Quantitative Methods for Investment Management by Binam Ghimire.
Chapter 2 Describing and Presenting a Distribution of Scores.
Measures of Central Tendency (MCT) 1. Describe how MCT describe data 2. Explain mean, median & mode 3. Explain sample means 4. Explain “deviations around.
Educational Research Descriptive Statistics Chapter th edition Chapter th edition Gay and Airasian.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 2 Describing and Presenting a Distribution of Scores.
Doc.RNDr.Iveta Bedáňová, Ph.D.
Descriptive measures Capture the main 4 basic Ch.Ch. of the sample distribution: Central tendency Variability (variance) Skewness kurtosis.
Summarizing Scores With Measures of Central Tendency
Descriptive Statistics
An Introduction to Statistics
Descriptive Statistics
Introduction to Statistics
Basic Statistical Terms
Numerical Descriptive Measures
Presentation transcript:

Statistics for Education Research Statistics for Education Research Lecture 1 Scales/Graph/Central Tendency Instructor: Dr. Tung-hsien He

Types of Data: Types of Data: 1. Nominal [ 名詞性 ] Scales 2. Ordinal ( 序數 ) Scales 3. Interval ( 區間 ) Scales 4. Ratio [ 比例 ] Scales Population vs. Samples Population vs. Samples Types of Graphs Types of Graphs Distribution of Frequency Distribution of Frequency 1. Shape 2. Kurtosis [ 峭度 ]/Skewness [ 偏度 ]

Types of Scores in Distribution of Frequency Types of Scores in Distribution of Frequency 1. Percentiles ( 百分位數 ]: 2. Percentile Rank (PR]: [ 百分等級 ) Measures of Central Tendency Measures of Central Tendency 1. Mode [ 眾數 ] : 2. Median [ 中位數 ]: 3. Mean [ 平均數 ]:

Measure of Variation Measure of Variation 1. Variation [ 變異數 ] 2. Standard Deviation: SD [ 標準差 ] 3. Standard Scores [ 標準分數 ]: e.g., t scores; z scores

Nominal [ 名詞性 ] Scales: Demographic Variables Nominal [ 名詞性 ] Scales: Demographic Variables 1. Objects are classified based on defined characteristics (e.g.: genders); 2. No logical ordering among categories; 3. Cases or categories are mutually exclusive. Ordinal ( 序數 ) Scales: Ordinal ( 序數 ) Scales: 1. Objects are classified and given a logical order among categories (e.g.: letter grades; 特優、優 ); 2. Cases or categorizes are mutually exclusive;

3.Differences in categories are not in any equal unit (e.g.: the difference between Grade A & B may/may not be the same as that between Grade B & C). Interval (Equal Unit) ( 區間 ) Scales: Interval (Equal Unit) ( 區間 ) Scales: 1. Objects are classified and given a logical order (Temperatures measured by thermometers); 2. Differences between levels of categories are in identical units; 3.Point 0 is only a point on the scale rather than its starting point;

4. Categories are mutually exclusive. Ratio [ 比例 ] Scales: Ratio [ 比例 ] Scales: 1. The highest level in the hierarchy of measurement; 2. Objects are classified and given a logical order; 3. Differences between levels of categories are in identical units; 4. Point 0 reflects an absence of the characteristic (a true zero point);

5. Data show the proportional amounts of the characteristic. Differences in Interval & Ratio Data: Differences in Interval & Ratio Data:E.g.: 1. Weight scales give ratio data because 25KG is twice lighter than 50 KG. 0 kg means “0”. 2. Thermometers give interval data because 25F is not twice cooler than 50F. 0 degree does not mean “no temperature”.

Population: All members in a specific group Population: All members in a specific group Sample: A subset of the population Sample: A subset of the population Parameter: Characteristic, for instance, mean of a population Parameter: Characteristic, for instance, mean of a population Statistic: Characteristic, for instance, means of a sample Statistic: Characteristic, for instance, means of a sample Notation Systems: Notation Systems: 1. Greek for Population measure; Roman for Sample 2.  for population mean; sample means = x Why sample means rather than population means? Why sample means rather than population means?

Descriptive vs. Inferential Statistics Descriptive vs. Inferential Statistics 1. Descriptive Statistics: Classify/Summarize Numerical data 2. Inferential Statistics: Make generalizations about a population by studying a sample drawn from this population

Bar Graphs ( 長條圖 ): Bar Graphs ( 長條圖 ): Relation between two variables (X axis: Independent Variable; Y axis: Dependent) measured by nominal scales (p. 36) Scattergrams ( 分散圖 ) (Scatterplot): Scattergrams ( 分散圖 ) (Scatterplot): Relation between two quantitative variables (p. 38)

Frequency Distribution Graphs ( 次數分佈圖 ) (Three-Quarter-High Rule) Frequency Distribution Graphs ( 次數分佈圖 ) (Three-Quarter-High Rule) 1. Histograms ( 長條統計圖 ) : Using exact limits & Bars are not separated (p. 39) 2. Frequency Polygons ( 次數分佈多邊圖 ): Using midpoint (p. 40)

3. Ogive ( 頻率曲線圖 ) (cumulative frequency polygon) (p. 42):

1. Shapes of Frequency Distributions (p. 44) Uniform (Rectangular) Distribution: Uniform (Rectangular) Distribution: Positively Skewed Distribution (Skewed to the right) Positively Skewed Distribution (Skewed to the right) Negatively Skewed Distribution: Negatively Skewed Distribution: Normal Distribution (Mesokurtic): Normal Distribution (Mesokurtic): Leptokurtic Distribution [ 高狹峰 ] -> A non-normal distribution with more outliers [ 極端值 ]. than a normal distribution Leptokurtic Distribution [ 高狹峰 ] -> A non-normal distribution with more outliers [ 極端值 ]. than a normal distribution

Platykurtic Distribution: [ 低濶峰分配 A non- normal distribution with fewer outliers than a normal distribution [ 極端值 ]. Platykurtic Distribution: [ 低濶峰分配 ] -> A non- normal distribution with fewer outliers than a normal distribution [ 極端值 ]. Bimodal Distribution ( 雙峰分佈 ): Bimodal Distribution ( 雙峰分佈 ): If many outliers are clustered at the two extreme ends of a distribution, it will form a bimodal distribution. If many outliers are clustered at the two extreme ends of a distribution, it will form a bimodal distribution.

2. If there are too many outliers, the distribution will not be normalized. In other words, many stat techniques may not yield trustworthy data. 3. How to locate outliers? Just look at the frequency graphs to find the “lone wolves”. 4. Degrees of Peakedness: A computed value that shows the degrees of homogeneity of a set of data; the higher the score, the more homogeneous the data are. Kurtosis ( 峭度 ) & Skewness ( 偏度 ) are used to measure “Normality”.

Kurtosis & Skewness Kurtosis & Skewness 1. Expected Values: 1~-1 Normal Distribution =0 is perfect score for normality 2. Kurtosis >0: Leptokurtic Distribution 3. Kurtosis <0: Platykurtic Distribution 4. Skewness >0: Positively skewed 5. Skewness <0: Negatively Skewed 6. Some researchers use 3~-3 to indicate normal distributions

To describe a distribution is to indicate its shape, average score (mean score), and variation. Graphs determine shapes, means indicate central tendency, and variations indicates how widely scores are spread. To describe a distribution is to indicate its shape, average score (mean score), and variation. Graphs determine shapes, means indicate central tendency, and variations indicates how widely scores are spread. Percentiles ( 百分位數 ]: Percentiles ( 百分位數 ]: 1. the point (scores) in the distribution at or below which a give percentage of scores is found; 2. For example, 75 th percentile of 180 scores means the scores at or below which 75 percent (3/4) of the entire scores will fall.

Percentile Rank (Different from Percentile: 某一分 數之百分等級值 ; PR): Percentile Rank (Different from Percentile: 某一分 數之百分等級值 ; PR): 1. the percentage of scores less than or equal to that score 2. PR63 means the percentage of scores in the distribution that falls at or below a score of 63: [ 分 數為 63 分之 PR 值是... ] Example: 基測 250 分之 PR 值為 98 -> PR250=98 Example: 基測 250 分之 PR 值為 98 -> PR250=98

2. At the center of a normal distribution, percentile scores would underestimate actual differences: 基測 PR 值為 50 之分數為 200 分,基測 PR 值為 51 之 分數可能為 210 分,實際量尺分數差距為 10 分, 但在 PR 值上只差距 1 分。 3. Percentiles can not be used to determine differences in ranks: The difference in two together percentiles does not represent the difference in two interval scores -> 基 測 PR 值為 50 之分數為 200 分,基測 PR 值為 51 之分 數為 210 分,在 PR 值上只差距 1 分,但是實際量尺 分數差距為 10 分。

Measures of Central Tendency: Measures of Central Tendency: 1. Mode [ 眾數 ] : a. the most frequent score in a distribution b. Simplest index of central tendency 2. Median [ 中位數 ]: a. the 50 th percentile b. if the number of scores is odd, choose the middle one; if the number of scores is even, compute the mean of the two central scores

3. Mean: X bar: x a. Arithmetic Average b. Formula: See p. 60 c. Properties: (See Table 3.4, p. 61) (1) the sum of deviation of all scores from the mean is 0 (2) the sum of squares of the deviation from the mean is smaller than that of any other value in the distribution

Comparisons of Measures of Central Tendency Comparisons of Measures of Central Tendency 1. Mode  Nominal Data: 出現頻率最多數 Median & Mode  Ordinal Data All Three  Interval or Ratio 2. Mode/Median vs. Sample Sizes Rule of Thumb: the bigger, the better!

Measures of Variation: Lengths of intervals that indicate the spread of scores in a distribution Measures of Variation: Lengths of intervals that indicate the spread of scores in a distribution 1. Mean Deviation ( 變異數 ] : (1) Variance for Population:  2 = SS/N =  (x i ) 2 /N (2) Variance for Sample: s 2 ( 標準差 ]= SS/N-1 =  (x i ) 2 /N-1

2. Standard Deviation ( 標準差 ]: (square root of the variance): Formula on p. 74 (1) the square root of the variance (2) the mean of the distances of all scores to the mean. (3) the greater the value is, the great variability of scores around the mean is

Standard Scores: Standard Scores: 1. z score: Formula p Meanings: to indicate how many units of SD which a raw score falls either above or below from the mean. 3. Properties: (1) Shape of standard scores is identical to that of raw scores; (2) Mean of z-score distribution is 0; (3) Values of variance & SD are T score: 10 z + 50 (Why t score?)