Introduction to statistics I Sophia King Rm. P24 HWB

Slides:



Advertisements
Similar presentations
Chapter 2: Frequency Distributions
Advertisements

Section #1 October 5 th Research & Variables 2.Frequency Distributions 3.Graphs 4.Percentiles 5.Central Tendency 6.Variability.
Statistics for the Social Sciences
Calculating & Reporting Healthcare Statistics
Lecture 2 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
DESCRIBING DATA: 2. Numerical summaries of data using measures of central tendency and dispersion.
PSY 307 – Statistics for the Behavioral Sciences
Introduction to Educational Statistics
Edpsy 511 Homework 1: Due 2/6.
Data observation and Descriptive Statistics
1 Chapter 4: Variability. 2 Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure.
Variability Ibrahim Altubasi, PT, PhD The University of Jordan.
Chapter 3: Central Tendency
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 4 Summarizing Data.
Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately describes the center of the.
Describing and Presenting a Distribution of Scores
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Graphs of Frequency Distribution Introduction to Statistics Chapter 2 Jan 21, 2010 Class #2.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Statistical Tools in Evaluation Part I. Statistical Tools in Evaluation What are statistics? –Organization and analysis of numerical data –Methods used.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure of variability usually accompanies.
Chapter 4 Variability. Variability In statistics, our goal is to measure the amount of variability for a particular set of scores, a distribution. In.
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
Thinking About Psychology: The Science of Mind and Behavior 2e Charles T. Blair-Broeker Randal M. Ernst.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Descriptive Statistics
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
The Central Tendency is the center of the distribution of a data set. You can think of this value as where the middle of a distribution lies. Measure.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Chapter 3 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 3: Measures of Central Tendency and Variability Imagine that a researcher.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Basic Statistical Terms: Statistics: refers to the sample A means by which a set of data may be described and interpreted in a meaningful way. A method.
Psy 230 Jeopardy Measurement Research Strategies Frequency Distributions Descriptive Stats Grab Bag $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500.
Chapter 4: Variability. Variability Provides a quantitative measure of the degree to which scores in a distribution are spread out or clustered together.
Chapter Eight: Using Statistics to Answer Questions.
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Edpsy 511 Exploratory Data Analysis Homework 1: Due 9/19.
Chapter 2: Frequency Distributions. Frequency Distributions After collecting data, the first task for a researcher is to organize and simplify the data.
1 Frequency Distributions. 2 After collecting data, the first task for a researcher is to organize and simplify the data so that it is possible to get.
Describing Distributions Statistics for the Social Sciences Psychology 340 Spring 2010.
Chapter 4: Variability. Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure of variability.
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
Statistics and Data Analysis
Chapter 2 Describing and Presenting a Distribution of Scores.
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
Psychology’s Statistics Appendix. Statistics Are a means to make data more meaningful Provide a method of organizing information so that it can be understood.
Chapter 14 Statistics and Data Analysis. Data Analysis Chart Types Frequency Distribution.
Educational Research Descriptive Statistics Chapter th edition Chapter th edition Gay and Airasian.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 2 Describing and Presenting a Distribution of Scores.
Exploratory Data Analysis
Module – 10 Research Statistics and Analysis
APPROACHES TO QUANTITATIVE DATA ANALYSIS
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Description of Data (Summary and Variability measures)
An Introduction to Statistics
Introduction to Statistics
Measures of Location Statistics of location Statistics of dispersion
Descriptive Statistics
Week 4 Frequencies.
Descriptive Statistics
Presentation transcript:

Introduction to statistics I Sophia King Rm. P24 HWB

Using statistics in Psychology  Carrying out psychological research means the collection of data. Statistics are a way of making use of this data Descriptive Statistics: used to describe characteristics of our sample Statistics describe samples Inferential Statistics: used to generalise from our sample to our population Parameters describe populations Any samples used should therefore be representative of the target population

Descriptive Statistics  Statistical procedures used to summarise, organise, and simplify data. This process should be carried out in such a way that reflects overall findings Raw data is made more manageable Raw data is presented in a logical form Patterns can be seen from organised data Frequency tables Graphical techniques Measures of Central Tendency Measures of Spread (variability)

Plotting Data: describing spread of data  A researcher is investigating short-term memory capacity: how many symbols remembered are recorded for 20 participants: 4, 6, 3, 7, 5, 7, 8, 4, 5,10 10, 6, 8, 9, 3, 5, 6, 4, 11, 6  We can describe our data by using a Frequency Distribution. This can be presented as a table or a graph. Always presents: The set of categories that made up the original category The frequency of each score/category Three important characteristics: shape, central tendency, and variability

Frequency Distribution Tables  Highest Score is placed at top  All observed scores are listed  Gives information about distribution, variability, and centrality X = score value f = frequency fx = total value associated with frequency  f = N  X =  fX

Frequency Table Additions  Frequency tables can display more detailed information about distribution Percentages and proportions p = fraction of total group associated with each score (relative frequency) p = f/N As %: p(100) =100(f/N)  What does this tell about this distribution of scores?

Grouped Frequency Distribution Tables  Sometimes the spread of data is too wide  Grouped tables present scores as class intervals About 10 intervals An interval should be a simple round number (2, 5, 10, etc), and same width Bottom score should be a multiple of the width  Class intervals represent Continuous variable of X: E.g. 51 is bounded by real limits of If X is 8 and f is 3, does not mean they all have the same scores: they all fell somewhere between 7.5 and 8.5

Percentiles and Percentile Ranks  X values = raw scores, without context  Percentile rank = the percentage of the sample with scores below or at the particular value  This can be represented be a cumulative frequency column  Cumulative percentage obtained by: c% = cf/N(100)  This gives information about relative position in the data distribution

Representing data as graphs  Frequency Distribution Graph presents all the info available in a Frequency Table (can be fitted to a grouped frequency table)  Uses Histograms Bar width corresponds to real limits of intervals Histograms can be modified to include blocks representing individual scores

Frequency Distribution Polygons  Shows same information with lines: traces ‘shape’ of distribution  Both histograms and polygons represent continuous data  For non numerical data, frequency distribution can be represented by bar graphs Bar graphs have spaces between adjacent bars to represent distinct categories

Frequencies of Populations and Samples  Population All the individuals of interest to the study  Sample The particular group of participants you are testing: selected from the population  Although it is possible to have graphs of population distributions, unlike graphs of sample distributions, exact frequencies are not normally possible. However, you can Display graphs of relative frequencies (categorical data) Use smooth curves to indicate relative frequencies (interval or ratio data)

 Bell-shaped: specific shape that can be defined as an equation  Symmetrical around the mid point, where the greatest frequency if scores occur  Asymptotes of the perfect curve never quite meet the horizontal axis  Normal distribution is an assumption of parametric testing Frequency Distribution: the Normal Distribution

Frequency Distribution: Different Distribution shapes

Measures of Central Tendency  A way of summarising the data using a single value that is in some way representative of the entire data set It is not always possible to follow the same procedure in producing a central representative value: this changes with the shape of the distribution  Mode Most frequent value Does not take into account exact scores Unaffected by extreme scores Not useful when there are several values that occur equally often in a set

Measures of Central Tendency  Median The values that falls exactly in the midpoint of a ranked distribution Does not take into account exact scores Unaffected by extreme scores In a small set it can be unrepresentative  Mean (Arithmetic average) Sample mean: M =  XPopulation mean:  =  X n N Takes into account all values Easily distorted by extreme values

Measures of Central Tendency  For our set of memory scores: 4, 6, 3, 7, 5, 7, 8, 4, 5,10 10, 6, 8, 9, 3, 5, 6, 4, 11, 6  Mode = 6: Median = 6: Mean = 6.35  The mean is the preferred measure of central tendency, except when There are extreme scores or skewed distributions Non interval data Discrete variables

Central Tendencies and Distribution Shape

Describing Variability  Describes in an exact quantitative measure, how spread out/clustered together the scores are  Variability is usually defined in terms of distance How far apart scores are from each other How far apart scores are from the mean How representative a score is of the data set as a whole

Describing Variability: the Range  Simplest and most obvious way of describing variability Range =  Highest -  Lowest The range only takes into account the two extreme scores and ignores any values in between. To counter this there the distribution is divided into quarters (quartiles). Q1 = 25%, Q2 =50%, Q3 =75% The Interquartile range: the distance of the middle two quartiles (Q3 – Q1) The Semi-Interquartile range: is one half of the Interquartile range

Describing Variability: Deviation  A more sophisticated measure of variability is one that shows how scores cluster around the mean Deviation is the distance of a score from the mean X - , e.g = 3.65, 3 – 6.35 = A measure representative of the variability of all the scores would be the mean of the deviation scores  (X -  ) Add all the deviations and divide by n n However the deviation scores add up to zero (as mean serves as balance point for scores)

Describing Variability: Variance  To remove the +/- signs we simply square each deviation before finding the average. This is called the Variance:  (X -  )² = = 5.33 n 20  The numerator is referred to as the Sum of Squares (SS): as it refers to the sum of the squared deviations around the mean value

Describing Variability: Population Variance  Population variance is designated by  ²  ² =  (X -  )² = SS N N  Sample Variance is designated by s² Samples are less variable than populations: they therefore give biased estimates of population variability Degrees of Freedom (df): the number of independent (free to vary) scores. In a sample, the sample mean must be known before the variance can be calculated, therefore the final score is dependent on earlier scores: df = n -1 s² =  (X - M)² = SS = = 5.61 n - 1 n

Describing Variability: the Standard Deviation  Variance is a measure based on squared distances  In order to get around this, we can take the square root of the variance, which gives us the standard deviation  Population (  ) and Sample (s) standard deviation  =   (X -  )² N s =   (X - M)² n - 1 So for our memory score example we simple take the square root of the variance: =  5.61 = 2.37

Describing Variability  The standard deviation is the most common measure of variability, but the others can be used. A good measure of variability must: Must be stable and reliable: not be greatly affected by little details in the data Extreme scores Multiple sampling from the same population Open-ended distributions Both the variance and SD are related to other statistical techniques

Descriptive statistics  A researcher is investigating short-term memory capacity: how many symbols remembered are recorded for 20 participants: 4, 6, 3, 7, 5, 7, 8, 4, 5,10 10, 6, 8, 9, 3, 5, 6, 4, 11, 6  What statistics can we display about this data, and what do they mean? Frequency table: show how often different scores occur Frequency graph: information about the shape of the distribution Measures of central tendency and variability

Descriptive statistics

References and Further Reading Gravetter & Wallnau  Chapter 2  Chapter 3  Chapter 4