Statistical Analysis - Chapter 2 “Organizing and Analyzing Data” Fashion Institute of Technology Dr. Roderick Graham.

Slides:



Advertisements
Similar presentations
Brought to you by Tutorial Support Services The Math Center.
Advertisements

Unit 16: Statistics Sections 16AB Central Tendency/Measures of Spread.
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Processing and Fundamental Data Analysis CHAPTER fourteen.
Chapter 2 Central Tendency & Variability. Measures of Central Tendency The Mean Sum of all the scores divided by the number of scores Mean of 7,8,8,7,3,1,6,9,3,8.
Lesson Fourteen Interpreting Scores. Contents Five Questions about Test Scores 1. The general pattern of the set of scores  How do scores run or what.
Statistics Intro Univariate Analysis Central Tendency Dispersion.
1 Chapter 4: Variability. 2 Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure.
MEASURES OF CENTRAL TENDENCY & DISPERSION Research Methods.
Measures of Central Tendency
@ 2012 Wadsworth, Cengage Learning Chapter 5 Description of Behavior Through Numerical 2012 Wadsworth, Cengage Learning.
Think of a topic to study Review the previous literature and research Develop research questions and hypotheses Specify how to measure the variables in.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
1 Business Math Chapter 7: Business Statistics. Cleaves/Hobbs: Business Math, 7e Copyright 2005 by Pearson Education, Inc. Upper Saddle River, NJ
Part II Sigma Freud & Descriptive Statistics
CHAPTER 3 : DESCRIPTIVE STATISTIC : NUMERICAL MEASURES (STATISTICS)
© 2006 Baylor University EGR 1301 Slide 1 Lecture 18 Statistics Approximate Running Time - 30 minutes Distance Learning / Online Instructional Presentation.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Chapter 9 Statistics Section 9.1 Frequency Distributions; Measures of Central Tendency.
CHAPTER 1 Basic Statistics Statistics in Engineering
1 Excursions in Modern Mathematics Sixth Edition Peter Tannenbaum.
Statistics Recording the results from our studies.
Descriptive Statistics Descriptive Statistics describe a set of data.
Statistics 1 Measures of central tendency and measures of spread.
BUS250 Seminar 4. Mean: the arithmetic average of a set of data or sum of the values divided by the number of values. Median: the middle value of a data.
PPA 501 – Analytical Methods in Administration Lecture 5a - Counting and Charting Responses.
8.3 Measures of Dispersion  In this section, you will study measures of variability of data. In addition to being able to find measures of central tendency.
Nature of Science Science Nature of Science Scientific methods Formulation of a hypothesis Formulation of a hypothesis Survey literature/Archives.
Descriptive Statistics: Numerical Methods
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
Descriptive Statistics Descriptive Statistics describe a set of data.
The Central Tendency is the center of the distribution of a data set. You can think of this value as where the middle of a distribution lies. Measure.
Psychology’s Statistics. Statistics Are a means to make data more meaningful Provide a method of organizing information so that it can be understood.
DATA ANALYSIS n Measures of Central Tendency F MEAN F MODE F MEDIAN.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Basic Statistical Terms: Statistics: refers to the sample A means by which a set of data may be described and interpreted in a meaningful way. A method.
Part II  igma Freud & Descriptive Statistics Chapter 2 Means to an End: Computing and Understanding Averages.
Chapter 3: Averages and Variation Section 2: Measures of Dispersion.
Sociology 5811: Lecture 3: Measures of Central Tendency and Dispersion Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Copyright © 2012 Pearson Education, Inc. All rights reserved Chapter 9 Statistics.
FARAH ADIBAH ADNAN ENGINEERING MATHEMATICS INSTITUTE (IMK) C HAPTER 1 B ASIC S TATISTICS.
Welcome to MM570 Applies Statistics for Psychology Unit 2 Seminar Dr. Bob Lockwood.
RESEARCH & DATA ANALYSIS
Measure of Central Tendency and Spread of Data Notes.
LIS 570 Summarising and presenting data - Univariate analysis.
Math 310 Section 8.1 & 8.2 Statistics. Centers and Spread A goal in statistics is to determine how data is centered and spread. There are many different.
Introduction to statistics I Sophia King Rm. P24 HWB
7.3 Measures of Central Tendency and Dispersion. Mean – the arithmetic average is the sum of all values in the data set divided by the number of values.
Standard Deviation. Two classes took a recent quiz. There were 10 students in each class, and each class had an average score of 81.5.
Graphing. Data Tables are a neat and organized way to collect data Graphs allow you to easily show the data in a way that people can draw conclusions.
Descriptive Statistics Research Writing Aiden Yeh, PhD.
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Averages and Variability
On stats Descriptive statistics reduce data sets to allow for easier interpretation. Statistics allow use to look at average scores. For instance,
Chapter 2 Describing and Presenting a Distribution of Scores.
Data Analysis. Statistics - a powerful tool for analyzing data 1. Descriptive Statistics - provide an overview of the attributes of a data set. These.
Measures of Central Tendency, Variance and Percentage.
Y12 Research Methods. Extraneous Variables (EV’s) These are variables that might affect the DV if the experiment is not well controlled. Starter: A study.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 2 Describing and Presenting a Distribution of Scores.
Making Sense of Statistics: A Conceptual Overview Sixth Edition PowerPoints by Pamela Pitman Brown, PhD, CPG Fred Pyrczak Pyrczak Publishing.
Chapter 2 The Mean, Variance, Standard Deviation, and Z Scores.
An Introduction to Statistics
Statistics in Forensics
Prof. Eric A. Suess Chapter 3
Exploratory Data Analysis
Different Types of Data
Theme 4 Describing Variables Numerically
11.1 Measures of Center and Variation
Lecture 4 Psyc 300A.
Central Tendency & Variability
The Mean Variance Standard Deviation and Z-Scores
Presentation transcript:

Statistical Analysis - Chapter 2 “Organizing and Analyzing Data” Fashion Institute of Technology Dr. Roderick Graham

Showing Data Graphically  When we collect a sample, we initially want to get a picture of how the data “looks”.  We can show our “stakeholders” easily what the patterns in the data are  What do we mean by “stakeholders”?  Three of the ways to show data are Histograms, Frequency Polygons, and Circle Graphs

Showing Data Graphically  Look at the listing of numbers on p.17  This is called “ungrouped” data  Sometimes it is better to “group” data into categories…this makes it easier to represent data graphically (p.18)

Histograms  Let’s look at the move from “ungrouped data” to the construction of a histogram in your textbook…(pp. 17 – 18) 1. Start with a survey of numbers…or “ungrouped data” 2. Decide on the categories you want to use and “group” the numbers into the categories that fit it 3. Now the data has been changed from a series of ages, to GROUPS of ages 4. We can compute statistics for both grouped and ungrouped data

Histograms  Let’s figure out this Histogram (taken from actual data I am using)…  1 = 18 – 24  2 = 25 – 34  3 = 35 – 44  4 = 45 – 54  5 = 55 – 64  6 – 65+ How many people are between ages 45 and 54?

Frequency Polygon (Line Graph)  This is a line graph representing the shape of a histogram  Usually when you have “too many bars” (categories) you may want to use line graph  This can be used to show trends easier than a histogram.

Circle Graph  These graphs are used to show what percentage (proportion) of a sample is doing what.  Your textbook goes into some detail about how to create circle graphs with a protractor…lucky for us we have Excel!  Below is an example from the CDC showing the percentages of how people have become infected with HIV…

Key Points  It is up to you (researcher) to decide what graph is most important for presenting your data. For me… 1. If am showing a small amount of categories, I use a histogram 2. If I am showing trends through time, or a large number of categories, I use a line graph 3. If I want to show percentages, I use a circle graph (this always the best way to show percentages)

Our first “statistics”  Remember that statistics are values that we compute from our sample of data that we have collected. We will learn two basic and important types of statistics:  Measures of Central Tendency – What are the middle values for our data?  Measures of Dispersion or Spread – How much diverse is our data…or how widely scattered is our data?  You can compute these statistics for both grouped and ungrouped data

Measures of Central Tendency (ungrouped)  What if we had collected data about one measure, and we wanted to know what the middle value was for this measure?  Ex. What is the middle value, in age, for those who listen to Lady Gaga?  Ex. How many times do young Hispanic women report shopping at H&M?  Knowing this middle, or central, value is important for describing our data.  There are three measures of central tendency…

Measures of Central Tendency (ungrouped)  Mean (p.24)  This is the mathematical average of a set of numbers  Median (p.26)  This is the middle value of a set of data that has been arranged from lowest to highest  Mode (p. 27)  The value that occurs the most in a set of data  We can use income as a good way of discussing these three measures. Imagine that we wanted to know the average incomes for FIT students. Imagine that we took a random sample of incomes for FIT students. …

Measures of Central Tendency (ungrouped)  The sample gives these values:  5000, 6000, 30000, , 15000, 6000, 17000, 13000, 12000, 11000, 8000, 6000, 15000, 6000,  The Mean  This is the average….  Sum of values =  Total N = 15  Mean = 18100

Measures of Central Tendency (ungrouped)  The sample gives these values:  5000, 6000, 30000, , 15000, 6000, 17000, 13000, 12000, 11000, 8000, 6000, 15000, 6000,  The Median  This is the middle values:  5000, 6000, 6000, 6000, 6000, 8000, 11000, 11500, 12000, 13000, 15000, 15000, 17000, 30000,  The median here is  In cases where there are two middle values, we average the two.

Measures of Central Tendency (ungrouped)  The sample gives these values:  5000, 6000, 30000, , 15000, 6000, 17000, 13000, 12000, 11000, 8000, 6000, 15000, 6000,  The Mode  This is the most numerous value:  5000, 6000, 6000, 6000, 6000, 8000, 11000, 11500, 12000, 13000, 15000, 15000, 17000, 30000,  The Mode here is  Sometimes there is no mode…or even two modes!

Measures of Central Tendency (ungrouped)  So given these values… 5000, 6000, 6000, 6000, 6000, 8000, 11000, 11500, 12000, 13000, 15000, 15000, 17000, 30000,  …what is the best measure of central tendency for this random sample of FIT students?  Mean?  Median?  Mode?

Measures of Dispersion or Spread (ungrouped)  Range (p.29)  The highest value minus the lowest value….  From our last example, the range would be: – 5000 =  Standard Deviation (p.29 – 35)  This is the average distance your values have from the mean score.  Best shown through example…

Measures of Dispersion or Spread (ungrouped) Standard Deviation  Let’s return to our FIT random sample… 5000, 6000, 6000, 6000, 6000, 8000, 11000, 11500, 12000, 13000, 15000, 15000, 17000, 30000,  Follow the steps on the right while we(I) calculate the standard deviation as a class on the board 1. Calculate the mean…which is Find the distance that each value has from the mean 3. Square the distance 4. Add up these distances and divide by the sample size – 1 (at this point, this number is called the variance). 5. Then we get the square root of this number

Standard Deviation XMean (x-bar)X – x-bar(X – x-bar) E E E E E E E E E E E E E E E4

Standard Deviation  We sum (x – x-bar) 2, and get the square root of this sum. This is the standard deviation. What is the square root of the sum?  Appx. 26,219  Right now, this number means very little…but in the following chapters we will gain a better understanding of the standard deviation

Measures of Central Tendency and Dispersion (Grouped Data)  Remember that grouped data is a collection of data that has been placed into categories…  Thus we need to calculate the mean and standard deviation differently, but the idea is the same.  P. 36 – 39 show the formulas for these measures.

Calculating the Mean for Grouped Data  Let’s say we conducted a random sample of FIT students, and asked them their GPA. We decided to group GPA into categories. Here is the data below:  So…what is the mean? Look at pages 36 – 38 and I will wait for someone to tell me how to go about answering this question? GPA CategoryNumber of Students 3.5 – – – Below 2.011

Calculating the Mean for Grouped Data X = the average of the categories f = number of students So can someone answer this question on the board (with help from classmates)? GPA CategoryNumber of Students 3.5 – – – Below GPA Category XNumber of Students (f) 3.5 – – – Below 2.0 (0 – 1.9).9511

Calculating the Standard Deviation of Grouped Data  Now let’s calculate the standard deviation for this same set of data…  Who can do this one on the board? GPA CategoryNumber of Students 3.5 – – – Below 2.011

Writing Research Reports (pp. 48 – 50)  Background Statement (5 pts)  I will give you data…use your imagination  Why was the study performed (why was the data collected)?  Design and Procedures of the Study (10 pts)  How did you conduct the study  How was the study internally valid/externally valid These two sections are not the most important…simply use your imagination to complete these two sections

Writing Research Reports (pp. 48 – 50)  Results (55 pts.)  The most important section.  For this first report, this is where you present your data graphically, show measures of dispersion, and central tendency  Analysis and Discussion (10 pts.)  What is interesting to you about the results?  Conclusions and Recommendations (20 pts.)  (this section you will not do for your report…this is where you present your results and analysis to the class. The class can ask you questions, so be on point!)

END