10a. Univariate Analysis Part 1 CSCI N207 Data Analysis Using Spreadsheet Lingma Acheson Department of Computer and Information Science,

Slides:



Advertisements
Similar presentations
Computer Programming (TKK-2144) 13/14 Semester 1 Instructor: Rama Oktavian Office Hr.: T.12-14, Th
Advertisements

Central Tendency Mean – the average value of a data set. Add all the items in a data set then divide by the number of items in the data set.
Lecture 2 Describing Data II ©. Summarizing and Describing Data Frequency distribution and the shape of the distribution Frequency distribution and the.
Introduction to Data Analysis
Bios 101 Lecture 4: Descriptive Statistics Shankar Viswanathan, DrPH. Division of Biostatistics Department of Epidemiology and Population Health Albert.
QUANTITATIVE DATA ANALYSIS
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Statistics: An Introduction Alan Monroe: Chapter 6.
SOWK 6003 Social Work Research Week 10 Quantitative Data Analysis
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Biostatistics Unit 2 Descriptive Biostatistics 1.
Quantifying Data.
Measures of Central Tendency 3.1. ● Analyzing populations versus analyzing samples ● For populations  We know all of the data  Descriptive measures.
11. Multivariate Analysis CSCI N207 Data Analysis Using Spreadsheet Lingma Acheson Department of Computer and Information Science, IUPUI.
1 Measures of Central Tendency Greg C Elvers, Ph.D.
Measures of Central Tendency
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 4 Summarizing Data.
Today: Central Tendency & Dispersion
Describing Data: Numerical
Chapter 3 Descriptive Measures
Think of a topic to study Review the previous literature and research Develop research questions and hypotheses Specify how to measure the variables in.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Statistics and Research methods Wiskunde voor HMI Betsy van Dijk.
BIOSTAT - 2 The final averages for the last 200 students who took this course are Are you worried?
Measures of Central Tendency or Measures of Location or Measures of Averages.
Data Handbook Chapter 4 & 5. Data A series of readings that represents a natural population parameter A series of readings that represents a natural population.
Chapter Eleven A Primer for Descriptive Statistics.
CSCI N207: Data Analysis Using Spreadsheets Copyright ©2005  Department of Computer & Information Science Univariate Data Analysis.
PPA 501 – Analytical Methods in Administration Lecture 5a - Counting and Charting Responses.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
1 Univariate Descriptive Statistics Heibatollah Baghi, and Mastee Badii George Mason University.
Measures of Central Tendency And Spread Understand the terms mean, median, mode, range, standard deviation.
10b. Univariate Analysis Part 2 CSCI N207 Data Analysis Using Spreadsheet Lingma Acheson Department of Computer and Information Science,
Psychology’s Statistics. Statistics Are a means to make data more meaningful Provide a method of organizing information so that it can be understood.
According to researchers, the average American guy is 31 years old, 5 feet 10 inches, 172 pounds, works 6.1 hours daily, and sleeps 7.7 hours. These numbers.
INVESTIGATION 1.
Thinking Mathematically
Determination of Sample Size: A Review of Statistical Theory
Experimental Research Methods in Language Learning Chapter 9 Descriptive Statistics.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Measures of Central Tendency: The Mean, Median, and Mode
Basic Statistical Terms: Statistics: refers to the sample A means by which a set of data may be described and interpreted in a meaningful way. A method.
Introduction to Statistics Santosh Kumar Director (iCISA)
Central Tendency & Dispersion
Descriptive & Inferential Statistics Adopted from ;Merryellen Towey Schulz, Ph.D. College of Saint Mary EDU 496.
Chapter Eight: Using Statistics to Answer Questions.
Economics 173 Business Statistics Lectures 1 Fall, 2001 Professor J. Petry.
Introduction to Statistics Measures of Central Tendency and Dispersion.
1 Outline 1. Why do we need statistics? 2. Descriptive statistics 3. Inferential statistics 4. Measurement scales 5. Frequency distributions 6. Z scores.
LIS 570 Summarising and presenting data - Univariate analysis.
Descriptive Statistics for one variable. Statistics has two major chapters: Descriptive Statistics Inferential statistics.
Dr. Engr. Sami ur Rahman Data Analysis Lecture 2: Basic Concepts of Statistical Methods.
Why do we analyze data?  To determine the extent to which the hypothesized relationship does or does not exist.  You need to find both the central tendency.
Central Tendency Mean – the average value of a data set. Add all the items in a data set then divide by the number of items in the data set.
Descriptive Statistics(Summary and Variability measures)
9d. Pie Charts CSCI N207 Data Analysis Using Spreadsheet Department of Computer and Information Science, IUPUI Lingma Acheson
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
Chapter 6 Becoming Acquainted With Statistical Concepts.
PCB 3043L - General Ecology Data Analysis Organizing an ecological study What is the aim of the study? What is the main question being asked? What are.
Lecture 8 Data Analysis: Univariate Analysis and Data Description Research Methods and Statistics 1.
Becoming Acquainted With Statistical Concepts
Data Analysis.
Measures of Central Tendency
Univariate Analysis/Descriptive Statistics
Chapter 14 Quantitative Data Analysis
Central Tendency.
Descriptive Statistics
Decimal Applications : Mean, Median, and Mode
Basic Biostatistics Measures of central tendency and dispersion
Measures of Central Tendency
Presentation transcript:

10a. Univariate Analysis Part 1 CSCI N207 Data Analysis Using Spreadsheet Lingma Acheson Department of Computer and Information Science, IUPUI

Univariate Analysis Deal with single variable, one data field. Apply some calculation to describe the data in the field, e.g. central tendency, location, dispersion, etc. Is also called descriptive statistics. First learn the concepts, then use Excel as a tool. Page 2

The Nature of Measurement Measurement is a process of assigning a number or a value to observations according to some established set of rules. The numbers with a quantitative bases are amendable to mathematical analysis. Need to decide on the scale of measurement. –E.g. Age: shall we use categories such as “<10”, “10 – 20”, “21 – 30”, etc., or shall we use the actual value? Rankings : shall we use 1, 2, 3, 4, or A, B, C, D? Salary: shall we use 1000 as the unit so the value are 21, 48, etc., or the actual value such as 21000, 48000, etc.? –Based on the purpose for collecting the values, what type of analysis to be performed, etc.

The Nature of Measurement The measurement process is influenced by many factors and environmental conditions. E.g. –Experimental error –Instrumental error –Incompleteness Need to consider the validity and accuracy of measurement.

Validity of Measurement What is an appropriate and meaningful way to measure a given property? E.g. Measure the area of a rectangular table. –What tool to use – ruler, tape? –What system to use – metric system or the British system? –How to come up with the area – further calculation is needed (indirect measurement)? Measurement of social and behavioral sciences are mostly indirect. E.g. What is a good way to measure how rich a family is? Are drivers over 65 involved in more fatal accidents than drivers below 17? Page 5

Accuracy of Measurement The quality of measurement. Inaccuracy may be caused by –systematic error, e.g. a weighing scale always reads a certain number of pounds low. –Incompleteness, e.g. small sample size. –Lower precision than what’s required, e.g. need a result in millimeters, but use a ruler with only centimeters. Physical measurement is more straightforward than social science. Page 6

Descriptive Statistics Methodology to observe, describe or summarize your data. –Central tendency Mean Median Mode –Dispersion Min/Max Range Variance Standard Deviation Distribution Univariate analysis, summarize data in one data field Page 7

The Mean (Average) Sum of all values divided by the number of values in the data set. One measure of central location in the data set. Mean = Mean =( )/20 = 68.6 Excel function: AVERAGE() Page 8

The Mean (Average) The data may or may not be symmetrical around its average value. Mean itself does not tell what your data looks like. Page 9

The Median The middle value in a sorted data set. Half the values are greater and half are less than the median. Another measure of central location in the data set. Odd number of items: find the middle number. E.g. (1, 2, 4, 7, 8, 9, 9) Median: 7 Even number of items: find the middle two and get the average of the two. E.g. (45, 49, 50, 53, 60, 62, 63, 65, 66, 67, 69, 71, 73, 74, 74, 78, 81, 85, 87, 100) Median: 68 Excel function: MEDIAN() Page 10

The Mode Most frequently occurring value. Another measure of central location in the data set. E.g. (45, 49, 50, 53, 60, 62, 63, 65, 66, 67, 69, 71, 73, 74, 74, 78, 81, 85, 87, 100) Mode: 74 Generally not all that meaningful unless a larger percentage of the values are the same number. Excel function: MODE() Page 11