Download presentation
Presentation is loading. Please wait.
Published byCorey Peters Modified over 8 years ago
1
24 Nov 2007Data Management1 Data Summarization and Exploratory Data Analysis Objective: Describe or Examine Data Sets in Term of Key Characteristics
2
24 Nov 2007Data Management2 Key Characteristics Majority and Minority Distribution or Pattern Typical or Most Frequent Values Variability Highest and Lowest Values Unusual or Extreme Values
3
24 Nov 2007Data Management3 Methods for Data Summarization Graphical Method Numerical Method
4
24 Nov 2007Data Management4 Graphical Method Bar chart Pie chart Histogram Stem and leaf plot Box plot Scattered plot Time sequence plot
5
24 Nov 2007Data Management5 Numerical Method Frequency or Relative Frequency Measures of Central Tendency Measure of Spread or Variability
6
24 Nov 2007Data Management6 Measure of central tendency Mean Median Mode Trimmed Mean
7
24 Nov 2007Data Management7 Measure of spread or variation Variance or Standard Deviation Range Inter-quartile Range
8
24 Nov 2007Data Management8 Types of Data Qualitative Data Quantitative Data
9
24 Nov 2007Data Management9 Qualitative Data Definition: Measurements that classify a population or a sample unit into one of a group of categories. Examples: - Sex, Occupation, Educational Level, Race, Nationality - Political Party Affiliation - Defective Status of Manufacturing Items
10
24 Nov 2007Data Management10 Quantitative Data Definition: Measurement that are recorded on a naturally occurring numerical scale. Example: - Weight, Height, Age, Blood Pressure, Pulse Rate - Temperature
11
24 Nov 2007Data Management11 Counted Data Definition: Total number of sample or population units that have specific characteristics or attributes. Examples: - Total number of male students in 2603 213 class - Total number of car accidents at an intersection during the week - Total number of households in Thailand with children age under 15 - Total number of HIV infected from 500 blood samples
12
24 Nov 2007Data Management12 Case study Aphasia is the impairment or loss of the faculty of using or understanding spoken or written language. Three types of aphasia have been identified by researchers: Broca’s, conduction, and anomic. The objective of the study is to determine whether one type of aphasia occurs more often than any other, and if so, how often. Consequently, a sample of 22 adults aphasiacs was selected and aphasia type of each was diagnosed. Data is as follow:
13
24 Nov 2007Data Management13
14
24 Nov 2007Data Management14 Procedure Frequency Counts and Percentage Bar Chart Good Representative Inference
15
24 Nov 2007Data Management15 Case study The Environmental Protection Agency (EPA) performs extensive tests on all new car models to determine their mileage ratings. Followings are 100 measurements represent the results of such tests on a certain new car model (miles per gallon).
16
24 Nov 2007Data Management16
17
24 Nov 2007Data Management17 Procedure Descriptive Statistics Stem and Leaf Plot, and Box Plot Typical Value and Variation Extreme Values
18
24 Nov 2007Data Management18 20.520.720.821.0 21.0 21.421.522.022.1 22.5 22.622.622.722.7 22.9 22.923.123.323.4 23.5 23.623.623.623.9 24.1 24.324.524.524.8 24.8 24.924.925.125.1 25.2 25.625.825.926.1 26.7
19
24 Nov 2007Data Management19
20
24 Nov 2007Data Management20
21
24 Nov 2007Data Management21
22
24 Nov 2007Data Management22 ColorGoldGrayBlackRedGreenBlueWhite Cars13001530501505020190
23
24 Nov 2007Data Management23
24
24 Nov 2007Data Management24 Application in Survey Data Structure of Sample Data Good Representative Investigation of Each Variable in Each Cluster Frequency Distribution Typical Value and Variation Extreme Values Non-weighted Tabulation
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.