Presentation is loading. Please wait.

Presentation is loading. Please wait.

Chapter 2: Organizing Data STP 226: Elements of Statistics Jenifer Boshes Arizona State University.

Similar presentations


Presentation on theme: "Chapter 2: Organizing Data STP 226: Elements of Statistics Jenifer Boshes Arizona State University."— Presentation transcript:

1 Chapter 2: Organizing Data STP 226: Elements of Statistics Jenifer Boshes Arizona State University

2 2.1: Variables and Data

3 Variable HeightAge Number of siblings Sex Marital status Blood Type A variable is a characteristic that varies from one person or thing to another. Example 1:

4 Qualitative Variable A qualitative variable is a non-numerically valued variable. (categorical variable) Example 2:

5 Quantitative Variable A quantitative variable is a numerically valued variable. Example 3:

6 Types of Quantitative Variables A discrete variable is a quantitative variable whose possible values form a finite (or countably infinite) set of numbers. A continuous variable is a quantitative variable whose possible values form some interval of numbers. A discrete variable is a quantitative variable whose possible values form a finite (or countably infinite) set of numbers. A continuous variable is a quantitative variable whose possible values form some interval of numbers.

7 Classify the following as qualitative, discrete, or continuous: HeightAge Number of siblings Place of birth Number of credit hours Eye color Ounces of coffee drank per day Number of times visited the Grand Canyon Example 4:

8 Variables

9 Data Data: Information obtained by observing values of a variable. Data is classified as qualitative data, quantitative data, discrete or continuous data depending on how it was obtained. Determine whether the following examples of data are quantitative or qualitative. You ask a sample of students how many hours of sleep they get. A census is taken of number of cars in a household in a city. First graders are asked about their favorite color. Example 5:

10 2.2: Grouping Data

11 Why do we group data? To simplify large/complicated data sets. To further organize data. To study a particular variable of interest.

12 Example 1 - Cholesterol: The total cholesterol level for 30 twenty year old males is given. Construct a grouped- data table for the data. Use a class width of 20 and a first cutpoint of 160. 180186200210190210 230200198240220200 210160180196200250 200210250260170190 210180180190200260

13 Grouped-Data Table ClassesFrequency Relative Frequency Midpoint Be sure to include the following columns: 180186200210190210 230200198240220200 210160180196200250 200210250260170190 210180180190200260

14 Example 1 - Cholesterol: Cholesterol Level Frequency Relative Frequency Midpoint 20.067170

15 Guidelines for Grouping Quantitative Data (1)The number of classes should be small enough to effectively describe the data, but large enough to display the relevant characteristics. (Usually 5-20.) (2)Each observation must belong to one, and only one, class. (3)Whenever possible, all classes should have the same width.

16 Terminology in Grouping Data Classes: Categories for grouping data. Frequency: The number of observations that fall into a class. Frequency distribution: A table that provides all classes and their frequencies. Relative frequency: The ratio of the frequency of a class to the total number of observations.

17 Lower cutpoint: The smallest value in a class. Upper cutpoint: The smallest value that could go into the next higher class. Midpoint: The middle of a class; found by taking the average the upper and lower cutpoints. Width: The difference between the upper and lower cutpoints. Terminology in Grouping Data

18 Example 2 – Maple Trees: The heights of 10 maple trees were recorded as follows. Determine the frequency and relative- frequency distributions for these data. (Use classes of size 5.) 6465728082 6580656570

19 2.3: Graphs and Charts

20 Graphical Displays for Quantitative Data Frequency histogram: A graph that displays the classes on the horizontal axis and the frequencies on the vertical axis. Relative frequency histogram: A graph that displays the classes on the horizontal axis and the relative frequencies on the vertical axis. Bar graph: Similar to a relative frequency histogram, but the bars do not touch. This is used for qualitative data.

21 Example 1 - Cholesterol: Cholesterol Level Frequency Relative Frequency Midpoint 20.067170 100.333190 110.366210 20.067230 30.1250 20.067270 301.000 Construct a histogram for the cholesterol data, showing both frequencies and relative frequencies.

22 Example 1-Cholesterol:

23 Dotplots Dotplots are particularly useful for showing the relative positions of the data in a data set or for comparing two or more data sets. They are most useful for a small data set with a moderate range in values. To construct a dotplot: Draw a horizontal axis. Draw a horizontal axis. Record each data point by placing a dot over the appropriate value. Record each data point by placing a dot over the appropriate value.

24 Example 2 -Cholesterol: Construct a dotplot for the data. 180186200210190210 230200198240220200 210160180196200250 200210250260170190 210180180190200260

25 Stem-and-Leaf Diagram (1)Select the leading digit(s) from the data and list in a vertical column. (STEM) (2)Write the final digit of each number to the right of the appropriate leading digit. (LEAVES)

26 Stem-and-Leaf Diagram 1. Select the leading digit(s) from the data and list in a vertical column. (STEM) 2. Write the final digit of each number to the right of the appropriate leading digit. (LEAVES) 2022181020 3032225030 2426283032 2018242220 A group of women who had just given birth were asked how many pre- natal visits they had made to a doctor. The following information was recorded. Example 3a – Pre-natal Visits:

27 Stem-and-Leaf Diagrams 2022181020 3032225030 2426283032 2018242220 Example 3a – Pre-natal Visits: The stem-and-leaf diagram with two lines per stem would be: 10188 2000022244 268 300022 3 4 4 50 5

28 Example 4 - Cholesterol 180186200210190210 230200198240220200 210160180196200250 200210250260170190 210180180190200260 Create a stem-and-leaf plot for the cholesterol data using the first two digits as the stem. 1617 18 19 20 21 22 23 24 25 26

29 Example 4 - Cholesterol Compare this stem-and-leaf to the histogram for the same data. What similarities do we see? 160 170 1800006 1900068 20000000 2100000 220 230 240 2500 2600

30 Example 4 - Cholesterol Compare this stem-and-leaf to the histogram for the same data. What similarities do we see? (Note: histogram grouped by classes of size 20, so not exactly the same.) 160 170 1800006 1900068 20000000 2100000 220 230 240 2500 2600

31 Graphing Qualitative Data A pie chart is a circle divided into wedge- shaped pieces that are proportional to the relative frequencies. A bar graph is like a histogram, but the bars do not touch.

32 Example 5 – Blood Type: A sample of 105 blood donors at a clinic can be described as follows: TypeFrequency A47 B22 AB20 O16 Total105 Type, x Frequency Relative Frequency A470.448 B220.210 AB200.190 O160.152 1051

33 2.4: Distribution Shapes

34

35 Distribution Shapes

36

37 Group Assignment/Practice Here are the ages for the CEOs of the 30 top-ranked small companies in America from Forbes. Using this data, produce the following: (a) A grouped-data table using classes of size five and starting with age 35. (b) Construct a histogram for the data. (c) A dotplot. (d) A stem-and-leaf diagram with one line per stem. (e) A stem-and-leaf diagram with two lines per stem. 593847536069445056634048536141444955624355616153484855624348

38 Bibliography Some of the textbook images embedded in the slides were taken from: Elementary Statistics, Sixth Edition; by Weiss; Addison Wesley Publishing Company Copyright © 2005, Pearson Education, Inc.


Download ppt "Chapter 2: Organizing Data STP 226: Elements of Statistics Jenifer Boshes Arizona State University."

Similar presentations


Ads by Google