Download presentation
Presentation is loading. Please wait.
Published byLoreen Page Modified over 8 years ago
1
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Day 2 Lecture Review of Descriptive Statistics
2
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-2 Agenda Procedures Review MyCourses, Smam320 website, WebAssign, Groups Lecture: Introduction to univariate data analysis
3
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-3 Learning “ Learning is defined as stabilizing, through repeated use, certain appropriate and desirable synapses in the brain.“ (Leamnson, 2000, p. 5)
4
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-4 Active Learning Knowing and learning are communal acts. They require a continual cycle of discussion, disagreement, and consensus over what has been and what it all means.” (Palmer)
5
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-5 The Quiet Signal In order to use active learning, we have to have a quiet signal. I will raise my hand when it is time to stop group work and resume all class activity.
6
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-6 The Quiet Signal Complete your sentence. Raise your hand. Alert classmates that its time to focus up front again.
7
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-7 Interview A interviews B At signal, reverse roles and B interviews A. (same number of minutes and same questions.) A B C
8
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-8 Interview Questions Name, etc. How comfortable are you with Minitab? How do you learn best? * * * * * * * Extra Time? * * * * * * * * What are your greatest fears about this class?
9
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-9 In Every Statistical Study: Questions are posed Characteristics are observed
10
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-10 Major Branches of Statistics Descriptive Statistics Organizing, Summarizing Information Graphical techniques (know which graph goes with which type of variable: qualitative, quantitative; and which are univariate techniques and which are bivariate) Numerical techniques (know which descriptive summary measure is appropriate for summarizing qualitative data and which for quantitative data.) Inferential Statistics Estimation Decision making
11
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-11 Descriptive Statistics Methods for summarizing data Summaries usually consist of graphs and numerical summaries of the data Summarize the raw data Present the data in a useful format
12
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-12 Inference Methods of making decisions or predictions about a populations based on sample information. Most of this class (320) is about Inference.
13
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-13 Parameter and Statistic A parameter is a numerical summary of the population A statistic is a numerical summary of a sample taken from the population
14
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-14
15
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-15 Randomness Simple Random Sampling: each subject in the population has the same chance of being included in that sample Randomness is crucial to experimentation
16
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-16 Variability Measurements vary from person to person Measurements vary from sample to sample Variability is the name of the game in Data Analysis!
17
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-17 Characteristics are Variables A Variable is any characteristic that is recorded for subjects in the study. Note that our text uses the word characteristic for parameter.
18
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-18 Variation in Data The terminology variable highlights the fact that data values vary.
19
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-19 Data values are called observations Each observation can be: Quantitative Categorical
20
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-20 Categorical Data - Sample Proportion
21
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-21 Describing the Center of a Data Set with the arithmetic mean The population mean is denoted by µ, is the average of all x values in the entire population. The sample median is obtained by first ordering the n observations from smallest to largest (with any repeated values included, so that every sample observation appears in the ordered list). Then
22
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-22 Mean The sum of the observations divided by the number of observations
23
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-23 Median The midpoint of the observations when they are ordered from the smallest to the largest (or from the largest to the smallest)
24
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-24 Comparing the Sample Mean & Sample Median when a distribution is skewed positively, the mean is larger than the median, when a distribution is skewed negatively, the mean is smaller then the median, and when a distribution is symmetric, the mean and the median are equal.
25
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-25 Illustrated Distribution Shapes Unimodal BimodalMultimodal Skew negatively Symmetric Skew positively
26
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-26 Measuring Spread: Standard Deviation Creates a measure of variation by summarizing the deviations of each observation from the mean and calculating an adjusted average of these deviations
27
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-27 Empirical Rule For bell-shaped data sets: Approximately 68% of the observations fall within 1 standard deviation of the mean Approximately 95% of the observations fall within 2 standard deviations of the mean Approximately 100% of the observations fall within 3 standard deviations of the mean
28
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-28 Z Scores We can measure any observation on a variable in deviations from its mean and measure these deviations in units of the standard deviation. The resulting value is the Z-score. Computing the z score is often referred to as standardization and the z score is called a standardized score.
29
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-29 Z Scores The z score is how many standard deviations the observation is from the mean. A positive z score indicates the observation is above the mean A negative z score indicates the observation is below the mean.
30
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-30 A multivariate data set consists of measurements or observations on each of two or more variables. The data we collected from the gas use survey represents multivariate data. When we looked at a scatterplot diagram of gas use and distance of residence and distinguished between commuters and those who live on-campus, that was multivariate data analysis. Multivariate Data
31
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-31 Bivariate Data A bivariate data set consists of measurements or observations on each of two variables. When we produced comparative descriptive statistics and comparative box-plots, those were examples of bivariate data analysis where one of the variables was a quantitative variable and the other was a qualitative variable.
32
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-32 nt al Examples with one Quantitative Variable and one Qualitative.
33
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-33 Comparative Descriptive Statistics Variable year N Mean StDev Median How much do you 1st 6 8.00 6.78 6.50 2nd 10 18.70 11.89 17.50 3 rd 6 3.67 4.32 2.50 4th 2 10.000 0.000000000 10.000
34
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-34 Example: Food Type and Pesticide Status What proportion of all sampled items contain pesticide residuals? Pesticides: Food Type: Yes No Organic 29 98 Conventional19485 7086
35
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-35 Contingency Table The Food Type and Pesticide Status Table is called a contingency table A contingency table: Displays 2 categorical variables The rows list the categories of 1 variable The columns list the categories of the other variable Entries in the table are frequencies
36
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-36 Scatterplots A scatterplot is a plot of pairs of observed values (both quantitative) of two different variables. When one of the variables is considered to be a response variable (y) and the other an explanatory variable (x). Use for y, the variable for which we want to understand the variation. The x will be the variable that we think might help us explain the variation in y.
37
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-37 Multivariate Data Analysis
38
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-38 Association Positive Association - Two variables are positively associated when above-average values of one tend to accompany above-average values of the other and below-average values tend similarly to occur together. (i.e., Generally speaking, the y values tend to increase as the x values increase.) Negative Association - Two variables are negatively associated when above-average values of one accompany below-average values of the other, and vice versa. (i.e., Generally speaking, the y values tend to decrease as the x values increase.)
39
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-39 The Pearson Correlation Coefficient A measure of the strength of the linear relationship between the two variables is called the Pearson correlation coefficient. The value of r does not depend on the unit of measurement for each variable. The value of r does not depend on which of the two variables is labeled x. The value of r is between –1 and +1.
40
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-40 Some Correlation Pictures
41
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-41 Regression Analysis Regression involves “fitting a line” through the scatterplot of the data to represent the paired observations (x,y). The relationship y = a + bx is the equation of a straight line. The value b, called the slope of the line, is the amount by which y increases when x increase by 1 unit. The value of a, called the intercept (or sometimes the vertical intercept) of the line, is the height of the line above the value x = 0.
42
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-42 Variation The Total Variation in the dependent variable can be split into two parts That which is EXPLAINED by the independent or explanatory variable and That which is NOT EXPLAINED by the independent variable. Least squares Regression is a method for getting an equation for a line through the data that produces a line that minimizes the sum of the unexplained variation for all the observations in the data set.
43
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-43 Using the Regression Equation How much do you spend per week on gas Gas Spending = 9.32 + 0.422 (Distance of residence) y=9.32 + 0.422 x We can use this to predict how much someone spends, if we know how far they live from RIT. We just plug that value in for x and solve for y. The regression equation is
44
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-44 How Good Is the Line? The coefficient of determination, denoted by r 2, gives the proportion of variation in y that can be attributed to an approximate linear relationship between x and y. The r 2 is between 0 and +1. The higher the value, the greater the % of variation explained by the regression equation. In our example, we were not able to explain much of the variation in the amount spent on gasoline.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.