Presentation is loading. Please wait.

Presentation is loading. Please wait.

Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Day 2 Lecture Review of Descriptive Statistics.

Similar presentations


Presentation on theme: "Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Day 2 Lecture Review of Descriptive Statistics."— Presentation transcript:

1 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Day 2 Lecture Review of Descriptive Statistics

2 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-2 Agenda Procedures Review  MyCourses,  Smam320 website,  WebAssign,  Groups Lecture: Introduction to univariate data analysis

3 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-3 Learning “ Learning is defined as stabilizing, through repeated use, certain appropriate and desirable synapses in the brain.“ (Leamnson, 2000, p. 5)

4 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-4 Active Learning Knowing and learning are communal acts. They require a continual cycle of discussion, disagreement, and consensus over what has been and what it all means.” (Palmer)

5 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-5 The Quiet Signal In order to use active learning, we have to have a quiet signal. I will raise my hand when it is time to stop group work and resume all class activity.

6 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-6 The Quiet Signal Complete your sentence. Raise your hand. Alert classmates that its time to focus up front again.

7 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-7 Interview A interviews B At signal, reverse roles and B interviews A. (same number of minutes and same questions.) A B C

8 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-8 Interview Questions Name, etc. How comfortable are you with  Minitab? How do you learn best? * * * * * * * Extra Time? * * * * * * * * What are your greatest fears about this class?

9 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-9 In Every Statistical Study: Questions are posed Characteristics are observed

10 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-10 Major Branches of Statistics Descriptive Statistics  Organizing, Summarizing Information  Graphical techniques (know which graph goes with which type of variable: qualitative, quantitative; and which are univariate techniques and which are bivariate)  Numerical techniques (know which descriptive summary measure is appropriate for summarizing qualitative data and which for quantitative data.) Inferential Statistics  Estimation  Decision making

11 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-11 Descriptive Statistics Methods for summarizing data Summaries usually consist of graphs and numerical summaries of the data Summarize the raw data Present the data in a useful format

12 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-12 Inference Methods of making decisions or predictions about a populations based on sample information. Most of this class (320) is about Inference.

13 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-13 Parameter and Statistic A parameter is a numerical summary of the population A statistic is a numerical summary of a sample taken from the population

14 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-14

15 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-15 Randomness Simple Random Sampling: each subject in the population has the same chance of being included in that sample Randomness is crucial to experimentation

16 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-16 Variability Measurements vary from person to person Measurements vary from sample to sample Variability is the name of the game in Data Analysis!

17 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-17 Characteristics are Variables A Variable is any characteristic that is recorded for subjects in the study. Note that our text uses the word characteristic for parameter.

18 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-18 Variation in Data The terminology variable highlights the fact that data values vary.

19 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-19 Data values are called observations Each observation can be:  Quantitative  Categorical

20 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-20 Categorical Data - Sample Proportion

21 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-21 Describing the Center of a Data Set with the arithmetic mean The population mean is denoted by µ, is the average of all x values in the entire population. The sample median is obtained by first ordering the n observations from smallest to largest (with any repeated values included, so that every sample observation appears in the ordered list). Then

22 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-22 Mean The sum of the observations divided by the number of observations

23 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-23 Median The midpoint of the observations when they are ordered from the smallest to the largest (or from the largest to the smallest)

24 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-24 Comparing the Sample Mean & Sample Median when a distribution is skewed positively, the mean is larger than the median, when a distribution is skewed negatively, the mean is smaller then the median, and when a distribution is symmetric, the mean and the median are equal.

25 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-25 Illustrated Distribution Shapes Unimodal BimodalMultimodal Skew negatively Symmetric Skew positively

26 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-26 Measuring Spread: Standard Deviation Creates a measure of variation by summarizing the deviations of each observation from the mean and calculating an adjusted average of these deviations

27 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-27 Empirical Rule For bell-shaped data sets: Approximately 68% of the observations fall within 1 standard deviation of the mean Approximately 95% of the observations fall within 2 standard deviations of the mean Approximately 100% of the observations fall within 3 standard deviations of the mean

28 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-28 Z Scores We can measure any observation on a variable in deviations from its mean and measure these deviations in units of the standard deviation. The resulting value is the Z-score. Computing the z score is often referred to as standardization and the z score is called a standardized score.

29 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-29 Z Scores The z score is how many standard deviations the observation is from the mean. A positive z score indicates the observation is above the mean A negative z score indicates the observation is below the mean.

30 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-30 A multivariate data set consists of measurements or observations on each of two or more variables. The data we collected from the gas use survey represents multivariate data. When we looked at a scatterplot diagram of gas use and distance of residence and distinguished between commuters and those who live on-campus, that was multivariate data analysis. Multivariate Data

31 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-31 Bivariate Data A bivariate data set consists of measurements or observations on each of two variables. When we produced comparative descriptive statistics and comparative box-plots, those were examples of bivariate data analysis where one of the variables was a quantitative variable and the other was a qualitative variable.

32 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-32 nt al Examples with one Quantitative Variable and one Qualitative.

33 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-33 Comparative Descriptive Statistics Variable year N Mean StDev Median How much do you 1st 6 8.00 6.78 6.50 2nd 10 18.70 11.89 17.50 3 rd 6 3.67 4.32 2.50 4th 2 10.000 0.000000000 10.000

34 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-34 Example: Food Type and Pesticide Status What proportion of all sampled items contain pesticide residuals? Pesticides: Food Type: Yes No Organic 29 98 Conventional19485 7086

35 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-35 Contingency Table The Food Type and Pesticide Status Table is called a contingency table A contingency table:  Displays 2 categorical variables  The rows list the categories of 1 variable  The columns list the categories of the other variable  Entries in the table are frequencies

36 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-36 Scatterplots A scatterplot is a plot of pairs of observed values (both quantitative) of two different variables. When one of the variables is considered to be a response variable (y) and the other an explanatory variable (x). Use for y, the variable for which we want to understand the variation. The x will be the variable that we think might help us explain the variation in y.

37 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-37 Multivariate Data Analysis

38 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-38 Association Positive Association - Two variables are positively associated when above-average values of one tend to accompany above-average values of the other and below-average values tend similarly to occur together. (i.e., Generally speaking, the y values tend to increase as the x values increase.) Negative Association - Two variables are negatively associated when above-average values of one accompany below-average values of the other, and vice versa. (i.e., Generally speaking, the y values tend to decrease as the x values increase.)

39 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-39 The Pearson Correlation Coefficient A measure of the strength of the linear relationship between the two variables is called the Pearson correlation coefficient. The value of r does not depend on the unit of measurement for each variable. The value of r does not depend on which of the two variables is labeled x. The value of r is between –1 and +1.

40 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-40 Some Correlation Pictures

41 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-41 Regression Analysis Regression involves “fitting a line” through the scatterplot of the data to represent the paired observations (x,y). The relationship y = a + bx is the equation of a straight line. The value b, called the slope of the line, is the amount by which y increases when x increase by 1 unit. The value of a, called the intercept (or sometimes the vertical intercept) of the line, is the height of the line above the value x = 0.

42 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-42 Variation The Total Variation in the dependent variable can be split into two parts That which is EXPLAINED by the independent or explanatory variable and That which is NOT EXPLAINED by the independent variable. Least squares Regression is a method for getting an equation for a line through the data that produces a line that minimizes the sum of the unexplained variation for all the observations in the data set.

43 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-43 Using the Regression Equation How much do you spend per week on gas Gas Spending = 9.32 + 0.422 (Distance of residence) y=9.32 + 0.422 x We can use this to predict how much someone spends, if we know how far they live from RIT. We just plug that value in for x and solve for y. The regression equation is

44 Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 7-44 How Good Is the Line? The coefficient of determination, denoted by r 2, gives the proportion of variation in y that can be attributed to an approximate linear relationship between x and y. The r 2 is between 0 and +1. The higher the value, the greater the % of variation explained by the regression equation. In our example, we were not able to explain much of the variation in the amount spent on gasoline.


Download ppt "Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Day 2 Lecture Review of Descriptive Statistics."

Similar presentations


Ads by Google