BCOR 1020 Business Statistics

Slides:



Advertisements
Similar presentations
Created by Tom Wegleitner, Centreville, Virginia
Advertisements

Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 1 An Introduction to Business Statistics.
Statistics for Managers Using Microsoft® Excel 5th Edition
Elementary Statistics MOREHEAD STATE UNIVERSITY
Chapter 1 A First Look at Statistics and Data Collection.
Measurement in Survey Research Developing Questionnaire Items with Respect to Content and Analysis.
Chapter 1 The Where, Why, and How of Data Collection
Chapter 1 The Where, Why, and How of Data Collection
Chapter 1 The Where, Why, and How of Data Collection
© 2004 Prentice-Hall, Inc.Chap 1-1 Basic Business Statistics (9 th Edition) Chapter 1 Introduction and Data Collection.
BCOR 1020 Business Statistics Lecture 15 – March 6, 2008.
Introduction to Statistics
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 1-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Chapter 1 The Where, Why, and How of Data Collection
Chapter One An Introduction to Business Statistics McGraw-Hill/Irwin Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Chapter 1 Sampling and Data.
Chapter 1 The Where, Why, and How of Data Collection
David Kilgour Statistics David Kilgour Statistics.
PowerPoint Presentation Package to Accompany:
Chapter 3 Goals After completing this chapter, you should be able to: Describe key data collection methods Know key definitions:  Population vs. Sample.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 1-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
STA 2023 Chapter 1 Notes. Terminology  Data: consists of information coming from observations, counts, measurements, or responses.  Statistics: the.
MATH1342 S08 – 7:00A-8:15A T/R BB218 SPRING 2014 Daryl Rupp.
Medical Statistics (full English class) Ji-Qian Fang School of Public Health Sun Yat-Sen University.
Prepared by Lloyd R. Jaisingh
Chapter 1: Introduction to Statistics
Probability & Statistics
Chapter 1 Introduction and Data Collection
© The McGraw-Hill Companies, Inc., by Marc M. Triola & Mario F. Triola SLIDES PREPARED BY LLOYD R. JAISINGH MOREHEAD STATE UNIVERSITY MOREHEAD.
What is Statistics Chapter 1 McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
STATISTICS is about how to COLLECT, ORGANIZE,
Statistics: Basic Concepts. Overview Survey objective: – Collect data from a smaller part of a larger group to learn something about the larger group.
Introduction to Statistics What is Statistics? : Statistics is the sciences of conducting studies to collect, organize, summarize, analyze, and draw conclusions.
Introduction to Probability and Statistics Consultation time: Ms. Chong.
ECON 3790 Statistics for Business and Economics
Chapter 2 Data.
1  Specific number numerical measurement determined by a set of data Example: Twenty-three percent of people polled believed that there are too many polls.
Chapter 1 Introduction to Statistics. Statistical Methods Were developed to serve a purpose Were developed to serve a purpose The purpose for each statistical.
Introduction Biostatistics Analysis: Lecture 1 Definitions and Data Collection.
Ch.1 INTRODUCTION TO STATISTICS Prepared by: M.S Nurzaman, MIDEc. ( deden )‏ (021) /
Basic Business Statistics
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 1-1 Statistics for Managers Using Microsoft ® Excel 4 th Edition Chapter.
2-1. Data Collection Data Vocabulary Data Vocabulary Level of Measurement Level of Measurement Time Series and Cross-sectional Data Time Series and Cross-sectional.
An Overview of Statistics Section 1.1. Ch1 Larson/Farber 2 Statistics is the science of collecting, organizing, analyzing, and interpreting data in order.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 1-3 Types of Data.
Data Collection Data Collection Definitions Level of Measurement Time Series and Cross-sectional Data Sampling Concepts Sampling Methods Data Sources Survey.
General Business 704 Data Analysis for Managers Introduction The Course, Data, and Excel.
Chapter Eleven Sampling: Design and Procedures Copyright © 2010 Pearson Education, Inc
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. A PowerPoint Presentation Package to Accompany Applied Statistics.
A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. Chap 1-1 A Course In Business Statistics 4 th Edition Chapter 1 The Where, Why, and How.
Chap 1-1 Chapter 3 Goals After completing this chapter, you should be able to: Describe key data collection methods Know key definitions:  Population.
1 Introduction to Statistics. 2 What is Statistics? The gathering, organization, analysis, and presentation of numerical information.
Basic Business Statistics, 8e © 2002 Prentice-Hall, Inc. Chap 1-1 Inferential Statistics for Forecasting Dr. Ghada Abo-zaid Inferential Statistics for.
1 What is Data? l An attribute is a property or characteristic of an object l Examples: eye color of a person, temperature, etc. l Attribute is also known.
Measurements Statistics WEEK 6. Lesson Objectives Review Descriptive / Survey Level of measurements Descriptive Statistics.
Biostatistics Dr. Amjad El-Shanti MD, PMH,Dr PH University of Palestine 2016.
Do Now  47 TCNJ students were asked to complete a survey on campus clubs and activities. 87% of the students surveyed participate in campus clubs and.
2 NURS/HSCI 597 NURSING RESEARCH & DATA ANALYSIS GEORGE MASON UNIVERSITY.
Elementary Statistics
Learning Objectives : After completing this lesson, you should be able to: Describe key data collection methods Know key definitions: Population vs. Sample.
Measurements Statistics
Chapter 6 Introductory Statistics and Data
statistics Specific number
Chapter 1 The Where, Why, and How of Data Collection
Chapter 1 The Where, Why, and How of Data Collection
Chapter 6 Introductory Statistics and Data
The Where, Why, and How of Data Collection
Business Statistics For Contemporary Decision Making 9th Edition
Chapter 1 The Where, Why, and How of Data Collection
Presentation transcript:

BCOR 1020 Business Statistics Lecture 2 – January 17, 2008

Overview Chapter 2 – Data Collection… Key Definitions Level of Measurement Time Series vs. Cross-Sectional Data Sampling Concepts Sampling Methods Data Sources Survey Research

Chapter 2 – Key Definitions Data is the plural form of the Latin datum (a “given” fact). In scientific research, data arise from experiments whose results are recorded systematically. In business, data usually arise from accounting transactions or management processes. Important decisions may depend on data. We will refer to Data as plural and data set as a particular collection of data as a whole. Observation – each data value. Subject (or individual) – an item for study (e.g., an employee in your company). Variable – a characteristic about the subject or individual (e.g., employee’s income).

Chapter 2 – Key Definitions Three types of data sets: Data Set Variables Typical Tasks Univariate One Histograms, descriptive statistics, frequency tallies, estimation, hypothesis testing Bivariate Two Scatter plots, correlations, simple regression Multivariate More than two Multiple regression, data mining, econometric modeling

Chapter 2 – Key Definitions Consider the multivariate data set with 5 variables 8 subjects 5 x 8 = 40 observations Multivariate data sets can become quite large.

Chapter 2 – Key Definitions A data set may have a mixture of data types. Types of Data Attribute (qualitative) Numerical (quantitative) Verbal Label X = economics (your major) Coded X = 3 (i.e., economics) Discrete X = 2 (your siblings) Continuous X = 3.15 (your GPA)

Chapter 2 – Key Definitions Attribute Data: Also called categorical, nominal or qualitative data. Values are described by words rather than numbers. For example, - Automobile style (e.g., X = full, midsize, compact, subcompact). - Mutual fund (e.g., X = load, no-load). Data Coding: Coding refers to using numbers to represent categories to facilitate statistical analysis. (We need numerical representations to perform the math.) Coding an attribute as a number does not make the data numerical. For example, 1 = Bachelor’s, 2 = Master’s, 3 = Doctorate Rankings may exist (consider the above example)

Chapter 2 – Key Definitions Binary Data – An Important Class of Attribute Data: A binary variable has only two values, 1 = presence, 0 = absence of a characteristic of interest (codes themselves are arbitrary). For example, 1 = employed, 0 = not employed 1 = married, 0 = not married 1 = male, 0 = female 1 = female, 0 = male The coding itself has no numerical value so binary variables are attribute data.

Chapter 2 – Key Definitions Numerical Data – Numerical or quantitative data arise from counting, measurement or some kind of mathematical operation. For example, Number of auto insurance claims filed in March. Length of time between customer arrivals on a webpage. Ratio of profit to sales for last quarter. Can be broken down into two types – discrete or continuous data.

Chapter 2 – Key Definitions Discrete Data – A numerical variable with a countable number of values that can be represented by an integer (no fractional values). For example, Number of Medicaid patients (e.g., X = 2). Number of takeoffs at O’Hare (e.g., X = 37). Continuous Data – A numerical variable that can have any value within an interval (e.g., length, weight, time, sales, price/earnings ratios). Any continuous interval contains infinitely many possible values (e.g., 426 < X < 428).

Clickers What type of data (attribute, discrete numerical, or continuous numerical) is your GPA? A = attribute B = discrete numerical C = continuous numerical

Clickers What type of data (attribute, discrete numerical, or continuous numerical) is the occupation of a mortgage applicant? A = attribute B = discrete numerical C = continuous numerical

Chapter 2 – Level of Measurement There are four levels of measurement for data: Level of Measurement Characteristics Example Nominal Categories only Eye color (blue, brown, green, hazel) Ordinal Rank has meaning Bond ratings (Aaa, Aab, C, D, F, etc.) Interval Distance has meaning Temperature (57o Celsius) Ratio Meaningful zero exists Accounts payable ($21.7 million)

Chapter 2 – Level of Measurement Nominal Measurement – Nominal data merely identify a category. Nominal data are qualitative, attribute, categorical or classification data (e.g., Apple, Compaq, Dell, HP). Nominal data are usually coded numerically, codes are arbitrary (e.g., 1 = Apple, 2 = Compaq, 3 = Dell, 4 = HP). Only mathematical operations are counting (e.g., frequencies) and simple statistics. Ordinal Measurement – Ordinal data codes can be ranked (e.g., 1 = Frequently, 2 = Sometimes, 3 = Rarely, 4 = Never). Distance between codes is not meaningful (e.g., distance between 1 and 2, or between 2 and 3, or between 3 and 4 lacks meaning). Many useful statistical tests exist for ordinal data. Especially useful in social science, marketing and human resource research.

Chapter 2 – Level of Measurement Interval Measurement – Data can not only be ranked, but also have meaningful intervals between scale points (e.g., difference between 60F and 70F is same as difference between 20F and 30F). Since intervals between numbers represent distances, mathematical operations can be performed (e.g., average). Zero point of interval scales is arbitrary, so ratios are not meaningful (e.g., 60F is not twice as warm as 30F).

Chapter 2 – Level of Measurement Likert Scales – A special case of interval (?) data frequently used in survey research. The coarseness of a Likert scale refers to the number of scale points (typically 5 or 7). A neutral midpoint (“Neither Agree Nor Disagree”) is allowed if an odd number of scale points is used or omitted to force the respondent to “lean” one way or the other. Likert data are coded numerically (e.g., 1 to 5) but any equally spaced values will work. Careful choice of verbal anchors results in measurable intervals (e.g., the distance from 1 to 2 is “the same” as the interval, say, from 3 to 4). Ratios are not meaningful (e.g., here 4 is not twice 2). Many statistical calculations can be performed (e.g., averages, correlations, etc.).

Chapter 2 – Level of Measurement Likert Scales – Example Two variants of Likert scales… How would you rate your marketing instructor? (check one)  Terrible  Poor  Adequate  Good  Excellent How would you rate your marketing instructor? (check one) Very Bad  Very Good

Chapter 2 – Level of Measurement Ratio Measurement – Ratio data have all properties of nominal, ordinal and interval data types and also possess a meaningful zero (absence of quantity being measured). Because of this zero point, ratios of data values are meaningful (e.g., $20 million profit is twice as much as $10 million). Zero does not have to be observable in the data, it is an absolute reference point. Changing Data by Recoding – In order to simplify data or when exact data magnitude is of little interest, ratio data can be recoded downward into ordinal or nominal measurements (but not conversely). For example, recode systolic blood pressure as “normal” (under 130), “elevated” (130 to 140), or “high” (over 140). The above recoded data are ordinal (ranking is preserved) but intervals are unequal and some information is lost.

Chapter 2 – Level of Measurement Use the following procedure to recognize data types: Question If “Yes” Q1. Is there a meaningful zero point? Ratio data (all statistical operations are allowed) Q2. Are intervals between scale points meaningful? Interval data (common statistics allowed, e.g., means and standard deviations) Q3. Do scale points represent rankings? Ordinal data (restricted to certain types of nonparametric statistical tests) Q4. Are there discrete categories? Nominal data (only counting allowed, e.g. finding the mode)

Clickers Which type of data (nominal, ordinal, interval, or ratio) is the pay of employees in the Wal-Mart store in Hutchinson Kansas? A = nominal B = ordinal C = interval D = ratio

Chapter 2 – Time Series & Cross-Sectional Data Time Series Data – Each observation in the sample represents a different equally spaced point in time (e.g., years, months, days). Periodicity may be annual, quarterly, monthly, weekly, daily, hourly, etc. We are interested in trends and patterns over time (e.g., annual growth in consumer debit card use from 1999 to 2006). Cross-sectional Data – Each observation represents a different individual unit (e.g., person) at the same point in time (e.g., monthly VISA balances). We are interested in variation among observations or in relationships. We can combine the two data types to get pooled cross-sectional and time series data.

Clickers Which type of data (time series or cross- sectional) is Mexico’s GDP for the last 10 quarters? A = time series B = cross-sectional

Chapter 2 – Sampling Concepts Sample or Census? A sample involves looking only at some items selected from the population. A census is an examination of all items in a defined population. Why collect a sample rather than complete a census?

Chapter 2 – Sampling Concepts Situations Where A Sample May Be Preferred: Infinite Population – Destructive Testing Timely Results – Cost Accuracy – Sensitive Information Situations Where A Census May Be Preferred: Small Population – Large Sample Size Database Exists – Legal Requirements

Chapter 2 – Sampling Concepts Parameters and Statistics: Parameters – Any measurement that describes an entire population. Usually, the parameter value is unknown since we rarely can observe the entire population. Parameters are often (but not always) represented by Greek letters. Statistics – Any measurement computed from a sample. Usually, the statistic is regarded as an estimate of a population parameter. Sample statistics are often (but not always) represented by Roman letters. Sample (Size = n): Statistics are computed and estimate parameters e.g., = sample mean, S = sample std. dev. Population (Size = N): Characterized by Parameters e.g., m = pop. Mean, s = pop. Std. dev.

Chapter 2 – Sampling Concepts Population – The population must be carefully specified and the sample must be drawn scientifically so that the sample is representative. Target Population: The target population is the population we are interested in (e.g., U.S. gasoline prices). The sampling frame is the group from which we take the sample (e.g., 115,000 stations). The frame should not differ from the target population. Finite or Infinite? A population is finite if it has a definite size, even if its size is unknown. A population is infinite if it is of arbitrarily large size. Rule of Thumb: A population may be treated as infinite when N is at least 20 times n (i.e., when N/n > 20).

Chapter 2 – Sampling Methods Probability Samples: Simple Random Sample – Use random numbers to select items from a list (e.g., VISA cardholders). Systematic Sample – Select every kth item from a list or sequence (e.g., restaurant customers). Stratified Sample – Select randomly within defined strata (e.g., by age, occupation, gender). Cluster Sample – Like stratified sampling except strata are geographical areas (e.g., zip codes). Nonprobability Samples: Judgment Sample – Use expert knowledge to choose “typical” items (e.g., which employees to interview). Convenience Sample – Use a sample that happens to be available (e.g., ask co-worker opinions at lunch).

Chapter 2 – Sampling Methods Simple Random Sample: Every item in the population of N items has the same chance of being chosen in the sample of n items. Clicker Example…. On your clicker, select either A, B, C, or D “at random”. Collect and look at the results. Discuss whether these results look random.

Chapter 2 – Sampling Methods With or Without Replacement: If we allow duplicates when sampling, then we are sampling with replacement. Duplicates are unlikely when n is much smaller than N. If we do not allow duplicates when sampling, then we are sampling without replacement. Sample Size: Sample size depends on the inherent variability of the quantity being measured and on the desired precision of the estimate.

Clickers Suppose you want to know the ages of moviegoers who attend Juno. What kind of sample is it if you survey the first 20 people who emerge from the theater? A = simple random sample B = systematic sample C = stratified sample D = convenience sample

Chapter 2 – Data Sources Useful Data Sources: Type of Data Examples U.S. general data Statistical Abstract of the U.S. U.S. economic data Economic Report of the President Almanacs World Almanac, Time Almanac Periodicals Economist, Business Week, Fortune Indexes New York Times, Wall Street Journal Databases CompuStat, Citibase, U.S. Census World data CIA World Factbook Web Google, Yahoo, msn In many instances, you will collect data specific to a specific question of interest through research or survey.

Chapter 2 – Survey Research Basic Steps of Survey Research: Step 1: State the goals of the research Step 2: Develop the budget (time, money, staff) Step 3: Create a research design (target population, frame, sample size) Step 4: Choose a survey type and method of administration Step 5: Design a data collection instrument (questionnaire) Step 6: Pretest the survey instrument and revise as needed Step 7: Administer the survey (follow up if needed) Step 8: Code the data and analyze it

Chapter 2 – Survey Research Survey Types (You should acquaint yourselves with the characteristics of each – Table 2.12). Mail Telephone Interviews Web Direct Observation

Chapter 2 – Survey Research Survey Guidelines: Plan – What is the purpose of the survey? Consider staff expertise, needed skills, degree of precision, budget. Design – Invest time and money in designing the survey. Use books and references to avoid unnecessary errors. Quality – Take care in preparing a quality survey so that people will take you seriously. Pilot Test – Pretest on friends or co-workers to make sure the survey is clear. Buy-in – Improve response rates by stating the purpose of the survey, offering a token of appreciation or paving the way with endorsements. Expertise – Work with a consultant early on.