STATISTICS Silvia Cagnone Department of Statistics University of Bologna MIREES A cademic year 2008/2009.

Slides:



Advertisements
Similar presentations
STATISTICAL METHODS I STATISTICAL METHODS I Prof. Elizabeth M. Role, Ph. D. Prof. Elizabeth M. Role, Ph. D.
Advertisements

CHAPTER 2 ORGANIZING AND GRAPHING DATA Prem Mann, Introductory Statistics, 8/E Copyright © 2013 John Wiley & Sons. All rights reserved.
Note 3 of 5E Statistics with Economics and Business Applications Chapter 2 Describing Sets of Data Descriptive Statistics - Tables and Graphs.
4. FREQUENCY DISTRIBUTION
2- 1 Chapter Two McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
CHAPTER 1 INTRODUCTION Prem Mann, Introductory Statistics, 8/E Copyright © 2013 John Wiley & Sons. All rights reserved.
CHAPTER 1: INTRODUCTION.
CHAPTER 2 ORGANIZING AND GRAPHING DATA. Opening Example.
QMS 6351 Statistics and Research Methods Chapter 2 Descriptive Statistics: Tabular and Graphical Methods Prof. Vera Adamchik.
Chapter 2 Graphs, Charts, and Tables – Describing Your Data
Introduction to Educational Statistics
Statistics Data.
Chapter 2: Organizing Data STP 226: Elements of Statistics Jenifer Boshes Arizona State University.
2.1 Summarizing Qualitative Data  A graphic display can reveal at a glance the main characteristics of a data set.  Three types of graphs used to display.
SECTION 12-1 Visual Displays of Data Slide
Welcome to Data Analysis and Interpretation
Stat 1080 “Elementary Probability and Statistics” By Dr. AFRAH BOSSLY
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
© 2008 Pearson Addison-Wesley. All rights reserved Chapter 1 Section 13-1 Visual Displays of Data.
12.1 – Visual Displays of Data In statistics: A population includes all of the items of interest. A sample includes some of the items in the population.
Descriptive Statistics
Thinking Mathematically
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Describing Data: Frequency Tables, Frequency Distributions, and Graphic Presentation Chapter 2.
2- 1 Chapter Two McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
2- 1 Chapter Two McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Data Presentation.
BUSINESS STATISTICS BQT 173
BUSINESS STATISTICS BQT 173. CHAPTER 1 : DATA & STATISTICS.
1 Course review, syllabus, etc. Chapter 1 – Introduction Chapter 2 – Graphical Techniques Quantitative Business Methods A First Course
1 1 Slide STATISTICS FOR BUSINESS AND ECONOMICS Seventh Edition AndersonSweeneyWilliams Slides Prepared by John Loucks © 1999 ITP/South-Western College.
Lectures one and two What is Statistics? Lecturer: Dr. Madgerie Jameson
Basic Statistics  Statistics in Engineering  Collecting Engineering Data  Data Summary and Presentation  Probability Distributions - Discrete Probability.
ORGANIZING AND GRAPHING DATA
STATISTICS. Statistics * Statistics is the area of science that deals with collection, organization, analysis, and interpretation of data. * A collection.
Probability & Statistics
Subbulakshmi Murugappan H/P:
McGraw-Hill/ Irwin © The McGraw-Hill Companies, Inc., 2003 All Rights Reserved. 2-1 Chapter Two Describing Data: Frequency Distributions and Graphic Presentation.
Describing Data: Frequency Tables, Frequency Distributions, and Graphic Presentation Chapter 2.
Prem Mann, Introductory Statistics, 6/E Copyright  2007 John Wiley & Sons. All rights reserved. SSF1063: Statistics for Social Sciences LU2: Graphing.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Describing Data: Frequency Tables, Frequency Distributions, and Graphic Presentation Chapter 2.
Copyright 2011 by W. H. Freeman and Company. All rights reserved.1 Introductory Statistics: A Problem-Solving Approach by Stephen Kokoska Chapter 2 Tables.
Stat 101Dr SaMeH1 Statistics (Stat 101) Associate Professor of Environmental Eng. Civil Engineering Department Engineering College Almajma’ah University.
CHAPTER 1 Basic Statistics Statistics in Engineering
ISTANBUL STOCK EXCHANGE (BIST) FELL 6 POINTS IN AVERAGE TODAY THE UNITED STATES DOLLAR (USD) APPRECIATED BY 4 PERCENT LAST WEEK AGAINST TURKISH LIRA (TRL).
Chapter 0: Why Study Statistics? Chapter 1: An Introduction to Statistics and Statistical Inference 1
Organizing and Graphing Qualitative Data
Biostatistics Dr. Amjad El-Shanti MD, PMH,Dr PH University of Palestine 2016.
 2012 Pearson Education, Inc. Slide Chapter 12 Statistics.
Stat 101Dr SaMeH1 Statistics (Stat 101) Associate Professor of Environmental Eng. Civil Engineering Department Engineering College Almajma’ah University.
ORGANIZING AND GRAPHING DATA
Chapter 12 Statistics 2012 Pearson Education, Inc.
Chapter 12 Statistics.
CHAPTER 2 : DESCRIPTIVE STATISTICS: TABULAR & GRAPHICAL PRESENTATION
CHAPTER 1 INTRODUCTION Prem Mann, Introductory Statistics, 7/E Copyright © 2010 John Wiley & Sons. All right reserved.
Chapter 2: Methods for Describing Data Sets
ORGANIZING AND GRAPHING DATA
8.DATA DESCRIPTIVE.
ORGANIZING AND GRAPHING DATA
CHAPTER 5 Basic Statistics
Chapter 5 STATISTICS (PART 1).
Introduction Chapter 1.
CHAPTER 1 INTRODUCTION Prem Mann, Introductory Statistics, 8/E Copyright © 2013 John Wiley & Sons. All rights reserved.
CHAPTER 1 INTRODUCTION Prem Mann, Introductory Statistics, 7/E Copyright © 2010 John Wiley & Sons. All right reserved.
THE STAGES FOR STATISTICAL THINKING ARE:
ORGANIZING AND GRAPHING DATA
Introduction to Probability and Statistics Thirteenth Edition
THE STAGES FOR STATISTICAL THINKING ARE:
CHAPTER 1 INTRODUCTION Prem Mann, Introductory Statistics, 8/E Copyright © 2013 John Wiley & Sons. All rights reserved.
CHAPTER 1 INTRODUCTION Prem Mann, Introductory Statistics, 7/E Copyright © 2010 John Wiley & Sons. All right reserved.
Frequency Distribution and Graphs
Presentation transcript:

STATISTICS Silvia Cagnone Department of Statistics University of Bologna MIREES A cademic year 2008/2009

Readings  Mann P., “Introductory Statistics”, 6 th edition, John Wiley & Sons, INC., Chapters 1, 2, 3, 4.4, 4.5, 4.6, 11.3, , 13.1, , 13.4, 13.6  HyperStat Online Textbook (  Handouts and exercises download from Office hours by Exam Project work and oral exam

Aim of the course Basic concepts of the statistical method for the analysis and the interpretation of economic and social data. a. Univariate statistical analysis (one character) b. Bivariate statistical analysis (two characters) c. Brief introduction to the statistical inference Exercises in laboratory software Excel

INTRODUCTION TO STATISTICS STATISTICS

What is statistics? Two meanings: 1. Common usage Numerical facts (e.g the age of a student, the income of a family, the starting salary of a typical college graduate, etc.). 2. Field or discipline of study Statistics is a group of methods used to collect, analyze, present, and interpret data and to make decisions Statistics is a group of methods used to collect, analyze, present, and interpret data and to make decisions.

What is statistics? Statistics has two aspects: 1.theoretical or mathematical statistics deals with the development, derivation and proof of statistical theorems, formulas, rules and laws; 2.applied statistics involves the applications of those theorems, formulas, rules and laws to solve real-world problems.

Why statistics? We use statistics when we need methods for We use statistics when we need methods for extracting information from observed or collected data extracting information from observed or collected data to obtain a deeper understanding from numbers about to obtain a deeper understanding from numbers about the situations they represent. Even professional statisticians have trouble understanding a data set (= a collection of numerical information) by merely looking at it. Even professional statisticians have trouble understanding a data set (= a collection of numerical information) by merely looking at it. Statistics and data analysis provide methods that can help in the understanding of nearly every field of human experience. Statistics and data analysis provide methods that can help in the understanding of nearly every field of human experience.

Why statistics? What we can say about this data set by merely looking at it???? Data set of individuals of Cambridge. Information on Sex, Height, Weight, Age and Smoking status.

Types of statistics  Descriptive statistics: consists of methods for organizing, displaying, and describing data by using tables, graphs, and summary measures.  Inferential Statistics: consists of methods that use sample results to help make decisions or predictions about a population. Population (target): the collection of all elements of interest Sample: the selection of a few elements from this population Probability acts as a link between descriptive and inferential statistics.

Descriptive statistics: example A sample of 30 employees from large companies was selected, and these employees were asked how stressful their jobs were. The responses of these employees are recorded next where Very represents Very stressful, Somewhat means Somewhat stressful, and None stands for Not stressful at all.

Inferential statistics: example 1. We may make some decisions about the political views of a college and university students based on political views of 1000 students selected from a few colleges and universities. 2. We may want to find the starting salary of a typical college graduate. To do so, we may select 2000 recent college graduates, find their starting salaries, and make a decision based on this information.

DESCRIPTIVE STATISTICS

Basic terms 1. ELEMENT or MEMBER or UNIT Specific subject or object (for example, a person, firm, item, state, or country) about which the information is collected 2. VARIABLE Characteristic under study that assumes different values for different elements. In contrast to a variable, the value of a CONSTANT is fixed. 3. OBSERVATION or MEASUREMENT The value of a variable for an element.

Basic terms: example Company 2001 Sales (millions of dollars) Wal-Mart Stores IBM General Motors Dell Computer Procter & Gamble JC Penney Home Depot 217,79985,866177,26031,16839,26232,00453,553 Element/ Member/Unit Variable Observation/ Measurement Data set: 2001 Sales of Seven U.S. Companies

Types of variables 1. QUANTITATIVE VARIABLE A variable that can be measured numerically. The data collected on a quantitative variable are called quantitative data. Examples: Incomes, heights, prices of homes, etc. DISCRETE VARIABLE DISCRETE VARIABLE A variable whose values are countable. A discrete variable can assume only certain values with no intermediate values. (e.g. nr students in a class, nr components of a family, births in Forlì in 2007, etc.) CONTINUOUS VARIABLE CONTINUOUS VARIABLE A variable that can assume any numerical value over a certain interval or intervals. (e.g. age, weight, height, time to get to the school, etc.) A variable that can assume any numerical value over a certain interval or intervals. (e.g. age, weight, height, time to get to the school, etc.)

Types of variables 2. QUALITATIVE or CATEGORICAL VARIABLE A variable that cannot assume a numerical value but can be classified into two or more nonnumeric categories. The data collected on such a variable are called qualitative data. A variable that cannot assume a numerical value but can be classified into two or more nonnumeric categories. The data collected on such a variable are called qualitative data. Examples: hair color, gender, etc.

Summation notation The summation operator   is a mathematical notation used to denote the sum of values. For example, suppose a sample consists of 5 books and their prices are $25, $60, $37, $53 and $16. If we denote the variable price of a book by X, we have: x 1 (price of the first book) = $25; x 2 (price of the second book) = $60; … x 5 (price of the fifth book) = $16; Now, suppose we want to add the prices of all five books: x 1 + x 2 + x 3 + x 4 + x 5 = = $191 or, briefly,  x i = x 1 + x 2 + x 3 + x 4 + x 5 = = $191

Summation notation Example  Annual salaries (in thousands of dollars) of four workers are 75, 42, 125, and 61. Find a) a) ∑x b) b) (∑x)² c) c) ∑x² Solution a) a) ∑x = x 1 + x 2 + x 3 + x 4 = = 303 b) b) (∑x)² = ( )² =(303)² = c) c) ∑x² = (75)² + (42)² + (125)² + (61)² = , =

ORGANIZING DATA

Raw data When data are collected, the information obtained from each member of a population or a sample is recoded in the sequence in which it becomes available. This sequence of data is called raw data. RAW DATA Data recorded in the sequence in which they are collected. These data are also called ungrouped data, because they contain information on each member of a sample or population individually.

Raw data: Examples Ages of 50 students Gender of 50 students MFMMFMFMMF FFFMFFFFMF MMMMFMMMMF MFMMMMFMMM FFMMMFFMMM MFFMFMFFMF FMMMFFMMMF FMFMMFMFMM FMMFMFMMFM FFMMMFFMMM

Organizing and Graphing Qualitative Data  FREQUENCY DISTRIBUTION  A frequency distribution for qualitative data lists all categories and the number of elements that belong to each of the categories. Example Frequency Variable Category

Frequency distribution How can we obtain a frequency distribution? Example A sample of 30 employees from large companies was selected, and these employees were asked how stressful their jobs were. The responses of these employees are recorded next where Very represents Very stressful, Somewhat means Somewhat stressful, and None stands for Not stressful at all. Raw data

Frequency distribution Stress on Job Tally Tally Frequency (n i ) VerySomewhatNone ||||| ||||| ||||| ||||| |||| ||||| | Sum =  n i = 30

Relative frequency and percentage distributions Relative frequency of a category: it is obtained by dividing the frequency of the category by the sum of all frequencies Percentage of a category: it is obtained by multiplying the relative frequency of the category by 100 Percentage = (Relative frequency) * 100 = f i * 100

Relative frequency and percentage distributions:example Stress on Job ( )Frequency ( n i ) Very Somewhat None  n i = 30 = n Stress on Job ( ) Relative Frequency (f i )Percentage Very Somewhat None 10/30 = /30 =.467 6/30 = (100) = (100) = (100) = 20.0  f i = 1.00 Sum = 100

Graphical presentation of qualitative data A graphic display can reveal at a glance the main characteristics of a data set. The bar graph and the pie chart are two types of graphs used to display qualitative data.

Bar graph A graph made of bars whose heights represent the frequencies of respective categories. Stress on JobFrequency ( n i ) Very Somewhat None  n i = 30 = N The categories are on the horizontal axis and all these categories are represented by intervals of the same width. We mark the frequencies on the vertical axis and their heights represent the frequency of the corresponding category. We leave a small gap between adjacent bars.

Bar graph for percentages The bar graphs for relative frequency and percentage distributions can be drawn simply by marking the relative frequencies or percentages, instead of the frequencies, on the vertical axis. Stress on Job Percentage Very Somewhat None

Pie charts A circle divided into portions that represent the relative frequencies or percentages of a population or a sample belonging to different categories.

Organizing and Graphing Quantitative Data Often for quantitative data with a large number of different values, it is appropriated to prepare a frequency distribution based on classes Often for quantitative data with a large number of different values, it is appropriated to prepare a frequency distribution based on classes. Example: frequency distribution Weekly earning of 120 employees of a large company Weekly earning of 120 employees of a large company Weekly Earnings (dollars) Number of Employees 400 -| | | | | | Variable Third class Frequency column Frequency of the third class Lower limit of the sixth class Upper limit of the sixth class

Relative frequency and percentage distribution Weekly Earnings (dollars) Number of Employees n Relative frequency f Percentage 400 -| | | | | | /120 = /120 = /120 = /120 = /120 = /120 = *100 = *100 = *100 = *100 = *100 = *100 = 5.0  n = 120  f = 1 100

Cumulative frequency A cumulative frequency is the total number of values that fall below a certain value. To obtain the cumulative frequency of a class, we add the frequency of that class to the frequencies of all preceding classes. Weekly Earnings (dollars) Number of Employees n Cumulative frequency 400 -| | | | | | = = = = = 120  n = 120

Graphical presentation of quantitative data Quantitative data can be displayed mainly in a histogram. Actually, we can also draw a pie chart to display the percentage distribution for a quantitative data set. The procedure to construct a pie chart is similar to the one for qualitative data.

Histogram A graph in which classes are marked on the horizontal axis and the frequencies, relative frequencies, or percentages are marked on the vertical axis. The frequencies, relative frequencies, or percentages are represented by the heights of the bars. In a histogram, the bars are drawn adjacent to each other, to underline the continuity of the quantitative data.

Example histogram Weekly Earnings (dollars) Number of Employees Percentage 400 -| | | | | |