Statistics made easy Basic principles. Statistics made easy Assume we are interested in getting some information about the average height of people in.

Slides:



Advertisements
Similar presentations
Statistical Sampling.
Advertisements

Exam One Review Quiz Psy302 Quantitative Methods.
10-3 Inferences.
Covariance and Correlation: Estimator/Sample Statistic: Population Parameter: Covariance and correlation measure linear association between two variables,
Chapter 7 Introduction to Sampling Distributions
1 Analysis of Variance This technique is designed to test the null hypothesis that three or more group means are equal.
PSY 307 – Statistics for the Behavioral Sciences
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
The Simple Regression Model
Measures of Variability or Dispersion
Variability Measures of spread of scores range: highest - lowest standard deviation: average difference from mean variance: average squared difference.
BHS Methods in Behavioral Sciences I
Transforms What does the word transform mean?. Transforms What does the word transform mean? –Changing something into another thing.
Quantitative Business Analysis for Decision Making Simple Linear Regression.
Lecture 7 1 Statistics Statistics: 1. Model 2. Estimation 3. Hypothesis test.
Measurement, Quantification and Analysis Some Basic Principles.
Correlation and Regression Analysis
Normal and Sampling Distributions A normal distribution is uniquely determined by its mean, , and variance,  2 The random variable Z = (X-  /  is.
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 6 Sampling and Sampling.
Quiz 2 Measures of central tendency Measures of variability.
Sampling Distributions Chapter 9 First, a word from our textbook A statistic is a numerical value computed from a sample. EX. Mean, median, mode, etc.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.1 Chapter Four Numerical Descriptive Techniques.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.1 Chapter Four Numerical Descriptive Techniques.
STATISTICS: BASICS Aswath Damodaran 1. 2 The role of statistics Aswath Damodaran 2  When you are given lots of data, and especially when that data is.
Correlation.
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
1 DATA DESCRIPTION. 2 Units l Unit: entity we are studying, subject if human being l Each unit/subject has certain parameters, e.g., a student (subject)
JDS Special Program: Pre-training1 Basic Statistics 01 Describing Data.
Marketing Research Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides.
Standard Deviation!. Let’s say we randomly select 9 men and 9 women and ask their GPAs and get these data: MENWOMEN
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
1 Theoretical Physics Experimental Physics Equipment, Observation Gambling: Cards, Dice Fast PCs Random- number generators Monte- Carlo methods Experimental.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Mean Variance Standard Deviation
Chapter 3 Basic Statistics Section 2.2: Measures of Variability.
Measures of Variability Objective: Students should know what a variance and standard deviation are and for what type of data they typically used.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
Hypothesis of Association: Correlation
Measures of Dispersion & The Standard Normal Distribution 9/12/06.
1 The  2 test Sections 19.1 and 19.2 of Howell This section actually includes 2 totally separate tests goodness-of-fit test contingency table analysis.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 2 – Slide 1 of 27 Chapter 3 Section 2 Measures of Dispersion.
6.3 THE CENTRAL LIMIT THEOREM. DISTRIBUTION OF SAMPLE MEANS  A sampling distribution of sample means is a distribution using the means computed from.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Numerical Measures of Variability
3 common measures of dispersion or variability Range Range Variance Variance Standard Deviation Standard Deviation.
1 Regression Analysis The contents in this chapter are from Chapters of the textbook. The cntry15.sav data will be used. The data collected 15 countries’
ECON 338/ENVR 305 CLICKER QUESTIONS Statistics – Question Set #8 (from Chapter 10)
© Buddy Freeman, 2015 Let X and Y be two normally distributed random variables satisfying the equality of variance assumption both ways. For clarity let.
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
1 ECE310 – Lecture 22 Random Signal Analysis 04/25/01.
Linear Correlation (12.5) In the regression analysis that we have considered so far, we assume that x is a controlled independent variable and Y is an.
Practice Page 128 –#6.7 –#6.8 Practice Page 128 –#6.7 =.0668 = test scores are normally distributed –#6.8 a =.0832 b =.2912 c =.4778.
Descriptive Statistics
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
LESSON 5 - STATISTICS & RESEARCH STATISTICS – USE OF MATH TO ORGANIZE, SUMMARIZE, AND INTERPRET DATA.
STATISTICS People sometimes use statistics to describe the results of an experiment or an investigation. This process is referred to as data analysis or.
Introduction Dispersion 1 Central Tendency alone does not explain the observations fully as it does reveal the degree of spread or variability of individual.
CORRELATION-REGULATION ANALYSIS Томский политехнический университет.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
1. According to ______ the larger the sample, the closer the sample mean is to the population mean. (p. 251) Murphy’s law the law of large numbers the.
Univariate Statistics
Sampling Distributions
Statistical Process Control
Chapter 3D Chapter 3, part D Fall 2000.
Chapter 6: Becoming Acquainted with Statistical Concepts
Presentation transcript:

Statistics made easy Basic principles

Statistics made easy Assume we are interested in getting some information about the average height of people in this room Assume we are interested in getting some information about the average height of people in this room One basic information would be provided by the mean : adding individual heights and divide it by the number of people One basic information would be provided by the mean : adding individual heights and divide it by the number of people The mean gives us some idea about the average height, but how much does it really say? The mean gives us some idea about the average height, but how much does it really say?

Mean Which information can be derived by the mean of the class? Which information can be derived by the mean of the class? Can we generalize this information? Can we generalize this information? What sort of generalization can we really made? What sort of generalization can we really made? This group is not truly a random sample. This group is not truly a random sample. It is predominant male but there is a percentage of women. It is predominant male but there is a percentage of women. It is multi-cultural. It is multi-cultural.

How statistics can go wrong Statistics are often used in “pop science”. Statistics are often used in “pop science”. The idea is that we select a representative sample of a population, perform some tests and then convert the results into numbers and perform statistics, i.e. calculating the mean. The idea is that we select a representative sample of a population, perform some tests and then convert the results into numbers and perform statistics, i.e. calculating the mean. How can we be sure that the sample does represent the behavior of the population? How can we be sure that the sample does represent the behavior of the population? How do we interpret the mean? How do we interpret the mean?

Second order Statistics Even if the sample is representative of the population, is the mean (first order statistics) enough to provide information? Even if the sample is representative of the population, is the mean (first order statistics) enough to provide information? Think of i.e. certain countries where there is a social polarization, i.e. the rich are very rich, the poor are very poor and there is no middle class. Think of i.e. certain countries where there is a social polarization, i.e. the rich are very rich, the poor are very poor and there is no middle class. This is why we introduce the meaning of variance. This is why we introduce the meaning of variance.

Variance and Standard Deviation Variance: Variance: When variance is computed over a sample: denominator N-1 When variance is computed over a sample: denominator N-1 Standard Deviation: the square root of variance Standard Deviation: the square root of variance

The so-called “normal” distribution

Correlation Second order statistics Second order statistics Assume that we have two observed variables. Correlation reveals the degree of linear relationship between them. Assume that we have two observed variables. Correlation reveals the degree of linear relationship between them. The correlation coefficient may take on any value between plus and minus one. The correlation coefficient may take on any value between plus and minus one.

Matlab functions Mean : mean(x), x: a vector or matrix Mean : mean(x), x: a vector or matrix Standard deviation: std(x) Standard deviation: std(x) Correlation coefficients: corrcoef(x,y), where x and y two column vectors representing the observations Correlation coefficients: corrcoef(x,y), where x and y two column vectors representing the observations

Matlab examples (1) X=[0:10]; mean(X) X=[0:10]; mean(X) ans = 5 std(X) std(X) ans = Y=5*X; corrcoef(x,y) Y=5*X; corrcoef(x,y) ans =

Matlab examples (2) Y=10*x; corrcoef(X,Y) Y=10*x; corrcoef(X,Y) ans = Y=10*(x.^5); corrcoef(X,Y) Y=10*(x.^5); corrcoef(X,Y) ans =

Matlab examples (3) Y=10*(X.^10); corrcoef(X,Y) Y=10*(X.^10); corrcoef(X,Y) ans = Y=-10*X; corrcoef(X,Y) Y=-10*X; corrcoef(X,Y) ans =