Lecture 1 Fadzly, N Normality and data transformation + Non-parametric tests.

Slides:



Advertisements
Similar presentations
1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Advertisements

Exam One Review Quiz Psy302 Quantitative Methods.
Copyright © Allyn & Bacon (2007) Statistical Analysis of Data Graziano and Raulin Research Methods: Chapter 5 This multimedia product and its contents.
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Statistics. The usual course of events for conducting scientific work “The Scientific Method” Reformulate or extend hypothesis Develop a Working Hypothesis.
Statistics for the Social Sciences
Some Basic Concepts Schaum's Outline of Elements of Statistics I: Descriptive Statistics & Probability Chuck Tappert and Allen Stix School of Computer.
Final Review Session.
Copyright (c) Bani Mallick1 Lecture 4 Stat 651. Copyright (c) Bani Mallick2 Topics in Lecture #4 Probability The bell-shaped (normal) curve Normal probability.
Data Transformation Data conversion Changing the original form of the data to a new format More appropriate data analysis New.
Business Statistics - QBM117 Statistical inference for regression.
Central Tendency and Variability
Continuous Probability Distribution  A continuous random variables (RV) has infinitely many possible outcomes  Probability is conveyed for a range of.
Today: Central Tendency & Dispersion
Introduction to Statistical Methods By Tom Methven Digital slides and tools available at:
AM Recitation 2/10/11.
PPA 501 – A NALYTICAL M ETHODS IN A DMINISTRATION Lecture 3b – Fundamentals of Quantitative Research.
Statistics for the Behavioral Sciences Second Edition Chapter 4: Central Tendency and Variability iClicker Questions Copyright © 2012 by Worth Publishers.
Psychometrics.
B AD 6243: Applied Univariate Statistics Understanding Data and Data Distributions Professor Laku Chidambaram Price College of Business University of Oklahoma.
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
Biostatistics: Measures of Central Tendency and Variance in Medical Laboratory Settings Module 5 1.
The Gaussian (Normal) Distribution: More Details & Some Applications.
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
Measures of Variability Objective: Students should know what a variance and standard deviation are and for what type of data they typically used.
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
Yes….. I expect you to read it! Maybe more than once!
Unit 7 Section : Confidence Intervals for the Mean (σ is unknown)  When the population standard deviation is unknown and our sample is less than.
From Theory to Practice: Inference about a Population Mean, Two Sample T Tests, Inference about a Population Proportion Chapters etc.
Measures of central tendency are statistics that express the most typical or average scores in a distribution These measures are: The Mode The Median.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
1 Review Sections Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 7 - Sampling Distribution of Means.
Measures of Central Tendency: The Mean, Median, and Mode
What does Statistics Mean? Descriptive statistics –Number of people –Trends in employment –Data Inferential statistics –Make an inference about a population.
Chapter 11 Lecture Research Techniques: For the Health Sciences Fifth Edition Analyzing and Interpreting Data: Descriptive Analysis R. Eric Heidel, PhD.
BASIC STATISTICAL CONCEPTS Chapter Three. CHAPTER OBJECTIVES Scales of Measurement Measures of central tendency (mean, median, mode) Frequency distribution.
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
Introduction to statistics I Sophia King Rm. P24 HWB
Describing Samples Based on Chapter 3 of Gotelli & Ellison (2004) and Chapter 4 of D. Heath (1995). An Introduction to Experimental Design and Statistics.
 Assumptions are an essential part of statistics and the process of building and testing models.  There are many different assumptions across the range.
Environmental Modeling Basic Testing Methods - Statistics II.
Chapter Eleven Sample Size Determination Chapter Eleven.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
 Normal Curves  The family of normal curves  The rule of  The Central Limit Theorem  Confidence Intervals  Around a Mean  Around a Proportion.
Review Day 2 May 4 th Probability Events are independent if the outcome of one event does not influence the outcome of any other event Events are.
Construction Engineering 221 Probability and statistics Normal Distribution.
The symmetry statistic
Chapter 3 INTERVAL ESTIMATES
Cross Tabulation with Chi Square
Advanced Quantitative Techniques
Statistical Methods Michael J. Watts
Different Types of Data
Populations.
Lecture Nine - Twelve Tests of Significance.
Chapter 3 INTERVAL ESTIMATES
Statistical Methods Michael J. Watts
Descriptive measures Capture the main 4 basic Ch.Ch. of the sample distribution: Central tendency Variability (variance) Skewness kurtosis.
Frequency and Distribution
Introduction to Summary Statistics
Being Descriptive Without Being Mean
Physics 114: Exam 2 Review Material from Weeks 7-11
Theme 5 Standard Deviations and Distributions
Quantitative Methods in HPELS HPELS 6210
CHAPTER 7 Sampling Distributions
CHAPTER 7 Sampling Distributions
Sampling Distributions (§ )
CHAPTER 7 Sampling Distributions
Presentation transcript:

Lecture 1 Fadzly, N Normality and data transformation + Non-parametric tests

Are our data “normal”? Many parametric statistical tests assumes that our gathered data comes from a normally distributed population However in reality..that is not the case

Are our data “normal”? Usually, one have to sample laboriously, extensively, and have a large quantity of data - remember the Central Limit Theorem Most data will be distribution free (applicable for non- parametric techniques) Counts data for example in theory can never be normal Certain nominal data – for example age class or sex, are usually multimodal

How to check for normality? The Hard Way Gaussian equation Goodness-of –Fit test Kolgomorov-Smirnov Shapiro-Wilk The easy way Simply plot out the data/histogram….see whether they look normal General rule of thumb-calculate the mean and standard s deviation, if 70 % of the observation fall within the interval of

example xf = 8.6; s =1.5 is 7.1 to 10.1, which contains 7/10 observation-sample normal. So the sample is normal.

My general rules: If I don’t mention anything, assume its normal If I mention that its not normal…then its not normal If I mention that its not normal, but I want it normalised…then normalise the data! (Data transformation) However, I sometime test/ trick the students…watch out for keywords like median, average, ordinal

Data transformation Why do we need to transform data? To satisfy the distributional parameter requirements for certain types of parametric tests (which requires all to be normally distributed) Easier to be explained visually

Option 1 : Square root The most easiest method Every non-negative real number x has a unique non-negative square root, called the principal square root Appropriate when the variance of a sample of count data is about equal to the mean, or similar to the Poisson distribution X is transformed into If there are zero counts, replace observation by

Option 2: Logarithmic transformation Appropriate when the variance of a sample of a count data is larger than the mean. x is transformed into log x

Option 3: The Arcsinh Appropriate when there are observations of zero in the count data. Arcsinh/Arcshine Using scientific calculator, set the mode to radians (rad). Enter the value, press inv key, followed by hyp then sin key. Or on certain casio models- press shift and press sin (on the display should show sin -1 The answer will be in degree 0 or radians.

Data transformation Is it compulsory? Depends on the researcher. The decision to use parametric or non-parametric tests. Data transformation simply refers to change a not normal curve into a bell shape normal curve Caution- data is lost through transformation! Do not try multi-level transformation.