Lecture 9 Chapter 5. Non-Normal Populations. 5.1 Introduction  Throughout the course ( in Chapters 2, 3 and 4) we have focussed on data which we can.

Slides:



Advertisements
Similar presentations
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 16 l Nonparametrics: Testing with Ordinal Data or Nonnormal Distributions.
Advertisements

Chapter 12 Probability © 2008 Pearson Addison-Wesley. All rights reserved.
Review of the Basic Logic of NHST Significance tests are used to accept or reject the null hypothesis. This is done by studying the sampling distribution.
CHAPTER 13: Binomial Distributions
Chapter 13: Inference for Distributions of Categorical Data
Topic 6: Introduction to Hypothesis Testing
1 Binomial Probability Distribution Here we study a special discrete PD (PD will stand for Probability Distribution) known as the Binomial PD.
1 Binomial Probability Distribution Here we study a special discrete PD (PD will stand for Probability Distribution) known as the Binomial PD.
Chapter 5: Probability Concepts
Point and Confidence Interval Estimation of a Population Proportion, p
Chapter 4 Probability Distributions
MARE 250 Dr. Jason Turner Hypothesis Testing III.
PSY 1950 Nonparametric Statistics November 24, 2008.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview Parameters and Statistics Probabilities The Binomial Probability Test.
Class notes for ISE 201 San Jose State University
Lecture Slides Elementary Statistics Twelfth Edition
Section 9.3 Testing a Proportion p. 2 Focus Points Identify the components needed for testing a proportion. Compute the sample test statistic. Find the.
Slide 1 Statistics Workshop Tutorial 7 Discrete Random Variables Binomial Distributions.
5-3 Binomial Probability Distributions
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 14: Non-parametric tests Marshall University Genomics.
Non-parametric statistics
Sample Size Determination Ziad Taib March 7, 2014.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
Nonparametric or Distribution-free Tests
Copyright © 2012 by Nelson Education Limited. Chapter 8 Hypothesis Testing II: The Two-Sample Case 8-1.
Education 793 Class Notes T-tests 29 October 2003.
More About Significance Tests
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 8 Continuous.
Inference for a Single Population Proportion (p).
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Review and Preview This chapter combines the methods of descriptive statistics presented in.
Slide 1 Copyright © 2004 Pearson Education, Inc..
1 Introduction to Hypothesis Testing. 2 What is a Hypothesis? A hypothesis is a claim A hypothesis is a claim (assumption) about a population parameter:
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 1 – Slide 1 of 34 Chapter 11 Section 1 Random Variables.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
One-sample In the previous cases we had one sample and were comparing its mean to a hypothesized population mean However in many situations we will use.
9 Mar 2007 EMBnet Course – Introduction to Statistics for Biologists Nonparametric tests, Bootstrapping
Section 6.2: How Can We Find Probabilities When Each Observation Has Two Possible Outcomes? 1.
Biostatistics, statistical software VII. Non-parametric tests: Wilcoxon’s signed rank test, Mann-Whitney U-test, Kruskal- Wallis test, Spearman’ rank correlation.
Ordinally Scale Variables
Binomial Experiment A binomial experiment (also known as a Bernoulli trial) is a statistical experiment that has the following properties:
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Section 5-2 Random Variables.
Copyright © Cengage Learning. All rights reserved. 14 Elements of Nonparametric Statistics.
Nonparametric Tests IPS Chapter 15 © 2009 W.H. Freeman and Company.
Section 8-5 Testing a Claim about a Mean: σ Not Known.
Random Variables Presentation 6.. Random Variables A random variable assigns a number (or symbol) to each outcome of a random circumstance. A random variable.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
4.2 Binomial Distributions
N318b Winter 2002 Nursing Statistics Specific statistical tests Chi-square (  2 ) Lecture 7.
AP Statistics Section 11.1 B More on Significance Tests.
Chapter 13: Inferences about Comparing Two Populations Lecture 8b Date: 15 th November 2015 Instructor: Naveen Abedin.
Hypothesis test flow chart frequency data Measurement scale number of variables 1 basic χ 2 test (19.5) Table I χ 2 test for independence (19.9) Table.
Binomial Probability. Features of a Binomial Experiment 1. There are a fixed number of trials. We denote this number by the letter n.
Binomial Distribution and Applications. Binomial Probability Distribution A binomial random variable X is defined to the number of “successes” in n independent.
Handout Six: Sample Size, Effect Size, Power, and Assumptions of ANOVA EPSE 592 Experimental Designs and Analysis in Educational Research Instructor: Dr.
Chapter 5 Probability Distributions 5-1 Overview 5-2 Random Variables 5-3 Binomial Probability Distributions 5-4 Mean, Variance and Standard Deviation.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
 Kolmogor-Smirnov test  Mann-Whitney U test  Wilcoxon test  Kruskal-Wallis  Friedman test  Cochran Q test.
Inferential Statistics Assoc. Prof. Dr. Şehnaz Şahinkarakaş.
Chapter 10: The t Test For Two Independent Samples.
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 5 Probability Distributions 5-1 Overview 5-2 Random Variables 5-3 Binomial Probability Distributions.
Inference for a Single Population Proportion (p)
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
CHAPTER 6 Random Variables
Chapter 5 Sampling Distributions
Chapter 5 Sampling Distributions
Copyright © Cengage Learning. All rights reserved.
Chapter 13: Inference for Distributions of Categorical Data
Chapter 5: Sampling Distributions
Chapter 11 Probability.
Presentation transcript:

Lecture 9 Chapter 5. Non-Normal Populations

5.1 Introduction  Throughout the course ( in Chapters 2, 3 and 4) we have focussed on data which we can assume comes from the Normal distribution.  However, some experiments give results that cannot sensibly be modelled by a Normal distribution.  In some cases this is because the distribution just has a different shape. Other times, the data are of a completely different type, e.g. categories rather than numbers.

5.2 Non-parametric methods We first consider the situation where our data are continuous but may not be Normally distributed, and in fact we do not know what distribution might be appropriate. In these cases, the methods that we have studied so far in this course, t-tests, ANOVA etc. are not appropriate, and must be replaced by tests which do not assume the Normal distribution, or indeed any other distribution. In these cases, the methods that we have studied so far in this course, t-tests, ANOVA etc. are not appropriate, and must be replaced by tests which do not assume the Normal distribution, or indeed any other distribution.

Methods which do not assume the data come from any distribution are called distribution- free, or non-parametric. Example The speech of two groups of speech-impaired children is assessed following two different programmes of treatment: Group A: Active Speech Therapy Group B: Conversation Sessions

The following data are scores on a scale in which higher values represent greater difficulty in speaking. Group A Group B Let’s look at histograms…

If we could assume these data to be normally distributed, we could use a two-sample t-test. However, this assumption is difficult to justify here. So, we use the appropriate non-parametric test for comparing two independent samples, which is called the Mann-Whitney test. The details of how to do the calculations for this are not necessary here. We omit them and go straight to the implementation in Minitab using the command: Stat>Nonparametrics>Mann-Whitney... For our data, we get p = This is significant at the 5% level, so we have evidence for a difference in the two treatments. Active speech therapy appears to be more effective than conversation sessions.

5.3 When the Data are counts: a) The Binomial Distribution We now consider a different kind of data altogether. Instead of numbers measured on a continuous scale, we consider situations where our data are counts of different kinds. In this section we consider what happens when we count the successes from a number of trials. The Binomial distribution is used to model the number of successes in a series of n independent trials, where each trial results in either a ‘success’ or a ‘failure’.

Let’s first see how this works. Example A drug is known to be 80% effective, i.e. the probability that each person with the disease will be cured is 0.8. Suppose four people with the disease are given the drug. What is the probability distribution for the number of people cured?

Notation Let X = number of people cured. Let s denote a success. Let f denote a failure. Consider a typical outcome of the experiment for the four people, e.g. that the first two are cured, and the second two are not. We would write this outcome: s s f f. Since each person is cured (or not cured) independently, we can calculate the probability of this outcome as Pr (s s f f) = Pr(s) x Pr(s) x Pr(f) x Pr(f) = 0.8 x 0.8 x 0.2 x 0.2 =

We could do similar calculations for all of the possible outcomes: We could do similar calculations for all of the possible outcomes: OutcomeProbability OutcomeProbability 1. ssss0.8 x 0.8 x 0.8 x 0.8 = ssss0.8 x 0.8 x 0.8 x 0.8 = sssfetc. 2. sssfetc. 3. ssfs 3. ssfs 4. ssff0.8 x 0.8 x 0.2 x 0.2 = ssff0.8 x 0.8 x 0.2 x 0.2 = sfssetc. 5. sfssetc. 6. sfsf0.8 x 0.2 x 0.8 x 0.2 = sfsf0.8 x 0.2 x 0.8 x 0.2 = sffs0.8 x 0.2 x 0.2 x 0.8 = sffs0.8 x 0.2 x 0.2 x 0.8 = sfffetc. 8. sfffetc. 9. fsss 9. fsss 10. fssf0.2 x 0.8 x 0.8 x 0.2 = fsfs0.2 x 0.8 x 0.2 x 0.8 = fsffetc. 13. ffss0.2 x 0.2 x 0.8 x0.8 = ffsfetc. 15. fffs 16. ffff0.2 x 0.2 x 0.2 x 0.2 =

Now suppose we want to know the probability that exactly two of the four patients are cured (not necessarily the first two), i.e. Pr (X=2). We can obtain this probability by adding up the probabilities for all of the outcomes in the table for which X=2. There are six of these, i.e. outcomes 4, 6, 7, 10, 11 and 13. Each of these outcomes has probability So: Pr (X = 2) = 6 x =

We can do similar calculations to obtain: Pr (X = 4) = Pr (X = 3) = Pr (X = 1) = Pr (X = 0) = In practice we get Minitab to do the calculations.