By James Miller et.all. Presented by Siv Hilde Houmb 1 November 2002

Slides:



Advertisements
Similar presentations
Introduction to Hypothesis Testing
Advertisements

1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Hypothesis testing Another judgment method of sampling data.
Hypothesis Testing Steps in Hypothesis Testing:
Statistical Issues in Research Planning and Evaluation
Statistical Decision Making
Inferential Statistics & Hypothesis Testing
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview of Lecture Independent and Dependent Variables Between and Within Designs.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 11: Power.
Today Concepts underlying inferential statistics
Chapter 12 Inferring from the Data. Inferring from Data Estimation and Significance testing.
Richard M. Jacobs, OSA, Ph.D.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Choosing Statistical Procedures
Chapter Ten Introduction to Hypothesis Testing. Copyright © Houghton Mifflin Company. All rights reserved.Chapter New Statistical Notation The.
Chapter Nine: Evaluating Results from Samples Review of concepts of testing a null hypothesis. Test statistic and its null distribution Type I and Type.
Statistics 11 Hypothesis Testing Discover the relationships that exist between events/things Accomplished by: Asking questions Getting answers In accord.
Overview of Statistical Hypothesis Testing: The z-Test
Testing Hypotheses I Lesson 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics n Inferential Statistics.
Hypothesis testing is used to make decisions concerning the value of a parameter.
Hypothesis Testing.
Estimation and Hypothesis Testing Now the real fun begins.
Tuesday, September 10, 2013 Introduction to hypothesis testing.
Section 9.1 Introduction to Statistical Tests 9.1 / 1 Hypothesis testing is used to make decisions concerning the value of a parameter.
Chapter 8 Introduction to Hypothesis Testing
Tests of significance & hypothesis testing Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
Sample size determination Nick Barrowman, PhD Senior Statistician Clinical Research Unit, CHEO Research Institute March 29, 2010.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
Chapter 9 Power. Decisions A null hypothesis significance test tells us the probability of obtaining our results when the null hypothesis is true p(Results|H.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
1 Lecture note 4 Hypothesis Testing Significant Difference ©
Lecture 16 Dustin Lueker.  Charlie claims that the average commute of his coworkers is 15 miles. Stu believes it is greater than that so he decides to.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
Statistical Power The power of a test is the probability of detecting a difference or relationship if such a difference or relationship really exists.
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Academic Research Academic Research Dr Kishor Bhanushali M
Education 793 Class Notes Decisions, Error and Power Presentation 8.
Math 4030 – 9a Introduction to Hypothesis Testing
Inen 460 Lecture 2. Estimation (ch. 6,7) and Hypothesis Testing (ch.8) Two Important Aspects of Statistical Inference Point Estimation – Estimate an unknown.
Introduction to hypothesis testing Hypothesis testing is about making decisions Is a hypothesis true or false? Ex. Are women paid less, on average, than.
© Copyright McGraw-Hill 2004
Formulating the Hypothesis null hypothesis 4 The null hypothesis is a statement about the population value that will be tested. null hypothesis 4 The null.
Hypothesis Testing Introduction to Statistics Chapter 8 Feb 24-26, 2009 Classes #12-13.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Course Overview Collecting Data Exploring Data Probability Intro. Inference Comparing Variables Relationships between Variables Means/Variances Proportions.
Testing Differences in Means (t-tests) Dr. Richard Jackson © Mercer University 2005 All Rights Reserved.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Math 4030 – 9b Introduction to Hypothesis Testing
Understanding Results
Hypothesis Tests: One Sample
CONCEPTS OF HYPOTHESIS TESTING
Chapter Nine Part 1 (Sections 9.1 & 9.2) Hypothesis Testing
Chapter 3 Probability Sampling Theory Hypothesis Testing.
What are their purposes? What kinds?
More About Tests Notes from
Power and Sample Size I HAVE THE POWER!!! Boulder 2006 Benjamin Neale.
Testing Hypotheses I Lesson 9.
Power Problems.
Inference as Decision Section 10.4.
Statistical Power.
Rest of lecture 4 (Chapter 5: pg ) Statistical Inferences
Presentation transcript:

By James Miller et.all. Presented by Siv Hilde Houmb 1 November 2002 Statistical Power and its subcomponents –missing an misunderstood concepts in Software Engineering Empirical Research By James Miller et.all. Presented by Siv Hilde Houmb 1 November 2002

Outline Statistical Power Analysis Statistical significance testing Direction/Non-direction of statistical test Parametric and Non-parametric tests Calculating Statistical Power Normative and judgment effect estimation approach

Introduction The authors informal review of the software engineering empirical literature failed to find many articles which report the statistical power of the described experiment. Why do we need it -> The power to identify false hypothesis

Statistical Power Analysis Statistical power analysis is a method of increasing the probability that an effect is found in the empirical study Reject or accept a null hypothesis (denoted H0) H1 : alternative hypothesis Statistical power is defined as the probability that a statistical test will correctly reject a false null hypothesis. Eks. A power level of 0.4 means that if an experiment is run ten times, an existing effect will be discovered only four times out of ten.

Statistical Power Analysis cont. High power means that if an effect exists there is a high probability that it will be found. And if an effect does not exists, you have a solid statistical argument for accepting the null hypothesis.

Statistical significance testing Concerns with controlling the “fate-luck” Should have a power level of 0.8

Direction/Non-direction of statistical test One tailed and two tailed tests Direction – one tailed Eks: phenomenon exists if A is larger than B Non-drection – two tailed Eks: phenomenon exists if A and B differs

Parametric and Non-parametric tests A parametric statistical test requires the estimation of one or more population parameters Example: an estimate of the difference in the average between the first and the second populations A non-parametric test does not involve estimation of a specific parameter Example: provides you with an estimate of P[X>Y], probability that a randomly selected patient from your first population has a larger value than a randomly selected patient from the second population

Parametric and Non-parametric tests Advantage of non-parametric test Do not require the sample population to be normally distributes Disadvantage Non-parametric test do not have the same statistical power as parametric test do Non-parametric test should only be used when substantial non-normality of the sample is believed to exist or when one wishes to be particularly conservative on the side of Type I errors.

Calculating Statistical Power The significance criterion ( ): the chosen risk of committing a Type I error, that is the probability of incorrectly rejecting the null hypothesis (H0), when performing significance testing. The sample size (N): number of subjects (as large as possible) The effect size ( ): the degree to which the phenomenon under study is present in the population.

Calculating Statistical Power cont. Significance criterion:  Sample size: N Effect size:  For comparison of two means, the most request used

The significance criterion Type I error: the probability of incorrectly rejecting the null hypothesis (H0),  Type II error: the probability of incorrectly accepting the null hypothesis (H0),  Probability of correctly rejecting the null hypothesis: 1-  Relationship between  and  (Type I and Type II error) is / = x, which means that a false rejection of H0 is x times more serious than erroneously accepting it.

The sample size N Given the effect size and the significance criterion as constant, the power level of the test is directly dependent upon the sample size.. As N increases, the probability of error decreases, thus greater the precision and the higher the chance of rejecting the false null hypothesis.

Effect size  The effect size is the degree to which the phenomenon under study is present in the population. The larger the effect size the greater the degree a phenomenon is likely to be detected and the null hypothesis to be rejected.

Evaluating effect size Normative: rely on other related empirical studies or the establishment of an empirical norm for subject of experiment Normative approach are used when conducting a replicating study in ESE since this is a fairly young research field Expert judgement: rely on the experts providing the estimate Guesswork

Expert Judgment The judgmental approach to the estimation and evaluation of effect sizes can simply be regarded as a consensus opinion of experts within the field of experimentation The difficulty of this task within SE is that the experts will often not fully understand the concepts of significance and effect size and hence their opinion may only address these concepts in a relatively indirect manner

Expert Judgment cont. Experts do also have problems with providing quantitative opinion and one then need to transfer the qualitative opinion into a quantitative value Opinion can be collected using Formal structured interviews Formal questionnaires survey