1 Measurement Theory Ch 3 in Kan Steve Chenoweth, RHIT.

Slides:



Advertisements
Similar presentations
Conceptualization and Measurement
Advertisements

Reliability and Validity
1 Reliability in Scales Reliability is a question of consistency do we get the same numbers on repeated measurements? Low reliability: reaction time High.
 A description of the ways a research will observe and measure a variable, so called because it specifies the operations that will be taken into account.
Measurement Reliability and Validity
1 Defect Removal Effectiveness Kan ch 6 Steve Chenoweth, RHIT Left – Some defects are easier to remove than others. This is the cruise ship Costa Concordia,
Validity In our last class, we began to discuss some of the ways in which we can assess the quality of our measurements. We discussed the concept of reliability.
G. Alonso, D. Kossmann Systems Group
HUDM4122 Probability and Statistical Inference March 30, 2015.
Reliability, the Properties of Random Errors, and Composite Scores.
1 Exponential Distribution and Reliability Growth Models Kan Ch 8 Steve Chenoweth, RHIT Right: Wait – I always thought “exponential growth” was like this!
Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules.
1. Estimation ESTIMATION.
Review: What influences confidence intervals?
Research Hypotheses and Multiple Regression Kinds of multiple regression questions Ways of forming reduced models Comparing “nested” models Comparing “non-nested”
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Measurement Fundamentals
Variables cont. Psych 231: Research Methods in Psychology.
Swami NatarajanJuly 12, 2015 RIT Software Engineering Measurement Fundamentals.
Bio (“life”) + logy (“study of”) Scientific study of life (pg. 4)
1 Software Quality Metrics Ch 4 in Kan Steve Chenoweth, RHIT What do you measure?
1 Quality Management Models Kan Ch 9 Steve Chenoweth, RHIT Right – To keep in mind – For Kan, this is part of Total Quality Management.
Determining the Size of
Kan Ch 7 Steve Chenoweth, RHIT
Business Research Method Measurement, Scaling, Reliability, Validity
1 KAN’S INTRO AND OVERVIEW MODELS Ch1 & 2 in his book Steve Chenoweth, CSSE.
Hypothesis Testing:.
Significance Testing 10/15/2013. Readings Chapter 3 Proposing Explanations, Framing Hypotheses, and Making Comparisons (Pollock) (pp ) Chapter 5.
Variation, Validity, & Variables Lesson 3. Research Methods & Statistics n Integral relationship l Must consider both during planning n Research Methods.
Measurement in Exercise and Sport Psychology Research EPHE 348.
1 Psych 5500/6500 Chi-Square (Part Two) Test for Association Fall, 2008.
Technical Adequacy Session One Part Three.
Evidence Based Medicine
Association between 2 variables
DAY 2: THE SCIENTIFIC METHOD Lakki Chandrasekaran August 19,
1 rules of engagement no computer or no power → no lesson no SPSS → no lesson no homework done → no lesson GE 5 Tutorial 5.
Mr. Boucher. 1 – What is science A – a methodology of thinking B – a way of researching the physical world C – a body of knowledge 2 – Only with all three.
Fundamentals of Measurement Theory. Measurement  Measurement is crucial to the progress of all sciences.  Scientific progress is made through observations.
Normal Distr Practice Major League baseball attendance in 2011 averaged 30,000 with a standard deviation of 6,000. i. What percentage of teams had between.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 1): Two-tail Tests & Confidence Intervals Fall, 2008.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Reliability, the Properties of Random Errors, and Composite Scores Week 7, Psych R. Chris Fraley
Ch 10 – Intro To Inference 10.1: Estimating with Confidence 10.2 Tests of Significance 10.3 Making Sense of Statistical Significance 10.4 Inference as.
Chapter 2: Behavioral Variability and Research Variability and Research 1. Behavioral science involves the study of variability in behavior how and why.
Bell Ringer Using female = 0 and male = 1, calculate the average maleness in this classroom.
 The point estimators of population parameters ( and in our case) are random variables and they follow a normal distribution. Their expected values are.
Digression - Hypotheses Many research designs involve statistical tests – involve accepting or rejecting a hypothesis Null (statistical) hypotheses assume.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Cmpe 589 Spring Measurement Theory Front-End –Design –Design Review and Inspection –Code –Code Inspections –Debug and Develop Test Cases  Integration.
Measurement Theory in Marketing Research. Measurement What is measurement?  Assignment of numerals to objects to represent quantities of attributes Don’t.
Review I A student researcher obtains a random sample of UMD students and finds that 55% report using an illegally obtained stimulant to study in the past.
MAT 1000 Mathematics in Today's World. Last Time 1.Collecting data with experiments 2.Practical problems with experiments.
Week 6. Statistics etc. GRS LX 865 Topics in Linguistics.
Statistics. Descriptive Statistics Organize & summarize data (ex: central tendency & variability.
Measurement Experiment - effect of IV on DV. Independent Variable (2 or more levels) MANIPULATED a) situational - features in the environment b) task.
Chapter 6 - Standardized Measurement and Assessment
Outline Variables – definition  Physical dimensions  Abstract dimensions Systematic vs. random variables Scales of measurement Reliability of measurement.
Chapter 13 Understanding research results: statistical inference.
Lesson 3 Measurement and Scaling. Case: “What is performance?” brandesign.co.za.
 Confidence Intervals  Around a proportion  Significance Tests  Not Every Difference Counts  Difference in Proportions  Difference in Means.
Chemistry. What is Chemistry? Chemistry is the "scientific study of matter, its properties, and interactions with other matter and with energy".
Data measurement, probability and Spearman’s Rho
REGRESSION G&W p
Review You run a t-test and get a result of t = 0.5. What is your conclusion? Reject the null hypothesis because t is bigger than expected by chance Reject.
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Review: What influences confidence intervals?
Significance Tests: The Basics
Non-Experimental designs: Correlational & Quasi-experimental designs
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

1 Measurement Theory Ch 3 in Kan Steve Chenoweth, RHIT

2 What quality is about You have to measure quality some way, to “know” if you have it! If customer “happiness” can’t be anticipated by something you can measure, then you’re just guessing about uncertainties. Engineering is all about the “controlling” of processes – how you do things. In this case, how you achieve quality.

3 Quality vs Qualities Quality is usually taken to be some degree of goodness, like how much a feature is there, how well it works, how fast it runs, etc. Qualities are usually seen as these dimensions that you can measure or judge. Almost inevitably, the more you can control the qualities you want to measure, the more reliably or accurately you can measure them. Like “experiments.” So,

4 We want to run QA like science! To begin with, you need to understand how to measure things carefully. Thus, Kan’s Ch 3. E.g., we try to do sequences of: – Observing things we can measure, then – Generalizing from those measurements, – Based on some careful method. So, we gather data to test propositions about our products.

5 What stands in the way of science? We don’t know how to control a lot of variables in real production processes like software. – We don’t do the same thing each time. – Just because measuring “errors per KLOC” worked well last time, doesn’t mean it will next time. – Thus, in software, we rarely rely on single measurements to decide things like, “Ship it.”

6 When is best chance for measurement? Doing things like what you already know what to do! The whole development process, if you just did several products a lot like this one! It’s worth, at the start, deciding – What’s the same, vs – What’s different.

7 We do apply common sense But, we also test that! E.g., “The higher the percentage of the designs and code that are inspected, the lower the defect rate at the later phase of formal machine testing.” If you have guts, you can actually test if this is true or not!

8 You have to be careful using believed but untested common sense!

9 We do hypothesis testing…

10 Kan’s levels of measurement Nominal scale – classify things! How about that new guy on the right?

11 Ordinal scale Roughly, something like 1,2,3,4,5,6,7,… Mathematical properties, such as: If A > B and B > C, then A > C.

12 Interval and Ratio scales Interval: Product A’s defect rate is 5 KLOC, Product B’s defect rate is 3.5 KLOC. Then Product A’s defect rate is 1.5 KLOC greater than Product B’s defect rate. Ratio: We can also say that Product A’s defect rate is 1.43 x Prodct B’s defect rate.

13 Higher level vs lower level Higher level measurements scales possess the properties of lower level ones. Can’t do the reverse – If we have: – Excellent – Good – Average – Worse than average – Poor How much better is Good than Poor? Who knows?

14 Games to play with the numbers Ratio Proportion Percentage

15 Percentage, cntd

16 More games Rate Six Sigma

17 Reliability and Validity Not the same thing E.g., IQ tests – What do they really measure? They all prove that they are “reliable” – – If you take it again, you’re likely to get close to the same score. And they are “validated” against each other – – If you take a different test, you’re likely to get close to the same score. So?

18 Validity = “Does it measure what we intend it to measure?” E.g., Do IQ tests predict how well students will do in school? – Angela Lee Duckworth, of U Penn, says a better key to success, even in school, is “grit”. – Other people cite motivation, “competence,” “multiple intelligences,” and other factors. – IQ tests predict best how well people do taking similar tests.

19 How about for us? What measures accurately if a product will be a success? Or, even “not crash”?

20 Reliability vs validity pictured Measurement errors – – Systematic – Random

21 Reliability and Validity give confidence! Reliability helps us know if our process is sound. Correlation is used to judge validity. Really hard to conclude “causality.” – If my alarm clock goes off at 7 AM every morning, – And yours goes off at 7:01 AM every morning, – Did my alarm clock make yours go off?