1 Simulations: Basic Procedure 1. Generate a true probability distribution, and a set of base conditionals (premises), with associated lower probability.

Slides:

Advertisements

Similar presentations

A small taste of inferential statistics

Advertisements

Overview of Inferential Statistics

Probability Distributions CSLU 2850.Lo1 Spring 2008 Cameron McInally Fordham University May contain work from the Creative Commons.

CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.

Bounds on Code Length Theorem: Let l ∗ 1, l ∗ 2,..., l ∗ m be optimal codeword lengths for a source distribution p and a D-ary alphabet, and let L ∗ be.

Mining Association Rules. Association rules Association rules… –… can predict any attribute and combinations of attributes … are not intended to be used.

Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?

Chapter 8 Estimating Single Population Parameters

Copyright © Cengage Learning. All rights reserved. 9 Inferences Based on Two Samples.

Behavioural Science II Week 1, Semester 2, 2002

Distribution of the sample mean and the central limit theorem

Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.

Lecture 9: One Way ANOVA Between Subjects

Inference about a Mean Part II

Part III: Inference Topic 6 Sampling and Sampling Distributions

The Excel NORMDIST Function Computes the cumulative probability to the value X Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc

BCOR 1020 Business Statistics

Chapter 10: Estimating with Confidence

Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.

CHAPTER 15 S IMULATION - B ASED O PTIMIZATION II : S TOCHASTIC G RADIENT AND S AMPLE P ATH M ETHODS Organization of chapter in ISSO –Introduction to gradient.

Copyright © 2006 The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Statistical Inference: Estimation and Hypothesis Testing chapter.

Estimation and Hypothesis Testing Now the real fun begins.

From Last week.

Copyright © Cengage Learning. All rights reserved. 8 Tests of Hypotheses Based on a Single Sample.

Inference in practice BPS chapter 16 © 2006 W.H. Freeman and Company.

Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.

The Probability of a Type II Error and the Power of the Test

1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.

Estimation of Statistical Parameters

1 Statistical Inference Greg C Elvers. 2 Why Use Statistical Inference Whenever we collect data, we want our results to be true for the entire population.

Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.

Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.

PARAMETRIC STATISTICAL INFERENCE

Section 8.1 Estimating  When  is Known In this section, we develop techniques for estimating the population mean μ using sample data. We assume that.

Chapter 13 ANOVA The Design and Analysis of Single Factor Experiments - Part II Chapter 13B Class will begin in a few minutes. Reaching out to the internet.

10.1: Confidence Intervals – The Basics. Review Question!!! If the mean and the standard deviation of a continuous random variable that is normally distributed.

Brian Macpherson Ph.D, Professor of Statistics, University of Manitoba Tom Bingham Statistician, The Boeing Company.

Back to basics – Probability, Conditional Probability and Independence Probability of an outcome in an experiment is the proportion of times that.

Lunch & Learn Statistics By Jay. Goals Introduce / reinforce statistical thinking Understand statistical models Appreciate model assumptions Perform simple.

1 Chapter 8 Sampling Distributions of a Sample Mean Section 2.

LECTURE 25 THURSDAY, 19 NOVEMBER STA291 Fall

Schreiber, Yevgeny. Value-Ordering Heuristics: Search Performance vs. Solution Diversity. In: D. Cohen (Ed.) CP 2010, LNCS 6308, pp Springer-

Unit 1 Sections 1-1 & : Introduction What is Statistics?  Statistics – the science of conducting studies to collect, organize, summarize, analyze,

Lecture 6 Preview: Ordinary Least Squares Estimation Procedure  The Properties Clint’s Assignment: Assess the Effect of Studying on Quiz Scores General.

Correlation They go together like salt and pepper… like oil and vinegar… like bread and butter… etc.

Chapter 8: Confidence Intervals based on a Single Sample

1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.

Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,

Copyright © Cengage Learning. All rights reserved. 9 Inferences Based on Two Samples.

Sampling Distributions Statistics Introduction Let’s assume that the IQ in the population has a mean (  ) of 100 and a standard deviation (  )

The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.2 Tests About a Population.

Communications Range Analysis Simulation Set Up –Single Biological Threat placed in Soldier Field –Communication range varied from meters –Sensor.

Copyright © Cengage Learning. All rights reserved. 5 Joint Probability Distributions and Random Samples.

Hypothesis Testing. Statistical Inference – dealing with parameter and model uncertainty  Confidence Intervals (credible intervals)  Hypothesis Tests.

BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling.

MODELING AND SIMULATION CS 313 Simulation Examples 1.

Structural & Multidisciplinary Optimization Group Deciding How Conservative A Designer Should Be: Simulating Future Tests and Redesign Nathaniel Price.

Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses pt.1.

STA248 week 121 Bootstrap Test for Pairs of Means of a Non-Normal Population – small samples Suppose X 1, …, X n are iid from some distribution independent.

Sampling Distributions

Modeling and Simulation CS 313

Regression Models - Introduction

CHAPTER 9 Testing a Claim

CHAPTER 9 Testing a Claim

CHAPTER 9 Testing a Claim

CHAPTER 9 Testing a Claim

CHAPTER 9 Testing a Claim

Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.

Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.

Regression Models - Introduction

Presentation transcript:

1 Simulations: Basic Procedure 1. Generate a true probability distribution, and a set of base conditionals (premises), with associated lower probability bounds. 2. Determine which derived conditionals (conclusions) each system is able to derive from the given base conditionals (including lower probability bounds for these conclusions). 3. Assign scores to each system, by comparing the lower probability bounds for the derived conditionals with the true probabilities.

2 Language Restriction We adopted a simple language with four binary variables: a, b, c, and d We only considered conditionals with conjuctive antecedents and consequents (and no repetition of atoms), yielding 464 conditionals: 64 of the form  x  y  z  w 96 of the form  x  y  z 48 of the form  x  y 96 of the form  x  y  z  w 96 of the form  x  y  z 64 of the form  x  y  z  w

3 The probability distributions… are fixed by randomly setting the following sixteen values:  (a),  (b|a),  (b|  a),  (c|a  b),  (c|a  b),  (c|  a  b),  (c|  a  b),  (d|a  b  c),  (d|a  b  c),  (d|a  b  c),  (d|a  b  c),  (d|a  b  c),  (d|  a  b  c),  (d|  a  b  c),  (d|  a  b  c), and  (d|  a  b  c).

4 Errors We say that a system has made an error when the lower probability bound for a derived conditional exceeds the true probability.

5 Scoring I The advantage-compared-to-guessing score for derived conditionals: Score AvG ( C  r D,  ) = 1/3  |r   (D|C)|.

6 Scoring II The price-is-right score for derived conditionals: Score PIR ( C  r D,  ) = r, if r   (D|C), = 0, otherwise. = 0, otherwise.

7 Scoring III The subtle-price-is-right score for derived conditionals: Score sPIR ( C  r D,  ) = r, if r   (D|C), =  (D|C)  r, otherwise. =  (D|C)  r, otherwise.

8 Minimum Probability for Base Conditionals For each simulation, we required that the probability for each base conditional (premise) exceeded some fixed value, s.

9 Simulations with Randomly Selected Base Conditionals ►  was determined at random, for each run. ► We varied the number, n, of base conditionals. (n = 2, 3, 4, 5, or 6) ► We varied, s, the minimal probability of base conditionals. (s = 0.5, 0.6, 0.7, 0.8, 0.9, or ) ► The base conditionals, for each run, were selected at random, from among those conditionals whose associated probability was at least s. ► We ran each combination of s and n one thousand times.

10 Simulations with Predetermined Base Conditionals ►  was determined at random, for each run. ► For each set of predetermined base conditionals, we ran each combination of s one thousand times. (s = 0.5, 0.6, 0.7, 0.8, 0.9, and )

11 The Data

12 Observations (Table 1) 1. Systems O is very conservative, and licences very few inferences. [YELLOW]

13 Observations (Table 2) 3. System Z almost always outperformed the other systems by the AvG measure (even though the AvG measure sometimes punishes non-errors). [YELLOW] 4. System QC outscored all of the systems by the PIR measure (save where s = ), though system Z also scores well. [BLUE]

14 Observations (Table 2) 5. System Z generally achieved positive sPIR scores, and outscored all of the systems by this measure (save in the case where s = ). [GREEN] 6. QC usually obtained negative sPIR scores, due to frequent errors.

15 Observations (Table 3) 7. Where s is fixed, increasing the number of base conditionals tends to increase the scores (for all measures) for systems O, P, and Z (but not for QC [YELLOW]).

16 Observations (Table 4) 8. For systems O, P, and Z, the score earned per inference tends to increase for higher values of s (exceptions in [YELLOW]).

17 Conclusions 1. System P offers the same security as system O, and licences more inferences. 2. System Z licences far more inferences than system P, and generally infers lower probability bounds that are close to the true probability values (based on AvG scoring).

18 Conclusions 3. QC inferences are too risky, and very few of the inferences sanctioned by QC, and not by Z, should be made. (Consider the QC  Z inferences, esp. sPIR scores.) 4. Of the four systems, system Z offers the best balance of safety versus inferential power.