Session 1 Probability to confidence intervals By : Allan Chang.

Slides:



Advertisements
Similar presentations
Sections 7-1 and 7-2 Review and Preview and Estimating a Population Proportion.
Advertisements

Confidence Intervals This chapter presents the beginning of inferential statistics. We introduce methods for estimating values of these important population.
Inferential Statistics & Hypothesis Testing
Chapter 8 Estimation: Additional Topics
Review: What influences confidence intervals?
1 MF-852 Financial Econometrics Lecture 4 Probability Distributions and Intro. to Hypothesis Tests Roy J. Epstein Fall 2003.
Lecture 5 Outline – Tues., Jan. 27 Miscellanea from Lecture 4 Case Study Chapter 2.2 –Probability model for random sampling (see also chapter 1.4.1)
Audit Tests: Risk, Confidence and Materiality Some basic statistics about Inherent Risk and Control Risk.
BHS Methods in Behavioral Sciences I
Statistical Concepts (continued) Concepts to cover or review today: –Population parameter –Sample statistics –Mean –Standard deviation –Coefficient of.
Chap 9-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 9 Estimation: Additional Topics Statistics for Business and Economics.
PSY 1950 Confidence and Power December, Requisite Quote “The picturing of data allows us to be sensitive not only to the multiple hypotheses that.
Chapter Topics Confidence Interval Estimation for the Mean (s Known)
Hypothesis Tests for Means The context “Statistical significance” Hypothesis tests and confidence intervals The steps Hypothesis Test statistic Distribution.
Inference about a Mean Part II
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
1/49 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 9 Estimation: Additional Topics.
Standard error of estimate & Confidence interval.
Confidence Interval A confidence interval (or interval estimate) is a range (or an interval) of values used to estimate the true value of a population.
Review of normal distribution. Exercise Solution.
Statistical inference: confidence intervals and hypothesis testing.
Tests of significance & hypothesis testing Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
Topic 5 Statistical inference: point and interval estimate
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
Sections 6-1 and 6-2 Overview Estimating a Population Proportion.
1 Chapter 6. Section 6-1 and 6-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Estimation Bias, Standard Error and Sampling Distribution Estimation Bias, Standard Error and Sampling Distribution Topic 9.
Sumukh Deshpande n Lecturer College of Applied Medical Sciences Statistics = Skills for life. BIOSTATISTICS (BST 211) Lecture 7.
Estimating a Population Proportion
Statistical estimation, confidence intervals
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Statistics in Biology. Histogram Shows continuous data – Data within a particular range.
Large sample CI for μ Small sample CI for μ Large sample CI for p
Sections 7-1 and 7-2 Review and Preview and Estimating a Population Proportion.
Physics 270 – Experimental Physics. Let say we are given a functional relationship between several measured variables Q(x, y, …) x ±  x and x ±  y What.
Confidence intervals. Estimation and uncertainty Theoretical distributions require input parameters. For example, the weight of male students in NUS follows.
1 Chapter 6. Section 6-1 and 6-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
: An alternative representation of level of significance. - normal distribution applies. - α level of significance (e.g. 5% in two tails) determines the.
Revision of basic statistics Hypothesis testing Principles Testing a proportion Testing a mean Testing the difference between two means Estimation.
1 URBDP 591 A Lecture 12: Statistical Inference Objectives Sampling Distribution Principles of Hypothesis Testing Statistical Significance.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Lesoon Statistics for Management Confidence Interval Estimation.
1 Probability and Statistics Confidence Intervals.
Elementary Probability.  Definition  Three Types of Probability  Set operations and Venn Diagrams  Mutually Exclusive, Independent and Dependent Events.
10.1 – Estimating with Confidence. Recall: The Law of Large Numbers says the sample mean from a large SRS will be close to the unknown population mean.
Hypothesis Testing. Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean μ = 120 and variance σ.
SPSS Problem and slides Is this quarter fair? How could you determine this? You assume that flipping the coin a large number of times would result in.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Confidence Intervals and Hypothesis Testing Mark Dancox Public Health Intelligence Course – Day 3.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate its.
And distribution of sample means
LBSRE1021 Data Interpretation Lecture 9
Confidence Intervals with Means
Inference: Conclusion with Confidence
Estimation & Hypothesis Testing for Two Population Parameters
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Hypothesis Testing Is It Significant?.
Hypothesis Testing: Hypotheses
When we free ourselves of desire,
Chapter 9 Hypothesis Testing.
CONCEPTS OF ESTIMATION
Lecture 7 Sampling and Sampling Distributions
BUSINESS MATHEMATICS & STATISTICS.
CS639: Data Management for Data Science
Last Update 12th May 2011 SESSION 41 & 42 Hypothesis Testing.
Chapter 9 Estimation: Additional Topics
Presentation transcript:

Session 1 Probability to confidence intervals By : Allan Chang

Probability Mean The normal distribution Standard Deviation (of measurements) Confidence interval of measurements Population and sample Standard Error (of the mean) Confidence interval of the mean Difference between two means (mean of differences) Standard Error (of the difference) Confidence interval of the difference

Probability Philosophical nature of truth –Aristotle : things can be true or false, but cannot be both –Confucius : everything is in the middle Science and empiricism –Truth cannot be known, only estimated from observations –Things repeatedly observed are more likely to be true –No theory can be accepted unless supported by repeated observations Uncertainties in observations –Multiple influences on any observation –Precisions of measurement –Very unlikely to get the same result every time –Probability provides an expression of confidence in the results –Statistics is the set of tools to summarise the observations

How probability is expressed Convention : a number between 0 and 1. –0 (0%) = absolutely impossible, 1 (100%) = absolute certainty Subjective probability : An expression of opinion. –e.g. the probability of my staying awake in this lecture is 10% Logical probability : mathematical calculations based on assumptions, how most statistical theories are developed –e.g. If the probability of getting heads in tossing a coin is 0.5 getting 2 heads in 2 tosses is 0.5x0.5=0.25, getting 2 tails = 0.25, and getting 1 head and 1 tail = 0.5 Getting at least 1 heads in 2 tosses is (0.5x0.5) = 0.75 Empirical probability : calculated from observations –e.g. if there are 10 women and 20 men in the class, the probability of being male is 20/(10+20) = 0.67 (67%)

Probability Mean The normal distribution Standard Deviation (of measurements) Confidence interval of measurements Population and sample Standard Error (of the mean) Confidence interval of the mean Difference between two means (mean of differences) Standard Error (of the difference) Confidence interval of the difference

Mean Havara : a system of insurance amongst Phoenecian traders Havara -> average -> mean Mean is the centre of all the measurements

Probability Mean The normal distribution Standard Deviation (of measurements) Confidence interval of measurements Population and sample Standard Error (of the mean) Confidence interval of the mean Difference between two means (mean of differences) Standard Error (of the difference) Confidence interval of the difference

Gauss De Moivre

Probability Mean The normal distribution Standard Deviation (of measurements) Confidence interval of measurements Population and sample Standard Error (of the mean) Confidence interval of the mean Difference between two means (mean of differences) Standard Error (of the difference) Confidence interval of the difference

De Moivre Fisher

z p Standard deviate z Probability p Z=0; p=0.5 Z=1.29; p=0.1 Z=1.65; p=0.05 Z=1.96; p=0.025 Z=2.33; p=0.01 Z=2.58; p=0.005 Z=3.09; p=0.001 Fisher

Probability Mean The normal distribution Standard Deviation (of measurements) Confidence interval of measurements Population and sample Standard Error (of the mean) Confidence interval of the mean Difference between two means (mean of differences) Standard Error (of the difference) Confidence interval of the difference

Confidence interval Half of excluded each side e.g. in 95% CI 5% is excluded 2.5% each side (z=1.96) 95% CI = mean +/ SD

Confidence interval of measurements The range any single measurement is likely to be within CI measurement = mean +/- z SD –90% CI = mean +/ SD –95% CI = mean +/ SD –99% CI = mean +/ SD

Probability Mean The normal distribution Standard Deviation (of measurements) Confidence interval of measurements Population and sample Standard Error (of the mean) Confidence interval of the mean Difference between two means (mean of differences) Standard Error (of the difference) Confidence interval of the difference

Population –Measure everyone, the results are true of the population –Very expensive and inconvenient to do, in most cases not possible Sampling –Measure a small but representative portion of the population, and assume that the results are true also of the population as a whole –The results are estimates of the truth, approximations –The degree of approximation (error) is important and needs to be estimated

Probability Mean The normal distribution Standard Deviation (of measurements) Confidence interval of measurements Population and sample Standard Error (of the mean) Confidence interval of the mean Difference between two means (mean of differences) Standard Error (of the difference) Confidence interval of the difference

Standard Error of the mean –Conceptually, the Standard Deviation of all the mean values, if the samples are taken repeatedly

nmeanSDSE mean and SD of the 10 means Standard Error of the mean SE = SD / sqrt(n) Random number mean=0 SD=1 10 samples of 20 each SE each sample = 0.2 – 0.32 SD of the 10 means = 0.2 SE is an estimate of the SD of the means if repeated sampling of the same size occurs

Standard Error of the mean (SE) –A function of Standard Deviation, and sample size –If the sample size is infinite, or the whole population, then SE = 0, as the mean value has no error –If the sample size =1, then the SE is the same as the Standard Deviation –SE mean = SD / sqrt(n)

Sample size SE = SD / SQRT(n) SE as % of SD

Probability Mean The normal distribution Standard Deviation (of measurements) Confidence interval of measurements Population and sample Standard Error (of the mean) Confidence interval of the mean Difference between two means (mean of differences) Standard Error (of the difference) Confidence interval of the difference

Confidence interval of the mean –The range within which the true mean value can be expected –CI mean = mean +/- z SE 90% CI = mean +/ SE 95% CI = mean +/ SE 99% CI = mean +/ SE –CI mean = CI measurement / SQRT(n)

Probability Mean The normal distribution Standard Deviation (of measurements) Confidence interval of measurements Population and sample Standard Error (of the mean) Confidence interval of the mean Difference between two means (mean of differences) Standard Error (of the difference) Confidence interval of the difference

Theory of sampling –Difference between the means of samples in two groups can be considered as the mean difference between these two groups in the population –e.g. The difference in the mean weights between 200 men and 200 women represents the mean difference in weight between men and women in the population –The problem is the Standard Error and the confidence interval

Probability Mean The normal distribution Standard Deviation (of measurements) Confidence interval of measurements Population and sample Standard Error (of the mean) Confidence interval of the mean Difference between two means (mean of differences) Standard Error (of the difference) Confidence interval of the difference

R.A. Fisher devised the Analysis of Variance between groups –Total variations in measurements are partitioned That attributable to groups difference That attributable to individual differences (residual, error)

From the Analysis of variance –Individual or random variation is partitioned –What is left are group variance and difference –From this Further partition to Standard Errors for each group What is left then is the variation or error of the difference From this, the Standard Error of the difference is derived SE difference is a function of the sample size and SD of the two groups, and the difference in the means between the two groups

Probability Mean The normal distribution Standard Deviation (of measurements) Confidence interval of measurements Population and sample Standard Error (of the mean) Confidence interval of the mean Difference between two means (mean of differences) Standard Error (of the difference) Confidence interval of the difference

–The range within which the difference can be expected –CI difference = mean +/- z SE difference 90% CI = difference +/ SE difference 95% CI = difference +/ SE difference 99% CI = difference +/ SE difference

Interpretation of CI difference CI represent the range within which the “true value” is likely to be –CI that does not traverse the null value means that the difference is not null, or that the difference is statistically significant –CI that are inside of a meaningless difference (tolerance) means that the difference is not of practical consequence, and can be considered to be equivalent

An example of CI We do two studies –Study 1, the difference in weight between women and men –Study 2, the difference in weight between those named Jim and all those named Bill –We think that a difference of less than 1Kg is probably not big enough to worry about

Difference in weight (Kg) men – women n = 20 men – women n = 100 Jim – Bill n = 200 Jim – Bill n = 1000 Tolerance +/- 1Kg

Terms you need to know Measurements –Mean, Standard Deviation, confidence interval Means –Mean, Standard Error of the mean, confidence interval Difference –Difference between means, mean difference, Standard Error of the difference, confidence interval, significant difference, not significantly different, equivalence