Stats 242.3(02) Statistical Theory and Methodology.

Slides:



Advertisements
Similar presentations
Previous Lecture: Distributions. Introduction to Biostatistics and Bioinformatics Estimation I This Lecture By Judy Zhong Assistant Professor Division.
Advertisements

Chapter 6 Sampling and Sampling Distributions
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Week11 Parameter, Statistic and Random Samples A parameter is a number that describes the population. It is a fixed number, but in practice we do not know.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
© 2004 Prentice-Hall, Inc.Chap 1-1 Basic Business Statistics (9 th Edition) Chapter 1 Introduction and Data Collection.
1 Midterm Review Econ 240A. 2 The Big Picture The Classical Statistical Trail Descriptive Statistics Inferential Statistics Probability Discrete Random.
Chapter 7 Sampling and Sampling Distributions
Programme in Statistics (Courses and Contents). Elementary Probability and Statistics (I) 3(2+1)Stat. 101 College of Science, Computer Science, Education.
Stats Probability Theory.
Evaluating Hypotheses
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 8-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Part III: Inference Topic 6 Sampling and Sampling Distributions
Inferences About Process Quality
7-1 Introduction The field of statistical inference consists of those methods used to make decisions or to draw conclusions about a population. These.
Statistical inference Population - collection of all subjects or objects of interest (not necessarily people) Sample - subset of the population used to.
Sampling Theory Determining the distribution of Sample statistics.
Elementary Statistical Concepts
Chapter 10 Hypothesis Testing
Confidence Intervals and Hypothesis Testing - II
Sampling Theory Determining the distribution of Sample statistics.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
Elementary Statistical Concepts
STA291 Statistical Methods Lecture 16. Lecture 15 Review Assume that a school district has 10,000 6th graders. In this district, the average weight of.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Random Sampling, Point Estimation and Maximum Likelihood.
1 Introduction to Hypothesis Testing. 2 What is a Hypothesis? A hypothesis is a claim A hypothesis is a claim (assumption) about a population parameter:
1 Sampling Distributions Lecture 9. 2 Background  We want to learn about the feature of a population (parameter)  In many situations, it is impossible.
Some definitions In Statistics. A sample: Is a subset of the population.
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
7-1 Introduction The field of statistical inference consists of those methods used to make decisions or to draw conclusions about a population. These.
Areej Jouhar & Hafsa El-Zain Biostatistics BIOS 101 Foundation year.
Chapter 7 Sampling and Point Estimation Sample This Chapter 7A.
Lab 3b: Distribution of the mean
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Stats Probability Theory. Instructor:W.H.Laverty Office:235 McLean Hall Phone: Lectures: M W F 2:30pm - 3:20am Arts 133 Lab: M 3:30 - 4:20.
Distributions of the Sample Mean
Chap 8-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 8 Introduction to Hypothesis.
The two way frequency table The  2 statistic Techniques for examining dependence amongst two categorical variables.
Lecture 3: Statistics Review I Date: 9/3/02  Distributions  Likelihood  Hypothesis tests.
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
Fall 2002Biostat Statistical Inference - Proportions One sample Confidence intervals Hypothesis tests Two Sample Confidence intervals Hypothesis.
Stats 845 Applied Statistics. This Course will cover: 1.Regression –Non Linear Regression –Multiple Regression 2.Analysis of Variance and Experimental.
Stats 845 Applied Statistics. This Course will cover: 1.Regression –Non Linear Regression –Multiple Regression 2.Analysis of Variance and Experimental.
Chapter 8: Simple Linear Regression Yang Zhenlin.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
© Copyright McGraw-Hill 2004
Point Estimation of Parameters and Sampling Distributions Outlines:  Sampling Distributions and the central limit theorem  Point estimation  Methods.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 11: Models Marshall University Genomics Core Facility.
Probability Theory Modelling random phenomena. Permutations the number of ways that you can order n objects is: n! = n(n-1)(n-2)(n-3)…(3)(2)(1) Definition:
Basic Business Statistics, 8e © 2002 Prentice-Hall, Inc. Chap 1-1 Inferential Statistics for Forecasting Dr. Ghada Abo-zaid Inferential Statistics for.
Stats Term Test 4 Solutions. c) d) An alternative solution is to use the probability mass function and.
Chapter 7 Introduction to Sampling Distributions Business Statistics: QMIS 220, by Dr. M. Zainal.
Topics Semester I Descriptive statistics Time series Semester II Sampling Statistical Inference: Estimation, Hypothesis testing Relationships, casual models.
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
Sampling Theory Determining the distribution of Sample statistics.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses pt.1.
Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate its.
Stats Introduction to Statistical Methods. Instructor:W.H.Laverty Office:235 McLean Hall Phone: Lectures: M T W Th F 11:00am - 12:20pm Geol.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
Chapter 6 Sampling and Sampling Distributions
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate accuracy.
Sampling Distributions
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Statistics in Applied Science and Technology
Introduction to Statistics
STAT 5372: Experimental Statistics
STAT 400 Probability and Statistics 1
Presentation transcript:

Stats 242.3(02) Statistical Theory and Methodology

Instructor:W.H.Laverty Office:235 McLean Hall Phone: Lectures: M W F 2:30pm - 3:20am Arts 206 Lab: M 3:30 - 4:20 Arts 206 Evaluation:Assignments, Labs, Term tests - 40% Final Examination - 60%

Text: Dennis D. Wackerly, William Mendenhall III, Richard L. Scheaffer, Mathematical Statistics with applications, 6th Edition, Duxbury Press

Course Outline

Introduction Chapter 1

Sampling Distributions Chapter 7 Sampling distributions related to the Normal distribution The Central Limit theorem The Normal approximation to the Binomial

Estimation Chapter 8 Properties of estimators Interval estimation Sample size determination

Properties and Methods of Estimation Chapter 9 The method of moments Maximum Likelihood estimation Sufficiency (Sufficient Statistics)

Hypothesis testing Chapter 10 Elements of a statistical test - type I and type II errors The Z test - one and two samples hypothesis testing for the means of the normal distribution with small sample sizes Power and the NeymannPearson Lemma Likelihood ratio tests

Linear and Nonlinear Models Least Squares Estimation Chapter 11 Topics covered dependent on available time

The Analysis of Variance Chapter 13 Topics covered dependent on available time

Nonparametric Statistical Methods Chapter 15 Topics covered dependent on available time

Introduction

What is Statistics? It is the major mathematical tool of scientific inference – methods for drawing conclusion from data. Data that is to some extent corrupted by some component of random variation (random noise)

Phenomena Deterministic Non-deterministic

Deterministic Phenomena A mathematical model exists that allows accurate prediction of outcomes of the phenomena (or observations taken from the phenomena)

Non-deterministic Phenomena Lack of perfect predictability

Non-deterministic Phenomena haphazard Random

Random Phenomena No mathematical model exists that allows accurate prediction of outcomes of the phenomena (or observations) However the outcomes (or observations) exhibit in the long run on the average statistical regularity

Example Tossing of a Coin: No mathematical model exists that allows accurate prediction of outcome of this phenomena However in the long run on the average approximately 50% of the time the coin is a head and 50% of the time the coin is a tail

Haphazard Phenomena No mathematical model exists that allows accurate prediction of outcomes of the phenomena (or observations) No exhibition of statistical regularity in the long run. Do such phenomena exist?

In both Statistics and Probability theory we are concerned with studying random phenomena

In probability theory The model is known and we are interested in predicting the outcomes and observations of the phenomena. model outcomes and observations

In statistics The model is unknown the outcomes and observations of the phenomena have been observed. We are interested in determining the model from the observations model outcomes and observations

Example - Probability A coin is tossed n = 100 times We are interested in the observation, X, the number of times the coin is a head. Assuming the coin is balanced (i.e. p = the probability of a head = ½.)

Example - Statistics We are interested in the success rate, p, of a new surgical procedure. The procedure is performed n = 100 times. X, the number of successful times the procedure is performed is 82. The success rate p is unknown.

If the success rate p was known. Then This equation allows us to predict the value of the observation, X.

In the case when the success rate p was unknown. Then the following equation is still true the success rate We will want to use the value of the observation, X = 82 to make a decision regarding the value of p.

Some definitions important to Statistics

A population: this is the complete collection of subjects (objects) that are of interest in the study. There may be (and frequently are) more than one in which case a major objective is that of comparison.

A case (elementary sampling unit): This is an individual unit (subject) of the population.

A variable: a measurement or type of measurement that is made on each individual case in the population.

Types of variables Some variables may be measured on a numerical scale while others are measured on a categorical scale. The nature of the variables has a great influence on which analysis will be used..

For Variables measured on a numerical scale the measurements will be numbers. Ex: Age, Weight, Systolic Blood Pressure For Variables measured on a categorical scale the measurements will be categories. Ex: Sex, Religion, Heart Disease

Note Sometimes variables can be measured on both a numerical scale and a categorical scale. In fact, variables measured on a numerical scale can always be converted to measurements on a categorical scale.

Example The following variables were evaluated for a study of individuals receiving head injuries in Saskatchewan. 1.Cause of the injury (categorical) Motor vehicle accident Fall Violence other

2.Time of year (date) (numerical or categorical) summer fall winter spring 3.Sex on injured individual (categorical) male female

4.Age (numerical or categorical) < – Mortality (categorical) Died from injury alive

Types of variables In addition some variables are labeled as dependent variables and some variables are labeled as independent variables.

This usually depends on the objectives of the analysis. Dependent variables are output or response variables while the independent variables are the input variables or factors.

Usually one is interested in determining equations that describe how the dependent variables are affected by the independent variables

Example Suppose we are collecting data on Blood Pressure Height Weight Age

Suppose we are interested in how Blood Pressure is influenced by the following factors Height Weight Age

Then Blood Pressure is the dependent variable and Height Weight Age Are the independent variables

Example – Head Injury study Suppose we are interested in how Mortality is influenced by the following factors Cause of head injury Time of year Sex Age

Then Mortality is the dependent variable and Cause of head injury Time of year Sex Age Are the independent variables

dependentResponse variable independentpredictor variable

A sample: Is a subset of the population

In statistics: One draws conclusions about the population based on data collected from a sample

Reasons: Cost It is less costly to collect data from a sample then the entire population Accuracy

Data from a sample sometimes leads to more accurate conclusions then data from the entire population Costs saved from using a sample can be directed to obtaining more accurate observations on each case in the population

Types of Samples different types of samples are determined by how the sample is selected.

Convenience Samples In a convenience sample the subjects that are most convenient to the researcher are selected as objects in the sample. This is not a very good procedure for inferential Statistical Analysis but is useful for exploratory preliminary work.

Quota samples In quota samples subjects are chosen conveniently until quotas are met for different subgroups of the population. This also is useful for exploratory preliminary work.

Random Samples Random samples of a given size are selected in such that all possible samples of that size have the same probability of being selected.

Convenience Samples and Quota samples are useful for preliminary studies. It is however difficult to assess the accuracy of estimates based on this type of sampling scheme. Sometimes however one has to be satisfied with a convenience sample and assume that it is equivalent to a random sampling procedure

Some other definitions

A population statistic (parameter): Any quantity computed from the values of variables for the entire population.

A sample statistic: Any quantity computed from the values of variables for the cases in the sample.