The Scientific Method Probability and Inferential Statistics.

Slides:



Advertisements
Similar presentations
“Students” t-test.
Advertisements

Hypothesis Testing. To define a statistical Test we 1.Choose a statistic (called the test statistic) 2.Divide the range of possible values for the test.
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Chapter 4 Probability and Probability Distributions
Modeling Process Quality
Physics 270 – Experimental Physics. Let say we are given a functional relationship between several measured variables Q(x, y, …) What is the uncertainty.
Introduction to Statistics
Hypothesis testing Week 10 Lecture 2.
QUANTITATIVE DATA ANALYSIS
PSY 307 – Statistics for the Behavioral Sciences
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
BHS Methods in Behavioral Sciences I
CHAPTER 6 Statistical Analysis of Experimental Data
Inference about a Mean Part II
Lecture 6: Descriptive Statistics: Probability, Distribution, Univariate Data.
Hypothesis Testing :The Difference between two population mean :
Chapter 9: Introduction to the t statistic
Chapter 6: Probability.
Inferential Statistics
Inferential Statistics
Testing Hypotheses.
Problem A newly married couple plans to have four children and would like to have three girls and a boy. What are the chances (probability) their desire.
Hypothesis Testing. Central Limit Theorem Hypotheses and statistics are dependent upon this theorem.
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
The smokers’ proportion in H.K. is 40%. How to testify this claim ?
Statistical Analysis Statistical Analysis
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Individual values of X Frequency How many individuals   Distribution of a population.
Mid-Term Review Final Review Statistical for Business (1)(2)
Theory of Probability Statistics for Business and Economics.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
BINOMIALDISTRIBUTION AND ITS APPLICATION. Binomial Distribution  The binomial probability density function –f(x) = n C x p x q n-x for x=0,1,2,3…,n for.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
Lecture 2 Review Probabilities Probability Distributions Normal probability distributions Sampling distributions and estimation.
Research Ethics:. Ethics in psychological research: History of Ethics and Research – WWII, Nuremberg, UN, Human and Animal rights Today - Tri-Council.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Fitting probability models to frequency data. Review - proportions Data: discrete nominal variable with two states (“success” and “failure”) You can do.
Physics 270 – Experimental Physics. Let say we are given a functional relationship between several measured variables Q(x, y, …) x ±  x and x ±  y What.
Medical Statistics as a science
Two Main Uses of Statistics: 1)Descriptive : To describe or summarize a collection of data points The data set in hand = the population of interest 2)Inferential.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
© Copyright McGraw-Hill 2004
URBDP 591 I Lecture 4: Research Question Objectives How do we define a research question? What is a testable hypothesis? How do we test an hypothesis?
Hypothesis Testing. Central Limit Theorem Hypotheses and statistics are dependent upon this theorem.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Introduction to Medical Statistics. Why Do Statistics? Extrapolate from data collected to make general conclusions about larger population from which.
Probability, Sampling, and Inference Q560: Experimental Methods in Cognitive Science Lecture 5.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
Chapter 13 Understanding research results: statistical inference.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
PCB 3043L - General Ecology Data Analysis Organizing an ecological study What is the aim of the study? What is the main question being asked? What are.
Chapter 9 Introduction to the t Statistic
Theoretical distributions: the other distributions.
Theoretical distributions: the Normal distribution.
Confidence Intervals.
AP Biology Statistics From BSCS: Interaction of experiments and ideas, 2nd Edition. Prentice Hall, 1970 and Statistics for the Utterly Confused by Lloyd.
Chapter 9 Hypothesis Testing.
Chapter 4. Inference about Process Quality
LBSRE1021 Data Interpretation Lecture 9
Statistics and Computers
Statistical Inference for the Mean: t-test
Presentation transcript:

The Scientific Method Probability and Inferential Statistics

Scientific investigations sample values of a variable to make inferences (or predictions) about all of its possible values in the population BUT! There is always some doubt as to whether observed values = population values eg Jersey cow serum iron concentrations Inferential statistics quantify the doubt – ► What are the chances of conclusions based on a sample of the population holding true for the population as a whole ? ► are the conclusions safe ? – will the prediction happen in most observed situations? Probability is defined as a relative frequency or proportion – the chance of something happening out of a defined number of opportunities Lecture 6: Probaility and Inferential Statistics Probability and Inferential Statistics

subjectively as a % expectation of an event – a cow has a 60% chance of calving tonight (based on experience, but subject to individual opinions) a priori probability – based on the theoretical model defining the set of all probabilities of an outcome. eg when a coin is tossed, the probability of obtaining a head is ½ or 0.5 defined probability – the proportion of times an event will occur in a very large number of trials (or experiments) performed under similar conditions. e.g. the proportion of times a guinea pig will have a litter of greater than three, based upon the observed frequency of this event All of these approaches are related mathematically Probabilities can be expressed as a percentage (23%), a fraction / proportion (23/100) or a decimal (0.23) as parts of a whole (= parts of a unitised number of opportunities) Lecture 6: Probaility and Inferential Statistics For example:

Addition rule – when two events are mutually exclusive (they can’t occur at the same time) the probabilities of either of them occurring is the sum of the probabilities of each event eg 1/5 + 1/5 = 2/5 or 0.4 for two particular biscuits out of 5 types Multiplication rule - when two events are independent, the probability of both events occurring = the product of their individual probabilities e.g. a Friesian cow inseminated on a particular day has a probability of calving 278 days later (the mean gestation period) of 0.5 (she either calves or she doesn’t!) If two Friesian cows are inseminated on the same day, then the probability of both of them calving on the same day 278 days later is (0.5 x 0.5 ) = 0.25 Probability distributions can derive from discrete or continuous data a discrete random variable with only two possible values (e.g. male/female) is called a binary variable Lecture 6: Probaility and Inferential Statistics Two rules govern probabilities

eg number of spots on ladybirds in a sample The binomial distribution portrays the frequency distribution of data relating to an “all or none” event – whether an animal displays or doesn’t display a characteristic eg pregnant / not pregnant, number of spots on ladybirds (either 3,5,7,9,11,15,18, 21 etc!) For a continuous variable, the probability that its value lies within a particular interval is given by the relevant area under the curve of the probability density function The NORMAL ( or Gaussian ) DISTRIBUTION is a theoretical distribution of a continuous random variable (x) whose properties can be described mathematically by the mean (  ) and standard deviation (σ) Lecture 6: Probaility and Inferential Statistics

the proportion of the values of x lying between + and – 1x (times!), 1.96x and 2.58x the standard deviation on either side of the mean. It means 100% of the data values are included within 3 sd units either side of the mean In a perfectly symmetrical normal distribution, MEAN, MEDIAN and MODE have the same value Normal distributions with the same value of the standard deviation (σ) but different values of the mean (  ) Lecture 6: Probaility and Inferential Statistics

It is possible to make predictions about the likelihood of the mean value of a variable differing from another mean value – whether the difference is likely or unlikely to be due to chance alone This is the basis of significance testing - if the distribution of observed values approximates to the normal distribution, it becomes possible to compare means of variables with the theoretical distribution and estimate whether their observed differences are significantly different from the expected values of each variable if they are truly normally distributed eg Student’s t Test We carry out an experiment on guinea pigs to test the hypothesis that dietary lipid sources rich in ω3 polyunsaturated fatty acids improve coat condition We compare the breaking strength of hairs from two groups of 10 guinea pigs fed a normal mix compared with a diet supplemented with cod liver oil, recording the max. weight their hair will support as tensile strength in g. We want to decide whether the mean strength of hairs from the control and experimental groups differ significantly at the end of the trial Lecture 6: Probaility and Inferential Statistics

The steps for doing this manually are best set out in a table Lecture 6: Probaility and Inferential Statistics First we must calculate the sample mean, variance and standard deviation for each data set (control and test) If the data for the control mean are referred to as a and the test mean as b, then the t statistic is calculated as: Calculating the t statistic

n a = 10 X b = 7.48X a = 6.34 ∑ = 6.69∑ = 1.77∑ = 74.8∑ = ( X b – X b ) 2 X b – X b ( X a – X a ) 2 X a – X a Hair Tensile Strength GP Test Group b (g) Hair Tensile Strength GP Control Group a (g) n b = 10 Lecture 6: Probaility and Inferential Statistics Calculating the t statistic

We then compare our calculated value of t with those in the table of critical values for the value of t Lecture 6: Probaility and Inferential Statistics Calculating the variance: Calculating the t statistic and the standard deviation and finally the t statistic! We can ignore the –ve sign! Degrees of Freedom of the data set

These are the significance levels for the t statistic at 10%, 5%, 1% and 0.1%, (from left to right) If our value for t (2.78) exceeds any of the tabulated values of t for 18 df, which it does for p = 0.05, but not for p = 0.01, we can say “the means are different at the 5% level of significance and we can reject H 0 ” (the null hypothesis of no difference between the two treatments) The confidence level is simply 100 – (significance level) So, alternatively, we could say: “ we can be 95% confident that there is a significant difference between the two means” Lecture 6: Probaility and Inferential Statistics Significance and confidence