Statistics. The usual course of events for conducting scientific work “The Scientific Method” Reformulate or extend hypothesis Develop a Working Hypothesis.

Slides:



Advertisements
Similar presentations
Population vs. Sample Population: A large group of people to which we are interested in generalizing. parameter Sample: A smaller group drawn from a population.
Advertisements

Measurement, Evaluation, Assessment and Statistics
Statistical Tests Karen H. Hagglund, M.S.
The usual course of events for conducting scientific work “The Scientific Method” Reformulate or extend hypothesis Develop a Working Hypothesis Observation.
Review of Basics. REVIEW OF BASICS PART I Measurement Descriptive Statistics Frequency Distributions.
Descriptive Statistics Chapter 3 Numerical Scales Nominal scale-Uses numbers for identification (student ID numbers) Ordinal scale- Uses numbers for.
Descriptive Statistics
Methods and Measurement in Psychology. Statistics THE DESCRIPTION, ORGANIZATION AND INTERPRATATION OF DATA.
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
1 Basic statistics Week 10 Lecture 1. Thursday, May 20, 2004 ISYS3015 Analytic methods for IS professionals School of IT, University of Sydney 2 Meanings.
Introduction to Educational Statistics
Data observation and Descriptive Statistics
Thomas Songer, PhD with acknowledgment to several slides provided by M Rahbar and Moataza Mahmoud Abdel Wahab Introduction to Research Methods In the Internet.
Central Tendency and Variability
Levels of Measurement Nominal measurement Involves assigning numbers to classify characteristics into categories Ordinal measurement Involves sorting objects.
Introduction to Statistics February 21, Statistics and Research Design Statistics: Theory and method of analyzing quantitative data from samples.
AM Recitation 2/10/11.
Statistical Analysis I have all this data. Now what does it mean?
@ 2012 Wadsworth, Cengage Learning Chapter 5 Description of Behavior Through Numerical 2012 Wadsworth, Cengage Learning.
Measurement Tools for Science Observation Hypothesis generation Hypothesis testing.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word.
Psychometrics.
B AD 6243: Applied Univariate Statistics Understanding Data and Data Distributions Professor Laku Chidambaram Price College of Business University of Oklahoma.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
JDS Special Program: Pre-training1 Basic Statistics 01 Describing Data.
Chapter Eleven A Primer for Descriptive Statistics.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Descriptive Statistics
Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
1 Univariate Descriptive Statistics Heibatollah Baghi, and Mastee Badii George Mason University.
Statistics in Biology. Histogram Shows continuous data – Data within a particular range.
Psychology 101. Statistics THE DESCRIPTION, ORGANIZATION AND INTERPRATATION OF DATA.
TYPES There are several TYPES of variables that reflect characteristics of the data Ratio Interval Ordinal Nominal.
CHAPTER 3  Descriptive Statistics Measures of Central Tendency 1.
ENGR 610 Applied Statistics Fall Week 2 Marshall University CITE Jack Smith.
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
Data Analysis.
BASIC STATISTICAL CONCEPTS Chapter Three. CHAPTER OBJECTIVES Scales of Measurement Measures of central tendency (mean, median, mode) Frequency distribution.
RESEARCH & DATA ANALYSIS
Chapter 5: Measures of Dispersion. Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents.
IE(DS)1 Descriptive Statistics Data - Quantitative observation of Behavior What do numbers mean? If we call one thing 1 and another thing 2 what do we.
Chapter Three McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved. Describing Data: Numerical Measures.
Statistical Analysis of Data. What is a Statistic???? Population Sample Parameter: value that describes a population Statistic: a value that describes.
LIS 570 Summarising and presenting data - Univariate analysis.
Descriptive and Inferential Statistics Or How I Learned to Stop Worrying and Love My IA.
Chapter 4: Variability. Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure of variability.
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Chapter 2 Describing and Presenting a Distribution of Scores.
Welcome to… The Exciting World of Descriptive Statistics in Educational Assessment!
SOC 3155 SPSS CODING/GRAPHS & CHARTS CENTRAL TENDENCY & DISPERSION h458 student
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 2 Describing and Presenting a Distribution of Scores.
Data, Tables & Graphs October 24, 2016 BIOL 260
Statistical Methods Michael J. Watts
Populations.
Doc.RNDr.Iveta Bedáňová, Ph.D.
Statistical Methods Michael J. Watts
Topic 3: Measures of central tendency, dispersion and shape
Measures of Central Tendency
Central Tendency and Variability
Descriptive Statistics
Introduction to Statistics
Univariate Statistics
Descriptive Statistics
Central Tendency & Variability
Presentation transcript:

Statistics

The usual course of events for conducting scientific work “The Scientific Method” Reformulate or extend hypothesis Develop a Working Hypothesis Observation Conduct an experiment or a series of controlled systematic observations Appropriate statistical tests Confirm or reject hypothesis

The usual course of events for conducting scientific work “The Scientific Method” Reformulate or extend hypothesis Develop a Working Hypothesis Observation Conduct an experiment or a series of controlled systematic observations Appropriate statistical tests Confirm or reject hypothesis In a group of crickets, small ones seem to avoid large ones There will be movement away from large cricket by small ones Record the number of times that small crickets move away from small and large crickets. Chi square test There is a significant difference in the number of times small crickets move away from large vs. small ones Avoidance may depend on previous experience

Imagine that you are collecting samples (i.e. a number of individuals) from a population of little ball creatures - Critterus sphericales Little ball creatures come in 3 sizes: Small = Medium = Large =

-sample 1 -sample 2 -sample 3 -sample 4 -sample 5 You end up with a total of five samples

The real population (all the little ball creatures that exist) Your samples

Each sample is a representation of the population BUT No single sample can be expected to accurately represent the whole population

To be statistically valid, each sample must be: 1) Random: Thrown quadrat?? Guppies netted from an aquarium?

Assign numbers from a random number table

To be statistically valid, each sample must be: 2) Replicated:

But not - ‘Pseudoreplication’ Not pseudoreplication Pseudoreplication 10 samples from the same tree 10 samples from 10 different trees Sample size = 1Sample size = 10

TYPES OF DATA

RATIO DATA - constant size interval - a zero point with some reality e.g. Heights, rates, time, volumes, weights

INTERVAL DATA - constant size interval - no true zero point zero point depends on the scale used e.g. Temperature

Ordinal Scale - ranked data -grades, preference surveys

Nominal Scale Team numbers Drosophila eye colour

The kind of data you are dealing with is one determining factor in the kind of statistical test you will use.

Statistics and Parameters

Measures of: Central tendency - mean, median, mode Dispersion - range, mean deviation, variance, standard deviation, coefficient of variation

The real population (all the little ball creatures that exist) Central tendency - Mean

The real population (all the little ball creatures that exist) Your samples

The real population (all the little ball creatures that exist)  =  X i N Central Tendency 1) Arithmetic mean At Population level Measuring the diameters of all the little ball creatures that exist  population mean X i - every measurement in the population N - population size

Your samples X =  X i n n n n n

n Sample mean Sum of all measurements in the sample Sample size

If you have sampled in an unbiased fashion X =  X i n n n n n Each roughly equals 

Central tendency - Median Median - middle value of a population or sample e.g. Lengths of Mayfly (Ephemeroptera) nymphs 5 th value (middle of 9)

Median value Odd number of valuesEven number of values Median = middle value Median = + 2

Odd number of values (i.e. n is odd) Even number of values Or - to put it more formally Median = X (n+1) 2 Median = X (n/2) + X (n/2) + 1 2

Frequency (= number of times each measurement appears in the population Values (= measurements taken) c. Mode - the most frequently occurring measurement Mode Central tendency - Mode

Measures of Dispersion Why worry about this?? -because not all populations are created equal Distribution of values in the populations are clearly different BUT means and medians are the same Mean & median

Measures of Dispersion - 1. Range - difference between the highest and lowest values Remember little ball creatures and the five samples Range = -

Range - crude measure of dispersion Note - three samples do not include the highest value and - two samples do not include the lowest

Measures of Dispersion - 2. Mean Deviation X is a measure of central tendency Take difference between each measure and the mean X i - X BUT  X i - X = 0 So this is not useful as it stands

Measures of Dispersion - 2. Mean Deviation (cont’d) But if you take the absolute value -get a measure of disperson  |X i - X| and n = mean deviation

Measures of Dispersion - 3. Variance -eliminate the sign from deviation from mean Square the difference (X i - X) 2 And if you add up the squared differences - get the “sum of squares”  (X i - X) 2 (hint: you’ll be seeing this a lot!)

Measures of Dispersion - 3. Variance (cont’d) Sum of squares can be considered at both the population and sample level ss =  (X i - X) 2 SamplePopulation SS =  (X i -  ) 2

s 2 =  (X i - X) 2 SamplePopulation  2 =  (X i -  ) 2 Measures of Dispersion - 3. Variance (cont’d) If you divide by the population or sample size - get the mean squared deviation or VARIANCE N n-1 Population variance Sample variance

s 2 =  (X i - X) 2 Note something about the sample variance n-1 Measures of Dispersion - 3. Variance (cont’d) Degrees of freedom or df or

Measures of Dispersion - 4. Standard Deviation - just the square root of the variance  =  (X i -  ) 2 N Population Sample s =  (X i - X) 2 n-1

Standard Deviation - very useful Most data in any population are within one standard deviation of the mean

NORMAL DISTRIBUTION

Type of data Discrete Continuous Other distributions 2 categories & Bernoulli process > 2 categories Use a Binomial model to calculate expected frequencies Use a Poisson distribution to calculate expected frequencies From previous slide show

Type of data Discrete Continuous Other distributions 2 categories & Bernoulli process > 2 categories Use a Binomial model to calculate expected frequencies Use a Poisson distribution to calculate expected frequencies Now we’re dealing with:

Normal Distribution - bell curve

Central Limit Theorem Any continuous variable influenced by numerous random factors will show a normal distribution.

Normal curve is used for: 2) Continuous random data Weight, blood pressure weight, length, area, rates Data points that would be affected by a large number of random (=unpredictable) events Blood pressure age physical activity smokingdiet genes stress

Normal curves can come in different shapes So, for comparison between them, we need to standardize their presentation in some way

Standarize by calculating a Z-Score Z = value of a random variable - mean standard deviation Z = X - µ  or

Example of a z-score calculation The mean grade on the Biometrics midterm is 78.4 and the standard deviation is 6.8. You got a 59.7 on the exam. What is your z-score? Z = X - µ  Z = =

If you look at the formula for z-scores: z = value of a random variable - mean standard deviation z is also the number of standard deviations a value is from the mean

Each standard deviation away from the mean defines a certain area of the normal curve