The following lecture has been approved for University Undergraduate Students This lecture may contain information, ideas, concepts and discursive anecdotes.

Slides:



Advertisements
Similar presentations
**ESTABLISHING PATTERNS OR TRENDS IN THE DATA COLLECTED** BY DR. ARTEMIO P. SEATRIZ MMSU-CTE LAOAG CITY.
Advertisements

Statistics for the Social Sciences Psychology 340 Fall 2006 Distributions.
The following lecture has been approved for University Undergraduate Students This lecture may contain information, ideas, concepts and discursive anecdotes.
Math 2260 Introduction to Probability and Statistics
Basic Ideas of Statistics Unit 1.1 Basic Ideas of Statistics Corresponds to Chapter 1 in Triola.
Introduction to Data Analysis
MSS 905 Methods of Missiological Research
Statistical Tests Karen H. Hagglund, M.S.
SPSS Basic stats guide Dr. Craig Jackson Senior Lecturer in Health Psychology Faculty of Health Birmingham City University
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
The following lecture has been approved for University Undergraduate Students This lecture may contain information, ideas, concepts and discursive anecdotes.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 1 Chicago School of Professional Psychology.
The following lecture has been approved for University Undergraduate Students This lecture may contain information, ideas, concepts and discursive anecdotes.
The following lecture has been approved for University Undergraduate Students This lecture may contain information, ideas, concepts and discursive anecdotes.
Thomas Songer, PhD with acknowledgment to several slides provided by M Rahbar and Moataza Mahmoud Abdel Wahab Introduction to Research Methods In the Internet.
Presenting information
Introduction to Biostatistics
Statistical Analysis I have all this data. Now what does it mean?
PY550 Research and Statistics Dr. Mary Alberici Central Methodist University.
CHAPTER 4 Research in Psychology: Methods & Design
DATA ANALYSIS FOR RESEARCH PROJECTS
Chapter 1 Descriptive Analysis. Statistics – Making sense out of data. Gives verifiable evidence to support the answer to a question. 4 Major Parts 1.Collecting.
@ 2012 Wadsworth, Cengage Learning Chapter 5 Description of Behavior Through Numerical 2012 Wadsworth, Cengage Learning.
Collecting, Presenting, and Analyzing Research Data By: Zainal A. Hasibuan Research methodology and Scientific Writing W# 9 Faculty.
Statistics 1 Course Overview
● Midterm exam next Monday in class ● Bring your own blue books ● Closed book. One page cheat sheet and calculators allowed. ● Exam emphasizes understanding.
With Statistics Workshop with Statistics Workshop FunFunFunFun.
Section 1.1, Slide 1 Copyright © 2014, 2010, 2007 Pearson Education, Inc. Section 14.1, Slide 1 14 Descriptive Statistics What a Data Set Tells Us.
Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word.
Statistics Ch.1: Variables & Measurement. Types Statistics: –Descriptive –Inferential Data: Collections of observations –Population –Sample.
Data Presentation.
Census A survey to collect data on the entire population.   Data The facts and figures collected, analyzed, and summarized for presentation and.
Graphical Analysis. Why Graph Data? Graphical methods Require very little training Easy to use Massive amounts of data can be presented more readily Can.
Quantitative Skills 1: Graphing
Biostatistics – A Revisit What are they? Why do we need them? Their relevance and importance.
CSCI N207: Data Analysis Using Spreadsheets Copyright ©2005  Department of Computer & Information Science Univariate Data Analysis.
Statistical Analysis I have all this data. Now what does it mean?
Chapter 1 Introduction to Statistics. Statistical Methods Were developed to serve a purpose Were developed to serve a purpose The purpose for each statistical.
Biostatistics Class 1 1/25/2000 Introduction Descriptive Statistics.
Areej Jouhar & Hafsa El-Zain Biostatistics BIOS 101 Foundation year.
Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Research Seminars in IT in Education (MIT6003) Quantitative Educational Research Design 2 Dr Jacky Pow.
Experimental Research Methods in Language Learning Chapter 9 Descriptive Statistics.
Research Ethics:. Ethics in psychological research: History of Ethics and Research – WWII, Nuremberg, UN, Human and Animal rights Today - Tri-Council.
Dr. Fowler AFM Unit 8-1 Organizing & Visualizing Data Organize data in a frequency table. Visualizing data in a bar chart, and stem and leaf display.
Medical Statistics as a science
Describing Data: Graphical Methods ● So far we have been concerned with moving from asking a research question to collecting good quality empirical data.
By: Asma Al-Oneazi Supervised by… Dr. Amal Fatani.
Statistical Analysis I Mosuk Chow, PhD Senior Scientist and Professor Department of Statistics December 8, 2015 CTSI BERD Research Methods Seminar Series.
Surveillance and Population-based Prevention Department for Prevention of Noncommunicable Diseases Displaying data and interpreting results.
Section 1.1, Slide 1 Copyright © 2014, 2010, 2007 Pearson Education, Inc. Section 14.1, Slide 1 14 Descriptive Statistics What a Data Set Tells Us.
Introduction to Medical Statistics. Why Do Statistics? Extrapolate from data collected to make general conclusions about larger population from which.
Measurements Statistics WEEK 6. Lesson Objectives Review Descriptive / Survey Level of measurements Descriptive Statistics.
Graphs & Charts: The Art of Data Visualisation Alasdair Rutherford SSPC9C6University of StirlingSpring 2016.
Introduction to Biostatistics Lecture 1. Biostatistics Definition: – The application of statistics to biological sciences Is the science which deals with.
Chapter 12 Understanding Research Results: Description and Correlation
Practice As part of a program to reducing smoking, a national organization ran an advertising campaign to convince people to quit or reduce their smoking.
Measurements Statistics
INTRODUCTION AND DEFINITIONS
CHAPTER 4 Research in Psychology: Methods & Design
LEVELS of DATA.
Quantitative Skills : Graphing
Basic Statistics Overview
Introduction to Statistics
Basic Statistical Terms
NURS 790: Methods for Research and Evidence Based Practice
Quantitative Methods in HPELS HPELS 6210
15.1 The Role of Statistics in the Research Process
Global PaedSurg Research Training Fellowship
Presentation transcript:

The following lecture has been approved for University Undergraduate Students This lecture may contain information, ideas, concepts and discursive anecdotes that may be thought provoking and challenging It is not intended for the content or delivery to cause offence Any issues raised in the lecture may require the viewer to engage in further thought, insight, reflection or critical evaluation

Background to Statistics Distributions Data collection Data presentation Dr. Craig Jackson Senior Lecturer in Health Psychology Trauma & Critical Care Faculty of Health & Community Care UCE Birmingham

33% 123 Problem: stick with initial choice or choose another door ? 50% ? Solution: probability says that you stand a better chance of finding the cash if you SWAP The Monty Hall Problem

Door 1Door 2Door 3 Never swap WIN LOSE LOSE Always swap LOSE WIN WIN Marilyn vos Savant The Monty Hall Problem

Dispersion RangeSpread of data MeanArithmetic average MedianLocation ModeFrequency SDSpread of data about the mean Range mmHg Mean82mmHgMedian82mmHgMode82mmHg SD± 10mmHg

Types of Data / Variables ContinuousDiscrete BPChildren HeightNo. colds in last 12 months WeightAge last birthday Age OrdinalNominal Grade of conditionSex Positions 1 st 2 nd 3 rd Hair colour “Better - Same – Worse”Blood group Height groupsEye colour Age groups

Conversion & Re-classification Easier to summarise Ordinal / Nominal data Cut-off Points(who decides this?) Allows Continuous variables to be changed into Nominal variables BP> 90mmHg=Hypertensive BP=< 90mmHg=Normotensive Easier clinical decisions Categorisation reduces quality of data Statistical tests may be more sensational Good for summariesBad for analyses

Histograms and Bar-Charts Distinction is often lost Histograms The distribution of a continuous variable No gaps between the bars Bar-Chart Spaces between the bars Distribution of discrete / categorical data

Types of statistics / analyses DESCRIPTIVE STATISTICSDescribing a phenomena FrequenciesHow many… Basic measurementsMeters, seconds, cm 3, IQ INFERENTIAL STATISTICSInferences about phenomena Hypothesis TestingProving or disproving theories Confidence IntervalsIf sample relates to the larger population CorrelationAssociations between phenomena Significance testinge.g diet and health

Types of Data QUALITITATIVE Data expressed by type Data that has been described QUANTITATIVE Data classified by numeric value Data that has been measured or counted QUALITITATIVE and QUANTITATIVE data are not mutually exclusive Use of the two data types in research is ok

values have no “real” meaning values have “real” meaning Categorical Data NOMINAL DATA values that the data may have do not have specific order values act as labels with no real meaning e.g. hair colourbrown =1blond =2black =100 ORDINAL DATA values with some kind of ordering data that has been measured or counted e.g. social class:upper1middle = 2working = 3 e.g. glioblastoma tumor grade:12345 e.g. position in a race:1 st 2 nd 3 rd

Quantitative Data DISCRETE distinct or separate parts, with no finite detail e.g children in family CONTINUOUS between any two values, there would be a third e.g between meters there are centimetres INTERVAL equal intervals between values and an arbitrary zero on the scale e.g temperature gradient RATIO equal intervals between values and an absolute zero e.g body mass index

Quantitative Data COUNTS number of items having a particular shared characteristic PROPORTIONS number of items with a particular characteristic / by the number of the total population PERCENTAGES a proportion multiplied by 100 represents “parts per hundred” RATIO alternative to proportions - number with the characteristic / by the number without RATES A variance of the proportion method, expressed as counts per 1000

Terminology - Variables INDEPENDENT - Working hours, exposure, worker attitudes, policies - Chemical exposure in workplace DEPENDENT - Symptomotology, productivity, accident rates, attitudes, health - Performance on neuropsychological test CONTROLLED - Working hours, temperatures, exposure, diet, class, income - Ambient noise and temperature in testing room

White Hot Red Hot Cold “Dangerous” “Unpleasant” “Uncomfortable” “Tolerable” “Comfortable” “Cold” 80 o C 60 o C 40 o C 20 o C 10 o C Unsafe Safe Levels of Variables Temperature

5’6” 5’7” 5’8” 5’9” 5’10” 5’11” 6’ 6’1” 6’2” 6’3” 6’4” 5’6” 5’7” 5’8” 5’9” 5’10” 5’11” 6’ 6’1” 6’2” 6’3” 6’4”Height % of population Distributions Sir Francis Galton ( ) Alumni of Birmingham University 9 books and > 200 papers Fingerprints, correlation of calculus, twins, neuropsychology, blood transfusions, travel in undeveloped countries, criminality and meteorology) Deeply concerned with improving standards of measurement

balls dropped through a succession of metal pins….. …..a normal distribution of balls do not have a normal distribution here. Why? Quincunx machine 1877

The distribution derived from the quincunx is not perfect It was only made from 18 balls Normal & Non-normal distributions

Galton’s quincunx machine ran with hundreds of balls a more “perfect” shaped normal distribution. Obvious implications for the size of samples of populations used The more lead shot runs through the quincunx machine, the smoother the distribution in the long run.....

A SAMPLE OF VISUAL ABILITIES IN THE UK (SIMPLIFIED DATA) very poor average very good frequency recruiting participants in the “r.n.i.b ” magazine would yield? recruiting participants in “ornathology” magazine would yield? recruiting participants in a gp surgery would yield? Normal & Non-normal distributions bigger samples are best (usually)

Presentation of data Why use tables and graphs FIRST PRINCIPALS OF DATA PRESENTATION enhance understanding clarity avoidance of misunderstanding WHY USE TABLES ? more accurate than graphs more concise than graphs WHY USE GRAPHS ? provide good general overview allows reader to visualise the concept

ExposedControlsT P n=197n=178 Age (yrs)(  9.4)(  7.3) I.Q (  10.8)(  8.7) Speed (ms) (  13.4)(  12.4) (ms) (  13.4)(  12.4) Presentation of data Table of means

ExposedControls Healthy Unwell Chi square (test of association) shows: Chi square = 7.2P = 0.02 Presentation of data Category tables

Bar charts Use for Comparing data and Counts of data Histograms Use for Comparing data and to show spread of data Pie charts Use for Counts of data Scatterplot Use to show spread of data Box plot Confidence intervals Graphical displays

Mean GHQ scores for exposure groups GHQ score Job Type Bar charts

Title of graph x-axis y-axis (ordinate) x-axis (abscissa) scale Data display area Legend key Graphical display components

Movie-goers’ ratings for National Lampoon’s European Vacation (1985) What does the distribution of votes indicate ? What other info is needed ? votes Viewer rating Graphical displays Some real data

Movie-goers’ ratings for The Empire Strikes Back (1980) What does the distribution of votes indicate ? What other info is needed ? Viewer rating votes Graphical displays Some real data

Movie-goers’ ratings (%) Rating Lampoon Empire Movie-goers’ ratings Rating Lampoon Empire Movie data summary Both tables represent the same data..… Do either of them convey the general trend ?

Lampoon Empire What is wrong with this bar chart ? How could it be improved ? votes Viewer rating Movie data summary Back to back comparison

Lampoon Empire Can this be improved ? Viewer rating % of votes Movie data summary Back to back comparison

Lampoon Empire Viewer rating % of votes over-complicated and messy Movie data summary Back to back comparison

Score12345 Frequency Tables determined numbers word processed less space Figures overview at a glance little “processing power” showing trends Clarity vs accuracy

Importance of Sample Size Forgotten in many studies Forgotten in many studies Little consideration given Little consideration given Appropriate size needed to confirm / refute hypotheses Appropriate size needed to confirm / refute hypotheses Small samples far too small to detect anything but the grossest difference Non-significant results are reported – Type 2 errors occur Small samples far too small to detect anything but the grossest difference Non-significant results are reported – Type 2 errors occur Too large a sample – unnecessary waste of (clinical) resources Ethical considerations – waste of patient time, inconvenience, discomfort Too large a sample – unnecessary waste of (clinical) resources Ethical considerations – waste of patient time, inconvenience, discomfort Make assessment of optimal sample size before starting investigation

Quantitative Data Summary What data is needed to answer the larger-scale research question What data is needed to answer the larger-scale research question Combination of quantitative and qualitative ? Combination of quantitative and qualitative ? Cleaning, re-scoring, re-scaling, or re-formatting Cleaning, re-scoring, re-scaling, or re-formatting Measurement of both IV’s and DV’s is complex but can be simplified Measurement of both IV’s and DV’s is complex but can be simplified Binary measurement makes analysis easier but less meaningful Binary measurement makes analysis easier but less meaningful Binary data needs clear parameters e.g exposed vs controls Binary data needs clear parameters e.g exposed vs controls Collecting good quality data at source is vital Collecting good quality data at source is vital

Quantitative Data Summary Continuous & Discrete data can also be converted into Binary data Continuous & Discrete data can also be converted into Binary data Normal distribution of participants / data points desirable Normal distribution of participants / data points desirable Means - age, height, weight, BMI, IQ, attitudes Means - age, height, weight, BMI, IQ, attitudes Frequencies / Classifications - job type, sick vs. healthy, dead vs alive Frequencies / Classifications - job type, sick vs. healthy, dead vs alive Means must be followed by Standard Deviation (SD or ±) Means must be followed by Standard Deviation (SD or ±) Presentation of data must enhance understanding or be redundant Presentation of data must enhance understanding or be redundant

Further Reading Altman DG. Designing Research. In: Altman DG (ed.) Practical Statistics For Medical Research. Chapman and Hall, London 1991; Bland M. The design of experiments. In: Bland M. (ed.) An introduction to medical statistics. Oxford Medical Publications, Oxford 1995; Daly LE, Bourke GJ. Epidemiological and clinical research methods. In: Daly LE, Bourke GJ. (eds.) Interpretation and uses of medical statistics. Blackwell Science Ltd, Oxford 2000; Gao Smith F, Smith J. (eds.) Key Topics in Clinical Research. BIOS scientific Publications, Oxford Jackson CA. Planning Health and Safety Research Projects. Croner Health and Safety at Work Special Report 2002; 62: Jackson CA. Analyzing Statistical Data in Occupational Health Research. Management of Health Risks Special Report, 81 Croner Publications, Surrey, June 2003