Data analysis and basic statistics KSU Fellowship in Clinical Pathology Clinical Biochemistry Unit 2015 - 2016.

Slides:



Advertisements
Similar presentations
Chapter 9 Introduction to the t-statistic
Advertisements

Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
QUANTITATIVE DATA ANALYSIS
Calculating & Reporting Healthcare Statistics
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-1 Statistics for Business and Economics 7 th Edition Chapter 2 Describing Data:
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Intro to Descriptive Statistics
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Introduction to Statistics Chapter 3 Using Statistics to summarize.
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
Describing Data: Numerical
APPENDIX B Data Preparation and Univariate Statistics How are computer used in data collection and analysis? How are collected data prepared for statistical.
Chapter 3 – Descriptive Statistics
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
Statistics 101 Chapter 10. Section 10-1 We want to infer from the sample data some conclusion about a wider population that the sample represents. Inferential.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
1 PUAF 610 TA Session 2. 2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week.
Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.1 Descriptive Statistics, The Normal Distribution, and Standardization.
Lecture 3 Describing Data Using Numerical Measures.
Skewness & Kurtosis: Reference
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Sampling Design and Analysis MTH 494 Ossam Chohan Assistant Professor CIIT Abbottabad.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Research Seminars in IT in Education (MIT6003) Quantitative Educational Research Design 2 Dr Jacky Pow.
INVESTIGATION 1.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
Descriptive Statistics The goal of descriptive statistics is to summarize a collection of data in a clear and understandable way.
Introduction to Statistics Santosh Kumar Director (iCISA)
Descriptive & Inferential Statistics Adopted from ;Merryellen Towey Schulz, Ph.D. College of Saint Mary EDU 496.
Chapter Eight: Using Statistics to Answer Questions.
Data Analysis.
Descriptive Statistics(Summary and Variability measures)
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
Confidence Intervals. Point Estimate u A specific numerical value estimate of a parameter. u The best point estimate for the population mean is the sample.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Statistics -Descriptive statistics 2013/09/30. Descriptive statistics Numerical measures of location, dispersion, shape, and association are also used.
Outline Sampling Measurement Descriptive Statistics:
Descriptive Statistics ( )
Statistics for Managers Using Microsoft® Excel 5th Edition
Business and Economics 6th Edition
MATH-138 Elementary Statistics
Analysis and Empirical Results
Chapter 3 Describing Data Using Numerical Measures
How Psychologists Ask and Answer Questions Statistics Unit 2 – pg
Statistics.
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Descriptive Statistics
Description of Data (Summary and Variability measures)
Science of Psychology AP Psychology
Chapter 3 Describing Data Using Numerical Measures
Numerical Descriptive Measures
Descriptive Statistics
Descriptive Statistics: Numerical Methods
Basic Statistical Terms
NURS 790: Methods for Research and Evidence Based Practice
Numerical Descriptive Measures
Data analysis and basic statistics
Lecture 10/24/ Tests of Significance
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Chapter Nine: Using Statistics to Answer Questions
Chapter Fifteen Frequency Distribution, Cross-Tabulation, and
Business and Economics 7th Edition
Numerical Descriptive Measures
Central Tendency & Variability
Presentation transcript:

Data analysis and basic statistics KSU Fellowship in Clinical Pathology Clinical Biochemistry Unit

Objectives Understand the main concepts of statistical data analysis. Have a knowledge about the basic statistic techniques:  Measures of location.  Measures of variability.  Hypothesis testing. Student’s t-test. Chi-squared test.

Preface This presentation focuses on the most common techniques for statistical data analysis.

What does Statistics mean? Statistics is defined as a science of collection, presentation, analysis, and reasonable interpretation of data. Statistics has traditionally been used with two purposes:  Summarize data so that it is readily comprehensible (Descriptive statistics).  Draw conclusions that can be applied to other cases (statistical inference).

The use of computers and their accompanying graphic programs have made it possible to obtain attractive and meaningful displays of data.

A Taxonomy of Statistics

Descriptive Statistics  Frequencies  Basic measurements Inferential Statistics  Hypothesis Testing  Correlation  Confidence Intervals  Significance Testing  Prediction Describing a phenomena How many? How much? BP, HR, BMI, IQ, etc. Inferences about a phenomena Proving or disproving theories Associations between phenomena If sample relates to the larger population E.g., Diet and health

Measures of location The mean:  It is defined as the sum of all the observations divided by the number of observations. is used to denote the mean of a population; is used to denote the mean of a sample.

Measures of location The median:  It is the number that divides the total number of ordered observations in half. For odd sample size number: the median is the middle observation of the ordered data. The median = (n+1)/n. For even sample size number: the median is the mean of the middle two numbers of the ordered data. The median = the mean of n/2 and (n/2)+1.

Measures of location Mean or median: The median is less sensitive to outliers (extreme scores) than the mean and thus a better measure than the mean for highly skewed distributions. Calculate the mean and median of the following values? 20, 30, 40, 990 Answer: Mean = 270. Median = 35. ✔

Measures of location The mode:  It is the value of the variable that occurs frequently. It can determine the Skewness of the data.

Measures of variability These measure how spread out the data are. e.g. Two distributions could have the same mean and look quite different. Examples of variability measures:  Variance.  Standard deviation.  Range.  Coefficient of variance.  Interquartile range

Variance Sample variance is defined as the sums of squares of the differences between each observation in the sample and the sample mean divided by 1 less than the number of observations (Why). It decreases as the sample size increases. The population variance is denoted by sigma squared (σ 2 ) σ 2 = Σ( - µ) 2 /N

Standard Deviation (SD) Sample standard deviation is the square root of the variance. It decreases as the sample size increases. s = √S 2 The population standard deviation is denoted by sigma (σ) σ = √σ 2 = √Σ( - µ) 2 /N

Range It is a measure of variation in data distribution, which is calculated by subtracting the smallest value from the largest value. Unlike SD, the range tends to increase as the sample size increases.

Coefficient of variance (CV) It is a standardized measure of dispersion of a probability distribution or frequency distribution. It is also defined as the ratio of the standard deviation to the mean. It is often expressed as a percentage. CV =

Interquartile range Quartiles: Data can be divided into four regions that cover the total range of observed values. Cut points for these regions are known as quartiles. In notations, quartiles of a data is the ((n+1)/4)q th observation of the data, where q is the desired quartile and n is the number of observations of data.

Interquartile range Q1 is the median of the first half of the ordered observations and Q3 is the median of the second half of the ordered observations. The interquartile range is calculated by subtracting the Q3 from Q1 (Q3 – Q1). Determine the interquartile range of the following numbers?

Answer In the previous question, Q1= ((15+1)/4)1 =4 th observation of the data. The 4 th observation is 11. So Q1 is of this data is Q1 Q2 Q3 Q1=11, Q2=40 (This is also the Median.) and Q3=61. Inter-quartile Range: Difference between Q3 and Q1. Inter-quartile range of the previous question is =21.

Shape of data Two measures of data shape:  Skewness: measures asymmetry of data. Positive or right skewed: Longer right tail Negative or left skewed: Longer left tail

Shape of data Two measures of data shape:  Kurtosis: measures peakedness of the distribution of data. The kurtosis of normal distribution is 0.

The normal distribution curve Features of the curve:  Mean, median and mode are in the center.  Bell-shaped curve.  The probability a score is above or below the mean is 50%.  Most of the scores are in the middle.

Confidence intervals It is a point of estimate to µ from the sample mean “plus or minus” the margin of error. The commonly used confidence intervals are 90%, 95% or 99%, but 95% is the most one. Standard error

Z or t table to be used Conditions for using “t”: 1. σ is unknown. 1. n < 30.

Student’s t-test A t-test is a hypothesis test of the mean of one or two normally distributed populations. Several types of t- tests exist for different situations, but they all use a test statistic that follows a t-distribution under the null hypothesis.

t-test types

Hypothesis testing steps 1. State null (H 0 ) and alternate (H 1 ) hypothesis. 2. Choose level of significance (α). Rejection (tails) regions 3. Find critical values. From z or t table 4. Find test statistic. Find z or t value OR 5. Draw and write the conclusion. Reject or accept

Example: The average IQ for the adult population is 100 with a standard deviation of 15. A researcher believes this value has changed. The researcher decides to test the IQ of 75 random adults. The average IQ of the sample is 105. Is there enough evidence to suggest the average IQ has changed?

Answer: State H 0 and H 1 : H 0 : µ = 100, H 1 : µ ≠ 100 Choose level of significance: (2 tailed test), α = 0.05 Find critical values: z score = ± 1.96 Find test statistics: = = 2.89 Draw and write the conclusion: Reject H 0 and Accept H

Chi-squared test Watch the video.

Software 1. Microsoft Excel. 2. Graphpad Prism. 3. SPSS. 4. ……etc.

Links to the references Basic statistics: A Primer for the Biomedical Sciences, Dunn and Clark, 4 th edition. Basic statistics: A Primer for the Biomedical Sciences, Dunn and Clark, 4 th edition Basic statistics overview ppt, Danielle Davidov, PhD. Basic statistics overview ppt, Danielle Davidov, PhD Class 1 ppt Lecture. Class 1 ppt Lecture Types of t-tests. Types of t-tests Math Meeting. Math Meeting Chi-squared test video. Chi-squared test video Note: to open each of the links above: 1. Place the cursor on any word of a reference. 2. Press the right click of the mouse. 3. Choose open hyperlink from the menu.