Fundamentals of Statistical Analysis DR. SUREJ P JOHN.

Slides:



Advertisements
Similar presentations
Chapter 3 Properties of Random Variables
Advertisements

SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic.
Inference for Regression
Statistical Tests Karen H. Hagglund, M.S.
Data Analysis Statistics. Inferential statistics.
QUANTITATIVE DATA ANALYSIS
BHS Methods in Behavioral Sciences I
Descriptive Statistics
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Analysis of Research Data
Introduction to Educational Statistics
Inference.ppt - © Aki Taanila1 Sampling Probability sample Non probability sample Statistical inference Sampling error.
Data Analysis Statistics. Inferential statistics.
Today Concepts underlying inferential statistics
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Descriptive Statistics: Part One Farrokh Alemi Ph.D. Kashif Haqqi M.D.
AM Recitation 2/10/11.
Estimation and Hypothesis Testing Faculty of Information Technology King Mongkut’s University of Technology North Bangkok 1.
1 GE5 Lecture 6 rules of engagement no computer or no power → no lesson no SPSS → no lesson no homework done → no lesson.
CHAPTER 4 Research in Psychology: Methods & Design
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Statistics 1 Course Overview
1 Basic Definitions Greg C Elvers, Ph.D.. 2 Statistics Statistics are a set of tools that help us to summarize large sets of data data -- set of systematic.
Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word.
T-Tests and Chi2 Does your sample data reflect the population from which it is drawn from?
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Ch2: Probability Theory Some basics Definition of Probability Characteristics of Probability Distributions Descriptive statistics.
Chapter Eleven A Primer for Descriptive Statistics.
COMM 250 Agenda - Week 12 Housekeeping RP2 Due Wed. RAT 5 – Wed. (FBK 12, 13) Lecture Experiments Descriptive and Inferential Statistics.
t(ea) for Two: Test between the Means of Different Groups When you want to know if there is a ‘difference’ between the two groups in the mean Use “t-test”.
Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.
Final review - statistics Spring 03 Also, see final review - research design.
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
Research Seminars in IT in Education (MIT6003) Quantitative Educational Research Design 2 Dr Jacky Pow.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
Agenda Descriptive Statistics Measures of Spread - Variability.
Quick and Simple Statistics Peter Kasper. Basic Concepts Variables & Distributions Variables & Distributions Mean & Standard Deviation Mean & Standard.
1 Statistical Analysis – Descriptive Statistics Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND.
Experimental Design and Statistics. Scientific Method
Understanding Your Data Set Statistics are used to describe data sets Gives us a metric in place of a graph What are some types of statistics used to describe.
Chapter Seventeen. Figure 17.1 Relationship of Hypothesis Testing Related to Differences to the Previous Chapter and the Marketing Research Process Focus.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Inferential Statistics. Coin Flip How many heads in a row would it take to convince you the coin is unfair? 1? 10?
Three Broad Purposes of Quantitative Research 1. Description 2. Theory Testing 3. Theory Generation.
Lecture 2 Frequency Distribution, Cross-Tabulation, and Hypothesis Testing.
Introduction to Basic Statistical Tools for Research OCED 5443 Interpreting Research in OCED Dr. Ausburn OCED 5443 Interpreting Research in OCED Dr. Ausburn.
Data Analysis.
Chapter 6: Analyzing and Interpreting Quantitative Data
PCB 3043L - General Ecology Data Analysis.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Advanced Statistical Methods: Continuous Variables REVIEW Dr. Irina Tomescu-Dubrow.
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
Chapter 13 Understanding research results: statistical inference.
Data Analysis. Qualitative vs. Quantitative Data collection methods can be roughly divided into two groups. It is essential to understand the difference.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Lesson 3 Measurement and Scaling. Case: “What is performance?” brandesign.co.za.
Chapter 4 Variability PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Seventh Edition by Frederick J Gravetter and Larry.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Central Bank of Egypt Basic statistics. Central Bank of Egypt 2 Index I.Measures of Central Tendency II.Measures of variability of distribution III.Covariance.
Statistics Vocabulary. 1. STATISTICS Definition The study of collecting, organizing, and interpreting data Example Statistics are used to determine car.
Appendix I A Refresher on some Statistical Terms and Tests.
Data analysis and basic statistics KSU Fellowship in Clinical Pathology Clinical Biochemistry Unit
Statistics & Evidence-Based Practice
APPROACHES TO QUANTITATIVE DATA ANALYSIS
Introduction to Inferential Statistics
Statistics Branch of mathematics dealing with the collection, analysis, interpretation, presentation, and organization of data. Practice or science of.
Basic Statistical Terms
Data analysis and basic statistics
15.1 The Role of Statistics in the Research Process
Presentation transcript:

Fundamentals of Statistical Analysis DR. SUREJ P JOHN

Definition of Variables A variable is an attribute of a person or an object that varies. Measurement are rules for assigning numbers to objects to represent quantities of attributes. Back to Table of Content

Definition Datum is one observation about the variable being measured. Data are a collection of observations. A population consists of all subjects about whom the study is being conducted. A sample is a sub-group of population being examined.

What Is Statistics? Statistics is the science of describing or making inferences about the world from a sample of data. Descriptive statistics are numerical estimates that organize and sum up or present the data. Inferential statistics is the process of inferring from a sample to the population.

1.Descriptive analysis – data distribution 2.Inferential analysis – hypothesis testing 3.Differences analysis – hypothesis testing 4.Association analysis – correlation 5.Predictive analysis – regression Five Types of Statistical Analysis

A Hypothesis: A statement relating to an observation that may be true but for which a proof (or disproof) has not been found The results of a well-designed experiment or data collection may lead to the proof or disproof of a hypothesis Descriptive vs. Inferential Statistics

Population Samples Sub-samples Inferential Statistics

For example, Heights of male vs. female at age of 25. Our observations: male H > female H; it may be linked to genetics, consumption and exercise etc. Is that true for male H> female H? i.e. Null hypothesis: male H ≤ female H Scenario I: Randomly select 1 person from each sex. Male: 170 Female: 175 Then, Female H> Male H ? Scenario II: Randomly select 3 persons from each sex. Male: 171, 163, 168 Female: 160, 172, 173 What is your conclusion then? Which is the better Scenario?

Important messages here: (1)Sample size is very important and will affect your conclusion (2)Measurement results vary among samples (or subjects) – that is “variation” or “uncertainty”. (3)Variation can be due to measurement errors (random or systematic errors) and inherent within samples variation. For example, at age 20, female height varies from 158 to 189 cm. Why? (4)Therefore, in Statistics, we always deal with distributions of data rather than a single point of measurement or event.

Moments of a Normal Distribution Each moment measures a different dimension of the distribution. 1. Mean (1st moment) 2. Standard deviation (2nd moment) 3. Skewness (3rd moment) 4. Kurtosis (4th moment)

Mean n Mean (µ) is equal to the sum of n number of observation divided by the number of observations (sample size) Mean = Sum of values/n =  X i /n e.g. length of 8 fish larvae at day 3 after hatching: 0.6, 0.7, 1.2, 1.5, 1.7, 2.0, 2.2, 2.5 mm mean length = ( )/8 = 1.55 mm mm mean

Standard deviation The standard deviation (SD) (represented by the Greek letter sigma, σ) shows how much variation or dispersion from the average exists.σ A low standard deviation indicates that the data points tend to be very close to the mean (also called expected value); a high standard deviation indicates that the data points are spread out over a large range of values. it is the square root of the Variance.The Variance is the average of the squared differences from the Mean The formula is easy: it is the square root of the Variance. The Variance is defined as: the average of the squared differences from the Mean.

Standard deviation

Calculate SD?

Skewness In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real-valued random variable about its mean. The skewness value can be positive or negative, or even undefined.

Kurtosis The coefficient of Kurtosis is a measure for the degree of peakedness /flatness in the variable distribution. The coefficient of Kurtosis is a measure for the degree of peakedness /flatness in the variable distribution. Kurtosis <0 Kurtosis = 0 Kurtosis > 0

Frequency Distribution In statistics, a frequency distribution is an arrangement of the values that one or more variables take in a sample. Each entry in the table contains the frequency or count of the occurrences of values within a particular group or interval, and in this way, the table summarizes the distribution of values in the sample. Frequency distribution tables can be used for both categorical and numeric variables.

Cross Tabulation A cross-tabulation (or cross-tab for short) is a display of data that shows how many cases in each category of one variable are divided among the categories of one or more additional variables. In a cross-tab, a cell is a combination of two or more characteristics, one from each variable. If one variable has two categories and the second variable has four categories, for instance, the cross-tab will have 6 cells, each with a number specific to that category

Left-handedRight-handedTotal Males 235 Females 145 Total 3710

Comparing Means We need to compare the means of groups in Inferential statistics. T-tests and ANOVA (Analysis of Variance) are the methods commonly used for comparing means. Independent T tests Independent T tests Independent T tests are used for testing the difference between the means of two independent groups. For Independent T-tests, there should be only one independent variable but it can have two levels. There should be only one dependant variable. Ex: gender (male and female) How male and female students differ in academic performance?

Anova (Analysis of Variance) Anova is used as the extension of Independent t-tests. This is used when the researcher is interested in whether the means from several ( >2) independent groups differ. For Avova, only one dependant variable should be present. There should be only ONE independent variable present (but it can have many levels unlike in independent t-tests)

Statistical errors in hypothesis testing

Statistical Errors in Hypothesis Testing Consider court judgments where the accused is presumed innocent until proved guilty beyond reasonable doubt (I.e. Ho = innocent)

Statistical Errors in Hypothesis Testing Similar to court judgments, in testing a null hypothesis in statistics, we also suffer from the similar kind of errors: