Statistical Modelling

Slides:



Advertisements
Similar presentations
Quantitative Techniques Lecture 1: Economic data 30 September 2004.
Advertisements

Lesson 10: Linear Regression and Correlation
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Chapter 13 Conducting & Reading Research Baumgartner et al Data Analysis.
Chapter 10 Simple Regression.
DATA ANALYSIS I MKT525. Plan of analysis What decision must be made? What are research objectives? What do you have to know to reach those objectives?
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Business Statistics - QBM117 Statistical inference for regression.
Simple Linear Regression Analysis
The Practice of Social Research
LIS 570 Summarising and presenting data - Univariate analysis continued Bivariate analysis.
Chapter 11 Simple Regression
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Advanced Higher Statistics Data Analysis and Modelling Hypothesis Testing Statistical Inference AH.
Statistical Methods Statistical Methods Descriptive Inferential
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Examining Relationships in Quantitative Research
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Research Seminars in IT in Education (MIT6003) Quantitative Educational Research Design 2 Dr Jacky Pow.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
September 18-19, 2006 – Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development Conducting and interpreting multivariate analyses.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
Advanced Statistical Methods: Continuous Variables REVIEW Dr. Irina Tomescu-Dubrow.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Beginning Statistics Table of Contents HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc.
Statistics and probability Dr. Khaled Ismael Almghari Phone No:
Overview G. Jogesh Babu. R Programming environment Introduction to R programming language R is an integrated suite of software facilities for data manipulation,
Research Methods. Define the Milgram experiment An experiment in which Milgram wanted to determine whether participants would administer painful shocks.
Howard Community College
Chapter 1: Statistics, Data and Statistical Thinking
Inference about the slope parameter and correlation
32931 Technology Research Methods Autumn 2017 Quantitative Research Component Topic 4: Bivariate Analysis (Contingency Analysis and Regression Analysis)
Lecture 9-I Data Analysis: Bivariate Analysis and Hypothesis Testing
Samples, Surveys, and Results UNIT SELF-TEST QUESTIONS
Advanced Higher Statistics
Lecture 10 Regression Analysis
Inference for Regression (Chapter 14) A.P. Stats Review Topic #3
Bivariate & Multivariate Regression Analysis
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Statistical tests for quantitative variables
Dr. Siti Nor Binti Yaacob
Introduction to Statistics
APPROACHES TO QUANTITATIVE DATA ANALYSIS
Dr. Siti Nor Binti Yaacob
12 Inferential Analysis.
BIVARIATE REGRESSION AND CORRELATION
© 2011 The McGraw-Hill Companies, Inc.
chance Learning impeded by two processes: Bias , Chance
The Nature of Probability and Statistics
Summarising and presenting data - Univariate analysis continued
Correlation and Regression
Dependent and Independent Variables, and Distributions
Regression Models - Introduction
Building Models: Mediation and Moderation Analysis
BA 275 Quantitative Business Methods
Section 7.7 Introduction to Inference
Simple Linear Regression
Multiple Regression Chapter 14.
12 Inferential Analysis.
Basic Practice of Statistics - 3rd Edition Inference for Regression
CHAPTER 12 More About Regression
MG3117 Issues and Controversies in Accounting
1. Homework #2 (not on posted slides) 2. Inferential Statistics 3
15.1 The Role of Statistics in the Research Process
Producing good data through sampling and experimentation
Statistics Review (It’s not so scary).
Chapter 1: Statistics, Data and Statistical Thinking
Presentation transcript:

Statistical Modelling Dr. Yang Hu yang.hu@Lancaster.ac.uk Dr. Stuart Bedston s.bedston@lancaster.ac.uk

Overview of the day Session 1: Introduction to statistical modelling Session 2: Dependent variables, independent variables, and distributions Session 3: From variables to models: Model choice, fitting and interpretation Session 4: Fitting models: diagnostics and further cautions Session 5: Building models: Mediation and moderation analysis Session 6: Advanced modelling techniques and learning resources

Introduction to Statistical Modelling Dr. Yang Hu yang.hu@Lancaster.ac.uk Dr. Stuart Bedston s.bedston@lancaster.ac.uk

Session overview Practice and example dataset for the day Inferential statistics: What is it? How is it different from descriptive statistics? Why and when to use statistical modelling? Workflow of statistical modelling

Example dataset British Crime Survey unrestricted teaching dataset 2007–2008 (see www.crimesurvey.co.uk for more information on the BCS) 11,676 random respondents (~25%) out of the 46,983 respondents who answered the follow-up Module B. 35 variables covering different types/substantive topics Download the teaching dataset: https://discover.ukdataservice.ac.uk/Catalogue/?sn=6891&type=Data%20catalogue)

Inferential statistics: What is it? Inferential statistics: Inferential statistics are techniques that allow us to use sample data to make generalizations about the populations from which the samples were drawn. Descriptive statistics: Analysis that helps describe, show or summarise data. Descriptive statistics do not allow us to make conclusions beyond the data.

Inferential statistics: What is it? Identify and extract underlying rules/regularities of relationships between different factors/ variables

From descriptive to inferential statistics From description of statistical “reality” to the generalization of underlying “rules”. Example: age and worry of being victim of personal crime (N = 8,434) Variable Mean (Median) Range SD Skewness Kurtosis Age 50.42 (49) 16 to 101 18.54 .11 2.10 Level of worry about being victim of personal crime .46 (–.12) –1.70 to 2.85 1.00 .83 3.54

From descriptive to inferential statistics Descriptive statistics (continued)

From descriptive to inferential statistics Descriptive statistics (continued) Each dot represents an observed data point Bivariate correlation: –.066

From descriptive to inferential statistics Descriptive statistics (continued) Summarised statistics by cohort group

From descriptive to inferential statistics Inferential statistical modelling (Ordinary least squares linear regression—to be covered in Session 4)

From descriptive to inferential statistics Inferential statistical modelling (Predictive margin with 95% confidence intervals) From descriptive observation to generalisable equation Worry=–.006*Age + .346

From descriptive to inferential statistics What if we also need to take account of multiple factors: gender, ethnicity, experience of crime in past 12 months, etc.

Why/when to use statistical modelling? Finding the underlying rules, patterns and regularities, instead of just describing the data. Such rules are transferrable—can be modified and used to predict and test hypothesis. The “fitness” of the rule can be assessed. Unravel complex/multivariate relationships (that cannot be easily summarised with descriptive statistics) Go beyond the dataset/sample to infer about a broader population Going beyond the past and present: predict future trends and (not yet known) outcomes Test hypothesis Assess the relative importance of predictors Assess the relationship between factors in determining the outcome :

Why/when to use statistical modelling? Why do we need multivariate analysis? To avoid omitted variable bias (get the most accurate estimates of the net role of a given variable) To get a more holistic understanding of the outcome of interest :

Workflow of statistical modelling Before Define concepts and measurements/variables—dependent and independent variables Clean and prepare variables During Devise research question/hypothesis, specifying “relationship (equation)” between variables (so usually 2+ variables) Fit models Build models After Diagnose models and statistical assumptions Report and interpret models