Course Review Questions will not be all on one topic, i.e. questions may have parts covering more than one area.

Slides:



Advertisements
Similar presentations
Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Advertisements

Chap 12-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 12 Simple Regression Statistics for Business and Economics 6.
10-3 Inferences.
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
Chapter Seventeen HYPOTHESIS TESTING
Stat 512 – Lecture 18 Multiple Regression (Ch. 11)
Role and Place of Statistical Data Analysis and very simple applications Simplified diagram of scientific research When you know the system: Estimation.
13-1 Designing Engineering Experiments Every experiment involves a sequence of activities: Conjecture – the original hypothesis that motivates the.
Role and Place of Statistical Data Analysis and very simple applications Simplified diagram of a scientific research When you know the system: Estimation.
Today Concepts underlying inferential statistics
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Simple Linear Regression Analysis
Introduction to Linear Regression and Correlation Analysis
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Chapter 11 Simple Regression
One-Factor Experiments Andy Wang CIS 5930 Computer Systems Performance Analysis.
Chapter 9 Comparing More than Two Means. Review of Simulation-Based Tests  One proportion:  We created a null distribution by flipping a coin, rolling.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
Hypothesis Testing Quantitative Methods in HPELS 440:210.
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Chapter 15 – Analysis of Variance Math 22 Introductory Statistics.
ANOVA: Analysis of Variance.
ANOVA, Regression and Multiple Regression March
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
One-way ANOVA Example Analysis of Variance Hypotheses Model & Assumptions Analysis of Variance Multiple Comparisons Checking Assumptions.
Chapter 13 Understanding research results: statistical inference.
Chapter 22 Inferential Data Analysis: Part 2 PowerPoint presentation developed by: Jennifer L. Bellamy & Sarah E. Bledsoe.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 16 : Summary Marshall University Genomics Core Facility.
Estimating standard error using bootstrap
Advanced Data Analytics
Chapter 13 Simple Linear Regression
Why Model? Make predictions or forecasts where we don’t have data.
T-Tests and ANOVA I Class 15.
32931 Technology Research Methods Autumn 2017 Quantitative Research Component Topic 3: Comparing between groups Lecturer: Mahrita Harahap
Statistical Inference
Step 1: Specify a null hypothesis
Computer aided teaching of statistics: advantages and disadvantages
Probability and Statistics
Comparing Two Means Prof. Andy Field.
Inference for Regression (Chapter 14) A.P. Stats Review Topic #3
One-way ANALYSIS OF VARIANCE (ANOVA)
Part Four ANALYSIS AND PRESENTATION OF DATA
STA 291 Spring 2010 Lecture 21 Dustin Lueker.
Estimation & Hypothesis Testing for Two Population Parameters
General Linear Model & Classical Inference
Comparing Three or More Means
PCB 3043L - General Ecology Data Analysis.
BIVARIATE REGRESSION AND CORRELATION
One-Way Analysis of Variance: Comparing Several Means
STAT120C: Final Review.
Hypothesis Tests for a Population Mean in Practice
CHAPTER 29: Multiple Regression*
Chapter 11: The ANalysis Of Variance (ANOVA)
I. Statistical Tests: Why do we use them? What do they involve?
CS 594: Empirical Methods in HCC Experimental Research in HCI (Part 2)
Statistical Thinking and Applications
One-Factor Experiments
Making Inferences about Slopes
Analyzing and Interpreting Quantitative Data
Chapter 13: Using Statistics
STA 291 Summer 2008 Lecture 21 Dustin Lueker.
STA 291 Spring 2008 Lecture 21 Dustin Lueker.
Introductory Statistics
STATISTICS INFORMED DECISIONS USING DATA
Presentation transcript:

Course Review Questions will not be all on one topic, i.e. questions may have parts covering more than one area.

Overall Could be given an output from R and asked to interpret should be able to spot which type of output it is. Could be given a Set of Data. Do you use a t-test or an ANOVA and why? Pick the test and explain why you chose it. Give your own examples, if possible, otherwise the examples in the notes are fine once you can understand and explain them. There will be no manual calculation questions in the exam. It does say on the paper that you are allowed to use a calculator – extra marks to be given to those who actually manage to incorporate the use of a calculator into the writing of their exam script.

Thoughts when Analysing Data What tests would you use? What are the conclusions you draw? Most interested in the explanation. How would you write a paper etc…

Fisher F-test Equality of Variance. Why look for it? Assumptions behind the test. 2 Assumptions: equal variance and unequal variance. When would you test for equality of variance? Give example.

Multiple Linear Regression What do you understand about it? What does it do? Give a real world example. How would you explain it to a customer? Explain about a model, how one is built etc. Will not be asked to build a model.

Multi Colinearity Multiple linear regression. Looking for correlations between variances. Don’t want high correlations between x values – messes things up in terms of prediction and building a model. Perfect correlation – therefore statistics packages may not be able to build a suitable model.

ANOVA How do you formulate the null and alternative hypotheses? We are not saying that all means are different – we are saying that at least 2 are. Which Groups Differ? Post Hoc Testing - TukeyHSD test. Look at p-values. Why is this test done? ANOVA does not tell which groups differ.

2-Way ANOVA Extension of 1-way ANOVA – addition of a factor, interaction effect. How do you interpret? Draw examples using plots – lines crossing – if lines are parallel there is no interaction. Explore and explain interactions and interaction effects – focus on interactions if significant. Why do a 2-way ANOVA?

Anscombe quartet No specific questions but there may be a sub question. E.g. if you plot the data what if there is high correlation?

Quantile-Quantile (qq) Shown a plot – interpret it. What is a qq plot and what is it used for? Testing for Normality of Distribution. Shapiro-Wilkes test – what is the null hypothesis – different to most null hypotheses we have looked at – looking for a high p-value as opposed to a low p-value etc.

Non Parametric Name non parametric alternatives to usual parametric tests. E.g. ANOVA vs C-Wallace. What are the alternatives used for? E.g. Mann Whitney test – what are the advantages and disadvantages? Assumption of normal distribution. Assumption that population variances are equal. Heterogeneity and homogeneity of variance.

Factor Analysis & Principal Component Analysis What is a latent (hidden) variable? What is a component and a factor? Theories behind factor analysis. Explain factor analysis – what questions relate to each other etc.? What is a scree plot (no. of components, factors)? Eigenvalues. Principal component analysis – regarding dimension analysis. Write the stages for exploratory factor analysis

Sampling Distribution Sampling distribution represents null hypothesis. Calculating statistics from samples. Sampling error.

Confidence Interval What is a confidence interval Explain why the low-high grows the more confident I predict

Effect Size Compare means of two groups over a standard deviation – Cohen’s D. Why is this important? What influence does sample size have? What influence does effect size have?

Power What is it? We usually assume that the null hypothesis is true. What if the alternative hypothesis is true and what does this mean in terms of the likelihood of finding something and significance? Before carrying out statistical work based on effect size and sample size, what is the power? Maybe unable to find what you want or something significant – low potential of succeeding – may need to change sample size etc.

Machine Learning What is machine learning? For each algorithm know its definition and an example of where to use it, with sample R for more marks Linear regression Logistic Regression K-Means Clustering K-Nearest Neighbors (KNN) Classification Naive Bayes Classification Decison Trees Support Vector Machine (SVM) Artifical Neural Network (ANN) Apriori AdaBoost

Time Series A little book of time series in R Trend component Season component Irregular component Additive model Normal distribution Box test