Small N - Large N: Some Alternatives Ray Kent University of Stirling Research Methods Festival, Oxford, July 2006.

Slides:



Advertisements
Similar presentations
Agenda of Week V Review of Week IV Inference on MV Mean Vector One population Two populations Multi-populations: MANOVA.
Advertisements

Contingency Table Analysis Mary Whiteside, Ph.D..
What, When and How? Dumitrela Negură BA. Introduced by Charles Ragin in 1987, when stumbling upon the causal inference problems generated by a small sample.
What is Chi-Square? Used to examine differences in the distributions of nominal data A mathematical comparison between expected frequencies and observed.
Significance Testing.  A statistical method that uses sample data to evaluate a hypothesis about a population  1. State a hypothesis  2. Use the hypothesis.
Causal-Comparative Research Designs
CGeMM – University of Louisville Mining gene-gene interactions from microarray data - Coefficient of Determination Marcel Brun – CGeMM - UofL.
Fuzzy Logic Based on a system of non-digital (continuous & fuzzy without crisp boundaries) set theory and rules. Developed by Lotfi Zadeh in 1965 Its advantage.
1 Counting in probability Permutations The number of orderings of different events Combinations The number ways that outcomes can be grouped.
Statistical Methods Chichang Jou Tamkang University.
12.The Chi-square Test and the Analysis of the Contingency Tables 12.1Contingency Table 12.2A Words of Caution about Chi-Square Test.
Statistics 200b. Chapter 5. Chapter 4: inference via likelihood now Chapter 5: applications to particular situations.
Correlation Patterns. Correlation Coefficient A statistical measure of the covariation or association between two variables. Are dollar sales.
Linear Regression and Correlation Analysis
10-2 Correlation A correlation exists between two variables when the values of one are somehow associated with the values of the other in some way. A.
Discriminant Analysis Objective Classify sample objects into two or more groups on the basis of a priori information.
Social Research Methods
Chapter 12 Inferring from the Data. Inferring from Data Estimation and Significance testing.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Structural Equation Modeling Intro to SEM Psy 524 Ainsworth.
Multivariate Probability Distributions. Multivariate Random Variables In many settings, we are interested in 2 or more characteristics observed in experiments.
The Practice of Social Research
Leedy and Ormrod Ch. 11 Gray Ch. 14
Statistical Methods For Engineers ChE 477 (UO Lab) Larry Baxter & Stan Harding Brigham Young University.
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
LIS 570 Summarising and presenting data - Univariate analysis continued Bivariate analysis.
Chapter 8 Introduction to Hypothesis Testing
Chapter 3 The Research Design. Research Design A research design is a plan of action for executing a research project, specifying The theory to be tested.
Tennessee Technological University1 The Scientific Importance of Big Data Xia Li Tennessee Technological University.
Ragin’s comparative method. Characteristics comparative method Combinations of conditions are attributed causal value Cases are studied as unique combinations.
Ragin’s comparative method. Characteristics comparative method Combinations of conditions are attributed causal value Cases are studied as unique combinations.
Tutor: Prof. A. Taleb-Bendiab Contact: Telephone: +44 (0) CMPDLLM002 Research Methods Lecture 8: Quantitative.
The Logic of Statistical Analysis Lesson 2 Population APopulation B Sample 1Sample 2 OR.
The Argument for Using Statistics Weighing the Evidence Statistical Inference: An Overview Applying Statistical Inference: An Example Going Beyond Testing.
Measures of Variability Objective: Students should know what a variance and standard deviation are and for what type of data they typically used.
Two Variable Statistics
Bayesian Networks for Data Mining David Heckerman Microsoft Research (Data Mining and Knowledge Discovery 1, (1997))
1 In this case, each element of a population is assigned to one and only one of several classes or categories. Chapter 11 – Test of Independence - Hypothesis.
Copyright © 2012, SAS Institute Inc. All rights reserved. ANALYTICS IN BIG DATA ERA ANALYTICS TECHNOLOGY AND ARCHITECTURE TO MANAGE VELOCITY AND VARIETY,
Inferential Statistics Body of statistical computations relevant to making inferences from findings based on sample observations to some larger population.
Chapter 13 - ANOVA. ANOVA Be able to explain in general terms and using an example what a one-way ANOVA is (370). Know the purpose of the one-way ANOVA.
Analyzing the Results of an Experiment… -not straightforward.. –Why not?
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
 Descriptive Methods ◦ Observation ◦ Survey Research  Experimental Methods ◦ Independent Groups Designs ◦ Repeated Measures Designs ◦ Complex Designs.
AP Statistics Semester One Review Part 2 Chapters 4-6 Semester One Review Part 2 Chapters 4-6.
Chapter 16 Social Statistics. Chapter Outline The Origins of the Elaboration Model The Elaboration Paradigm Elaboration and Ex Post Facto Hypothesizing.
Wei Sun and KC Chang George Mason University March 2008 Convergence Study of Message Passing In Arbitrary Continuous Bayesian.
Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.
2008/9/15fuzzy set theory chap01.ppt1 Introduction to Fuzzy Set Theory.
International Conference on Fuzzy Systems and Knowledge Discovery, p.p ,July 2011.
Chapter 14 Chi-Square Tests.  Hypothesis testing procedures for nominal variables (whose values are categories)  Focus on the number of people in different.
5 Questions What is Theory? Why do we have theory? What is the relationship between theory and research? What is the relationship between theory and reality?
DATA ANALYSIS Data analysis helps discover and substantiate patterns and relationships, test our expectations, and draw inferences that make our research.
Test of independence: Contingency Table
Chapter 11 – Test of Independence - Hypothesis Test for Proportions of a Multinomial Population In this case, each element of a population is assigned.
CHAPTER 5 Handling Uncertainty BIC 3337 EXPERT SYSTEM.
SESRI Workshop on Survey-based Experiments
Making Comparisons All hypothesis testing follows a common logic of comparison Null hypothesis and alternative hypothesis mutually exclusive exhaustive.
Artificial Intelligence and Adaptive Systems
Summarising and presenting data - Univariate analysis continued
Different Scales, Different Measures of Association
You need: Pencil Agenda Scrap Paper AP log Math book Calculator
SESRI Workshop on Survey-based Experiments
Statistical Inference about Regression
Ch11 Curve Fitting II.
Fuzzy Logic Bai Xiao.
MGSE7.SP.3/MGSE7.SP.4: I can use measure of center and measures of variability for numerical data from random samples to draw informal comparative inferences.
Fuzzy Logic Based on a system of non-digital (continuous & fuzzy without crisp boundaries) set theory and rules. Developed by Lotfi Zadeh in 1965 Its advantage.
Inference Concepts 1-Sample Z-Tests.
Structural Equation Modeling
Presentation transcript:

Small N - Large N: Some Alternatives Ray Kent University of Stirling Research Methods Festival, Oxford, July 2006

Limitations of mainstream quantitative methods The focus is on the variableThe focus is on the variable The thinking in linearThe thinking in linear The main pattern sought is covariationThe main pattern sought is covariation

Cramers V =0.96 Traditional analysis expects to see this:

Or this: r = 0.86 (Var X) (Var Y)

Heavy television viewing is a sufficient, but not necessary condition for large expenditure on convenience food Phi (Cramers V) = 0.37 Lambda = 0.0 But we often get this:

Or this: r = 0.3

Further limitations Not good at handling causal or logical relationshipsNot good at handling causal or logical relationships Poor at handling complexityPoor at handling complexity

Some common misuses The use (even reliance) on statistical inference on non-random samples or total populationsThe use (even reliance) on statistical inference on non-random samples or total populations Causal inferences based on establishing covariationCausal inferences based on establishing covariation Poor, vague wording of hypothesesPoor, vague wording of hypotheses

Some alternatives to mainstream statistics Combinatorial logicCombinatorial logic Fuzzy-set analysisFuzzy-set analysis Neural network analysisNeural network analysis Data miningData mining Bayesian methodsBayesian methods Chaos/tipping point theoryChaos/tipping point theory

Combinatorial logic Instead of comparing variable distributions, we see cases as combinations of characteristics

A data matrix on SPSS

X 1 is a necessary, but not sufficient, cause of Y The frequency of 2 k combinations of 3 binary causal variables plus binary outcome

A fuzzy set

X 1 is a necessary, but not sufficient, condition for Y to occur The degree of membership of X 1 sets a ceiling on the degree of membership of Y

X 1 is a sufficient, but not necessary, condition for Y to occur High membership of X 1 acts as a floor for high membership of Y

Some other alternatives Neural network analysisNeural network analysis Data miningData mining Bayesian methodsBayesian methods Chaos/tipping point theoryChaos/tipping point theory