Elang 273: Statistics September 15, 2008. Statistics The scientific method is defined by: 1. The research question is empirical 2. The data we collect.

Slides:



Advertisements
Similar presentations
Bivariate Analysis Cross-tabulation and chi-square.
Advertisements

Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
Statistical Tests Karen H. Hagglund, M.S.
QUANTITATIVE DATA ANALYSIS
PSY 340 Statistics for the Social Sciences Chi-Squared Test of Independence Statistics for the Social Sciences Psychology 340 Spring 2010.
Chi-square Test of Independence
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
CHAPTER 2 Basic Descriptive Statistics: Percentages, Ratios and rates, Tables, Charts and Graphs.
Naked mole rats are a burrowing rodent
Data Analysis Statistics. Inferential statistics.
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
 Raw data is generated by the process of collecting information  From 20-question survey of 100 people, for example, 2000 ‘bits’ of information are.
DESIGNING, CONDUCTING, ANALYZING & INTERPRETING DESCRIPTIVE RESEARCH CHAPTERS 7 & 11 Kristina Feldner.
Meaning of Measurement and Scaling
Learning Objective Chapter 13 Data Processing, Basic Data Analysis, and Statistical Testing of Differences CHAPTER thirteen Data Processing, Basic Data.
Inferential Statistics
Understanding Research Results
AM Recitation 2/10/11.
Statistical Analysis I have all this data. Now what does it mean?
Fundamentals of Data Analysis. Four Types of Data Alphabetical / Categorical / Nominal data: –Information falls only in certain categories, not in-between.
Statistics Ch.1: Variables & Measurement. Types Statistics: –Descriptive –Inferential Data: Collections of observations –Population –Sample.
Data Presentation.
1 Psych 5500/6500 Chi-Square (Part Two) Test for Association Fall, 2008.
1 Inference for Categorical Data William P. Wattles, Ph. D. Francis Marion University.
Statistics Definition Methods of organizing and analyzing quantitative data Types Descriptive statistics –Central tendency, variability, etc. Inferential.
Introduction To Biological Research. Step-by-step analysis of biological data The statistical analysis of a biological experiment may be broken down into.
Statistical Analysis I have all this data. Now what does it mean?
Statistics 11 Correlations Definitions: A correlation is measure of association between two quantitative variables with respect to a single individual.
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Statistical analysis Prepared and gathered by Alireza Yousefy(Ph.D)
Chi-Square X 2. Parking lot exercise Graph the distribution of car values for each parking lot Fill in the frequency and percentage tables.
The Statistical Analysis of Data. Outline I. Types of Data A. Qualitative B. Quantitative C. Independent vs Dependent variables II. Descriptive Statistics.
Chapter Twelve Copyright © 2006 John Wiley & Sons, Inc. Data Processing, Fundamental Data Analysis, and Statistical Testing of Differences.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Review Hints for Final. Descriptive Statistics: Describing a data set.
Chi-square Test of Independence
Chapter 13 CHI-SQUARE AND NONPARAMETRIC PROCEDURES.
Chi-Square Test James A. Pershing, Ph.D. Indiana University.
Physics 270 – Experimental Physics. Let say we are given a functional relationship between several measured variables Q(x, y, …) x ±  x and x ±  y What.
Reasoning in Psychology Using Statistics Psychology
4 normal probability plots at once par(mfrow=c(2,2)) for(i in 1:4) { qqnorm(dataframe[,1] [dataframe[,2]==i],ylab=“Data quantiles”) title(paste(“yourchoice”,i,sep=“”))}
N318b Winter 2002 Nursing Statistics Specific statistical tests Chi-square (  2 ) Lecture 7.
Chapter Eight: Using Statistics to Answer Questions.
Elang 273: Statistics. Review: Scientific Method 1. Observe something 2. Speculated why it is so and form hypothesis 3. Test hypothesis by getting data.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Chi-Square X 2. Review: the “null” hypothesis Inferential statistics are used to test hypotheses Whenever we use inferential statistics the “null hypothesis”
A. Chi-square (Goodness of fit) Question answered: Is the actual distribution of items into categories different from what you could get by chance?
Chi-Square X 2. Review: the “null” hypothesis Inferential statistics are used to test hypotheses Whenever we use inferential statistics the “null hypothesis”
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Cross Tabs and Chi-Squared Testing for a Relationship Between Nominal/Ordinal Variables.
Chapter 13 Understanding research results: statistical inference.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Scatterplots & Correlations Chapter 4. What we are going to cover Explanatory (Independent) and Response (Dependent) variables Displaying relationships.
1.1 ANALYZING CATEGORICAL DATA. FREQUENCY TABLE VS. RELATIVE FREQUENCY TABLE.
Chi Square Chi square is employed to test the difference between an actual sample and another hypothetical or previously established distribution such.
Cross Tabulation with Chi Square
Chapter 12 Understanding Research Results: Description and Correlation
CHAPTER 13 Data Processing, Basic Data Analysis, and the Statistical Testing Of Differences Copyright © 2000 by John Wiley & Sons, Inc.
INF397C Introduction to Research in Information Studies Spring, Day 12
Inferential Statistics
Spearman’s rho Chi-square (χ2)
Inferential Statistics
Inferential statistics,
NURS 790: Methods for Research and Evidence Based Practice
Inference for Categorical Data
15.1 The Role of Statistics in the Research Process
Chapter Nine: Using Statistics to Answer Questions
Research Methods: Data analysis and reporting investigations.
Psych 2 – Statistical Methods for Psychology and Social Science
Presentation transcript:

Elang 273: Statistics September 15, 2008

Statistics The scientific method is defined by: 1. The research question is empirical 2. The data we collect is public 3. The data is falsifiable  Statistics helps most with this  But also with this

Statistics Research question: Is the word glistening used more often in one register (as shown in COCA) than another? SECTIONSPOKENFICTIONMAGAZINENEWSPAPERACADEMIC PER MIL SIZE (MW) FREQ How much different do these frequencies have to be before we can say they are different?

Statistics Researchers have agreed that if the chance that the difference between two groups is greater than a certain percentage, then we will consider the difference to be statistically significant. A significant difference is better than one in twenty of happening by chance (p <.05). The opposite of significance is random chance.

Two types of statistics 1. Descriptive a. nominal (categorical) b. ordinal (rank order) c. continuous 2. Inferential a. chi-square b. t-tests/ANOVA c. correlations d. varbrul

1. Descriptive Statistics These are the types of statistics you are familiar with—showing means, percentages, quartiles, usually through bars, pie charts, and graphs

1. Descriptive Statistics Three types of data 1.Nominal (Categorical): sex, race, national origin, native speaker, how often you choose one thing over another, how often a word occurs in one register versus another 2.Continuous: height, weight, age, scores on a language test, IQ, working memory span 3.Ordinal (Rank Order): No fixed interval (first, second, third place in a race)—what order people choose their favorite dialect

1. Descriptive Statistics How could you depict the data for each of these types? 1.Nominal 2.Continuous 3.Ordinal (rank order)

1. Nominal (Categorical) Answers to “Where is this speaker from?” (native listeners)

1. Nominal (Categorical) correct dialect identification by American English speakers

2. Continuous

Native listeners: status vs. solidarity Status RP Birmingham Network NYC West Yorkshire Alabama Solidarity RP Birmingham Network New York West Yorkshire Alabama

3. Ordinal (Rank Order) Coupland & Bishop, 2007

2. Inferential Statistics a.Chi square b.ANOVA/t-test c.Correlations (rank order correlations) d.Logical regression e.Varbrul

2. Inferential Statistics For each type of statistics we need to know 1.Statistical value (chi value, F statistic, t statistic) 2.Probability value (p value) 3.Degrees of Freedom (df)

2. Inferential Statistics Research question: Is the word glistening used more often in one register (as shown in COCA) than another? SECTIONSPOKENFICTIONMAGAZINENEWSPAPERACADEMIC PER MIL SIZE (MW) FREQ

2. Inferential Statistics Research question: Is the word glistening used more often in one register (as shown in COCA) than another? What kind of data is this? Nominal (categorical) For this kind of data we use a chi square

a. Chi-square Tells us whether something happened more often than chance would predict bremen.de/~anatol/qnt/qnt_chi.html Use with multiple choice questions, percentage of time respondents choose specific choice, more corpora or frequency data

a. Chi-square What chi-square statistic answers: Is the distribution into categories random or not? (Uses counts of nominal data) For example, multiple choice questions. Jill loves the taste of coffee. A-c[æ]fi-186 B-c[^]fi-113 C-c[a]fi-70 Is 186, 113, 70 really different from what random choice would give?

a. Chi square To compute chi square, you need to know what is observed (the responses you got from your survey, corpus) and the expected frequencies. To calculate expected frequencies, you add up all the observed frequencies and divide by the number of data points Observed Data point 1Data point 2 Expected

a. Chi-square (Invented) frequency of use of dude in four million word spoken corpora: US NZ AU UK Random distribution would be: Observed (what the actually did) US NZ AU UK Expected (what you would expect by random chance) USNZ AU UK

a. Chi Square ncy_NROW_NCOLUMN_form.html chi-square = 2.77 degrees of freedom = 3 probability = We want this to be large We want this to be small The larger the chi value and the smaller the p value the more likely that the difference between the observed and the expected did not occur by chance

a. Chi square Practice: Is the word glistening used more often in one register (as shown in COCA) than another? SECTIONSPOKENFICTIONMAGAZINENEWSPAPERACADEMIC PER MIL SIZE (MW) FREQ To do this, you need to times each number by 10 and use only whole numbers

a. Chi Square Results: chi-square = 97.2 degrees of freedom = 4 probability = 0.000

a. Chi square More practice 1. Multiple choice question: Jill loves the taste of coffee. A-c[æ]fi-186 B-c[^]fi-113 C-c[a]fi-70 did respondents choose number A more often than the other two choices? 2. Identification: American Listeners choose the following choices when asked “where is this speaker from” (he was from Birmingham UK): London: 45% England: 25% Scotland: 25% Ireland: 5%

Chi-square Homework