INF 397C Introduction to Research in Information Studies Fall, Day 2

Slides:



Advertisements
Similar presentations
Population vs. Sample Population: A large group of people to which we are interested in generalizing. parameter Sample: A smaller group drawn from a population.
Advertisements

SPSS Review CENTRAL TENDENCY & DISPERSION
BHS Methods in Behavioral Sciences I April 18, 2003 Chapter 4 (Ray) – Descriptive Statistics.
Descriptive Statistics
SOC 3155 SPSS CODING/GRAPHS & CHARTS CENTRAL TENDENCY & DISPERSION.
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Data observation and Descriptive Statistics
1 Measures of Central Tendency Greg C Elvers, Ph.D.
Measures of Central Tendency
Today: Central Tendency & Dispersion
Statistics for Linguistics Students Michaelmas 2004 Week 1 Bettina Braun.
Objective To understand measures of central tendency and use them to analyze data.
BIOSTATISTICS II. RECAP ROLE OF BIOSATTISTICS IN PUBLIC HEALTH SOURCES AND FUNCTIONS OF VITAL STATISTICS RATES/ RATIOS/PROPORTIONS TYPES OF DATA CATEGORICAL.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Psychometrics.
R.G. Bias | | Name that tune. Song title? Performer(s)? 1.
Measures of Central Tendency or Measures of Location or Measures of Averages.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Types of data and how to present them 47:269: Research Methods I Dr. Leonard March 31, :269: Research Methods I Dr. Leonard March 31, 2010.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Basic Statistics. Scales of measurement Nominal The one that has names Ordinal Rank ordered Interval Equal differences in the scores Ratio Has a true.
Statistics Recording the results from our studies.
PPA 501 – Analytical Methods in Administration Lecture 5a - Counting and Charting Responses.
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
1 Describing distributions with numbers William P. Wattles Psychology 302.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
R. G. Bias | School of Information | SZB 562BB | Phone: | i 1 INF 397C Introduction to Research in Library and Information.
Skewness & Kurtosis: Reference
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Central Tendency & Dispersion
R. G. Bias | School of Information | SZB 562BB | Phone: | i 1 LIS Introduction to Research in Library and Information.
BASIC STATISTICAL CONCEPTS Chapter Three. CHAPTER OBJECTIVES Scales of Measurement Measures of central tendency (mean, median, mode) Frequency distribution.
IE(DS)1 Descriptive Statistics Data - Quantitative observation of Behavior What do numbers mean? If we call one thing 1 and another thing 2 what do we.
LIS 570 Summarising and presenting data - Univariate analysis.
Descriptive Statistics(Summary and Variability measures)
R. G. Bias | School of Information | UTA | Phone: | i 1 INF 397C Introduction to Research in Information Studies.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 10 Descriptive Statistics Numbers –One tool for collecting data about communication.
Outline Sampling Measurement Descriptive Statistics:
Statistics as a Tool in Scientific Research
Analysis of Quantitative Data
SPSS CODING/GRAPHS & CHARTS CENTRAL TENDENCY & DISPERSION
Descriptive measures Capture the main 4 basic Ch.Ch. of the sample distribution: Central tendency Variability (variance) Skewness kurtosis.
Numerical Descriptive Measures
Practice Page Practice Page Positive Skew.
Descriptive Statistics: Overview
Module 6: Descriptive Statistics
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Spring 2017 Room 150 Harvill Building 9:00 - 9:50 Mondays, Wednesdays.
Central Tendency and Variability
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Slides to accompany Weathington, Cunningham & Pittenger (2010), Statistics Review (Appendix A) Bring all three text books Bring index cards Chalk? White-board.
IB Psychology Today’s Agenda: Turn in:
IB Psychology Today’s Agenda: Turn in:
Description of Data (Summary and Variability measures)
Statistical Analysis How do we make sense of the data we collect during a study or an experiment?
Summary descriptive statistics: means and standard deviations:
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Fall 2016 Room 150 Harvill Building 10: :50 Mondays, Wednesdays.
Descriptive Statistics
Introduction to Statistics
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Descriptive Statistics
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Summary descriptive statistics: means and standard deviations:
Numerical Descriptive Measures
Describing distributions with numbers
DESCRIPTIVE STATISTICS BORAM KANG
Descriptive Statistics
Lecture 4 Psyc 300A.
Central Tendency & Variability
Presentation transcript:

INF 397C Introduction to Research in Information Studies Fall, 2005 Day 2

σ = SQRT(Σ(X - µ)2/N) Standard Deviation (Does that give you a headache?)

USA Today has come out with a new survey - apparently, three out of every four people make up 75% of the population. David Letterman

Statistics: The only science that enables different experts using the same figures to draw different conclusions. Evan Esar (1899 - 1995), US humorist

Scales (last week) Nominal Ordinal Interval Ratio Name = Mutually-exclusive Ordered Equal interval + abs. 0 Gender, Yes/No Class rank, ratings Days of wk., temp. Inches, dollars

Critical Skepticism Remember the Rabbit Pie example from last week? The “critical consumer” of statistics asked “what do you mean by ’50/50’”?

Remember . . . I do NOT want you to become cynical. Not all “media bias” (nor bad research) is intentional. Just be sensible, critical, skeptical. As you “consume” statistics, ask some questions . . .

Ask yourself. . . Who says so? (A Zest commercial is unlikely to tell you that Irish Spring is best.) How does he/she know? (That Zest is “the best soap for you.”) What’s missing? (One year, 33% of female grad students at Johns Hopkins married faculty.) Did somebody change the subject? (“Camrys are bigger than Accords.” “Accords are bigger than Camrys.”) Does it make sense? (“Study in NYC: Working woman with family needed $40.13/week for adequate support.”)

What were . . . . . . some claims you all heard this week?

Last week . . . We learned about frequency distributions. I asserted that a frequency distribution, and/or a histogram (a graphical representation of a frequency distribution), was a good way to summarize a collection of data. And I asserted there’s another, even shorter-hand way.

Measures of Central Tendency Mode Most frequent score (or scores – a distribution can have multiple modes) Median “Middle score” 50th percentile Mean - µ (“mu”) “Arithmetic average” ΣX/N

OK, so which do we use? Means allow further arithmetic/statistical manipulation. But . . . It depends on: The type of scale of your data Can’t use means with nominal or ordinal scale data With nominal data, must use mode The distribution of your data Tend to use medians with distributions bounded at one end but not the other (e.g., salary). (Look at our “Number of MLB games” distribution.) The question you want to answer “Most popular score” vs. “middle score” vs. “middle of the see-saw” “Statistics can tell us which measures are technically correct. It cannot tell us which are ‘meaningful’” (Tal, 2001, p. 52).

Scales (which measure of CT?) Nominal (mode) Ordinal (mode, median) Interval (any) Ratio Name = Mutually-exclusive Ordered Equal interval + abs. 0 Gender, Yes/No Class rank, ratings Days of wk., temp. Inches, dollars

Mean – “see saw” (from Tal, 2001)

Have sidled up to SHAPES of distributions Symmetrical Skewed – positive and negative Flat

“Pulling up the mean”

Why . . . . . . isn’t a “measure of central tendency” all we need to characterize a distribution of scores/numbers/data/stuff? “The price for using measures of central tendency is loss of information” (Tal, 2001, p. 49).

Note . . . We started with a bunch of specific scores. We put them in order. We drew their distribution. Now we can report their central tendency. So, we’ve moved AWAY from specifics, to a summary. But with Central Tendency, alone, we’ve ignored the specifics altogether. Note MANY distributions could have a particular central tendency! If we went back to ALL the specifics, we’d be back at square one.

Measures of Dispersion Range Semi-interquartile range Standard deviation σ (sigma)

Range Highest score minus the lowest score. Like the mode . . . Easy to calculate Potentially misleading Doesn’t take EVERY score into account. What we need to do is calculate one number that will capture HOW spread out our numbers are from that measure of Central Tendency. ‘Cause MANY different distributions of scores can have the same central tendency! “Standard Deviation”

Back to our data – MLB games Let’s take just the men in this class xls spreadsheet. Measures of central tendency. Go with mean. So, how much do the actual scores deviate from the mean?

So . . . Add up all the deviations and we should have a feel for how disperse, how spread, how deviant, our distribution is. Let’s calculate the Standard Deviation. As always, start inside the parentheses. Σ(X - µ)

Damn! OK, let’s try it on a smaller set of numbers. X 2 3 5 6

Damn! (cont’d.) OK, let’s try it on a smaller set of numbers. X X - µ 2 -2 3 -1 5 1 6 Σ = 16 Σ = 0 µ = 4 Hmm.

OK . . . . . . so mathematicians at this point do one of two things. Take the absolute value or square ‘em. We square ‘em. Σ(X - µ)2

Standard Deviation (cont’d.) Then take the average of the squared deviations. Σ(X - µ)2/N But this number is so BIG!

OK . . . . . . take the square root (to make up for squaring the deviations earlier). σ = SQRT(Σ(X - µ)2/N) Now this doesn’t give you a headache, right? I said “right”?

Hmmm . . . Mode Range Median ????? Mean Standard Deviation

We need . . . A measure of spread that is NOT sensitive to every little score, just as median is not. SIQR: Semi-interquartile range. (Q3 – Q1)/2

To summarize Mode Range Easy to calculate. May be misleading. Median SIQR Capture the center. Not influenced by extreme scores. Mean (µ) SD (σ) Take every score into account. Allow later manipulations.

Practice Problems

Who wants to guess . . . . . . What I think is the most important sentence in S, Z, & Z (2003), Chapter 2?

p. 19 Penultimate paragraph, first sentence: “If differences in the dependent variable are to be interpreted unambiguously as a result of the different independent variable conditions, proper control techniques must be used.”

http://highered. mcgraw-hill http://highered.mcgraw-hill.com/sites/0072494468/student_view0/statistics_primer.html Click on Statistics Primer.

Homework LOTS of reading. See syllabus. Send a table/graph/chart that you’ve read this past week. Send email by noon, Friday, 9/16/2005. See you next week.