Lecture 5.

Slides:



Advertisements
Similar presentations
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Advertisements

AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Normal Distribution Links Standard Deviation The Normal Distribution Finding a Probability Standard Normal Distribution Inverse Normal Distribution.
Statistics Recording the results from our studies.
Chapter 2 The Normal Distributions “Statistically thinking will one day be as necessary for efficient citizenship as the ability to read and write.” H.
MAT 1000 Mathematics in Today's World. Last Time 1.Three keys to summarize a collection of data: shape, center, spread. 2.Can measure spread with the.
Psychology’s Statistics. Statistics Are a means to make data more meaningful Provide a method of organizing information so that it can be understood.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
§ 5.3 Normal Distributions: Finding Values. Probability and Normal Distributions If a random variable, x, is normally distributed, you can find the probability.
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
AP Statistics HW: p.83 #1, 4, 6, 8 Obj: to understand density functions Do Now: Use your calculator to make a histogram for the following data on the height.
The pictures of Statistics.  Central Tendencies -  Mean –  Median –  Mode -  Statistics -
The Normal Distribution Chapter 3. When Exploring Data Always start by plotting your individual variables Look for overall patterns (shape, centre, spread)
Copyright © Cengage Learning. All rights reserved.
Chapter 2: Modeling Distributions of Data
2.2 Normal Distributions
Different Types of Data
Modeling Distributions of Data
Copyright © Cengage Learning. All rights reserved.
Chapter 2: Modeling Distributions of Data
Stat 2411 Statistical Methods
Chapter 6 The Normal Curve.
Descriptive Statistics (Part 2)
Descriptive Statistics: Overview
Reasoning in Psychology Using Statistics
Do-Now-Day 2 Section 2.2 Find the mean, median, mode, and IQR from the following set of data values: 60, 64, 69, 73, 76, 122 Mean- Median- Mode- InterQuartile.
Unit 4: A Brief Look at the World of Statistics
Ninth grade students in an English class were surveyed to find out about how many times during the last year they saw a movie in a theater. The results.
Describing Location in a Distribution
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Standard Normal Calculations
Normal Distribution Links Standard Deviation The Normal Distribution
Standard Deviation.
Standard Deviation Calculate the mean Given a Data Set 12, 8, 7, 14, 4
Chapter 5: Describing Distributions Numerically
Chapter 4 – Part 3.
Chapter 2: Modeling Distributions of Data
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Fall 2017 Room 150 Harvill Building 10: :50 Mondays, Wednesdays.
Decision Errors and Power
Displaying Distributions with Graphs
Displaying and Summarizing Quantitative Data
CHAPTER 2 Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Mean Deviation Standard Deviation Variance.
Chapter 2: Modeling Distributions of Data
Pull 2 samples of 5 pennies and record both averages (2 dots).
Do Now In BIG CLEAR numbers, please write your height in inches on the index card.
Chapter 2: Modeling Distributions of Data
Good morning! Please get out your homework for a check.
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Xbar Chart By Farrokh Alemi Ph.D
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Center and Spread IB SL: Statistics Day 2.
Normal Distribution.
Advanced Algebra Unit 1 Vocabulary
Standard Deviation.
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Compare and contrast histograms to bar graphs
Chapter 2: Modeling Distributions of Data
Measures of Dispersion
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Presentation transcript:

Lecture 5

The center of symmetric distributions: the mean Besides the median, there is one more good measurement of the “center” It works especially well if the distribution is symmetric 𝑦= 𝑇𝑜𝑡𝑎𝑙 𝑛 = 𝑖=1 𝑛 𝑦 𝑖 𝑛 - the average of all collected values

Example 75.60946 47.67863 13.4834 68.27895 67.70208 27.63657 10.97026 52.80463 86.75896 35.4131 92.91208 89.27507 50.14048 13.405 49.27678 65.42801 13.67298 Median ≈50.14 The mean is 50.61

Interpretation of mean (or average) “Center of mass”: if we have points with assigned masses, the “center of mass” is

In the same way, the mean is a point where the histogram balances

Mean or median? This is a tough question. For many “scientific” purposes, the mean is a lot better than the median. The notion of “Expectation” (which is a generalized mean) is central in Probability and Statistics

Mean or median vol. 2 However, in some situations median is more “stable”. For example, if we collect ages of students in a class, and typically it’s, say, 18, 19, 20; but then there is a 70 y.o. student.

18,18,18,18,18,19,19,19,19,19,20,20,20, 20,20, 70 With or without the 70, median is = 19. With the 70, mean is = 22.19 Without the 70, mean is = 19 So with the obvious outlier, mean does not represent an “average” student.

The reason behind this situation is not that the distribution is not symmetric. In fact, it is not symmetric in a special way

Draw a picture! It always helps to draw a good picture and look at the histogram. It often clarifies, should we trust mean or median (or both). Sometimes people throw away top and bottom 10% of the data and average the rest.

The spread: the standard deviation This should tell us how far actual values are from the mean, in average Take ( 𝑦 𝑖 − 𝑦) 𝑛 . This is always equal to 0 The reason is: some of the terms are positive, and some are negative. Since the mean perfectly balances things, they add up to 0

To destroy all negative terms, we square them. Staying positive To destroy all negative terms, we square them. 𝑠 2 = 𝑦 𝑖 − 𝑦 2 𝑛 or 𝑠 2 = 𝑦 𝑖 − 𝑦 2 𝑛−1 𝑠 2 is called the “variance”, and 𝑠= 𝑠 2 is called “standard deviation” It is certainly correct to divide by n, but the book suggests to divide by n-1.

Now find (data value – mean): 14-17=-3, 13-17=-4, 20-17=3, 22-17=5, Example 14, 13, 20, 22, 18, 19, 13 First find the mean: (14+13+20+22+18+19+13)/7 = 17 Now find (data value – mean): 14-17=-3, 13-17=-4, 20-17=3, 22-17=5, 18-17=1, 19-17=2, 13-17=-4

14, 13, 20, 22, 18, 19, 13, mean = 17 14-17=-3, 13-17=-4, 20-17=3, 22-17=5, 18-17=1, 19-17=2, 13-17=-4 −3 2 + −4 2 + 3 2 + 5 2 + 1 2 + 2 2 + −4 2 =9+16+9+25+1+4+16= 80 Now divide by 7 (book suggests 7-1=6) 80/7 ≈11.43, 80/6 ≈13.33 – this is the variance. St. dev. = square root ≈3.38 (3.65)

Glance into future: why do we need all that? We will rely on the following fact: if the distribution is “normal”, then most of the data should be between (mean – 3*st.dev.) and (mean+3*st.dev.) That is, if we know that our distribution is nice, then it is more or less enough to know only mean and standard deviation

On professors side, most of the scores (in a single test and total) should fall into mentioned interval In a sense, “curve” means that professor adjusts the scores to make this happen.

Why it’s good to pay attention On the quiz tomorrow you will be allowed to use a calculator. Please bring one!

Understanding and comparing distributions

Magnitudes only South America: 6.7 8.2 7.6 5.1 4.9 7.1 8.3 5.3 6.9 7.6 7.6 Median = 7.1 Mean = 6.84 North America: 5.1 7.2 6.4 7.9 6.9 6.1 6.3 6.0 6.9 Median = 6.4 Mean = 6.53

The distribution for North America is more symmetric; also in South America a typical magnitude is higher