Presentation is loading. Please wait.

Presentation is loading. Please wait.

Correlation and Regression Stats. T-Test Recap T Test is used to compare two categories of data – Ex. Size of finch beaks on Baltra island vs. Isabela.

Similar presentations


Presentation on theme: "Correlation and Regression Stats. T-Test Recap T Test is used to compare two categories of data – Ex. Size of finch beaks on Baltra island vs. Isabela."— Presentation transcript:

1 Correlation and Regression Stats

2 T-Test Recap T Test is used to compare two categories of data – Ex. Size of finch beaks on Baltra island vs. Isabela island – Ex. Heights of boys vs. girls – Ex. (add your own)

3 Regression Stats A regression is used to test for a correlation in data – Ex: Are higher math MAPS scores related to a higher grade in algebra class? – Ex: Are the lengths of women’s boots related to the temperature outside? – Ex: Add your own

4 *correlation means that there is a relationship between the two sets of data, it DOES NOT necessarily mean that one causes the other. – MAP scores don’t cause a higher grade in algebra, and women’s boot lengths don’t control the weather

5 Question: As you go further up in the atmosphere do temperatures become colder? Independent variable: Dependent variable: *In order to test for a correlation there has to be at least 5 different values for the independent variable

6 The best way to visualize a relationship between two sets of data is with a scatter plot and a line of best fit Altitude (km)Temp (°C) 37 104 222 280 32-3 *Add a line of best fit to the chart Is this a linear relationship (correlation)? Is is a positive or negative relationship (correlation)?

7 Create a scatter graph for the data: Open up an Excel spreadsheet Copy and paste the table data into the workbook Highlight the numbers and insert a marked scatter graph Back click on one of the data points to add a trendline click the option tab and add the equation of the line Copy and paste your graph into the next slide

8 How close was your line of best fit compared to Excel’s? Looking at the equation of the line, can you tell me on average how temperature changed for every km rise in in altitude?

9 *Our altitude data seemed to have a negative linear correlation: as altitudes got higher, temperatures dropped. But…. 1.Is there really a significant correlation? 2.If there is a correlation how strong or weak is it?

10 1. Is there a significant correlation? If the p-value is below 0.05 there is a significant correlation between the two data sets. *It means that we have 95% certainty that there is a relationship between the two variables

11 2. How strong is the correlation? The strength of the correlation can be measured as a value between 1 and -1 (called an R – value) 1 is a 100% (very strong) positive correlation -1 is a 100% (very strong) negative correlation 0 is a 0% correlation (no correlation or no relationship between your variables)

12 Look back at your altitude/temperature graph… with your partner estimate a correlation strength and write it here: R value = *Was it positive or negative? Why? *Was it close to zero or close to 1 (or -1)? Why?

13 Time to test! Download the excel sheet called ‘Regression Test’ Open the workbook and select the tab called “input” Input in your values from the boot/temp table Click the tab that says “output” Although these tables have a lot of information in them, we will only be looking at several numbers from the first two tables…. NOTE: pay attention to how excel is displaying decimals (period or comma) and ensure you keep the same format when entering in data

14 Standard deviation of the average – how much the data points vary from the average P-value: the probability of no correlation again this needs to be below 0.05 to indicate a significant correlation Average change in Y value for every unit of the X value (is the same number as the slope in the equation of the line) … For our example, how much the temperature changes as altitude increase by 1 km. R - The correlation strength: the “puffiness” of the scatterplot. A strong correlation has a value close to 1 or -1. A weak correlation has a value close to 0 R 2 Value: What you report in your formal conclusion. (Literally the R value squared, to get rid of the negative for clarity).

15 Write the p-value here: Is there a significant correlation between altitude and temperature? How do you know? Write the r-value here: Is there a strong correlation between altitude and temperature? How do you know?

16 Time to Conclude! Formal results makes a clear and direct statement about the outcome your statistical test while reporting key statistics: “The temperature changed an average of -0.3±0.04 °c per kilometer rise in altitude. This marks a significant temperature decrease as altitude increases in this study (p=0.0052, n=5) There is a strong, negative correlation between altitude and temperature (r 2 = 0.9477)

17 Note: Multiple Trials AltitudeTemp 37 36 37 104 4 3 221 2 When you have multiple trials, your data table should still be in two columns for excel to present and process the data correctly.


Download ppt "Correlation and Regression Stats. T-Test Recap T Test is used to compare two categories of data – Ex. Size of finch beaks on Baltra island vs. Isabela."

Similar presentations


Ads by Google