One-way Analysis of Variance (ANOVA) Note: In this version, we’ll use Excel to do all the calculations. If you wanted the tutorial that worked through.

Slides:



Advertisements
Similar presentations
The essentials managers need to know about Excel
Advertisements

Microsoft ® Office Excel ® 2003 Training Enter formulas CSNT, Inc. presents:
1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Formulas and Functions. Type Data Into Spreadsheet ◦Open Excel ◦Create Blank Workbook ◦Type data shown as in example ◦ To get 2 lines in a cell ◦Type.
1 1 Slide © 2003 South-Western /Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Word Processing First Steps
Introduction to Excel 2007 Part 2: Bar Graphs and Histograms February 5, 2008.
Math 115a Mathematics for Business Decisions, part I Histograms Math 115a.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 5 Analyzing.
Significance Testing Chapter 13 Victor Katch Kinesiology.
A Tutorial by Aaron Price
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
XP New Perspectives on Microsoft Office Excel 2003, Second Edition- Tutorial 11 1 Microsoft Office Excel 2003 Tutorial 11 – Importing Data Into Excel.
ABC’s of PowerPoint (Office 2007) Part 1: Basic Vocabulary Part 2: Cursors Part 3: Insert Your Text Part 4: Insert Your Pictures Part 5: Basic Tools &
1 Confidence Interval for Population Mean The case when the population standard deviation is unknown (the more common case).
P-value Method 2 means, sigmas unknown. Sodium levels are measured in millimoles per liter (mmol/L) and a score between 136 and 145 is considered normal.
Traditional Method 2 means, σ’s known. The makers of a standardized exam have two versions of the exam: version A and version B. They believe the two.
Introduction to Excel 2007 Part 3: Bar Graphs and Histograms Psych 209.
P-value Method One Mean, sigma known. The average length of a certain insect has been determined to be.52 cm with a standard deviation of.03 cm. A researcher.
Test for Goodness of Fit. The math department at a community college offers 3 classes that satisfy the math requirement for transfer in majors that do.
8/15/2015Slide 1 The only legitimate mathematical operation that we can use with a variable that we treat as categorical is to count the number of cases.
Traditional Method 2 means, dependent samples. A data entry office finds itself plagued by inefficiency. In an attempt to improve things the office manager.
Traditional Method 2 proportions. The President of a homeowners’ association believes that pink flamingos on lawns are tacky, or as she puts it, “detrimental.
P-value method 1 mean, σ unknown. A student claims that the average statistics textbook has fewer than 650 pages. In a sample of 52 statistics texts,
P-value method 2 means, both σ’s known. An economist is comparing credit card debt from two recent years. She has gathered the following data: Year 1.
The following slide has the same photo at three resolutions. Each version has been resized in Powerpoint to occupy the entire width of the screen. Remember.
P-value Method 2 proportions. A resident of a small housing complex has a pet monkey who likes to sit out on the porch and smoke cigarettes. Some of the.
P-value method dependent samples. A group of friends wants to compare two energy drinks. They agree to meet on consecutive Saturdays to run a mile. One.
P-value method One Proportion. The mayor of Pleasantville has just signed a contract allowing a biohazards company to build a waste disposal site on what.
Traditional Method One mean, sigma known. The Problem In 2004, the average monthly Social Security benefit for retired workers was $ with a standard.
Traditional Method One Proportion. A researcher claims that the majority of the population supports a proposition raising taxes to help fund education.
Making Tables and Graphs with Excel The Basics. Where do my IV and DV go? Just like you would create a data table on paper, your IV goes in the leftmost.
Traditional method 2 means, σ’s unknown. Scientists studying the effect of diet on cognitive ability are comparing two groups of mice. The first group.
Analyzing Descriptive Data using Excel © Schachman.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 10.2.
Traditional Method 1 mean, sigma unknown. In a national phone survey conducted in May 2012, adults were asked: Thinking about social issues, would you.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 10.2.
EDT 608 Unit 3 Using Microsoft Excel Under “Start”, go to “Programs” and select “Microsoft Excel”. Go the next slide to see the Excel Screen template.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Procedure for Hypothesis Testing 1. Establish the null hypothesis, H 0. 2.Establish.
Two Way ANOVA ©2005 Dr. B. C. Paul. ANOVA Application ANOVA allows us to review data and determine whether a particular effect is changing our results.
Basic concept Measures of central tendency Measures of central tendency Measures of dispersion & variability.
Excel CREATING A WORKSHEET AND CHART. Personal Budget Worksheet We will create a personal budget worksheet that shows you income each month and your expenses.
Processing Lab 3 – Header issues and trace editing Bryce Hutchinson Objectives: Fixing elevation issues Define an LMO function Pick first breaks Kill traces.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 1): Two-tail Tests & Confidence Intervals Fall, 2008.
6/4/2016Slide 1 The one sample t-test compares two values for the population mean of a single variable. The two-sample t-test of population means (aka.
Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)
June 21, Objectives  Enable the Data Analysis Add-In  Quickly calculate descriptive statistics using the Data Analysis Add-In  Create a histogram.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
Using the Pythagorean Theorem Sarah Katko ICL 7062.
Welcome to MM570 Psychological Statistics
Learning PowerPoint Presenting your ideas as a slide show… …on the computer!
CHAPTER 27: One-Way Analysis of Variance: Comparing Several Means
Testing Hypotheses about a Population Proportion Lecture 31 Sections 9.1 – 9.3 Wed, Mar 22, 2006.
 Columns  Rows  Cells  Ranges  Cell addresses  Column headers  Row headers  Formulas  Spreadsheet.
To view this in “presentation” mode, go to Slide Show  View Show (the toolbar at the top of the page) Use the “Enter” key to advance to the next slide.
Statistical hypothesis Statistical hypothesis is a method for testing a claim or hypothesis about a parameter in a papulation The statement H 0 is called.
When the means of two groups are to be compared (where each group consists of subjects that are not related) then the excel two-sample t-test procedure.
COMPARISON OF MEANS QUESTION #10c
Microsoft Excel.
Hypothesis Testing for Population Means (s Unknown)
Hypothesis Tests for a Population Mean,
Statistical Analysis Determining the Significance of Data
Lesson Comparing Two Means.
Chapter Nine Part 1 (Sections 9.1 & 9.2) Hypothesis Testing
Spreadsheets.
Hypothesis Tests for Two Population Standard Deviations
Testing Hypotheses about a Population Proportion
Getting started – Example 1
Introduction to Excel 2007 Part 3: Bar Graphs and Histograms
Presentation transcript:

One-way Analysis of Variance (ANOVA) Note: In this version, we’ll use Excel to do all the calculations. If you wanted the tutorial that worked through the calculations using only a calculator, go back to the menu of tutorials to select that one.

Once upon a time…. (that time was Fall 2012) At a community college far, far away… (actually, right here at Saddleback College, but “far, far away” sounds better)

An evil biology professor… That’s me!

…enslaved an entire class in a room full of fancy equipment and made them do work! This was called….

The students were trying to determine whether light conditions had a significant effect on the concentration of chlorophyll in geranium leaves. For their experiment, they had three groups of plants: Acknowledgement: This image was found on Google images. It originated at

The first group was exposed to normal light conditions. Aaah…. I love sunshine!

The second group was kept in the dark for 72 hours. Yikes! I’ve always been afraid of the dark!

The third group was kept in the dark for 72 hours and then exposed to light for 4 hours.

Then the students measured the chlorophyll concentration in the leaves. Click on Dr. Huntley if you’d like to read about how they did this.

LIGHTDARKDARK + 4 hr. Light

Perform an ANOVA test with α =.05 to determine whether there is a significant difference in the mean concentration of chlorophyll between the 3 groups. You may assume that chlorophyll concentration is approximately normally distributed. If you want to work through this problem on your own and just check your work when you’re done, go ahead. Click on the leaf to the right when you’re ready to check your answer.

Set-up Open a new Excel workbook and enter in the data. Your worksheet should look like this.

From the top menu, select “Data.”

When this menu opens up, select “Data Analysis” from the far right.

If “Data Analysis” doesn’t appear here, yell at your computer in frustration! Then install the Analysis Toolpak (you should have directions for doing so in one of your labs.)

The dialogue box to the left will open up. If “Anova: Single Factor” isn’t the first option, move this bar up until you see it.

The dialogue box to the right will open up. Click on “Anova: Single Factor” to highlight it and then click on “OK.”

The above dialogue box will appear. In the “Input Range” box, type in the cells that contain the headings (light, dark, and 4- hour) and the data, starting with the cell in the upper left- hand corner, followed by a colon (:), followed by the cell in the lower right-hand corner.

From the screen shot below, we see that for this example that is A1:C13. A1:C13

From the screen shot below, we see that for this example that is A1:C13. A1:C13 Note that you can also enter the input range by using the red arrow and highlighting the appropriate cells; feel free to use this option if you are familiar with it.

A1:C13 Make sure you’ve checked the bubble that says the data is sorted by columns.

A1:C13 Click on this box to indicate that the first row contains the headings (labels) for our columns; the actual data starts in row 2. When you click on the box, a check mark will appear.

A1:C13 Click on this box to indicate that the first row contains the headings (labels) for our columns; the actual data starts in row 2. When you click on the box, a check mark will appear. This is a neat feature of Excel; when we type in the headings and check this box, we’ll see these headings in the output generated by Excel. If we didn’t do this, the data would be described as “Column 1, Column 2,” and “Column 3” instead of “Light, Dark,” and “4-hour.”

A1:C13 Be sure the correct value of alpha appears in this box. If it doesn’t, click on the box and type in the correct value---for this problem,

A1:C13.05 Under “Output options,” select “Output Range” by clicking on the bubble next to it.

A1:C13.05 Pick an empty cell and enter it here. This is where your results will appear. I’ve chosen to enter E3. Again, if you prefer to use the red arrow and highlight the desired cell, that’s fine. E3

A1:C13.05 E3 Finally, click “OK.”

You’ll see the output boxes below, positioned to start in the cell you selected. Notice that it’s a little hard to read, since not all the information fits in the space allotted for it.

To fix this, click on “Home” at the top of your screen.

When this menu opens up, click on “Format” in the options above “Cells.”

Then select “Autofit Column Width” from the drop-down menu that opens up.

Now we can read everything in the output box.

Now we’re ready to perform our hypothesis test! We’ll work through the usual 6-step process, and refer to the output from Excel whenever we need a calculated value.

Step 1: State the hypotheses and identify the claim (if there is one) The Null hypothesis is always that there is no difference between the means. We can describe this in symbols. Remember to use μ, the symbol for population mean. Notice that I’m using a subscript of “l” for the light group, “d” for the dark group, and “4” for the 4-hour group.

The Alternate hypothesis is that there is a difference in the means. This includes the possibility a)that we have 3 different means, or b)that 2 means are the same and only 1 is different from the others. To allow for both possibilities, we state the Alternate hypothesis in words. Note that this problem was posed as a question so that there is no claim.

Step (*) Draw the F distribution and mark off the area in the right tail. Slow Down! There are 3 things we have to check first!

1. Chlorophyll concentration is approximately normally distributed. We were told to assume this. 2. We have independent samples.We can trust Dr. Huntley to set up a valid experiment. 3. The population variances are equal. This is a tough one, since we don’t know any of the population variances.

We’ll rely on the rule of thumb that it’s ok to proceed if the largest sample standard deviation (not the variance!) is no more than twice the smallest. Excel gives us the sample variances. To calculate the sample standard deviations, we need to take the square roots.

We’ll rely on the rule of thumb that it’s ok to proceed if the largest sample standard deviation (not the variance!) is no more than twice the smallest. LARGEST smallest The largest is less than twice the smallest.

So we can go ahead and draw the F distribution and mark off the area in the right tail..05 This test is always right-tailed (like me!) since if the means are different, that will make the F- value BIG.

Step 2 Mark off the critical value..05 The critical value is the boundary of the right tail; it goes here.

Find the critical value in the Excel output here.

Then add the critical value to our picture …

Step 3 Mark off the test value, F.

Find the test value in the Excel output here.

Then add the test value to our picture … … Since … > 3.284…, it goes to the right.

Step 4: Decide whether or not to reject the Null …33.411… The test value is in the critical region; reject the Null.

Step 5: Answer the question in plain English There is enough evidence to support a claim that light conditions affect chlorophyll concentration. Equivalently, there is enough evidence to reject a claim that light conditions don’t affect chlorophyll concentration.

I’m exhausted! Could we just see a quick summary of all that?

First we entered the data into Excel, selected the “Anova: Single Factor” option, and got the above output. Then we used this output to perform the usual 6 steps of hypothesis testing.

Each click will give you one step; step (*) is broken into two clicks. Step(*) …Step …Step 3 Step 4 Reject the Null. Step 5 There is enough evidence to support a claim that light conditions affect chlorophyll concentration.

And there was much rejoicing.

Press escape to exit the slide show If you keep clicking through the slides, you’ll go to the explanation of the experiment.

Step 1. Cut two 5 mm disks from a geranium leaf. Acknowledgement: The image of the geranium leaf was found on Google images. It originated at davesgarden.com 5 ml 8 % acetone solution

Acknowledgement: This image was found on google images, which credits it with coming from flushgordon.info

Thanks Dr. Huntley!! Please don’t mention it. Especially not to my 3b students; I make them do the conversions themselves.

Click on the picture below to see the data the students collected. Top Secret Lab Results