Analyzing Cancer Data 1.What type of data—counts or rates? 2.Before you analyze, look at the data. 3.If appropriate, check the data for linear trends.

Slides:



Advertisements
Similar presentations
Exercise 7.5 (p. 343) Consider the hotel occupancy data in Table 6.4 of Chapter 6 (p. 297)
Advertisements

UNLOCKING THE SECRETS HIDDEN IN YOUR DATA Part 3 Data Analysis.
Introduction to Excel Formulas, Functions and References.
Loading Excel Double click the Excel icon on the desktop (if you have this) OR Click on Start All Programs Microsoft Office Microsoft Office Excel 2003.
©2004, 2006, 2008 UIW Department of Instructional Technology Meat and Potatoes SPSS Presented by Terence Peak.
1 Using Excel for Statistics Office 2010 and 2013.
1 Excel and Regression. 2 3 On the next screen you can see I typed in the data for the schooling / income study. I even put in labels in the first row.
DIRECTIONS FOR ACCESSING THE AT-RISK ALERTS’ REPORT IN MYDATA.
Chapter 10 Comparisons Involving Means Part A Estimation of the Difference between the Means of Two Populations: Independent Samples Hypothesis Tests about.
Building a Histogram in Excel IE 1225 R. Lindeke.
Math 115a Mathematics for Business Decisions, part I Histograms Math 115a.
HOW TO CREATE A HISTOGRAM IN EXCEL. STEP 1: INSTALL ANALYSIS TOOLPAK 1.Click on the Microsoft Office Button (circle button) 2.Click on Excel Options.
Excel Charts – Basic Skills Creating Charts in Excel.
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
Spreadsheets and Non- Spatial Databases Unit 4: Module 15, Lecture 2- Advanced Microsoft Excel.
Section 6.3 Suppose that X 1, X 2, …, X n are a random sample from a N(  X,  X 2 ) distribution, that Y 1, Y 2, …, Y m are a random sample from a N(
Histograms & Summary Data.  Summarizing large of amounts of data in two ways: Histograms: graphs give a pictorial representation of the data Numerical.
Chapter 10b Hypothesis Tests About the Difference Between the Means of Two Populations: Independent Samples, Small-Sample CaseHypothesis Tests About the.
Chapter 12b Testing for significance—the t-test Developing confidence intervals for estimates of β 1. Testing for significance—the f-test Using Excel’s.
Chapter 12a Simple Linear Regression
Histograms & Summary Data.  Summarizing large of amounts of data in two ways: Histograms: graphs give a pictorial representation of the data Numerical.
RESEARCH STATISTICS Jobayer Hossain Larry Holmes, Jr November 6, 2008 Examining Relationship of Variables.
T T20-06 Control Chart (with Runs Tests) Purpose Allows the analyst create and analyze a "Control Chart". A visual analysis of the control time.
Multiple Regression and Correlation Analysis
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
BCOR 1020 Business Statistics Lecture 24 – April 17, 2008.
Spreadsheet Problem Solving
E XCEL P ROJECT T UTORIAL. G ETTING YOUR UNIQUE DATA SET … Go to the stat 216 homepage: and.
Linear Regression with Excel  Brief Tutorial for Forecasting Assignment MIT Airport Systems Planning Course Shiro Yamanaka.
Example 16.3 Estimating Total Cost for Several Products.
Excel For MATH 125 Histograms. If you have Excel 2003…
1 Introduction to Spreadsheets Bent Thomsen. 2 What is an electronic spreadsheet? It is the electronic equivalent of an accounting worksheet, comprised.
Moving data on the worksheet Copying and Pasting Data, Slide 1Copyright © 2004, Jim Schwab, University of Texas at Austin Like other windows applications,
Regression Basics For Business Analysis If you've ever wondered how two or more things relate to each other, or if you've ever had your boss ask you to.
1 Spreadsheet Problem Solving  applied statistics  table lookup.
Technology Basics Creating Worksheet Formulas. 2 Understand Formulas Equations used to calculate values in cells are called formulas. Formulas consist.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 11 Regression.
C opyright  2007 by Oxford University Press, Inc. PowerPoint Slides Prepared by Robert F. Brooker, Ph.D.Slide 1 1.
Analyzing Descriptive Data using Excel © Schachman.
1 1 Slide Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination n Model Assumptions n Testing.
1 1 Slide © 2004 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Demographic Profiles of Agency Clients - Part 2 Next, we will create a table and a column chart for the conservator field in my database. Because we are.
Project 6 Using The Analysis ToolPak To Analyze Sales Transactions Jason C. H. Chen, Ph.D. Professor of Management Information Systems School of Business.
C opyright  2007 by Oxford University Press, Inc. PowerPoint Slides Prepared by Robert F. Brooker, Ph.D.Slide 1 1.
การใช้ Solver ใน Excel. Solver Part of a suite of commands sometimes called what-if analysis tools what-if analysis Find an optimal value for a Formula.
What is SPSS  SPSS is a program software used for statistical analysis.  Statistical Package for Social Sciences.
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
Math & Formula Do math under any tab ■ Click the cell (B32) where put the result mean, and next to f x type: average(B2:B31) ■ (B2:B31) is define the data.
June 21, Objectives  Enable the Data Analysis Add-In  Quickly calculate descriptive statistics using the Data Analysis Add-In  Create a histogram.
11/16/2015Slide 1 We will use a two-sample test of proportions to test whether or not there are group differences in the proportions of cases that have.
1. 2 Word Processing Word Processing is writing words and sentences on the computer. It is easy to change or move text in a word document. People use.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
MEASURE : Measurement System Analysis
Sensitivity Analysis A systematic way of asking “what-if” scenario questions in order to understand what outcomes could possibly occur that would effect.
1 Using Excel for Statistics in Gateway Cases Office 2007 & 2010.
Pasewark & Pasewark Microsoft Office 2003: Introductory 1 INTRODUCTORY MICROSOFT EXCEL Lesson 4 – Worksheet Formulas.
Regression Analysis in Microsoft Excel MS&T Physics 1135 and 2135 Labs.
© The McGraw-Hill Companies, Inc., Chapter 10 Correlation and Regression.
Regression Analysis: A statistical procedure used to find relations among a set of variables B. Klinkenberg G
Ing. Martina Majorová, FEM SUA Statistics Lecture 2 – Introduction to MS Excel 2003.
Installing Analysis Tool Pak
PowerPoint Slides Prepared by Robert F. Brooker, Ph.D. Slide 1
Principles and Worldwide Applications, 7th Edition
Microsoft Office Illustrated
Obtaining Summary Measures with Add-Ins
Creating Scatterplots
Creating Scatterplots
STATISTICS What is statistics? Statistics is the art and science of learning from data. Statistics is made up of research studies and the analysis of.
Installing Analysis Tool Pak
Presentation transcript:

Analyzing Cancer Data 1.What type of data—counts or rates? 2.Before you analyze, look at the data. 3.If appropriate, check the data for linear trends. Do the data increase or decrease over time? 4.Is a given rate significantly different from another rate? (For future consideration.)

1. What type of data (counts or rates)? Which is appropriate for what and why? Counts for resource capacity planning; Rates for comparisons—over time or with other jurisdictions or between subgroups (e.g. race, ethnicity, gender, age).

2. Look at the data. Do you see a notable pattern? Example: Cervical cancer, all women, New Jersey, incidence per 100, Not obvious? Look at a picture.

Cervical Cancer, Females

What type of pattern? Linear? Unimodal? Bimodal? Random (no pattern)?

3.Is there really a significant linear trend in rates? Find out in Excel. Note that the data must be entered in columns, i.e. with the years in one column and the rates in another column, for Excel’s analysis tools to work.

A.Is the Analysis ToolPak installed in Excel? a.From the menu bar, go to Tools | Add-Ins…. b.Is the Analysis ToolPak listed? c.If not, use your original Microsoft Office installation CD-ROMs to do a complete install of Excel, and then come back to this step. Otherwise, verify that the checkbox next to Analysis ToolPak is checked. d.If it is not checked, check it and click Load. You may still need the installation CD-ROMs at this point; if so, follow the directions on the screen to install the Analysis ToolPak.

B.Go to Tools | Data Analysis…. You will get this dialog box: Select Regression and click OK.

C.The Regression dialog box appears: The cells with the rates go here. The cells with the list of years (1996 – 2000) go here. Put the output here (upper left corner cell of output).

D.Excel computes the regression output: Negative means a decreasing trend; positive means an increasing trend. The smaller the number here, the more significant the trend (i.e. the less likely that the observed trend arose from mere chance variations in the data). A typical cutoff is 5% (0.05). This example shows a highly significant trend since this value is less than (Next two slides show enlargements of key statistics.)

Detail of Significance of Regression

Detail of Regression Coefficient This number gives the amount by which the cervical cancer rate among white females in New Jersey changes each year. In other words, for each year from 1996 to 2000, the rate dropped by about half a person per 100,000.

Beware of Extrapolation! Based on our regression, if the cervical cancer rate per 100,000 white females goes down by about 0.5 per year, and it was about 10 per 100,000 in 2000, what would it be in 2020? Zero How about in 2025?Minus five (–5) Moral: Regression estimates are useful only within or very near the range in which the regression was estimated. In this case, the range was 1996 – Extrapolation outside of this range is not likely to give meaningful results.

If you cannot install the Analysis ToolPak, you can use the built-in Excel function LINEST, but it is much less friendly: a.Select a 5 x 2 block of cells, starting at the upper left cell in the box, for instance A7:B11, as shown. b.Type in the formula =LINEST(B1:B5,A1:A5,,TRUE) (assuming the data are in the upper left corner of the worksheet, as shown). Here B1:B5 are the cells with the rates, and A1:A5 are the cells with the years. c.End the formula with Ctrl- Shift-Enter (instead of the usual Enter). d.The output will appear in the 5 x 2 block of cells:

Negative means a decreasing trend; positive means an increasing trend. Unlike Regression, LINEST does not automatically compute significance information. You can get that information from the numbers in cells A10 ( ) and B10 (3). Choose an empty cell—say C10—and type the formula =FDIST(A10,1,B10). The relevant significance will appear in cell C10. (Not shown here.)

4.Is a given rate significantly different from another rate? This is a complex area on which we will give you some guidance at a future time. There are, however, things you can do now that do not involve statistical tests….

If the county rates appear to be unreasonably different from state rates, check carefully for errors in your county’s data, but… bear in mind that if your county has a small relevant population (for instance, there are very few Hispanic males), the rates are more likely to differ widely from state rates because each case has such a large impact. Look at the counts, too! You can use the graphs to see if there are obvious differences between your county’s rate patterns and the statewide rate patterns; for instance, if a cancer rate is increasing for the state but decreasing in your county or vice versa.