EPID 623-88 Introduction to Analysis and Interpretation of HIV/STD Data LECTURE 1: Examining Your Data and Steps in Data Analysis Manya Magnus, Ph.D. Summer.

Slides:



Advertisements
Similar presentations
From study objectives to analysis plan Helen Maguire.
Advertisements

Contingency Tables Chapters Seven, Sixteen, and Eighteen Chapter Seven –Definition of Contingency Tables –Basic Statistics –SPSS program (Crosstabulation)
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Processing and Fundamental Data Analysis CHAPTER fourteen.
Florida Department of Health HIV/AIDS Section Division of Disease Control and Health Protection Annual data trends as of 12/31/2013 Living (Prevalence)
EPID Introduction to Analysis and Interpretation of HIV/STD Data Confounding Manya Magnus, Ph.D. Summer 2001 adapted from M. O’Brien and P. Kissinger.
Bivariate Analysis Cross-tabulation and chi-square.
National Center for HIV/AIDS, Viral Hepatitis, STD & TB Prevention Division of HIV/AIDS Prevention Epidemiology of HIV Infection through 2012.
Late HIV Diagnoses, Georgia,
HIV/AIDS Prevalence & Mortality Report, 2010 Minnesota Department of Health HIV/AIDS Surveillance System Minnesota Department of Health HIV/AIDS Surveillance.
HIV Surveillance – Persons Who Inject Drugs National Center for HIV/AIDS, Viral Hepatitis, STD & TB Prevention Division of HIV/AIDS Prevention.
Data analysis: cross-tabulation GAP Toolkit 5 Training in basic drug abuse data management and analysis Training session 11.
SUBSTANCE ABUSE & HIV DISEASE Janet L. Mitchell, M.D., M.P.H., F.A.C.O.G. Consultant on Women’s Health Addiction, Research & Treatment Corporation Brooklyn,
Data analysis Incorporating slides from IS208 (© Yale Braunstein) to show you how 208 and 214 are telling you many of the the same things; and how to use.
Tables and graphs for frequencies and summary statistics
Blood and Airborne Pathogens HIV/AIDS/STDs/HepatitisTB/SARS/Influenza.
A Simple Guide to Using SPSS© for Windows
SOWK 6003 Social Work Research Week 10 Quantitative Data Analysis
HIV/AIDS Prevalence & Mortality Report, 2001 Minnesota Department of Health HIV/AIDS Surveillance System Minnesota Department of Health HIV/AIDS Surveillance.
CHAPTER 14, QUANTITATIVE DATA ANALYSIS. Chapter Outline  Quantification of Data  Univariate Analysis  Subgroup Comparisons  Bivariate Analysis  Introduction.
Quantifying Data.
Rates of Adults and Adolescents Living with Diagnosed HIV Infection, Year-end 2012—United States and 6 Dependent Areas N = 931,449Total Rate =
HIV Infection Among Those with an Injection Drug Use*- Associated Risk, Florida, 2012 Florida Department of Health HIV/AIDS and Hepatitis Section Division.
Epidemiology of HIV among Asians & Pacific Islanders Reported in Florida, Through 2012 Florida Department of Health HIV/AIDS and Hepatitis Section Annual.
Epidemiology of HIV Among Asians and Pacific Islanders Reported in Florida, Through 2014 Florida Department of Health HIV/AIDS Section Division of Disease.
HIV Surveillance Report, 2012 Minnesota Department of Health HIV/AIDS Surveillance System Minnesota Department of Health HIV/AIDS Surveillance System.
HIV/AIDS in Michigan January 1, 2010 Michigan Department of Community Health HIV/STD/VH/TB Epidemiology Section Division of Communicable Diseases Bureau.
Centers for Disease Control and Prevention CDC Half-Year of Diagnosis** Number of Cases **Adjusted for reporting delay AIDS Cases in Racial/Ethnic Minorities*
HIV-Infected Women: An Update June 28, 2007 Kathleen McDavid, PhD, MPH HIV Incidence and Case Surveillance Branch Division of HIV/AIDS Prevention, CDC.
EPID Introduction to Analysis and Interpretation of HIV/STD Data Introduction: STDs and HIV—a brief background Manya Magnus, Ph.D. Summer 2001.
Statistics for Decision Making Bivariate Descriptive Statistics QM Fall 2003 Instructor: John Seydel, Ph.D.
Survey Methodology Data interpretation and presentation EPID 626 Lecture 11.
Wisconsin Department of Health Services HIV/AIDS Surveillance Annual Review New diagnoses, prevalent cases, and deaths through December 31, 2013 April.
HIV Care Continuum, District 8- 2 Southwest (Albany), Georgia, 2012.
HIV/AIDS among Women in Texas Enhanced Perinatal Surveillance May 30, 2007 Nita Ngo, MPH.
Epidemiology of HIV Among Men in Florida, Reported through 2013 Florida Department of Health HIV/AIDS Section Division of Disease Control and Health Protection.
1 PHSKC 4/01 Epidemiology of HIV/AIDS Seattle-King County, WA HIV/AIDS Epidemiology Program Public Health - Seattle & King Co. (206) On the web.
Epidemiology of HIV Among Men in Florida, Reported through 2012 Florida Department of Health HIV/AIDS and Hepatitis Section Division of Disease Control.
HIV Care Continuum, Georgia, United States, 2011 Presented to American Public Health Association, Annual Meeting Presented by Deepali Rane, MBBS, MPH,
HIV Infection Among Those with an Injection Drug Use*-Associated Risk, Florida, 2014 Florida Department of Health HIV/AIDS Section Division of Disease.
Florida Department of Health HIV/AIDS and Hepatitis Section Division of Disease Control and Health Protection Annual data trends as of 12/31/2012 Living.
Arnold School of Public Health Health Services, Policy, and Management 1 Drug Treatment Disparities Among African Americans Living with HIV/AIDS Carleen.
Healthy People 2010 Focus Area 13: HIV Progress Review August 30, 2007.
HIV Surveillance Report, 2010 Minnesota Department of Health HIV/AIDS Surveillance System Minnesota Department of Health HIV/AIDS Surveillance System.
HIV Care Continuum New Diagnoses, 2011, Fulton County, Georgia.
HIV Care Continuum, District 10 Northeast (Athens), Georgia, 2012.
Introduction to Quantitative Research Analysis and SPSS SW242 – Session 6 Slides.
Perform Descriptive Statistics Section 6. Descriptive Statistics Descriptive statistics describe the status of variables. How you describe the status.
Aim: How do we analyze data with a two-way table?
HIV Infection among Those with an Injection Drug Use*-Associated Risk, Florida, 2011 HIV/AIDS & Hepatitis Program *Injection Drug Use (IDU) data in this.
AIDS Cases with TB Florida 2013 TB Created: 12/27/13 Revision: 10/16/14 To protect, promote and improve the health of all people in Florida through integrated.
Discovering Mathematics Week 5 BOOK A - Unit 4: Statistical Summaries 1.
PSC 47410: Data Analysis Workshop  What’s the purpose of this exercise?  The workshop’s research questions:  Who supports war in America?  How consistent.
Epidemiology of HIV Infection Trends Among Adults Living in Rural and Non-Rural Counties in Florida (excl DOC*) Diagnosed through 2014 Created:12/04/14.
DTC Quantitative Methods Summary of some SPSS commands Weeks 1 & 2, January 2012.
Florida Department of Health HIV/AIDS Section Division of Disease Control and Health Protection Annual data trends as of 12/31/2014 Living (Prevalence)
HIV Infection Among Those with an Injection Drug Use*-Associated Risk, Florida, 2014 Florida Department of Health HIV/AIDS Section Division of Disease.
HIV Surveillance by Race/Ethnicity
Elementary Analysis Richard LeGates URBS 492. Univariate Analysis Distributions –SPSS Command Statistics | Summarize | Frequencies Presents label, total.
Epidemiology of HIV Among Men in Florida, Reported through 2014 Florida Department of Health HIV/AIDS Section Division of Disease Control and Health Protection.
HIV Care Continuum New Diagnoses, 2011, Georgia. Persons with HIV Engaged in Selected Stages of the Continuum of Care, United States Percent
Epidemiology of HIV Among Men in Florida, Reported through 2011 Florida Department of Health HIV/AIDS & Hepatitis Program Annual data as of 12/31/2011.
Viral load distribution 2012 among persons living with HIV and persons newly diagnosed Georgia, 2011.
AIDS Cases with Tuberculosis – Florida, 2014 TB Created: 09/25/15 Revision: 01/26/16 To protect, promote and improve the health of all people in Florida.
HIV Surveillance in Women National Center for HIV/AIDS, Viral Hepatitis, STD & TB Prevention Division of HIV/AIDS Prevention.
Epidemiology of HIV Among Asians and Pacific Islanders Reported in Florida, Through 2012 Florida Department of Health HIV/AIDS and Hepatitis Section Division.
Katarina Grande, HIV Surveillance Coordinator Casey Schumann, HIV Epidemiologist Wisconsin Department of Health Services Statewide Action Planning Group.
Impact of HIV Disease, Among the Caribbean-Born, Florida, 2014 Florida Department of Health HIV/AIDS Section Division of Disease Control and Health Protection.
HIV/AIDS Surveillance System HIV Surveillance Report, 2015 Sexually Transmitted Diseases, HIV and Tuberculosis Section, Epidemiology and Surveillance Unit.
Introduction to SPSS July 28, :00-4:00 pm 112A Stright Hall
An estimated 10,706 AIDS cases diagnosed in 2006 were associated with injection drug use. Approximately 84% of AIDS cases associated with injection drug.
Presentation transcript:

EPID Introduction to Analysis and Interpretation of HIV/STD Data LECTURE 1: Examining Your Data and Steps in Data Analysis Manya Magnus, Ph.D. Summer 2001

Lecture Outline The relationship between data, “answers,” and graphics Examining data –Investigating variables and raw data –Plotting –Frequencies/crosstabs Data analysis –Deciding on a question –Dummy (aka: mock) tables –Data management –Data analysis

Getting from Point A to Point B How do you go from this: PIDDOBGenderGAMOTCounty /1/ /8/ /19/ /5/ /29/

Getting from Point A to Point B To this?

Getting from Point A to Point B Or this: AKLATNFL… PCP … LIP … SBI … Wasting …

Getting from Point A to Point B To this?

{Please note that these data were made up for the purpose of examples only… they are not real. The slides are real, however, and from the following website if you are interested:

Each of the previous slides uses graphic means to communicate data. We are most interested in the data underlying the graphics. The purpose of graphics should be to INFORM and COMMUNICATE effectively and truthfully—not confuse or mislead. Starting with the data themselves is the best way to begin. But having an idea of the question you are asking, or the table or chart you want to present, can be helpful.

What question is each of the following slides answering?

*>1 diagnosis reported for some children Mycobacterium avium infection Condition Lymphoid interstitial pneumonitis Recurrent bacterial infections HIV wasting syndrome Candida esophagitis Number % of Cases* Pneumocystis carinii pneumonia HIV encephalopathy Cytomegalovirus disease Pulmonary candidiasis 4185 Severe herpes simplex infection4225 Cryptosporidiosis AIDS-Defining Conditions Most Commonly Reported for Children <13 Years of Age, N=8,718, Reported through 1999, United States

AIDS in Children <13 Years of Age by Exposure Category Reported in 1999 and Cumulative, United States Exposure Category % Perinatally acquired Transfusion-associated Hemophilia Number Other/not reported Number % , ,718 Total Cumulative

June 1999 Year of Diagnosis Adult/Adolescent AIDS Cases by Exposure Category and Year of Diagnosis, June 1999 United States Other includes cases with other or unreported risk exposure. Data adjusted for reporting delays and risk redistribution MSM & IDU Men who have sex with men (MSM) Injection drug use (IDU) Other Heterosexual contact Percent of Cases

AIDS Cases in Adult/Adolescent Men, Reported July June 1999, and Estimated AIDS incidence,* Diagnosed July 1998 – June 1999 by Risk Exposure United States * Data adjusted for reporting delays and estimated proportional redistribution of cases initially reported without risk. Data reported through March AIDS Incidence Reported July June 1999 Estimated AIDS Incidence* Diagnosed July June 1999 <1% 1% <1% 45% 21% 5% 8% 53% 13% 27% 6% Risk Exposure Injection drug use (IDU) Men who have sex with men Hemophilia Transfusion Heterosexual contact MSM/IDU Other/not identified

36% 62% 28% 31% 40% AIDS Incidence Reported July June 1999 Estimated AIDS Incidence* Diagnosed July June % <1% 1% <1% 1% Injection drug use (IDU) Hemophilia Transfusion Heterosexual contact Other/not identified Risk Exposure * Data adjusted for reporting delays and estimated proportional redistribution of cases initially reported without risk. Data reported through March AIDS Cases in Adult/Adolescent Women, Reported July June 1999, and Estimated AIDS Incidence,* Diagnosed July June 1999, by Risk Exposure United States

Sample Dataset Human development data (adapted from StataQuest text and disk). Comparing average life expectancy in different groups of countries. From UN Human Development Report dataset.

Examining Data What are your data and your variables? –Type—numeric, string, counts –Individual level or aggregate –What do they indicate—labels, questionnaire responses, categories –Which variables are predictors –Which variables are outcomes –Which variables are potential confounders

Examining Data (1) Type—numeric, string, counts First assess what type of variables you are working with and how they are stored in your dataset. For each variable in your dataset, see if it is a continuous variable (e.g., age in years), a categorical variable (e.g., age in categories), or a count. How are the variables stored? Are they stored as string or numeric variables?

Dataset: Humandev.sav

Examining Data (2) Individual level or aggregate data Do the data relate to a person or to a group (e.g., a country)? What is the source of each line of data? Is there more than one line of data per id number?

Examining Data (3) What do they indicate—labels, questionnaire responses, categories Where do the variables come from? Are they from a coding guide? If so, have all codes used available for reference. Are they from a survey or questionnaire? If so, have all codes used as well as questions available for reference.

Examining Data (4) Which variables are predictors, outcomes, confounders? –Variables are variables—meaning is imposed by research question and researcher –[Lecture on confounders upcoming…] –Helpful to decide in advance which are which –Can organize data to reflect these decisions

Examining Data (5) Summarize data first. Mean Median Mode Standard deviation Variance Range, min, max

Examining Data (6) Plotting data and examining visually are very important. Look at raw data. Check normality visually and using statistics (kurtosis—peakedness—3==normal, skewness—symmetry—0==normal, etc.). Transform as needed. Check residuals. Remember assumptions for statistical testing.

Examining Data (7) Frequencies and crosstabs How many in each category? How many for each value? Looking at how many in each of new grouping. Know which is dependent, which is independent. Study design can dictate whether want row or column percentages. Be careful, because interpretation differs.

Steps in data management Make a back up copy first Refer to codebook/card layout Recode missing Make categorical variables from continuous Make summary variables/outcome variables Save your work as you go Confirm that new variables are correctly generated each time; QA

Steps in data analysis (1) Decide what question you are asking Plan the analysis on paper –Determine type of analysis –Identify and define outcome, predictors, confounders and effect modifiers for your analysis –Make dummy tables

Example of dummy table Trich no (n=) Trich yes (n=) OR (95% CI)P-value Gender Male Female Race AA Non-AA

Steps in data analysis (2) Perform Univariate Analysis –Frequencies –Descriptives (means and s.d., medians and ranges) Perform Bivariate Analysis –Do cross-tabulations –Check for confounding Perform multivariate analyses

Helpful Hints Save data file frequently Give *.sav, *.spo, *.sps files the same prefix (eg. NIR.sav, NIR.spo) Keep a notebook of what you did so that you remember what the new variables mean and add new variables to your coding guide. Use labels.