Figure 4 - Sample of Data Collected

Slides:



Advertisements
Similar presentations
What Do We Need to Know about H1N1 Influenza? September 2009.
Advertisements

Chapter 2 The Process of Experimentation
Quality Improvement Introduction to Business Statistics, 5e Kvanli/Guynes/Pavur (c)2000 South-Western College Publishing.
Statistical Inference and Sampling Introduction to Business Statistics, 5e Kvanli/Guynes/Pavur (c)2000 South-Western College Publishing.
Time Series Analysis and Index Numbers Introduction to Business Statistics, 5e Kvanli/Guynes/Pavur (c)2000 South-Western College Publishing.
BioQUEST  Case Study Format ◦ Learning Objectives ◦ Resources  Data Analysis & Visualization ◦ Tools ◦ Statistics  Assessment.
Total Quality Management BUS 3 – 142 Statistics for Variables Week of Mar 14, 2011.
AGEC 622 Mission is prepare you for a job in business Have you ever made a price forecast? How much confidence did you place on your forecast? Was it correct?
1 Confidence Intervals for Means. 2 When the sample size n< 30 case1-1. the underlying distribution is normal with known variance case1-2. the underlying.
Demand Estimation & Forecasting
1 Simple Linear Regression 1. review of least squares procedure 2. inference for least squares lines.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12 Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
Inference for regression - Simple linear regression
1 Statistical Analysis - Graphical Techniques Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND.
Study Designs Afshin Ostovar Bushehr University of Medical Sciences Bushehr, /4/20151.
EE325 Introductory Econometrics1 Welcome to EE325 Introductory Econometrics Introduction Why study Econometrics? What is Econometrics? Methodology of Econometrics.
RESEARCH METHODOLOGY. WHAT IS RESEARCH METHODOLOGY?  In this section, the researcher must state the type of research, its meaning, and how it is applicable.
Math in the News Having the flu is no fun! Flu season is here, and this year is a bad one. The season started early and is still going strong. The Centers.
Grayson Rural Electric Cooperative Corporation 2006 Load Forecast Prepared by: East Kentucky Power Cooperative, Inc. Forecasting and Market Analysis Department.
The Spanish Influenza By: Justin and Matthew. The causes and affects  The 1918 Spanish flu pandemic was an extremely deadly flu pandemic. World War 1.
Calculation of excess influenza mortality for small geographic regions Al Ozonoff, Jacqueline Ashba, Paola Sebastiani Boston University School of Public.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Scientific Processes Mrs. Parnell. What is Science? The goal of science is to investigate and understand the natural world, to explain events in the natural.
Tools of Environmental Science Chapter 2. The Experimental (Scientific) Method Series of steps that scientists worldwide Series of steps that scientists.
Business Project Nicos Rodosthenous PhD 04/11/ /11/20141Dr Nicos Rodosthenous.
Introduction for Basic Epidemiological Analysis for Surveillance Data National Center for Immunization & Respiratory Diseases Influenza Division.
The Major Steps of a Public Health Evaluation 1. Engage Stakeholders 2. Describe the program 3. Focus on the evaluation design 4. Gather credible evidence.
Implementing the Analysis Information System IN 2004 In the sub Saharan region of Africa In the Northern Africa region WHY This difference of level? Overall.
NCHS July 11, A Semiparametric Approach to Forecasting US Mortality Age Patterns Presenter: Rong Wei 1 Coauthors: Guanhua Lu 2, Benjamin Kedem 2.
1 Epidemiology 10/20/10MDufilho. 2 Epidemiology The study of the frequency and distribution of disease and health-related factors in human populations.
Time Series Analysis and Forecasting. Introduction to Time Series Analysis A time-series is a set of observations on a quantitative variable collected.
Center for Surveillance, Epidemiology, and Laboratory Services Division of Health Informatics and Surveillance José Aponte Public Health Advisor Best Practices.
Computer Applications Chapter 16. Management Information Systems Management Information Systems (MIS)- an organized system of processing and reporting.
Statistics is... a collection of techniques for planning experiments, obtaining data, and then organizing, summarizing, presenting, analyzing, interpreting,
RESEARCH METHODS IN TOURISM Nicos Rodosthenous PhD 18/04/ /4/20131Dr Nicos Rodosthenous.
Bryan Gustafson Arian Payne Introduction The CDC recommends that all people older than 6 months should be vaccinated annually. However, there is constant.
Some Final Material. GOOGLE FLU TRENDS Sore throat? Sniffles? Google it! Duh! During flu season, more people enter search queries concerning the flu.
1 Statistical Analysis - Graphical Techniques Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND.
Forecast 2 Linear trend Forecast error Seasonal demand.
Dental Personnel Power Planning Dr Hidayathulla Shaikh.
Cell Diameters and Normal Distribution. Frequency Distributions a frequency distribution is an arrangement of the values that one or more variables take.
By Dr Hidayathulla Shaikh. Objectives At the end of the lecture student should be able to Define demography Discuss static and dynamic demography Define.
Understanding Epidemiology
FREQUENCY DISTRIBUTION
Chapter 7 Demand Estimation and Forecasting
or items of information; these will be numbers in context
Data Analysis.
Epidemiology.
Warm Up p11 Talk to the text you have in front of you. Using your knowledge of word stems, what can you infer is the difference between an epidemic and.
Chapter 2 Doing Sociological Research
Analyze ICD-10 Diagnosis Codes with Stata
MAT 135 Introductory Statistics and Data Analysis Adjunct Instructor
Evaluating Hydrodynamic Uncertainty in Oil Spill Modeling
Inference: Conclusion with Confidence
Disease Detective Team!
Introductory Econometrics
FORECASTING DEMAND OF INFLUENZA VACCINES AND TRANSPORTATION ANALYSIS.
One Health Early Warning Alert
Visualization and Analysis of Air Pollution in US East Coast Cities
Public Health Surveillance
Introduction to Business Statistics
What is the difference between an outbreak, epidemic, and a pandemic?
Summary Sheet Figures and Maps
Summary Sheet Figures and Maps
Inferential Statistics
Analyzing social media data to monitor public health trends
Biological Science Applications in Agriculture
Descriptive Statistics
Chapter 7 Demand Estimation & Forecasting
Statistics is... a collection of techniques for planning experiments, obtaining data, and then organizing, summarizing, presenting, analyzing, interpreting,
Presentation transcript:

Figure 4 - Sample of Data Collected Instances of Influenza in the United States Visualized Dr. Johann Thiel, Parth Patel - New York City College of Technology, CUNY – Fall 2018 Figure 2 - Fatal and Non-Fatal Instances of Influenza Discussion We needed to perform a data sanitization procedure to get a normalized and de-duplicated dataset (see Figure 4). This was accomplished in a performant manner using Pandas sorting capabilities and custom Python iterators to do the validation. Introduction The Tycho Project collects large data sets related to healthcare and in particular, instances and geographical information of diseases. We look at the instance counts and locations of Influenza from 1919-1951 across the United States. We hope to find seasonal and geographical insight to the spread of the disease. Figure 4 - Sample of Data Collected Research Questions Is there seasonal behavior to the instance counts of Influenza historically? Is there any pattern of disease spreading from one geographical area to another? Are there any sharp drops or increases in instance counts that could be explained by the introduction of vaccination or other preventative health measures? Methodology We use Pandas for the parsing and analysis of the data, and present the results as a Jupyter notebook. Analysis and results are avilable at https://github.com/parthpatel1001/tycho_influenza/blob/master/influenza_visualization.ipynb. We encounter an interesting data deduplication problem, and create helper functions to sanitize the data. Figure 3 - Heat Map Distribution of Instances of Influenza by State with Normalization Data sanitization is an important component in the process of analysis and is critical to ensuring valid and verifiable results. In future analysis, we can utilize predicative seasonal time series tools such as Facebook’s Prophet. In doing so, we can analyze the possibility of creating forecast ranges with confidence intervals. Additional research needs to be done to compute the number of flu cases occurring in the US after 1951. The Centers for Disease Control (CDC) publishes such numbers periodically and should be incorporated into the current project. Vaccine data can also be incorporated, particularly the introduction of vaccination programs in particular geographic locations, and their impact, both in that area and surrounding, on instance counts. Results The following figures represent some of the graphical results obtained by using various scientific computing modules in Python. Figure 1 - Fatal Instances of Influenza by State Conclusion As was to be expected, the data shows a seasonal trend in flu cases (see Figure 1 and 2). Furthermore, the heat map (Figure 3) gives a lot of interesting data. From it we can observe particular years where there was a national flu epidemic. We can also see how some geographically close states seem to have had localized outbreaks. References https://www.tycho.pitt.edu/