Google Flu Trends Terminology –Influenza = flu –ILI = influenza like illness CDC ILI time series –Weekly –1-2 week publication lag Predicting it using.

Slides:



Advertisements
Similar presentations
Surveillance in a Pandemic: Situational Awareness
Advertisements

Predicting Flu Trends using Twitter Data Harshavardhan Achrekar [1] Avinash Gandhe [ 2 ] Ross Lazarus [3] Ssu-Hsin Yu [2] Benyuan Liu [1] SNEFT – Social.
Predicting Flu Trends using Twitter Data Harshavardhan Achrekar [1] Avinash Gandhe [ 2 ] Ross Lazarus [3] Ssu-Hsin Yu [2] Benyuan Liu [1] Workshop on Cyber-Physical.
Twitter Catches The Flu: Detecting Influenza Epidemics using Twitter Eiji ARAMAKI * Sachiko MASKAWA * Mizuki MORITA ** * The University of Tokyo ** National.
CS 315 – Web Search and Data Mining. Overview The power of crowdsourcing Predicting flu outbreaks Predicting “the present” through Google Insights! Predicting.
Preparing Data for Analysis National Center for Immunization & Respiratory Diseases Influenza Division Nishan Ahmed Regional Training Workshop on Influenza.
Influenza Outbreaks and Cruise Ships Laura Martin 25 April 2002.
GEO SYMPTOM SOLUTIONS Anurag Jain. Method of reach Content Categorization User Categorization based on site usage and declared information Scale for WebMD.
Surveillance of gastroenteritis using drug sales data in France Mathilde Pivette, PharmD, MPH Pr Avner Bar-Hen Dr Pascal Crépey.
U.S. Surveillance Update Anthony Fiore, MD, MPH CAPT, USPHS Influenza Division National Center for Immunizations and Respiratory Disease Centers for Disease.
WHAT IS THE H1N1? Today we are going to speak about the h1n1 (swing) flu. It is a new desease that has appeared in 2009 around the world, and we are going.
George A. Ralls M.D. Dave Freeman Health Services Department September 1st, 2009 INFLUENZA UPDATE.
Forecasting World Wide Pandemics Using Google Flu Data to Forecast the Flu Brian Abe Dan Helling Eric Howard Ting Zheng Laura Braeutigam Noelle Hirneise.
Towards Detecting Influenza Epidemics by Analyzing Twitter Massages Aron Culotta Jedsada Chartree.
TANEY COUNTY HEALTH DEPARTMENT AUGUST 2009 Situation Update: H1N1 Influenza A.
1 INFLUENZA (AND OTHER RESPIRATORY VIRUS) SURVEILLANCE IN WISCONSIN Thomas Haupt M.S. Wisconsin Influenza Surveillance Coordinator.
Swine Flu:/ Leslie Cepeda:). About Virus What is the disease: The swine flu is a new influenza virus causing illness in people. This new virus was first.
12 August 2003 The new QFLU daily analysis service supporting pandemic influenza planning Porter A, Hippisley-Cox J QResearch, Nottingham University Smith.
Influenza A H1N1: A Pandemic in Real Time – What’s Next? Danny Chen, MD FRCPC MSc Infectious Disease Specialist Grand Rounds, York Central Hospital September.
Influenza-like Illness Surveillance at the National Level
Detecting influenza epidemics using search engine query data Jeremy Ginsberg1, Matthew H. Mohebbi1, Rajan S. Patel1, Lynnette Brammer2, Mark S. Smolinski1.
Query Routing in Peer-to-Peer Web Search Engine Speaker: Pavel Serdyukov Supervisors: Gerhard Weikum Christian Zimmer Matthias Bender International Max.
Epidemic and Pandemic Disease Outbreaks. How do we define an Epidemic?  An epidemic is an out break of disease that affects many individuals at the same.
Math in the News Having the flu is no fun! Flu season is here, and this year is a bad one. The season started early and is still going strong. The Centers.
Sore throat? Sniffles?Sore throat? Sniffles?  Google it! Duh!  During flu season, more people enter search queries concerning the flu.  Each year 90.
Influenza Activity for FY IDAHO: Flu season is starting to wind down State Activity Code: Regional Vital Statistics: As of MMWR week 6, 12 influenza-
Mapping Influenza-Like Illness (ILI) Using Physician Claims Data Larry Svenson Public Health Surveillance and Environmental Health Branch Public Health.
Dr. Zhen XU Branch of Respiratory Disease Prevention and Control Division for Disease Control and Emergency Response Chinese Center for Disease Control.
Successful Alerts and Responses: Real-time Monitoring of ED Chief Complaints and Investigation of Anomalies CDC Public Health Preparedness Conference February.
Eurostat Web activity evidence to increase timeliness of official statistics IAOS – 10 October.
Correlation and Prediction Error The amount of prediction error is associated with the strength of the correlation between X and Y.
US Situation Update and CDC International Response H1N1 Pandemic US Situation Update and CDC International Response Peter Nsubuga, MD, MPH On behalf of.
Swine Flu Presenter: Ali Azarashk. Overview Introduction Classification History Transmission Signs and Symptoms Treatment.
The Vermont Department of Health Overview of Pandemic Influenza Regional Pandemic Planning Summits 2006 Guidance Support Prevention Protection.
Introduction for Basic Epidemiological Analysis for Surveillance Data National Center for Immunization & Respiratory Diseases Influenza Division.
Detecting Influenza Outbreaks by Analyzing Twitter Messages By Aron Culotta Jedsada Chartree 02/28/11.
Mr. C’s Joke/Riddle of the Day. The Role Canada is Playing How the World Keeps Diseases from Spreading Pt.2.
Diseases Unit 3. Disease Outbreak  A disease outbreak happens when a disease occurs in greater numbers than expected in a community, region or during.
Eurostat WebDataNet Conference 2015 Salamanca, 26 th – 28 th May 2015 Fernando Reis, Big Data Task-Force European Commission (Eurostat) Web activity evidence.
Internet Search Term Surveillance for Influenza Internet Search Term Surveillance for Influenza Philip M. Polgreen 1, Yiling.
Swine Flu & You! Information Regarding the Possible Approaching Swine Flu Pandemic.
Bringing Together the Social and Technical in Big Data Analytics: Why You Can't Predict the Flu from Twitter, and Here's How David A. Broniatowski Asst.
Center for Computational Analysis of Social and Organizational Systems Dynamic Network Approach to Health Surveillance Prof.
SAN DIEGO COUNTY INFLUENZA SURVEILLANCE October 14, 2015.
Some Final Material. GOOGLE FLU TRENDS Sore throat? Sniffles? Google it! Duh! During flu season, more people enter search queries concerning the flu.
What Is H1N1 (Swine Flu) Pandemic Influenza? Colorized image of H1N1 from a transmission electron micrograph. Source: CDC.
INFLUENZA DIVISION Update on the Epidemiology and Clinical Features of Novel H1N1 Joseph Bresee, MD Chief, Epidemiology and Prevention Branch Influenza.
Hot Topic Meeting by: Royal College of Physicians of Edinburgh & The Scottish Executive Health Department Pandemic Flu Planning Scotland’s Health Response.
Seoul Metropolitan Government Health & Welfare Headquarters Health Policy Division Kui-Nam Kim.
Forecasting. Model with indicator variables The choice of a forecasting technique depends on the components identified in the time series. The techniques.
Avian Influenza A (H5N1) “Bird Flu”
Epidemic and Pandemic Disease Outbreaks.
Diseases Unit 3.
C.-S. Lee, J.-H. Lee  Clinical Microbiology and Infection 
Epidemic Alerts EECS E6898: TOPICS – INFORMATION PROCESSING: From Data to Solutions Alexander Loh May 5, 2016.
Michael M. Wagner, MD PhD Professor, Department of Biomedical Informatics, University of Pittsburgh School of Medicine
One Health Early Warning Alert
Influenza-like Illness Surveillance at the National Level
C.-S. Lee, J.-H. Lee  Clinical Microbiology and Infection 
University of Washington, Autumn 2018
Toby L. Merlin, M.D. Deputy Incident Manager CDC H1N1 Response
These slides are excerpted from a presentation titled “I Don’t Need A Flu Shot!” By Bill Rogers, Ball State University.
Diseases Unit 3.
U.S. Influenza Sentinel Provider Surveillance Network
C. Oprica, C.E. Nord  Clinical Microbiology and Infection 
Analyzing social media data to monitor public health trends
The difference between epidemic & pandemic
Geographical and Temporal Correlations in the Incidence of Lyme Disease, RMSF, Ehrlichiosis, and Coccidioidomycosis with Search Data  Vladimir Ratushny,
2015 NINR Big Data in Symptoms Research Boot Camp Overview
Exponential Smoothing
Presentation transcript:

Google Flu Trends Terminology –Influenza = flu –ILI = influenza like illness CDC ILI time series –Weekly –1-2 week publication lag Predicting it using frequency of search queries –Machine learning problem –Logit Regression

Flu a big problem 10s of millions of cases every year, worldwide k deaths every year, worldwide Swine flu pandemic is worse Surveillance –CDC –European Influenza Surveillance Scheme (EISS) –Also, monitoring volume of calls to help-lines, and –Volume of over-the-counter sales

Regression I(t) = fraction of doctor’s visits due to flu Q(t) = fraction of search queries related to flu logit(·) used to map I(t), Q(t) from [0,1] to R –logit(p) = log [p/(1-p)] Regression: logit(I(t)) =  logit(Q(t)) +  +  (t) –  = error –Correlation is a performance measure

Which Queries? Training data: I 1 (t),…,I 9 (t) –9 regions –Some subset of 9/28/2003 to 3/11/2007 for which > 0 Candidate queries –Database of 50m most common search queries in US –Q j (t) = volume fraction of candidate query in region j –Calculate correlations  j = corr( logit(Q j (·)), logit(I j (·)) ) –Z-transform the correlations to make them normally distributed –Average the Z-transformed correlations –Rank them

Which Queries? Q j (t) = sum of volume of top n=45 queries –Out-of-sample validation to choose n Avg. correlation of 0.9 over 9/28/2003 to 3/11/2007 Avg. correlation of 0.97 over 3/18/2007 to 5/11/2008 Query type# among top 45 Influenza complications11 Cold/flu remedy8 General flu symptoms5 Term for influenza4 Specific flu symptoms4 Symptoms of a flu complication4 Antibiotics3 General flu remedies2 Symptoms of a related disease2 Antivirals1 Related disease1

Extensions Geographic granularity –Example: predictions specific to state of Utah Not including query variants, misspellings, etc. Not using a weighted sum of query volumes –Just a plain sum –Queries volumes are very correlated

Discussion Use of early warning –New strain? Extra capacity needed? Public awareness? Searches may not indicate infection –But flu related news –Keeping secret the actual queries in the regression Not a substitute to actual surveillance –Demographics, genotype, …