Forecasting the Future using Diverse Social Media Sources

Slides:



Advertisements
Similar presentations
Chapter 11 user support. Issues –different types of support at different times –implementation and presentation both important –all need careful design.
Advertisements

EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.
Towards Twitter Context Summarization with User Influence Models Yi Chang et al. WSDM 2013 Hyewon Lim 21 June 2013.
Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.
EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.
Weather Forecasting and Prediction. Methods of Forecasting  Step 1 - Outside!  Step 2 - Satellite maps  Step 3 - Radar imagery  Step 4 - Surface and.
AOSC 200 Sections 0203 and 0204 TA: Javier Amezcua
1 CS 430 / INFO 430 Information Retrieval Lecture 24 Usability 2.
Chapter 4 Foundations of Decision Making
Wang, Z., et al. Presented by: Kayla Henneman October 27, 2014 WHO IS HERE: LOCATION AWARE FACE RECOGNITION.
Media Relations in a Social Media World By Julie DeBardelaben Deputy Director of Public Affairs CAP National Headquarters.
Next generation library catalogs and the integration of gazetteer information for geographical research Julie Sweetkind-Singer Assistant Director of Geospatial,
Observing Strategy and Observation Targeting for Tropical Cyclones Using Ensemble-Based Sensitivity Analysis and Data Assimilation Chen, Deng-Shun 3 Dec,
Lecture 29: Face Detection Revisited CS4670 / 5670: Computer Vision Noah Snavely.
 Detecting system  Training system Human Emotions Estimation by Adaboost based on Jinhui Chen, Tetsuya Takiguchi, Yasuo Ariki ( Kobe University ) User's.
Prediction of Influencers from Word Use Chan Shing Hei.
Material from Authors of Human Computer Interaction Alan Dix, et al
Automated Solar Cavity Detection
©2001 Southern Illinois University, Edwardsville All rights reserved. CS 321 Human-Computer Interaction Today Fun with Icons Wednesday Project Milestone:
Similarity Access for Networked Media Connectivity Pavel Zezula Masaryk University Brno, Czech Republic.
Network Community Behavior to Infer Human Activities.
Image Classification over Visual Tree Jianping Fan Dept of Computer Science UNC-Charlotte, NC
يادگيري ماشين Machine Learning Lecturer: A. Rabiee
Iterative similarity based adaptation technique for Cross Domain text classification Under: Prof. Amitabha Mukherjee By: Narendra Roy Roll no: Group:
Classification Ensemble Methods 1
Units of Measurement SI units (Systeme Internationale d’Unites) were developed so that scientists could duplicate and communicate their work. Base UnitsDerived.
RESEARCH MOTHODOLOGY SZRZ6014 Dr. Farzana Kabir Ahmad Taqiyah Khadijah Ghazali (814537) SENTIMENT ANALYSIS FOR VOICE OF THE CUSTOMER.
PANDA: Pose Aligned Networks for Deep Attribute Modeling Ning Zhang 1,2 Manohar Paluri 1 Marć Aurelio Ranzato 1 Trevor Darrell 2 Lumbomir Boudev 1 1 Facebook.
Chapter 11 user support. Overview Users require different types of support at different times. There are four main types of assistance that users require:
Regional-scale OSSEs used to explore the impact of infrared brightness temperature observations Jason Otkin UW-Madison/CIMSS 06 February 2013.
Recent developments in object detection
Design and Use of Earth Observation Image Content Tools Mihai Datcu(1, 2), Daniele Cerra(1), Houda Chaabouni-Chouayakh(1), Amaia de Miguel(1), Daniela.
Beyond the Words: Predicting User Personality from Heterogeneous Information Presenter: Benyi Gong.
Predicting Azure Consumption using Ensemble Learning
Best Made In Korea Analysis URL: Bestmadeinkorea.com By.
Jingcheng Du, B.S., Jun Xu, Ph.D., Hsingyi Song, MPH, Cui Tao, Ph.D.
Effects of User Similarity in Social Media Ashton Anderson Jure Leskovec Daniel Huttenlocher Jon Kleinberg Stanford University Cornell University Avia.
Chilimbi, et al. (2014) Microsoft Research
Contextual Intelligence as a Driver of Services Innovation
The Problem: Classification
Synoptic Weather.
Summary Presented by : Aishwarya Deep Shukla
Visualizing Spatiotemporal Embeddings Demo
Lexical: Words vs. Characters Syntactic and Stylistic
Deceptive News Prediction Clickbait Score Inference
Deep Residual Learning for Image Recognition
D. Arendt (presenter), S. Volkova, E. Bell
Population Information Integration, Analysis and Modeling
A Tutorial on HOG Human Detection
Quantifying Deception Propagation on Social Networks
Proportion of Original Tweets
Quanzeng You, Jiebo Luo, Hailin Jin and Jianchao Yang
Authors: S. Volkova, J. JanG, Presenter: Maria Glenski
REVEAL Total cost: EUR EU contribution: EUR
Show suggestions and borderlines Hierarchical Clustering
Twitter as a novel source of mobility indicators
CornerNet: Detecting Objects as Paired Keypoints
Making a good thematic map – Extracting or collecting geographic data
Predicting Prevalence of Influenza-Like Illness From Geo-Tagged Tweets
Socialized Word Embeddings
Creative Activity and Research Day (CARD)
John H.L. Hansen & Taufiq Al Babba Hasan
RCNN, Fast-RCNN, Faster-RCNN
Interactive media.
Lecture 29: Face Detection Revisited
Air Masses Create a list of climate zones starting at the top of the globe. Include temperatures, moisture level, and latitudes on Earth. Create a list.
FOCUS PRIOR ESTIMATION FOR SALIENT OBJECT DETECTION
THE ASSISTIVE SYSTEM SHIFALI KUMAR BISHWO GURUNG JAMES CHOU
Jianbo Chen*, Le Song†✦, Martin J. Wainwright*◇ , Michael I. Jordan*
Do Better ImageNet Models Transfer Better?
Presentation transcript:

Forecasting the Future using Diverse Social Media Sources Katherine Porterfield, Dustin Arendt, Nathan Hodas and Svitlana Volkova Data Sciences and Analytics, National Security Directorate NOTE: Print this poster file at 100% SCALE to result in a physical print measuring 48” wide x 36” tall. All type size notations shown above are based on the final printed size of the poster. • Contact Digital Duplicating (375-2969, http://digitalduplicating.pnl.gov) to order poster printing and finishing services for your completed poster design. • Remember to have your poster cleared for public display/distribution through the ERICA Information Release system (http://erica.pnl.gov). • Sidebar “About PNNL” box is considered optional, and can be removed if space is needed for technical content. MOTIVATION Social media has been used to develop large-scale predictive and forecasting analytics – influenza, events, and elections. Current analytics fail to contrast generalizability across different social media signals, models, tasks and metrics. Goal: extensively evaluate the predictive power of social media signals and contrast performance across: predictive models; forecasting tasks, geographic locations, evaluation metrics. FORECASTING VARIABLES WEATHER, TWITTER AND FLICKR DATA Weather data1: Global Summary of the Day (GSOD) Twitter data2: 171M tweets, 25 U.S. and 6 international locations, 25-mile radius from military bases Flickr data3: geo-tags, timestamps and users tags IMAGE FEATURE EXTRACTOR Flickr data was used to extract weather features based off classifying images into four weather categories: cloudy, clear, snow and storm. ResNet4 (He at al., 2016) + ImageNet5 (Deng et al., 2013) FORECASTING MODELS ML: AdaBoost Regressor DL: LSTM models training window size is 12 days, forecasting up to 7 days Text + Image Ensemble Text + Image Merged ImageOnly: 2,048-dim vectors TextOnly Tweet content + affects:  what emotions users express Behavior: how people interact Tweets: what people talk about Tweet style: how people talk WeatherOnly (Y from Y) FORECASTING RESULTS Contrasting model performance across the forecasting tasks and geolocations: #3 Max daily and dew-point temperature easier to predict than wind speed and visibility miles #4 Performance varies across geo locations **Example Task** Weather Forecast Metrics: Pearson, RMSE, MAPE, RMSPE, R2 Evaluate model accuracy, robustness and the ability to forecast upward and downward weather tendency Max Daily Temperature (C) Visibility (Miles) Dew Point Temperature (C) Wind Speed #5 ML models (AdaBoost) outperform DL models (LSTM) when trained on the same data across locations. #6 Locations with more tweets yield better forecasts. #7 Higher variance in weather variable leads to higher error. #8 Social media signals yield comparable results to the upper bound weather models. Contrasting the predictive power of social media signals: #1 Models learned from content and affects significantly outperforms content only, behavior and style models #2 TextOnly model outperformed ImageOnly, ensemble and merged image + text models Model Pearson RMSE R2 Weather .92 3.02 .84 ImageOnly .44 7.57 .01 TextOnly .74 5.16 .54 Merge I + T .68 5.87 .40 Ensample I + T .73 .28 File Name // File Date // PNNL-SA-##### 1GSOD Weather data: https://www7.ncdc.noaa.gov/CDO/cdoselect.cmd?datasetabbv=GSOD&countryabbv=&georegionabbv 2Volkova et. al., Uncovering the Relationships Between Military Community Health and Affects Expressed in Social Media. EPJ Data Science 2017. 4Resnet: He, Kaiming, et al. "Deep residual learning for image recognition. CVPR 2016. 5ImageNet: Deng, Jia, et al. "Imagenet: A large-scale hierarchical image database. CVPR 2009. 3Flickr data: https://www.flickr.com/groups/projectweather/