Analyzing and Visualizing Disaster Phases from Social Media Streams

Slides:



Advertisements
Similar presentations
HOW DISASTER ORGANIZATIONS FIT TOGETHER IN MY COMMUNITY JUNE 13, 2013 Connecting Disaster Resources with Disaster Needs.
Advertisements

Sequential Minimal Optimization Advanced Machine Learning Course 2012 Fall Semester Tsinghua University.
Large-Scale Entity-Based Online Social Network Profile Linkage.
Distant Supervision for Emotion Classification in Twitter posts 1/17.
Utilizing Social Media to Understand Human Interaction with Extreme Media Events - The Superstorm Sandy Beta Test Arthur G. Cosby Somya D. Mohanty National.
Hurricane Isaac X X X ◘ Isaac began as a tropical wave on August 16 th off the coast of Africa & was classified as a tropical storm on August 21 st ◘
CS771 Machine Learning : Tools, Techniques & Application Gaurav Krishna Y Harshit Maheshwari Pulkit Jain Sayantan Marik
On feature distributional clustering for text categorization Bekkerman, El-Yaniv, Tishby and Winter The Technion. June, 27, 2001.
Mapping Between Taxonomies Elena Eneva 11 Dec 2001 Advanced IR Seminar.
CS 5604 Spring 2015 Classification Xuewen Cui Rongrong Tao Ruide Zhang May 5th, 2015.
1 © Goharian & Grossman 2003 Introduction to Data Mining (CS 422) Fall 2010.
Text Classification using SVM- light DSSI 2008 Jing Jiang.
Introducing the Hurricane Preparedness and Recovery Web Portal - October 8, Presented by Charles R. McClure, PhD Director, FSU Information Institute.
Qatar Content Classification Presenter Mohamed Handosa VT, CS6604 May 6, 2014 Client Tarek Kanan 1.
Extracting a Keyword Network of Flood Disaster Measures Motoki Miura, Mitsuhiro Tokuda, and Daiki Kuwahara Department of Civil and Architectural Engineering,
AUTOMATED TEXT CATEGORIZATION: THE TWO-DIMENSIONAL PROBABILITY MODE Abdulaziz alsharikh.
Medical Data Classifier undergraduate project By: Avikam Agur and Maayan Zehavi Advisors: Prof. Michael Elhadad and Mr. Tal Baumel.
Librarians vs. Automation Carolyn Weber Lucio Campanelli Will Hohyon Ryu.
Project Final Presentation – Dec. 6, 2012 CS 5604 : Information Storage and Retrieval Instructor: Prof. Edward Fox GTA : Tarek Kanan ProjArabic Team Ahmed.
The Community Eye of the Hurricane: Gulf Coast Public Libraries’ Experiences and Needs Related to Hurricane Response Karen Vargas, Michelle Malizia and.
A National Hazards Information Strategy (NHIS) Helen M. Wood Director, Office of Satellite Data Processing & Distribution “A coordinated approach for using.
Sentiment Analysis with Incremental Human-in-the-Loop Learning and Lexical Resource Customization Shubhanshu Mishra 1, Jana Diesner 1, Jason Byrne 2, Elizabeth.
ProjFocusedCrawler CS5604 Information Storage and Retrieval, Fall 2012 Virginia Tech December 4, 2012 Mohamed M. G. Farag Mohammed Saquib Khan Prasad Krishnamurthi.
Librarians vs. Automation Carolyn Weber Lucio Campanelli Will Hohyon Ryu.
CSE 534 Final Project Internet Outage Analysis Name: Guanyu Zhu, Wei-Ting Lin, Zhaowei Sun Professor: Phillipa Gill.
Text Document Categorization by Term Association Maria-luiza Antonie Osmar R. Zaiane University of Alberta, Canada 2002 IEEE International Conference on.
Nuhi BESIMI, Adrian BESIMI, Visar SHEHU
Question Classification using Support Vector Machine Dell Zhang National University of Singapore Wee Sun Lee National University of Singapore SIGIR2003.
Competition II: Springleaf Sha Li (Team leader) Xiaoyan Chong, Minglu Ma, Yue Wang CAMCOS Fall 2015 San Jose State University.
Musical Genre Categorization Using Support Vector Machines Shu Wang.
Bringing Order to the Web : Automatically Categorizing Search Results Advisor : Dr. Hsu Graduate : Keng-Wei Chang Author : Hao Chen Susan Dumais.
CTRnet Digital Library for Disaster Information Services Seungwon Yang 1, Andrea Kavanaugh 1, Nádia P. Kozievitch 4, Lin Tzy Li 1,4,5, Venkat Srinivasan.
Ping-Tsun Chang Intelligent Systems Laboratory NTU/CSIE Using Support Vector Machine for Integrating Catalogs.
Linked Data Profiling Andrejs Abele UNLP PhD Day Supervisor: Paul Buitelaar.
Information Storage and Retrieval(CS 5604) Collaborative Filtering 4/28/2016 Tianyi Li, Pranav Nakate, Ziqian Song Department of Computer Science Blacksburg,
Big Data Processing of School Shooting Archives
Detecting Web Attacks Using Multi-Stage Log Analysis
Combining Models Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya.
A Simple Approach for Author Profiling in MapReduce
Classify A to Z Problem Statement Technical Approach Results Dataset
SAFE 101 NSC Chapter 18.
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Using Social Media to Enhance Emergency Situation Awareness
Sentiment Analysis of Twitter Data(using HadoopMapreduce)
Recognition of bumblebee species by their buzzing sound
Disaster Preparation and Apps
University of Rochester
Efficient Image Classification on Vertically Decomposed Data
ArcGIS for Emergency Management– An Overview
Juweek Adolphe Zhaoyu Li Ressi Miranda Dr. Shang
CS6604 Project Ensemble Classification
Natural Language Processing of Knee MRI Reports
Text Classification CS5604 Information Retrieval and Storage – Spring 2016 Virginia Polytechnic Institute and State University Blacksburg, VA Professor:
Classifying enterprises by economic activity
Efficient Image Classification on Vertically Decomposed Data
Tracking FEMA Kevin Kays, Emily Maier, Tyler Leskanic, Seth Cannon
iSRD Spam Review Detection with Imbalanced Data Distributions
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Classification Breakdown
MTBI Personality Predictor using ML
CMPT 733, SPRING 2017 Jiannan Wang
Elena Mikhalkova, Nadezhda Ganzherli, Yuri Karyakin, Dmitriy Grigoryev
Kanchana Ihalagedara Rajitha Kithuldeniya Supun weerasekara
NAÏVE BAYES CLASSIFICATION
Practice Project Overview
Extracting Why Text Segment from Web Based on Grammar-gram
Wil Collins, Will Dickerson Client: Mohamed Magdy and CTRnet
Credit Card Fraudulent Transaction Detection
Austin Karingada, Jacob Handy, Adviser : Dr
Presentation transcript:

Analyzing and Visualizing Disaster Phases from Social Media Streams Group VizDisasters: Liangzhe Chen, Xaio Lin, Andrew Wood Client: Seungwon Yang Information Storage & Retrieval Final Presentation 12/4/2012 Virginia Tech

Motivation CTRnet: archiving disaster-related online data in collaboration with the Internet Archive Tweets during disasters: quick alternative to cell phones Large dataset to pull from for researchers & responders

Four Phases of Emergency Management Response Recovery Mitigation Preparedness Professional and personal activities

Four Phases in Tweets Reporting situation / sharing information Majority For hurricane: rain, flood, wind, cloud, weather forecast Photographs (Instagram) Reporting personal activities Very few 11/22/2018 ProjVisDisaster

Four Phases in Tweets Reporting professional activities Response More than 4,700 people in as many as 80 shelters in 7 states overnight; more than 3,000 #RedCross workers (37 from KC region) at #Isaac Recovery FEMA announces that federal aid has been made available for the state of Louisiana. #Isaac Mitigation FEMA mitigations advisers to offer rebuilding tips in St. Bernard and Ascension Parishes. http://t.co/ZziRGOGw #Isaac Preparedness Very cool app! MT @redcross: Our hurricane app has info on #RedCross shelters, a toolkit w flashlight, alarm http://t.co/E7o1rtJK #Isaac 11/22/2018 ProjVisDisaster

Our Approach 11/22/2018 ProjVisDisaster

Our Approach Machine learning Visualization Use case / Demo Extract professional activities Classify professional activities into four phases Visualization Phase view, tweet view, social network view, map view Use case / Demo 11/22/2018 ProjVisDisaster

Learning Professional Activities in Four Phases 11/22/2018 ProjVisDisaster

Learning Professional Activities in Four Phases Preprocessing Building dataset Vectorization Classification Algorithms Evaluation 11/22/2018 ProjVisDisaster

Building dataset Focus on tweets about professional activities Based on keywords of known organizations FEMA Red Cross (RedCross) Salvation Army (SalvationArmy) 11/22/2018 ProjVisDisaster

Building dataset Combining tweet and resource title Mitigation specialists are offering free rebuilding tips in five parishes. http://t.co/hwXajm6X #Isaac 11/22/2018 ProjVisDisaster

Building dataset Overview of Issac dataset About 56,000 English tweets during hurricane Issac 5,677 tweets with reference to FEMA, Red Cross or Salvation Army 1,453 without re-tweets 1,121 manually labeled explicitly with one of the four phases, response, recovery, mitigation or preparedness 11/22/2018 ProjVisDisaster

Vectorization tf transform idf transform Normalization Stemming (Porter stemmer) 11/22/2018 ProjVisDisaster

Algorithms Naïve Bayes Naïve Bayes Multinomial Random Forest SVM Multiclass 11/22/2018 ProjVisDisaster

Evaluation Tuned classifier, 10 fold cross-validation Accuracy Weighted F Measure Naïve Bayes 70.47% 0.723 Naïve Bayes Multinomial 77.87% 0.782 Random Forest 76.27% 0.754 SVM Multiclass 80.82% Reported slightly lower than naïve bayes multinomial 11/22/2018 ProjVisDisaster

Evaluation Preprocessing v.s. Accuracy TF IDF Normalization Naïve Bayes Multinomial SVM Multiclass 76% 80.1% X 77% 80.4% 60% 78.8% 78.1% 75% 78% 80.8% 63% 78.9% 79.0% 11/22/2018 ProjVisDisaster

Visualizing Four Phases 11/22/2018 ProjVisDisaster

Visualizing Four Phases Phase view ThemeRiver, D3 library Tweet view JqGrid Library Social Network View Gephi Map View Google Geocoding API 11/22/2018 ProjVisDisaster

Phase view 11/22/2018 ProjVisDisaster

Tweet view 11/22/2018 ProjVisDisaster

Social Network View 11/22/2018 ProjVisDisaster

Map view 11/22/2018 ProjVisDisaster

Use Case & Demo http://spare05.dlib.vt.edu/~ctrvis/phasevis/ 11/22/2018 ProjVisDisaster

Use Case 11/22/2018 ProjVisDisaster

Use Case 11/22/2018 ProjVisDisaster

Summary and Future Work Analysis/classification of disaster tweets into phases Multi-view visualization Future challenges: Automated professional organization extraction Processing of personal tweets Application to other disasters 11/22/2018 ProjVisDisaster

Acknowledgements Haeyong Chung Sunshin Lee 11/22/2018 ProjVisDisaster