Tasking the Tweeters: Obtaining Actionable Information from Human Sensors Alun Preece, Will Webberley (Cardiff) Dave Braines (IBM UK)

Slides:



Advertisements
Similar presentations
Fuzzy Angle Fuzzy Distance + Angle AG = 90 DG = 1 Annual Conference of ITA ACITA 2009 Exact and Fuzzy Sensor Assignment Hosam Rowaih 1 Matthew P. Johnson.
Advertisements

SWG Strategy (C) Copyright IBM Corp. 2006, All Rights Reserved. P4 Task 2 Fact Extraction using a CNL Current Status David Mott, Dave Braines, ETS,
Baltimore April 2012 International Technology Alliance in Network and Information Sciences Tasking and Sharing Sensing Assets Using Controlled Natural.
International Technology Alliance in Network & Information Sciences Dave Braines, John Ibbotson, Graham White (IBM UK) SPIE Defense Security & Sensing.
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
International Technology Alliance In Network & Information Sciences International Technology Alliance In Network & Information Sciences Paul Smart, Ali.
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Information Society Technologies Third Call for Proposals Norbert Brinkhoff-Button DG Information Society European Commission Key action III: Multmedia.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
By Lee Betancourt Director of Communications and Public Relations Jane Myers Public Relations, Communications and Social Media Coordinator Social Media.
Sentinel Alun Preece Irena Spasić David Rogers Cardiff School of Computer Science & Informatics.
August 23, 2013 Social Media Audit. Overview  Goals –Evaluate current social networking status –Identify trending topics and social influencers –Provide.
Face Processing in Social Networking Services NTIA Privacy Multistakeholder Process: Commercial Facial Recognition Technology March 25, 2014 Olga Raskin.
Enhancing Data Quality of Distributive Trade Statistics Workshop for African countries on the Implementation of International Recommendations for Distributive.
TaRDiS Tackling Radicalisation in Dispersed Societies Knowledge Sharing Briefing Cardiff 18 March 2014 With the financial support of the Prevention of.
I-Room : Integrating Intelligent Agents and Virtual Worlds.
® ® Global Advisory Council (GAC) Outreach overview, Jan 2011 Mark Reichardt, President and CEO Open Geospatial Consortium © 2011 Open Geospatial Consortium.
SEO & SMO Working Plan Copyright © Orion Computech | | - | Skype: - vishal.orion.
Information | Analytics | Expertise SOCIAL MEDIA INTELLIGENCE Practical Strategies for Using Social Media to Enhance Security AUGUST 2014 © 2014 IHS IHS.
ENTERFACE’08 Multimodal high-level data integration Project 2 1.
Consistency of Assessment
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
International Technology Alliance In Network & Information Sciences International Technology Alliance In Network & Information Sciences Paul R. Smart,
The Decision-Making Process IT Brainpower
Sensemaking and Ground Truth Ontology Development Chinua Umoja William M. Pottenger Jason Perry Christopher Janneck.
Brent Dingle Marco A. Morales Texas A&M University, Spring 2002
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
ONS Big Data Project. Plan for today Introduce the ONS Big Data Project Provide a overview of our work to date Provide information about our future plans.
Personal Transparency and self- analytic tools for online habits Mark Johnson David Sherlock David Griffiths The Institute for Educational Cybernetics.
Semantic Web for E-Science and Education Enrico Motta Knowledge Media Institute The Open University, UK.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Huimin Ye.
Emotional Intelligence and Agents – Survey and Possible Applications Mirjana Ivanovic, Milos Radovanovic, Zoran Budimac, Dejan Mitrovic, Vladimir Kurbalija,
Lean Supply Chain Action Learning Program September 2007.
Information Literacy in the workplace: implications for trainers By Dr. Mark Hepworth Department of Information Science Loughborough University.
9 Closing the Project Teaching Strategies
Blaz Fortuna, Marko Grobelnik, Dunja Mladenic Jozef Stefan Institute ONTOGEN SEMI-AUTOMATIC ONTOLOGY EDITOR.
Copyright © 2014 Pearson Education, Inc. 1 It's what you learn after you know it all that counts. John Wooden Key Terms and Review (Chapter 5)
1Copyright CEIS Preliminary Approach for the PSIP May2004 CEIS, WTC Tour B, 2 rue A Fresnel - METZ.
Presentation Outline (hidden slide) Technical Level: 100 Intended Audience: TDMs, ITPros, ITDMs, BI specialists Objectives (what do you want the audience.
Knowing Your Facebook From Your Flickr Dan O’ Neill – -
© 2007 Tom Beckman Features:  Are autonomous software entities that act as a user’s assistant to perform discrete tasks, simplifying or completely automating.
Potential Value Adding capacity to logistics by utilising Geographical Information Systems Presented by Lorraine Hall Logistics Research Fellow Monash.
© 2008 IBM Corporation ® Atlas for Lotus Connections Unlock the power of your social network! Customer Overview Presentation An IBM Software Services for.
Pete Bohman Adam Kunk. What is real-time search? What do you think as a class?
December 2010iTEC - Designing the future classroom1 אלף כיתות משתתפות במיזם לעיצוב כיתת הלימוד העתידית – iTEC דב וינר iTEC – Designing the future classroom.
PUBLIC RELEASE – DISTRIBUTION UNLIMITED SPIE 2015 The International Technology Alliance in Network and Information Sciences Collaborative human- machine.
Pete Bohman Adam Kunk. Real-Time Search  Definition: A search mechanism capable of finding information in an online fashion as it is produced. Technology.
CLARIN work packages. Conference Place yyyy-mm-dd
MURI: Integrated Fusion, Performance Prediction, and Sensor Management for Automatic Target Exploitation 1 Dynamic Sensor Resource Management for ATE MURI.
Management of Digital Content in Business Environments Constantine D. Spyropoulos Director of Institute of Informatics & Telecommunications NCSR “Demokritos”
A Cross-Sensor Evaluation of Three Commercial Iris Cameras for Iris Biometrics Ryan Connaughton and Amanda Sgroi June 20, 2011 CVPR Biometrics Workshop.
Visual Analytics with Linked Open Data and Social Media for e- Governance Vitaveska Lanfranchi Suvodeep Mazumdar Tomi Kauppinen Anna Lisa Gentile Updated.
1 What is OO Design? OO Design is a process of invention, where developers create the abstractions necessary to meet the system’s requirements OO Design.
BioRAT: Extracting Biological Information from Full-length Papers David P.A. Corney, Bernard F. Buxton, William B. Langdon and David T. Jones Bioinformatics.
ICT TOOLS AND SOCIETY INVOLVEMENT AMONG THE EUPAN NETWORK HIGHLIGHTS FROM THE SURVEY RESULTS TANYA CHETCUTI AND MARCO FICHERA - WORKSHOP EUROPEAN COMMISSION.
Improving the Social Nature of OnLine Learning Tap into what students are already doing Tap into what students are already doing Educause SWRC07 Copyright.
Tasking the Tweeters: Obtaining Actionable Information from Human Sensors Alun Preece, Will Webberley (Cardiff) Dave Braines (IBM UK)
A Classification-based Approach to Question Answering in Discussion Boards Liangjie Hong, Brian D. Davison Lehigh University (SIGIR ’ 09) Speaker: Cho,
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Marine Metadata Interoperability Acknowledgements Ongoing funding for this project is provided by the National Science Foundation.
Information Integration 15 th Meeting Course Name: Business Intelligence Year: 2009.
© Nous Infosystems Pvt. Ltd. – Confidential Social Engagement for Banks and Financial Services Leveraging 19 years of expertise in global software services.
Online communication channels Project: Efficient Managers for Efficient Natura2000 Network Created by EUROPARC Federation The production of this presentation.
1 APPROVED FOR PUBLIC RELEASE U.S. Army Research, Development and Engineering Command ARL-CISD Social Network Analysis Team Leader Visual Analytics Consortium,
10 Trends in Online Disaster Outreach. Trends 1) Find your role: Passive or Active? Passive - broadcast and disseminate information Active & Interactive:
1 This project has received funding from the EU’s FP7 Programme under grant agreement no A DEcision Support Tool for Reconstruction and recovery.
CS10K Community Facilitators and Social Learning Team Meeting January 14, 2013 Portland, OR.
Introduction C.Eng 714 Spring 2010.
Data Warehousing and Data Mining
A Network Science Approach to Fake News Detection on Social Media
Presentation transcript:

Tasking the Tweeters: Obtaining Actionable Information from Human Sensors Alun Preece, Will Webberley (Cardiff) Dave Braines (IBM UK)

Introduction

Social Media for Real-time Intel Social Media streams as sources of actionable intelligence for Situation Awareness (SA) Acknowledging a human-based sensor network A good source today: Twitter –Real-time characteristics –Follower-based model –Open APIs Real-world examples from recent events: –Boston marathon bombing (US) –Lee Rigby murder (UK) Some SA platforms emerging: Twitcident, Apollo, ReDites, Sentinel

Mapping Social Media to DCPD Direction - what data to collect from where Collection e.g. Twitter: APIs for streaming, searching sampling (other platforms available too) Post-filtering for noise reduction Processing e.g. probabilistic, NLP, sentiment, event detection Provide semantic enrichment (contextual) for Shared Understanding. Detect trends, clusters, anomalies etc Dissemination Visualisation, alerting, summarisation Further querying Direct further collections A generic social media processing pipeline mapped to DCPD steps

Dynamic ISR asset management  Missions-and-means framework formalised as a collection of ontologies  Tasks characterised by the data needed to achieve them  type of data (visual, IR, radar etc )  “quality” rating 0 to 9  Assets rated by the data they provide MMF framework NIIRS-based approach  Software tool for agile sensor-task assignment  Extensible knowledge-base of sensor-task suitability  Uses existing models and frameworks to map capabilities Sensor Assignment to Missions (SAM) In previous work we have defined a framework for dynamic ISR asset management:

A Pilot Study

July 26 th, 2014: Cardiff protest march Planned protest march in Cardiff, UK Against Israeli incursions into Gaza Potential for public order disruption Approximately 2,000 people Some limited local trouble Evidence of protest and activities found on Twitter –Real-time during the event –In various stages afterwards Source: Wales online –

Social Media Timeline Timeline of the July 26 th 2014 protest and its aftermath Verbal and physical abuse at bars [15:15] March ends [15:40] Tweeting after the march Police mentions increase after broadcast news UK-wide tweets Important: We are observing perception of the event, not the event itself…

Practical details Sentinel Twitter Stream Analysis Geo-tagged tweets Topical search terms Mentions of local places People on the ground Access to live twitter (+ search) Manually identify “key” tweets Some issues Generality of tweets Crowd size estimation: “a few hundred”, “thousands” Very few tweets geo-tagged The Sentinel application

Observations from the pilot Sweet spot for initial relevancy: Search terms + geo- spatial Social Media reflects perception, not reality We are not claiming that this simple study is representative. Key events and activities can be detected: –…but how early can these be found through “small signals”? Some issues with Social Media: –Propagation of misinformation –Detection of bias –Quantification of contextual factors There is the potential to inform action via this kind of situation awareness

Modeling Tweets and Tweeters

Background: CNL for conversation Need an appropriate form for human-machine interaction:  humans prefer natural language (NL) or images  these forms are difficult for machines to process, leading to ambiguity and miscommunication Compromise: controlled natural language (CNL) there is a person named p1 that is known as ‘John Smith’ and is a person of interest. low complexity | no ambiguity ITA Controlled English (CE)

Defining sources and people Sources, e.g. a Twitter account: conceptualise a ~ twitter account ~ A that is an online identity and is a temporal thing and has the value L as ~ location ~ and has the value NT as ~ number of tweets ~ and has the web image PP as ~ profile picture ~ and has the value NT as ~ number of tweets ~ and has the value NFR as ~ number of friends ~ and has the value NFO as ~ number of followers ~. conceptualise a ~ twitter account ~ A that is an online identity and is a temporal thing and has the value L as ~ location ~ and has the value NT as ~ number of tweets ~ and has the web image PP as ~ profile picture ~ and has the value NT as ~ number of tweets ~ and has the value NFR as ~ number of friends ~ and has the value NFO as ~ number of followers ~. there is a journalist named ‘Paul Heaney’ that uses the twitter account ‘paulheaney67’ and works for the media organization ‘bbc’. there is a journalist named ‘Paul Heaney’ that uses the twitter account ‘paulheaney67’ and works for the media organization ‘bbc’. People (and their derivation from a source): …we are actually building profiles of “human sensors”.

Human Sensor profiles The following information is available for inclusion in the human sensor profile: All data from their Twitter profile (including location) Who they frequently interact with Who they talk about Who are their influencers Recently posted media (photos, videos) Terms names from recent tweets Locations from recent tweets –Including travel to/from locations Sentiment analysis for tweets and terms The use of our human friendly CNL means that additional “local knowledge” can easily be added too. e.g. “stance” – to capture some importance contextual detail This is a dynamic social network

Talking to Moira An example Moira query showing some elements of the tweeter model All this information (people, sources, tweets, terms, events etc) is available in a CNL knowledge base. The Moira agent is able to access this and support conversation with human team members…

Tasking Tweeters

Defining ISR tasks From our previous work: conceptualise the task T ~ requires ~ the intelligence capability IC and ~ is looking for ~ the detectable thing DT and ~ operates in ~ the spatial area SA and ~ operates during ~ the time period TP and ~ is ranked with ~ the task priority PR. conceptualise the task T ~ requires ~ the intelligence capability IC and ~ is looking for ~ the detectable thing DT and ~ operates in ~ the spatial area SA and ~ operates during ~ the time period TP and ~ is ranked with ~ the task priority PR. The “action” – what you are trying to achieve What you are trying to do, e.g. “detect”, “localize” From a predefined ISR ontology From a gazetteer or similar To establish temporal bounds For simple resource scheduling

Defining Social Media ISR tasks conceptualise the task T ~ requires ~ the intelligence capability IC and ~ is looking for ~ the detectable thing DT and ~ operates in ~ the spatial area SA and ~ operates during ~ the time period TP and ~ is ranked with ~ the task priority PR. conceptualise the task T ~ requires ~ the intelligence capability IC and ~ is looking for ~ the detectable thing DT and ~ operates in ~ the spatial area SA and ~ operates during ~ the time period TP and ~ is ranked with ~ the task priority PR. Direction: The search terms (topics) are derived from the “detectable” The spatial extent from the “spatial area” Processing: The required “intelligence capability” determines the type of processing: “localization” – derive location data from tweets or tweeter “detection” – use existing event detection algorithms. Collection: Stream-processing of tweets based on “direction” phase. Dissemination: Alerting (or otherwise) via contextual application such as Sentinel, or agent such as Moira.

Identifying “key tweeters” In practice “key tweeters” emerge: –Use spatial terms: they want people to know where they are –Use terms/hashtags: they want their tweets to be found –Social network: who are they and who they connect to From these we can determine: –Whether they are in a “position to know” –Their skills in Twitter usage –Their influence and reach All of this helps build knowledge of trust and information quality

Findings so far Existing ISR task representation can drive Twitter collection Human & machine agents can use this information in many ways The Moira agent helps us to interact with the knowledge base: –Engage the system in a conversation –Assert new local knowledge –Extend the model –Invoke additional functions such as “fact extraction” Use of the “stance” relationship in a conversation with Moira An example of fact extraction from tweet text using Moira

Wrapping up

Related work Conversational interaction: –Bi-directional chains for ISR pipelines –Humans and machines in collaboration Experiments with Human subjects: –Using the Moira interface –Crowd-sourced Situational Understanding –Combine Human input and physical sensors –Handling incomplete and conflicting information –Use of relevancy criteria to minimise resource utility

Some conclusions Streamed insight from Social Media could be incorporated into traditional ISR asset management. This could be streamlined through: –Automatic assignment of assets (for stream processing) –Automatic identification of Social Media collections Lots of issues: –e.g. misinformation and coordinated rumours Awareness improves potential for action: –Early countering strategies, opportunities for community intervention Limitations and opportunities: –We have focused on text-based analysis –Imagery potential: image processing, face detection, object recognition etc

Tasking the Tweeters: Obtaining Actionable Information from Human Sensors SPIE DSS 2015 – Ground/Air Multisensor Interoperability, Integration & Networking for Persistent ISR IV Any questions? Research was sponsored by US Army Research Laboratory and the UK Ministry of Defence and was accomplished under Agreement Number W911NF The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the US Army Research Laboratory, the U.S. Government, the UK Ministry of Defense, or the UK Government. The US and UK Governments are authorized to reproduce and distribute reprints for Government purposes notwithstanding any copyright notation hereon. Development of the Sentinel platform was funded by the European Commission under the project “Tackling Radicalisation in Dispersed Societies (TaRDiS)”, and the ESRC via the project “After Woolwich: Social Re- actions on Social Media” (ES/L008181/1). Cardiff University provided funding for the pilot study examining community impacts of the NATO Summit. We thank Kieran Evans and David Rogers (Cardiff University) for setting up the data collection pipeline for the pilot study in Section 2 and assistance with the data analysis. We thank Darren Shaw (IBM Emerging Technology Services, UK) for creating the tweeter locator service in Section 3. Valuable insights on policing and community reaction to events such as the ones featured in our pilot study were provided by Martin Innes, Colin Roberts and Sarah Tucker (Cardiff Universities Police Science Institute,