Predictive Semantic Social Media Analysis David A. Ostrowski System Analytics and Environmental Sciences Research and Advanced Engineering Ford Motor Company.

Slides:



Advertisements
Similar presentations
Pat Langley Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford, California
Advertisements

Multi-label Relational Neighbor Classification using Social Context Features Xi Wang and Gita Sukthankar Department of EECS University of Central Florida.
Conceptual Graph Analysis Chapter 20 Lori Nuth | EDIT 730 | Fall 2005.
An Overview of Machine Learning
Social Media Mining Chapter 5 1 Chapter 5, Community Detection and Mining in Social Media. Lei Tang and Huan Liu, Morgan & Claypool, September, 2010.
Modeling Relationship Strength in Online Social Networks Rongjian Xiang 1, Jennifer Neville 1, Monica Rogati 2 1 Purdue University, 2 LinkedIn WWW 2010.
CIS 678 Artificial Intelligence problems deduction, reasoning knowledge representation planning learning natural language processing motion and manipulation.
CS347 Review Slides (IR Part II) June 6, 2001 ©Prabhakar Raghavan.
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
Lesson 8: Machine Learning (and the Legionella as a case study) Biological Sequences Analysis, MTA.
Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Chapter 5 Data mining : A Closer Look.
#1 Google #2 Facebook #3 Youtube #7 Ebay #8 Twitter #9 Craigslist.
CS Machine Learning. What is Machine Learning? Adapt to / learn from data  To optimize a performance function Can be used to:  Extract knowledge.
LLNL-PRES This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344.
SPAM DETECTION USING MACHINE LEARNING Lydia Song, Lauren Steimle, Xiaoxiao Xu.
1 © Goharian & Grossman 2003 Introduction to Data Mining (CS 422) Fall 2010.
B.Ramamurthy. Data Analytics (Data Science) EDA Data Intuition/ understand ing Big-data analytics StatsAlgs Discoveries / intelligence Statistical Inference.
Social Network Analysis via Factor Graph Model
Data Mining Chun-Hung Chou
From Devices to People: Attribution of Search Activity in Multi-User Settings Ryen White, Ahmed Hassan, Adish Singla, Eric Horvitz Microsoft Research,
MACHINE LEARNING 張銘軒 譚恆力 1. OUTLINE OVERVIEW HOW DOSE THE MACHINE “ LEARN ” ? ADVANTAGE OF MACHINE LEARNING ALGORITHM TYPES  SUPERVISED.
Text mining.
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
Exploiting Ontologies for Automatic Image Annotation M. Srikanth, J. Varner, M. Bowden, D. Moldovan Language Computer Corporation
Machine Learning in Spoken Language Processing Lecture 21 Spoken Language Processing Prof. Andrew Rosenberg.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Data Mining: Classification & Predication Hosam Al-Samarraie, PhD. Centre for Instructional Technology & Multimedia Universiti Sains Malaysia.
Transfer Learning with Applications to Text Classification Jing Peng Computer Science Department.
Text mining. The Standard Data Mining process Text Mining Machine learning on text data Text Data mining Text analysis Part of Web mining Typical tasks.
CRM - Data mining Perspective. Predicting Who will Buy Here are five primary issues that organizations need to address to satisfy demanding consumers:
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Prediction of Influencers from Word Use Chan Shing Hei.
Exploit of Online Social Networks with Community-Based Graph Semi-Supervised Learning Mingzhen Mo and Irwin King Department of Computer Science and Engineering.
Advanced Analytics on Hadoop Spring 2014 WPI, Mohamed Eltabakh 1.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
1Ellen L. Walker Category Recognition Associating information extracted from images with categories (classes) of objects Requires prior knowledge about.
Nuria Lopez-Bigas Methods and tools in functional genomics (microarrays) BCO17.
V. Clustering 인공지능 연구실 이승희 Text: Text mining Page:82-93.
Predicting Voice Elicited Emotions
Basic Machine Learning: Clustering CS 315 – Web Search and Data Mining 1.
Mining information from social media
ECE 471/571 - Lecture 19 Review 11/12/15. A Roadmap 2 Pattern Classification Statistical ApproachNon-Statistical Approach SupervisedUnsupervised Basic.
11 A Classification-based Approach to Question Routing in Community Question Answering Tom Chao Zhou 1, Michael R. Lyu 1, Irwin King 1,2 1 The Chinese.
A RESEARCH SUPPORT SYSTEM FRAMEWORK FOR WEB DATA MINING Jin Xu, Yingping Huang, Gregory Madey Department of Computer Science and Engineering University.
Computer Vision Group Department of Computer Science University of Illinois at Urbana-Champaign.
Don’t Follow me : Spam Detection in Twitter January 12, 2011 In-seok An SNU Internet Database Lab. Alex Hai Wang The Pensylvania State University International.
Scalable Learning of Collective Behavior Based on Sparse Social Dimensions Lei Tang, Huan Liu CIKM ’ 09 Speaker: Hsin-Lan, Wang Date: 2010/02/01.
Machine Learning in Practice Lecture 21 Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
Understanding unstructured texts via Latent Dirichlet Allocation Raphael Cohen DSaaS, EMC IT June 2015.
Data Mining and Text Mining. The Standard Data Mining process.
High resolution product by SVM. L’Aquila experience and prospects for the validation site R. Anniballe DIET- Sapienza University of Rome.
Experience Report: System Log Analysis for Anomaly Detection
PREDICT 422: Practical Machine Learning
Semi-Supervised Clustering
Machine Learning overview Chapter 18, 21
DATA MINING © Prentice Hall.
Data Mining 101 with Scikit-Learn
MACHINE LEARNING.
Introduction to Azure Machine Learning Studio
Machine Learning Week 1.
Intro to Machine Learning
Vanquishing the Measurement Dragon
INTRODUCTION.
3.1.1 Introduction to Machine Learning
Course Summary ChengXiang “Cheng” Zhai Department of Computer Science
Intro to Machine Learning
Motivation Semantic Transformation Module Most of the existing works neglect the semantic relationship between the visual feature and linguistic knowledge,
Presentation transcript:

Predictive Semantic Social Media Analysis David A. Ostrowski System Analytics and Environmental Sciences Research and Advanced Engineering Ford Motor Company

Social media Influential Sample of the web –News driven CRM –Real-time –Less biased Unique opportunities for analytics

Opportunities Old Model –Reactionary Damage control Inquiries Confirm positive reaction New Model –Preemptive Focused engagement –Promotions –Events –Media Anticipatory

Social Dimensions Describes affiliations across a network Values / Community Reinforced by relationships Utilize to predict purchase behavior

Relational Learning ‘Birds of a Feather’ Leverage each local network to semantic understanding Relational Learning =>Social dimensions

Framework Overview Relational learning –Strengthen representation –Support knowledge Unsupervised classification –Generation of dimensions Supervised classification –Dimensions => behavior

Framework Overview Local network taxonomy labels Social Dimension RN classification K-means cluster features Supv. classification behaviors features Higher level features

Case Study One 4000 facebook identifiers Associations to two vehicle lines Question: –What can we extract to characterize between these two purchase behaviors

Relational Learning Step Extracted data from FB Consolidated interests Applied the RN algorithm Guided by taxonomy

Preliminary cluster statistics normalized differences between vehicle lines

Extracted social dimensions Applied feature sets to k-means (3-6) Each classification attempt to characterize between vehicle line and a social dimension (value / interest..) All classification to be considered towards behavioral training Also considered community detection –Via maximization of a modularity matrix via leading eigenvectors

Applied Supervised Classification for the Behavior prediction Applied sets through three Machine Learning algorithm Simple Bayes precision.7, recall.69 Weightily Averaged One-dependence Estimators (WAODE) precision.69 recall.70 J48 precision.69 recall.70

Case Study Facebook IDs across four vehicle lines Relational modeling –Similar performance as first case study Social Dimensions generated for k=(3-7) –Not as much separation after k=6 clustering Precision recall (among simple bayes, WAODE, J48).469, , ,.536

Next Steps Institutionalization –Extract / define exactly what our dimensions are explaining in our data sets. Relate to specific association –Values –community

Q/A See me for friends and neighbors discount….

Appendix (software) ‘R’ igraph ‘R’ km module Weka Ruby -Watir