Fifth International Workshop on Knowledge Discovery, Knowledge Management and Decision Support – EUREKA Big Data architecture for Social Media Sentiment Analysis supporting Context Aware Recommendation Systems Authors: Lic. Giosvany Miranda - Cuba Dra. Tatiana Delgado - Cuba Mexico City. April 23, 2015
Introduction
Introduction Mark Troester, 2012, SAS, White paper “Big Data Meets Big Data Analytics”
Introduction
Architecture Approach An Architecture Approach Data Source Layer Ingestion LayerIngestion Layer Infrastructure Layer Hadoop Storage Layer Analysis Layer Visualitation Layer
Architecture Approach Data Source Layer Infrastructure Layer Ingestion Layer An Architecture Approach Hadoop Storage Layer AnalysisLayerAnalysisLayer Visualitation Layer Internal Structured Data Social Web Data Internal Data Structured Data External Data Semi Structured Data
Architecture Approach Data Source Layer Infrastructure Layer Ingestion Layer An Architecture Approach Hadoop Storage Layer AnalysisLayerAnalysisLayer Visualitation Layer Internal Structured Data Social Web Data IdentificationFiltrationValidationTransformationIntegration
Architecture Approach Data Source Layer Infrastructure Layer Ingestion Layer An Architecture Approach Hadoop Storage Layer AnalysisLayerAnalysisLayer Visualitation Layer Internal Structured Data Social Web Data Infrastructure Layer Virtualization Cloud Services
Architecture Approach Data Source Layer Infrastructure Layer Ingestion Layer An Architecture Approach Hadoop Storage Layer AnalysisLayerAnalysisLayer Visualitation Layer Internal Structured Data Social Web Data Infrastructure Layer Virtualization Cloud Services No SQL Key Value Pair Structured Key Value Pair Structured Column – Oriented Semi-Structured Column – Oriented Semi-Structured Hadoop Storage Layer
Architecture Approach Data Source Layer Infrastructure Layer Ingestion Layer An Architecture Approach Hadoop Storage Layer AnalysisLayerAnalysisLayer Visualitation Layer Internal Structured Data Social Web Data Infrastructure Layer Virtualization Cloud Services Hadoop Storage Layer No SQL Key Value Pair Structured Key Value Pair Structured Column – Oriented Semi-Structured Column – Oriented Semi-Structured Analysis Layer Data Evaluation Similarity Recommendation Semantic Sentiment Analysis Start Rating Evaluation Collaborative & Content Filter
Architecture Approach Data Source Layer Infrastructure Layer Ingestion Layer An Architecture Approach Hadoop Storage Layer AnalysisLayerAnalysisLayer Visualitation Layer Data Evaluation Entity Quality Rating Entity Rating Entity Polarity
Architecture Approach Data Source Layer Infrastructure Layer Ingestion Layer An Architecture Approach Hadoop Storage Layer AnalysisLayerAnalysisLayer Visualitation Layer Similarity Recommendation Service Output Service
Architecture Approach Data Source Layer Infrastructure Layer Ingestion Layer An Architecture Approach Hadoop Storage Layer AnalysisLayerAnalysisLayer Visualitation Layer Internal Structured Data Social Web Data Infrastructure Layer Virtualization Cloud Services Hadoop Storage Layer No SQL Key Value Pair Structured Key Value Pair Structured Column – Oriented Semi-Structured Column – Oriented Semi-Structured Analysis Layer Data Evaluation Similarity Recommendation Semantic Sentiment Analysis Start Rating Evaluation Collaborative & Content Filter Visualitation Layer CARS & Hadoop Administration Social Web Ranking ANDARIEGO CARS
Architecture Approach Data Source Layer Infrastructure Layer Ingestion Layer An Architecture Approach Hadoop Storage Layer AnalysisLayerAnalysisLayer Visualitation Layer CARS Hadoop Administration ANDARIEGO Top Entities Twitter Ranking Facebook Ranking ANDARIEGO Social Web Ranking ANDARIEGO CARS
Future Work Future Work Implementing the proposal architecture in a real scenario such as ANDARIEGO geographical application. Evaluate implementation methods for implementing infrastructure layer based on cloud computing. Evaluate metrics based on fuzzy logic to contribute to the semantic sentiment analysis.
Conclusions Conclusions The proposed big data architecture, provides elements for an efficient storage, management and analysis of social streaming data. The proposed context aware recommendation system have emerged as a very useful element for general mobile systems users.
Fifth International Workshop on Knowledge Discovery, Knowledge Management and Decision Support – EUREKA Big Data architecture for Social Media Sentiment Analysis supporting Context Aware Recommendation Systems Authors: Lic. Giosvany Miranda – Cuba Dra. Tatiana Delgado – Cuba Mexico City. April 23, 2015