2015 SLA IT Webinar Using Analytics to Understand Social Media Activity Michelle Chen School of Information San José State University February 4 th, 2015.

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

Usage Statistics in Context: related standards and tools Oliver Pesch Chief Strategist, E-Resources EBSCO Information Services Usage Statistics and Publishers:
Social media for business by Frank Flores Hash Cloud Studio A Creative Marketing Agency 200 Industrial Rd. Suite 155 San Carlos, CA (650)
Chapter 5: Introduction to Information Retrieval
Learning more about Facebook and Twitter. Introduction  What we’ve covered in the Social Media webinar series so far  Agenda for this call Facebook.
A journey into Text Analytics John McConnell Analytical People ASC Winchester 7th September 2013 © analytical-people 2013.
PoliWeb project (PEPS'14) Geraldine Castel CEMRA, Université Stendhal, France Genoveva Vargas-Solar CNRS, LIG-LAFMIA, France Towards a cloud infrastructure.
Creating Collaborative Partnerships
By Lee Betancourt Director of Communications and Public Relations Jane Myers Public Relations, Communications and Social Media Coordinator Social Media.
Text mining Extract from various presentations: Temis, URI-INIST-CNRS, Aster Data …
Data Mining Glen Shih CS157B Section 1 Dr. Sin-Min Lee April 4, 2006.
1 Causal Analytics with Social Media Content Lipika Dey Innovation Labs, Delhi.
April 22, Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Doerre, Peter Gerstl, Roland Seiffert IBM Germany, August 1999 Presenter:
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Data Mining By Archana Ketkar.
UCB CS Research Fair Search Text Mining Web Site Usability Marti Hearst SIMS.
ICT TOOLS AND SOCIETY INVOLVEMENT AMONG THE EUPAN NETWORK HIGHLIGHTS FROM THE SURVEY RESULTS TANYA CHETCUTI EUROPEAN COMMISSION.
Overview of Web Data Mining and Applications Part I
Overview of Search Engines
Text mining in social media for participatory sensing data A dissertation by Georgios Keikoglou SID:
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
8/16/2015 Search Engine Optimization (SEO). Keyword Research After closely monitoring the competitors we have come up with the business keywords that.
Social Media Marketing Plan By D.U.E.S. (Tony, Sara, Shannon, MaryAnn)
July 30th – August 1st, 2013 McCormick Place, Chicago, IL Integrating Social Media at Live Events David Brull July 30,
More than words: Social networks’ text mining for consumer brand sentiments A Case on Text Mining Key words: Sentiment analysis, SNS Mining Opinion Mining,
Opinion mining in social networks Student: Aleksandar Ponjavić 3244/2014 Mentor: Profesor dr Veljko Milutinović.
«Tag-based Social Interest Discovery» Proceedings of the 17th International World Wide Web Conference (WWW2008) Xin Li, Lei Guo, Yihong Zhao Yahoo! Inc.,
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
Introduction to Web Mining Spring What is data mining? Data mining is extraction of useful patterns from data sources, e.g., databases, texts, web,
Twitter and the Major News Network: Understanding the Online Strategies of Major News networks through Twitter BY: SANAI MELES.
Chapter 6: Information Retrieval and Web Search
Virtual Platform for Education Cooperation in the Americas Webinar Technical Secretariat of the Inter-American Committee on Education-CIE Department of.
Taking Stock of CAR in your Library Researchers & CAR Specialists Margot Williams The Washington Post SLA June 2004.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
SOCIAL MEDIA CREATED BY: NATALIA LAWCZYS, DALE LEGGE, HOSSEIN DANESHYAR.
How Useful are Your Comments? Analyzing and Predicting YouTube Comments and Comment Ratings Stefan Siersdorfer, Sergiu Chelaru, Wolfgang Nejdl, Jose San.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Prediction of Influencers from Word Use Chan Shing Hei.
OCLC Online Computer Library Center 1 Social Media and Advocacy.
ICT TOOLS AND SOCIETY INVOLVEMENT AMONG THE EUPAN NETWORK HIGHLIGHTS FROM THE SURVEY RESULTS TANYA CHETCUTI AND MARCO FICHERA - WORKSHOP EUROPEAN COMMISSION.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
What Is Text Mining? Also known as Text Data Mining Process of examining large collections of unstructured textual resources in order to generate new.
© Copyright 2008 STI INNSBRUCK TrustYou Ioan Toma.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
Individual Project by Nora-Marie Myers May 3, 2011 Social Media Communities in the Media King 5 Seattle The Huffington Post.
We.b : The web of short URLs Demetris Antoniades, lasonas Polakis, Gerogios Kontaxis, Elias Athansapoulos, Sotiris loannidis, Evangelos P.Markatos, Thomas.
Social Media Policies and Evaluation Tools. We can acquire a sense of who makes up our community We can gain more direct information about what people.
Info Sabanci University start-up company founded in March 2013 by academicians and graduate students from Sabanci University. We develop social media.
Info Start-up company founded by academicians and graduate students from Sabanci University. We offer social media analysis tools and services including.
© 2014 Deluxe Enterprise Operations, Inc. All rights reserved. Proprietary and Confidential. How to Own the First Page of Google for Your Brand …and dominate.
Multi-Class Sentiment Analysis with Clustering and Score Representation Yan Zhu.
13 Social Media and Networking. Introduction Social Media Types of Social Media Benefits and Challenges Measuring Social Media Performance.
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
More than words: Social network’s text mining for consumer brand sentiments Expert Systems with Applications 40 (2013) 4241–4251 Mohamed M. Mostafa Reporter.
Thotwaves Innovations Welcome To SMM & SMO Activity Plan
Introduction BIM Data Mining.
The Sellout: Readers Sentiment Analysis of 2016 Man Booker Prize Winner Paper ID : 748.
Data Mining Generally, (Sometimes called data or knowledge discovery) is the process of analyzing data from different perspectives and summarizing it.
DATA MINING © Prentice Hall.
Future-oriented Benchmarking Through Social Media Analysis
Power of Social Media Analytics
Mining the Data Charu C. Aggarwal, ChengXiang Zhai
Social Media Marketing Analytics 社群網路行銷分析
Text Mining & Natural Language Processing
Course Summary ChengXiang “Cheng” Zhai Department of Computer Science
Course Introduction CSC 576: Data Mining.
Text Mining & Natural Language Processing
Knowledge Sharing Mechanism in Social Networking for Learning
Ungraded quiz Unit 9.
Presentation transcript:

2015 SLA IT Webinar Using Analytics to Understand Social Media Activity Michelle Chen School of Information San José State University February 4 th, 2015

SJSU Washington Square Agenda Introduction Research projects Social media analytics Government use of social media: Sentiment analysis Social media and participatory services: Twitter and Pinterest Useful tools Conclusions

SJSU Washington Square Introduction Research initiatives and areas Understanding user behavior in virtual, networked environments (e.g., social media) and its implications Enhancing knowledge discovery and content analysis through information visualization Data mining Information visualization Social media

SJSU Washington Square Research Project #1 Twitter Sentiment Analysis for Understanding Citizens’ Trust in Government (with Franks) Collected over 1m tweets from January 2013 from 60 accounts 20 cities, 20 mayors, 20 police departments Analysis was done using R (for data retrieval, preparation, and computation) and Excel (for plotting)

SJSU Washington Square Sentiment Analysis What is “sentiment analysis”? o Using natural language processing, statistics, or machine learning methods to extract, identify, or otherwise characterize the sentiment content of a text unit Some examples o Is the product review positive or negative? o Have the bloggers’ attitudes about the stock changed since the acquisition? o How are people responding to the government’s post/news item on Twitter? o Other possible tasks: question answering, summarization o Sentiment analysis in business, politics, law/policy making, sociology, and psychology

SJSU Washington Square Limitations & Challenges Due to fair usage policy of web API’s, all tweets were not available. Twitter search and streaming API’s could not be used: limitations of 100 top tweets only (i.e., historical tweets were not available) Use topsy.com as an alternative: lists top 1000 tweets from historical data A programming looping logic was applied to repeat the retrieval process Limitations of topsy.com: re-tweet counts, geographic locations, etc. were not available

SJSU Washington Square Methodology

SJSU Washington Square Methodology: Data Collection Topsy API was used to retrieve the tweets An API URL example: 5001&type=tweet&nohidden=0&perpage=100&page=1&apikey=09C43A9B270A470B8EB8F2946 A9369F3 A batch script in R was executed to retrieve these tweets The API response: a JSON data file (a tree/XML like format)

SJSU Washington Square Methodology: Data Preparation The retrieved data was cleansed by removing: symbols punctuations special characters URLs numbers

SJSU Washington Square Methodology: Sentiment Analysis Bag of Words approach was used for sentiment analysis. Stemming: Each tweet was stemmed into the group of English words Matching: A match of each word was searched in the lexicon database (total 6135 words in the lexicon; 2230 positive and 3905 negative) Scoring: Positive and negative matches were summed to define a score of each tweet Polarity: (P-N)/(P+N), where P=total sum of positive sentiment words; N=total sum of negative sentiment words Results were grouped and combined on a monthly basis.

SJSU Washington Square Analysis Results Overall sentiment classification

SJSU Washington Square Analysis Results Sentiment analysis plot

SJSU Washington Square Analysis Results Word cloud

SJSU Washington Square Research Project #2 Quantitative Analytics for Library User Engagement Strategies through Social Media: Pinterest and Twitter (with Zou and Dey) Among many social media platforms, Pinterest is a new visual discovery social medium in which people can upload images and then collect ideas coming from different users with the same interests. User feedback can be considered a significant resource for libraries to customize their services to better engage their users.

SJSU Washington Square Research Project #2 (cont.) Quantitative Analytics for Library User Engagement Strategies through Social Media: Pinterest and Twitter (with Zou and Dey) The “20 Great Ways Libraries Are Using Pinterest Right Now” (Dunn 2012) Categorize 20 user engagement methods into four categories: Literature exhibits Engaging topic Community building Library showcasing 10 selected libraries were studied: NYPL, SJPL, SFPL, LAPL, BHPL, CAPL, SEPL, HTPL, OHPL, NDPL

SJSU Washington Square Research Project #2 (cont.) Quantitative Analytics for Library User Engagement Strategies through Social Media: Pinterest and Twitter (with Zou and Dey) Pinterest metrics Pin, board, followers, following, repin, like, comment Preliminary findings

SJSU Washington Square Research Project #2 (cont.) Quantitative Analytics for Library User Engagement Strategies through Social Media: Pinterest and Twitter (with Zou and Dey) Twitter analysis: topic modeling Looks for patterns in the use of words and injects semantic meaning into vocabulary; a topic consists of a cluster of words that frequently occur together “MALLET” was adopted Results

SJSU Washington Square Useful Tools R (check out R Programming on Coursera!) Splunk Data mining techniques (check out Data visualizations (check out Many Eyes: ibm.com/software/analytics/manyeyes/) 969.ibm.com/software/analytics/manyeyes/

SJSU Washington Square Conclusions Social media analytics for… Capturing user data to understand attitudes, opinions and trends, and manage online reputation Predicting customer behavior and improve customer satisfaction by anticipating customer needs and recommending next best actions Creating customized campaigns that resonate with social media participants Identifying the primary influencers within specific social network channels

SJSU Washington Square Q&A Thank you very much!