Sentiment Analysis: The Emotionality of Discourse .

Slides:



Advertisements
Similar presentations
Detecting and analysing emotion in social network sites Mike Thelwall Statistical Cybermetrics Research Group University of Wolverhampton, UK Virtual Knowledge.
Advertisements

Recording Audio with Audacity Workshop by Dr. Luba Iskold Fulvia Alderiso and Kellen Mickley August 2007 Dept. of Languages, Literatures and Cultures.
SentiStrength: Sentiment Strength Detection in MySpace and Twitter Mike Thelwall Statistical Cybermetrics Research Group University of Wolverhampton, UK.
Pollyanna Gonçalves (UFMG, Brazil) Matheus Araújo (UFMG, Brazil) Fabrício Benevenuto (UFMG, Brazil) Meeyoung Cha (KAIST, Korea) Comparing and Combining.
Linkedin “Your Professional Networking Hub”. What is linkedin Linkedin is a social networking website for professionals. It’s highly homogenous with most.
Time series Word frequencies Sentiment analysis Twitter Analysis for the Social Sciences and Humanities.
Solutions for Multilingual Literature by XSL Formatter 6,800 known languages.
Clients for XProtect VMS What’s new presentation
< Translator Team > 25+ Languages, …and growing!.
Welcome to Warwick Study Abroad students!. An international student society.
What's on the Web? The Web as a Linguistic Corpus Adam Kilgarriff Lexical Computing Ltd University of Leeds.
Good morning! Welcome to Visual Christian music: More than the meaning.
Evaluations Submit your evals online.
Digital audio editing software (Audacity) Audacity Instructions Introduction What is Audacity What can you do with Audacity Audacity Control Panel How-To.
The Influence of First Language on Reading and Spelling in English Linda Siegel University of British Columbia Vancouver, CANADA
V OKI I NTRODUCTION W EB 2.0 T OOLS EDTC 6340 Technology Advances in PK-12 Classroom Presenter: Blanca E. Pena.
Overview of REALNEO Technologies REALNEO Web Platform Architecture Overview of Drupal.
Week 4 Number Systems.
Тема урока: «Talking about countries and Nationalities»
WISER: learning a language Lucile Deslignères Librarian, Language Centre Johanneke Sytsema Linguistics Subject Consultant
1 Translate and Translator Toolkit Universally accessible information through translation Jeff Chin Product Manager Michael Galvez Product Manager.
Copyright © IBM Corp., The Eclipse™ Babel Project Translation Server Kit Lo IBM™ Corporation.
Hungarian Italian Armenian Russian English Finnish Irish Polish Portuguese Chinese Japanese Dutch French Arabic Greek People Speak English All Over the.
“Limba dultsi multu adutsi”. Sweet language brings much. Aromanian proverb Primary Languages Session 2: Teaching & Learning Primary Languages Session.
Nationalities. Questions Where do you live? I live in Ashdod. Where are you from? I am from Israe l. What nationality are you? I am Israeli What language.
New RCLayout. Do product layout 3 improvements All products Local databases New functionalities.
5 th EI World Congress - Berlin, July 2007 Use of the Web and Internet Technologies to enhance Teacher Union Work.
Why Study Languages Produced by the Subject Centre for Languages, Linguistics and Area Studies …When Everyone Speaks English?
ONLINE COMMUNICATIONS 1 Lesson 4. Contemporary social media  People with common interests tend to gather together to exchange views and put forward ideas.
Luis Avila Tics. We have to recognize all the operating systems we have nowadays in the different smartphones Blackberry: Bb OS Iphone: iOS Nokia: symbian.
Look of the new IPPOG Resources database website Proposal by BG + HP based on structure proposed (BG+RL+HP) 2/11/2015 Following and evolving from the discussion.
Transana. General For qualitative analysis Transana is cross-platform. Runs on both Windows and Apple OS X Transana is Open Source. – Researchers can.
Key vocabulary: Unit 5. International Business Styles (međunarodni stilovi poslovanja ), New Insights into Business, str. 44.
IS THE IDIOM PRINCIPLE BLOCKED IN BILINGUAL L2 PRODUCTION? Hiroki Tsuchimochi.
ELanguages creative collaboration for teachers globally.
Social Media by: Nicole Naylor
GCSE reform in England Parent Partnership Tuesday 8 November 2016
RECENT TRENDS IN SMT By M.Balamurugan, Phd Research Scholar,
REVISION UNITS 1-4 Our Class and School Languages The Norman Conquest
V E R B O T He is… You are … She is… I am… We are…
Online Educational tool #2 and #3
Profiling Web Archive Coverage for Top-Level Domain & Content Language
Overview of REALNEO Technologies
V E R B O T He is… You are … She is… I am… We are…
On the way to school. Teenage problems.
Supporting Your Child in their Writing
Rosta Farzan and Keyang Zheng, School of Computing and Information
Culture in Business ELL 실용 비즈니스 영어.
Designing Classroom Language Tests
If you are speaking another language at home
Writing an informal (describing an event)
Digital Asset Management Part 11: Access
Dances Or in the form of cultural dances Youth Alive Training.
Retrieval of audio testimonials via voice search
Looking at Texts from a Reader’s Point of View
Source Info Card Workshop in Ecosystem Ecology Insights into Climate Change: Past, Present, and Future First published in 1923, Time.

Big Data Sources – Web, Social media and Text Analytics
Text Mining & Natural Language Processing
Collecting Globally, Connecting Locally: 21st Century Libraries
PolyAnalyst Web Report Training
COUNTRIES NATIONALITIES LANGUAGES.
PolyAnalyst Web Report Training
NATIONALITIES  « What’s your nationality? » « I’m French »
Claro ScanPen Reader By Claro Software Limited
Personal Responses Year Nine.
Countries and nationalities
TYPE TOPIC HERE Type general description here Type number Type number

Before language competency After language competency.
Presentation transcript:

Sentiment Analysis: The Emotionality of Discourse . Information Studies Sentiment Analysis: The Emotionality of Discourse . Mike Thelwall Statistical Cybermetrics Research Group University of Wolverhampton, UK

Sentiment Strength Detection in the Social Web with SentiStrength Detect positive and negative sentiment strength in short informal text Does not rely on standard grammar and spelling Uses nonstandard emotion expression forms from the social web (e.g., :-) or haaappppyyy!!!) Classifies positive 1 to 5 AND negative -1 to -5 sentiment Thelwall, M., Buckley, K., & Paltoglou, G. (2012). Sentiment strength detection for the social Web. Journal of the American Society for Information Science and Technology , 63(1), 163-173 Thelwall, M., Buckley, K., Paltoglou, G., Cai, D., & Kappas, A. (2010). Sentiment strength detection in short informal text. Journal of the American Society for Information Science and Technology, 61(12), 2544-2558.

SentiStrength Algorithm - Core List of 2,489 positive and negative sentiment terms and strengths (1 to 5), e.g. ache = -2, dislike = -3, hate=-4, excruciating -5 encourage = 2, coolest = 3, lover = 4 Sentiment strength is highest in sentence; or highest sentence if multiple sentences

I hate Paul but encourage him. positive, negative -2 1, -2 My legs ache. You are the coolest. I hate Paul but encourage him. 3 3, -1 2, -4 -4 2

Extra rules (total: about 20) spelling correction nicce -> nice booster words alter strength very happy negating words flip sentiments not nice repeated letters boost sentiment/+ve niiiice emoticon list :) =+2 exclamation marks count as +2 unless –ve hi! repeated punctuation boosts sentiment good!!! Negativity ignored in questions u h8 me? sentiment idiom list shock horror = -2 Online as http://sentistrength.wlv.ac.uk/

Tests against human coders on data sets of >1000 texts Positive scores -correlation with humans Negative YouTube 0.589 0.521 MySpace 0.647 0.599 Twitter 0.541 0.499 Sports forum 0.567 Digg.com news 0.352 0.552 BBC forums 0.296 0.591 All 6 data sets 0.556 0.565 SentiStrength agrees with typical humans as much as they agree with each other 1 is perfect agreement, 0 is random agreement

Why the bad results for news and politics (BBC, Digg)? Irony, sarcasm and expressive language: Theresa May must be very happy that I have lost my job. It is really interesting that Theresa May and most of her ministers are millionaires. Your argument is a joke.

Other SentiStrength Languages OK: Spanish, Finnish, German, Dutch, Russian, Turkish, Italian Untested: French, Polish, Greek, Swedish, Portuguese, Persian, Arabic, Welsh, Irish Basic: Chinese, Filipino, Hausa, Indonesian, Japanese, Korean, Shona, Swahili

Data and instructions: http://sentistrength.wlv.ac.uk/jkpop/ Workshop task Classify YouTube comment sentiment with SentiStrength in Python Read the very strongly positive or negative comments to discover What strong sentiment is expressed about and How sentiment differs between groups or videos. Data and instructions: http://sentistrength.wlv.ac.uk/jkpop/

Workshop task, more details Extract a random large file of video comments from each of TWICE, BTS, EXO, BLACKPINK and use the instructions to classify their sentiment. Load the results into a spreadsheet and then read the strongly positive or negative comments: What is sentiment is expressed about? How does sentiment differ between groups in terms of strength or aspects?

Example results BLACKPINK: Praise of the group and their dancing BTS: Praise of the group themselves and how they make the commenter feel. Negative sentiment is from crying (positive) EXO: More critical analysis of the video and group compared to the other videos. TWICE: The song and the dance that goes with it.

Python code Download Python and SentiStrength programs and other files to your computer Enter file locations on your computer here Other code…. Calls the SentiStrength java program & sends the file to process

All code

Search for videos by keyword or download entire channels Collect your own YouTube (or Twitter) data after the workshop with Mozdeh (free, Windows only) mozdeh.wlv.ac.uk Search for videos by keyword or download entire channels Includes advanced analytics Thelwall, M. Social media analytics for YouTube comments: Potential and limitations. International J. Social Research Methodology

Starting the workshop Follow the instructions in the Word/PDF file at http://sentistrength.wlv.ac.uk/jkpop/