Analysis of online hate communities in Social Networks Presented by : Ruchi Bhindwale.

Slides:



Advertisements
Similar presentations
On-line media tools for strategic communications purposes When using media tools for communication we try to use the latest technologies such us blogging,
Advertisements

Relevant characteristics extraction from semantically unstructured data PhD title : Data mining in unstructured data Daniel I. MORARIU, MSc PhD Supervisor:
Principles of Information Technology
1 Entity Ranking Using Wikipedia as a Pivot (CIKM 10’) Rianne Kaptein, Pavel Serdyukov, Arjen de Vries, Jaap Kamps 2010/12/14 Yu-wen,Hsu.
Copyright © 2003 by The McGraw-Hill Companies, Inc. All rights reserved. Business and Administrative Communication SIXTH EDITION.
Automatic Web Page Categorization by Link and Context Analysis Giuseppe Attardi Antonio Gulli Fabrizio Sebastiani.
Towards Semantic Web Mining Bettina Berndt Andreas Hotho Gerd Stumme.
San Francisco Bay Area News Ecology Daniel Ramos CS790G Fall 2010.
WebMiningResearch ASurvey Web Mining Research: A Survey By Raymond Kosala & Hendrik Blockeel, Katholieke Universitat Leuven, July 2000 Presented 4/18/2002.
THE IIA’S GLOBAL WEBSITE WHAT DOES IT MEAN TO YOU?
ISP 433/633 Week 7 Web IR. Web is a unique collection Largest repository of data Unedited Can be anything –Information type –Sources Changing –Growing.
Online communities 1 Theory revision Complete some of the activities in this powerpoint and use the revision book to answer questions.
PageRank Identifying key users in social networks Student : Ivan Todorović, 3231/2014 Mentor : Prof. Dr Veljko Milutinović.
Application of Graph Theory to OO Software Engineering Alexander Chatzigeorgiou, Nikolaos Tsantalis, George Stephanides Department of Applied Informatics.
Analyzing Sentiment in a Large Set of Web Data while Accounting for Negation AWIC 2011 Bas Heerschop Erasmus School of Economics Erasmus University Rotterdam.
Web Information Retrieval Projects Ida Mele. Rules Students can work in teams (max 3 people) The project must be delivered by the deadline that will be.
Projects ( ) Ida Mele. Rules Students have to work in teams (max 2 people). The project has to be delivered by the deadline that will be published.
WEB FORUM MINING BASED ON USER SATISFACTION PAGE 1 WEB FORUM MINING BASED ON USER SATISFACTION By: Suresh Pokharel Information and Communications Technologies.
Social Network Analysis via Factor Graph Model
Temporal Event Map Construction For Event Search Qing Li Department of Computer Science City University of Hong Kong.
Title Extraction from Bodies of HTML Documents and its Application to Web Page Retrieval Microsoft Research Asia Yunhua Hu, Guomao Xin, Ruihua Song, Guoping.
Homework – using iplayer/channel4.com or a Christian tv website watch any one religious specific programme in full. Produce a full length essay discussing.
MediaEval Workshop 2011 Pisa, Italy 1-2 September 2011.
Extracting Key Terms From Noisy and Multi-theme Documents Maria Grineva, Maxim Grinev and Dmitry Lizorkin Institute for System Programming of RAS.
Page 1 WEB MINING by NINI P SURESH PROJECT CO-ORDINATOR Kavitha Murugeshan.
Authors: Xu Cheng, Haitao Li, Jiangchuan Liu School of Computing Science, Simon Fraser University, British Columbia, Canada. Speaker : 童耀民 MA1G0222.
Cyber Bullying A guide for parents understanding the “hidden” bully.
Making Sense of Online Learning: Frames, Rubrics, Tools & Coding Systems for Analyzing Asynchronous Online Discourse Theresa Flynn Pepperdine University.
Graph and Topological Structure Mining on Scientific Articles Fan Wang, Ruoming Jin, Gagan Agrawal and Helen Piontkivska The Ohio State University The.
Chris Luszczek Biol2050 week 3 Lecture September 23, 2013.
CAIS Boarding Schools │ BRAND V: 1.0 October 13, 2011.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
The Cluster Computing Project Robert L. Tureman Paul D. Camp Community College.
When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.
Task 1 Research on any 2 of the following: Online shopping Online banking Web broadcasting Social networking sites Discuss the disadvantages and advantages.
Annotating Words using WordNet Semantic Glosses Julian Szymański Department of Computer Systems Architecture, Faculty of Electronics, Telecommunications.
25/03/2003CSCI 6405 Zheyuan Yu1 Finding Unexpected Information Taken from the paper : “Discovering Unexpected Information from your Competitor’s Web Sites”
Strategies for Staying Informed about Public Health Concerns Kristine Alpi, MLS, MPH November 6, 2004.
Binxing Jiao et. al (SIGIR ’10) Presenter : Lin, Yi-Jhen Advisor: Dr. Koh. Jia-ling Date: 2011/4/25 VISUAL SUMMARIZATION OF WEB PAGES.
Improving Web Search Results Using Affinity Graph Benyu Zhang, Hua Li, Yi Liu, Lei Ji, Wensi Xi, Weiguo Fan, Zheng Chen, Wei-Ying Ma Microsoft Research.
United Nations Economic Commission for Europe Statistical Division The Importance of Databases in the Dissemination Process Steven Vale, UNECE.
How Useful are Your Comments? Analyzing and Predicting YouTube Comments and Comment Ratings Stefan Siersdorfer, Sergiu Chelaru, Wolfgang Nejdl, Jose San.
Features of the Literature Review Purpose: to convey a lot of knowledge from the authors/experts in an easily accessible format Focus: main themes/ideas.
Organising PR Campaigns and Civil Education Campaigns Cesar Flores Zavarce President – Smartmatic Asia.
Input Design Lecture 11 1 BTEC HNC Systems Support Castle College 2007/8.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Externally growing self-organizing maps and its application to database visualization and exploration.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Mining Logs Files for Data-Driven System Management Advisor.
Mining and Visualizing the Evolution of Subgroups in Social Networks Falkowsky, T., Bartelheimer, J. & Spiliopoulou, M. (2006) IEEE/WIC/ACM International.
Challenges with XML Challenges with Semi-Structured collections Ludovic Denoyer University of Paris 6 Bridging the gap between research communities.
+ User-induced Links in Collaborative Tagging Systems Ching-man Au Yeung, Nicholas Gibbins, Nigel Shadbolt CIKM’09 Speaker: Nonhlanhla Shongwe 18 January.
A Classification-based Approach to Question Answering in Discussion Boards Liangjie Hong, Brian D. Davison Lehigh University (SIGIR ’ 09) Speaker: Cho,
Ian F. C. Smith Writing a Journal Paper. 2 Disclaimer / Preamble This is mostly opinion. Suggestions are incomplete. There are other strategies. A good.
Divided Pretreatment to Targets and Intentions for Query Recommendation Reporter: Yangyang Kang /23.
1. Truncation of the expression web log Personal website or web page on which an individual records opinions, links to other sites, etc. on a regular.
1. 1.To examine the information included in business reports. 2.To understand how to organize documents in order to ensure clear communication. 3.To analyze.
 Internet –INTERnational NETwork is the network of computer networks.  It is a Wide Area Network(WLAN).You can have unlimited access to internet. 
Anatomy of Social Media Translation Gene Schriver.
Building Progressive Communities Using Technology PDA National Field Team.
WEB STRUCTURE MINING SUBMITTED BY: BLESSY JOHN R7A ROLL NO:18.
Research on Knowledge Element Relation and Knowledge Service for Agricultural Literature Resource Xie nengfu; Sun wei and Zhang xuefu 3rd April 2017.
How to post a discussion through Moodle Room: Your course website home view Click on a Discussion.
Mining the Data Charu C. Aggarwal, ChengXiang Zhai
JOB SITE SOFTWARE | JOB RECRUITMENT SOFTWARE | JOB SEEKERS SOFTWARE:
خشنه اتره اهورهه مزدا شيوۀ ارائه مقاله 17/10/1388.
Multimedia Information Retrieval
Analyzing Two Participation Strategies in an Undergraduate Course Community Francisco Gutierrez Gustavo Zurita
Discovering Important Nodes through Graph Entropy
Evaluation Measures, Ongoing Improvements and Enhancement
CSE591: Data Mining by H. Liu
Online NCERT Solution for Class 11 Political Science
Presentation transcript:

Analysis of online hate communities in Social Networks Presented by : Ruchi Bhindwale

OUTLINE Introduction Related Work Analysis Our Approach Data Preprocessing Graph Creation Manual Mining Results Advantages/Disadvantages Conclusion

Introduction Web 2.0 Blogsphere Social Networking Sites Hate Groups

Related Work Often Social Networks as represented as a graph Approaches to identify communities o Co-citation Analysis o Hidden Markov Model o Content Analysis

Analysis One supporter and many opponents 98 % were in the category Countries and Regional and Religion and Belief All the communities with hate title do not have posts with hate content Such communities contained foreign language words

Our Approach Combination of content (text) mining and graph mining. Text mining is employed to deal with the posts while graph mining considers the communication pattern within these communities.

Data Preprocessing Select communities related to country and politics Mine the title with “hate keyword” Consider only those communities with substantial number of members Mine the thread title to select relevant posts Consider only those posts with substantial number of replies Text mine the post to provide a hate content Representation the communication as a graph

Rules for generating nodes and edges Each Member as a node. A directed edge between nodes for the message posted by one member, addressed to the other member in a particular discussion thread. Self loop edge for the member who creates a new hate thread. The message not addressed to anybody is considered as addressed to the creator of the thread.

Weighing scheme Weights are assigned to edges according to degree of hate content of the corresponding messages. Positive weight for the message that support the topic of the community and negative for opposing. Different weight values are assigned. E.g. 1 for normal, 2 for high and 3 for very high hate or anti-hate content.

Graph Characteristics Reveals two communities inside one community. One who supports the community and the other who opposes. Very less communication inside these sub communities. Easy to identify the members who spread hate heavily by the weight of the edges going out from the node corresponding to that member.

Manual Mining Results 25 communities were selected Resulting Set obtained was manually validated ASU MS 2006 Microsoft Corporation Cricket Fans Linux Kernel Programmers We hate India USA Democrats Communism Hate Israel Data Mining and KDD We hate exams Hate Pakistan Brad Pitt Fan club For those who hate idol worship Hate Indian Muslims Buddhism

Step 1(Select Category)  We hate India  Hate Israel  We hate exams  Communism  USA Democrats  Hate Pakistan  For those who hate Idol worship  Hate Indian Muslims  Buddhism  ASU MS 2006  Microsoft Corporation  Cricket Fans  Linux Kernel Programmers  Data Mining and KDD  Brad Pitt Fan club

Step 2Step 3 We hate India Hate Israel Hate Pakistan For those who hate Idol worship Hate Indian Muslims Communism USA Democrats Buddhism We hate India Hate Pakistan For those who hate Idol worship Hate Indian Muslims Hate Israel

Step 4(Number of threads) We hate India Hate Pakistan Hate Indian Muslims For those who hate Idol worship

The Graph

Advantages and Disadvantages of the approach The Approach clearly reveal basic communication pattern in a hate community. Can easily identify the hate spreading people. Difficult to measure degree of hate content as hate content tend to be very subjective. Not easy to figure out that - To whom a particular message is addressed in an ongoing discussion, when it is not explicitly cited.

Conclusion Hate community targeted to a country or a religion usually contains high amount of offensive content. For social networking websites providing features to create communities and discussion boards inside such communities, detecting hate communities has become very important. We have tried to give a model to analyze such offensive hate communities.

Thanks to Nitin and Lei