ANALYZING THE SOCIAL WEB an introduction 1. OUTLINE 1.Introduction 2.Network Structure and Measures 3.Social Information Filtering 2.

Slides:



Advertisements
Similar presentations
Recommender Systems & Collaborative Filtering
Advertisements

Fawaz Ghali Web 2.0 for the Adaptive Web.
Google News Personalization: Scalable Online Collaborative Filtering
Options for communicating. A social network is a social structure made of individuals (or organizations) called "nodes," which are tied (connected) by.
1 Working with Social Media in Research Settings Victoria Wade Careers Consultant.
Analysis and Modeling of Social Networks Foudalis Ilias.
Comparison of Social Networks by Likhitha Ravi. Outline What is a social network? Elements of social network Previous studies What is missing in previous.
Relationship Mining Network Analysis Week 5 Video 5.
Developing a Social Media Strategy Ashley Schaffer Ebe Randeree For Your Organization.
Sean Blong Presents: 1. What are they…?  “[…] specific type of information filtering (IF) technique that attempts to present information items (movies,
LYRIC-BASED ARTIST NETWORK METHODOLOGY Derek Gossi CS 765 Fall 2014.
Social Media Networking Sites Charlotte Jenkins Designing the Social Web
DATA MINING LECTURE 12 Link Analysis Ranking Random walks.
Recommender Systems Aalap Kohojkar Yang Liu Zhan Shi March 31, 2008.
By: Roma Mohibullah Shahrukh Qureshi
UNDERSTANDING VISIBLE AND LATENT INTERACTIONS IN ONLINE SOCIAL NETWORK Presented by: Nisha Ranga Under guidance of : Prof. Augustin Chaintreau.
Computing Trust in Social Networks
Agent Technology for e-Commerce
Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.
Recommender systems Ram Akella November 26 th 2008.
Social Context Based Recommendation Systems and Trust Inference Student: Andrea Manrique ID: ITEC810, Macquarie University1 Advisor: A/Prof. Yan.
How to Analyse Social Network? : Part 2 Power Laws and Rich-Get-Richer Phenomena Thank you for all referred contexts and figures.
Social Networking – The Ways and Means Rosey Broderick May 2011.
By: Aaron Gustafson Owner Computers N’ Stuff.  Facebook is FREE!!!  Youtube is FREE!!!  Twitter is FREE!!!  Google Plus is FREE!!!  Website hosting.
Recommender Systems and Collaborative Filtering
Chapter 12 (Section 12.4) : Recommender Systems Second edition of the book, coming soon.
Recommender Systems. >1,000,000,000 Finding Trusted Information How many cows in Texas?
Mark Levene, An Introduction to Search Engines and Web Navigation © Pearson Education Limited 2005 Slide 9.1 Chapter 9 : Social Networks What is a social.
How To Manage Your Social Media Presence in Just 10 Minutes A Day! And be more popular than the local news guy!
Distributed Networks & Systems Lab. Introduction Collaborative filtering Characteristics and challenges Memory-based CF Model-based CF Hybrid CF Recent.
1 Computing with Social Networks on the Web (2008 slide deck) Jennifer Golbeck University of Maryland, College Park Jim Hendler Rensselaer Polytechnic.
Using Transactional Information to Predict Link Strength in Online Social Networks Indika Kahanda and Jennifer Neville Purdue University.
MASTER THESIS num. 802 ANALYSIS OF ALGORITHMS FOR DETERMINING TRUST AMONG FRIENDS ON SOCIAL NETWORKS Mirjam Šitum Ao. Univ. Prof. Dr. Dieter Merkl Univ.
Understanding Cross-site Linking in Online Social Networks Yang Chen 1, Chenfan Zhuang 2, Qiang Cao 1, Pan Hui 3 1 Duke University 2 Tsinghua University.
Pete Bohman Adam Kunk. What is real-time search? What do you think as a class?
Social Networking Algorithms related sections to read in Networked Life: 2.1,
1,000,000,000 This is the number of items shared on Facebook in a single day – photos, comments, thoughts and links. Increasingly, these are being shared.
Pete Bohman Adam Kunk. Real-Time Search  Definition: A search mechanism capable of finding information in an online fashion as it is produced. Technology.
Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.
A Graph-based Friend Recommendation System Using Genetic Algorithm
Social Media Getting Social in a Digital World. (And, why it matters to your business!)
Date: 2012/4/23 Source: Michael J. Welch. al(WSDM’11) Advisor: Jia-ling, Koh Speaker: Jiun Jia, Chiou Topical semantics of twitter links 1.
Collaborative Filtering  Introduction  Search or Content based Method  User-Based Collaborative Filtering  Item-to-Item Collaborative Filtering  Using.
The Birth & Growth of Web 2.0 COM 415-Fall II Ashley Velasco (Prince)
Finding high-Quality contents in Social media BY : APARNA TODWAL GUIDED BY : PROF. M. WANJARI.
Recommender Systems Debapriyo Majumdar Information Retrieval – Spring 2015 Indian Statistical Institute Kolkata Credits to Bing Liu (UIC) and Angshul Majumdar.
Recommender Systems. Recommender Systems (RSs) n RSs are software tools providing suggestions for items to be of use to users, such as what items to buy,
Social networks: facebook lianne lindeke charlie mayfield january 25, 2012 uwsmc.
Recommendation Algorithms for E-Commerce. Introduction Millions of products are sold over the web. Choosing among so many options is proving challenging.
1 Friends and Neighbors on the Web Presentation for Web Information Retrieval Bruno Lepri.
Information Design Trends Unit Five: Delivery Channels Lecture 2: Portals and Personalization Part 2.
CS 590 Term Project Epidemic model on Facebook
Speaker : Yu-Hui Chen Authors : Dinuka A. Soysa, Denis Guangyin Chen, Oscar C. Au, and Amine Bermak From : 2013 IEEE Symposium on Computational Intelligence.
How to Analyse Social Network? Social networks can be represented by complex networks.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 11 Inference for Distributions of Categorical.
Item-Based Collaborative Filtering Recommendation Algorithms Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl GroupLens Research Group/ Army.
How Chapters Can use Social Media Mark Storace Sacramento Chapter March 2013.
Presented By: Madiha Saleem Sunniya Rizvi.  Collaborative filtering is a technique used by recommender systems to combine different users' opinions and.
GRAPH AND LINK MINING 1. Graphs - Basics 2 Undirected Graphs Undirected Graph: The edges are undirected pairs – they can be traversed in any direction.
Collaborative Filtering - Pooja Hegde. The Problem : OVERLOAD Too much stuff!!!! Too many books! Too many journals! Too many movies! Too much content!
GUILLOU Frederic. Outline Introduction Motivations The basic recommendation system First phase : semantic similarities Second phase : communities Application.
Data Mining: Concepts and Techniques
Recommender Systems & Collaborative Filtering
Comparison of Social Networks by Likhitha Ravi
E-Commerce Theories & Practices
Location Recommendation — for Out-of-Town Users in Location-Based Social Network Yina Meng.
Communication Technology in a Changing World
Movie Recommendation System
Graph and Link Mining.
A Glimpse of Recommender Systems on the Web
Presentation transcript:

ANALYZING THE SOCIAL WEB an introduction 1

OUTLINE 1.Introduction 2.Network Structure and Measures 3.Social Information Filtering 2

1. INTRODUCTION Social media has become the dominant method of using the Internet, & it has infiltrated & changed the way millions of people interact & communicate. Social networking in particular has become extremely popular, with > 1 billion users on Facebook alone & billions more accounts across thousands of social networking sites online. 3

Understanding social networks—both those explicitly formed on social networking websites & those implicitly formed in many other types of social media—has taken on new importance in light of this astounding popularity. Analysis of these social connections and interactions can help us understand who the important people are in a network, what roles a person plays, what subgroups of users are highly interconnected, how things like diseases or rumors will spread through a network, and how users participate. 4

APPLICATIONS OF THESE ANALYSES Organizations can prevent or control the spread of disease outbreaks. Websites can support participation & contributions from many types of users. Businesses can provide immediate assistance to customers who have problems or complaints. Users can band together to better understand their communities & government or take collective action. Content providers online can filter & sort information to show users the most relevant, interesting, & trusted content. 5

The methods for analyzing social networks have been around for decades or longer, but social media provides new challenges and opportunities. Networks online are orders of magnitude larger than the networks analyzed in the past. Often, networks are simply too big to be analyzed in their entirety. A good social network analyst working with social media needs to know how to analyze the structure of networks, apply sociological principles to understand user behavior, & deal w/ size, scope, & application of networks. 6

ANALYZING THE SOCIAL WEB Classic social network analysis studies a network’s structure. In a social network, a person is considered a ‘node’ or ‘vertex’, & relationship b/w people is a ‘link’ or ‘edge’. When all the people and relationships are identified, there are many statistics that can provide insight into the network. However, even before learning those statistics or anything about social network analysis, you can probably identify some important & interesting things in a network. 7

8

TIE STRENGTH & TRUST Tie strength, which is the strength of the relationship b/w 2 people, & trust are 2 relationship features that have great impact on what happens in a social network. Furthermore, learning what role a person plays in a network by analyzing his or her behavior can link quantitative measures w/ qualitative analysis to help better understand what goes on in a social group. 9

With those analysis methods at hand, the next step is to use them to understand network phenomena. One of the most important of these phenomena is propagation : How do things like information, diseases, or rumors spread in a network? A combination of quantitative and qualitative features inform our understanding of propagation, & another set of analysis techniques is available to study the spread of things through networks. 10

NODES, EDGES, & NETWORK MEASURES The term ‘social network’ has entered common language & is understood to describe circles of friends, acquaintances, colleagues, & so on. However, networks are well grounded in mathematics, and understanding how to represent, describe, and measure properties of networks will be the foundations of quantitative network analysis. 11

12

13

14

15

16

REPRESENTING NETWORKS Adjacency lists 17

Adjacency matrix 18

19

2. NETWORK STRUCTURE & MEASURES Describing nodes and edges Degree(undirected), in-degree & out-degree(directed) 20

CENTRALITY Centrality is one of the core principles of network analysis. It measures how “central” a node is in the network. This is used as an estimate of its importance in the network. However, depending on the application and point of view, what counts as “central” may vary depending on the context. Correspondingly, there are a number of ways to measure centrality of a node. degree centrality, closeness centrality, betweenness centrality, and eigenvector centrality 21

DEGREE CENTRALITY Degree centrality is one of the easiest to calculate. The degree centrality of a node is simply its degree—the number of edges it has. The higher the degree, the more central the node is. This can be an effective measure, since many nodes with high degrees also have high centrality by other measures. 22

23

24

25

CLOSENESS CENTRALITY Closeness centrality indicates how close a node is to all other nodes in the network. It is calculated as the average of the shortest path length from the node to every other node in the network. 26

27

28

BETWEENNESS CENTRALITY Betweenness centrality measures how important a node is to the shortest paths through the network. To compute betweenness for a node N, we select a pair of nodes & find all the shortest paths between those nodes. Then we compute the fraction of those shortest paths that include node N. 29

If there were 5 shortest paths between a pair of nodes, & 3 of them went through node N, then the fraction would be 3/5 = 0.6. We repeat this process for every pair of nodes in the network. We then add up the fractions we computed, and this is the betweenness centrality for node N. 30

31 Betweenness Centrality The betweenness centrality of A is zero, since no shortest paths between B, C, D, E, and F go through A. Betweenness centrality is one of the most frequently used centrality measures. It captures how important a node is in the flow of information from one part of the network to another.

EIGENVECTOR CENTRALITY Eigenvector centrality measures a node’s importance while giving consideration to the importance of its neighbors. For example, a node with 300 relatively unpopular friends on Facebook would have lower eigenvector centrality than someone with 300 very popular friends (like Barak Obama). 32

It is sometimes used to measure a node’s influence in the network. It is determined by performing a matrix calculation to determine what is called the principal eigenvector using the adjacency matrix. Not only is it used to determine influence in social networks, but a variant of eigenvector centrality is at the core of Google’s PageRank algorithm, which they use to rank web pages. 33

The main principle is that links from important nodes (as measured by degree centrality) are worth more than links from unimportant nodes. All nodes start off equal, but as the computation progresses, nodes with more edges start gaining importance. Their importance propagates out to the nodes to which they are connected. After re-computing many times, the values stabilize, resulting in the final values for eigenvector centrality. 34

3. SOCIAL INFORMATION FILTERING 35

SOCIAL SHARING AND SOCIAL FILTERING One way to find useful info among all the links, news, videos, & photos posted each day is to rely on other people to find it for us. Social sharing & social filtering use the interests of others, especially friends on social networks, to highlight info that is more likely to be of interest. 36

Social-sharing websites, like Digg, Slashdot, & reddit, are designed for people to share interesting content. The community then votes items up or down, & the most interesting links are highlighted. The reliance on large # of people to help complete a task like this is a type of crowdsourcing. From the “crowd” of people online, each contributes a tiny amount of work by sharing or voting on content, & the aggregate results are a valuable contribution. 37

AUTOMATED RECOMMENDER SYSTEMS Recommender systems are major parts of e-commerce sites & social media sites. We introduce the major types here & discuss how they take advantage of social patterns & connections to suggest items that users might like. 38

Even if the term recommender system is not a familiar one, nearly all Internet users will be familiar w/ them. These are the features of websites that suggest items a user might like. Amazon.com uses to suggest other items a customer might want to buy. 39

TRADITIONAL RECOMMENDER SYSTEMS Recommender systems basically work in one of two ways: suggesting items similar to the ones a person likes (content- based approach) suggesting items liked by people who are similar to the user. (collaborative filtering technique) [… or hybrid] 40

CONTENT-BASED APPROACH 41

They might look at all the items that a user has rated & then look for items that are similar to the things the user likes. This is how Pandora, the online music streaming service, works. 42

A user starts with a song or artist, & Pandora creates a musical profile of it. Then, Pandora selects songs that are similar in profile & plays those. 43

COLLABORATIVE FILTERING Collaborative filtering looks at each pair of users, finds the items that both people have rated, & computes a similarity score for the 2 people based on their ratings. That similarity measure is then used to give similar people more say in how much the user might like a new item. 44

Consider this simple example of collaborative filtering. A user, Alice, has rated a set of movies. Two other users, Bob and Chuck, have also rated those movies. These are shown in Table

Now assume Alice wants to know how much she might like the movie Vertigo, which she has never seen. Both Bob & Chuck have seen it. Bob rated it a 3 and Chuck rated it a 5. What would be a good recommendation to Alice for how much she will like it? 46

One option is to show the average rating for the movie, which is a 4 in this case. However, that does not take into account that Chuck is more similar to Alice than Bob is. A simple example of collaborative filtering will use the correlation # to compute a weighted average. Bob and Chuck’s ratings will be multiplied by their correlation w/ Alice, and that total will be divided by the sum of the weights. 47

48

Examples of explicit data collection include the following: Asking a user to rate an item on a sliding scale. Asking a user to rank a collection of items from favorite to least favorite. Presenting two items to a user and asking him/her to choose the better one of them. Asking a user to create a list of items that he/she likes. 49

Examples of implicit data collection include the following: Observing the items that a user views in an online store. Analyzing item/user viewing times Keeping a record of the items that a user purchases online. Obtaining a list of items that a user has listened to or watched on his/her computer. Analyzing the user's social network and discovering similar likes and dislikes 50

One of the most famous examples of collaborative filtering is item-to-item collaborative filtering (people who buy x also buy y), an algorithm popularized by Amazon.com's recommender system. Other examples include: Amazon.com As previously detailed, Last.fm recommends music based on a comparison of the listening habits of similar users.Last.fm Facebook, MySpace, LinkedIn, and other social networks use collaborative filtering to recommend new friends, groups, and other social connections (by examining the network of connections between a user and their friends). FacebookMySpaceLinkedIn 51

Collaborative filtering approaches often suffer from three problems: cold start, scalability, and sparsity. Cold Start: These systems often require a large amount of existing data on a user in order to make accurate recommendations. Scalability: In many of the environments that these systems make recommendations in, there are millions of users and products. Thus, a large amount of computation power is often necessary to calculate recommendations. Sparsity: The number of items sold on major e-commerce sites is extremely large. The most active users will only have rated a small subset of the overall database. Thus, even the most popular items have very few ratings. 52

SOCIAL RECOMMENDER SYSTEMS Collaborative filtering is an early example of how algorithms can leverage data from the crowd. Info from a lot of people online is collected & used to generate personalized suggestions for any user. 53

These techniques were originally developed in the 1990s and early 2000s. Since the availability of this data has increased w/ the rise of social media, recommender systems have started to consider social connections in addition to similarity. 54

Simple examples of social recommendations can be found on many social networking websites. For example, on Twitter, when a user searches for a term, the search results can be shown in 3 ways: all tweets that match the search, “top” tweets, as determined by Twitter, or tweets only posted by people the user knows. This simple social filter excludes anything from unknown people, since it may be of less interest. 55

Friend recommenders are also common in social networking websites. Facebook prominently features a “People You May Know” section, which is essentially a recommendation of people to add as friends. 56

CASE STUDY: REDDIT VOTING SYSTEM 57

AMAZON BOOK RECOMMENDATIONS 58

59