Social Computing and Big Data Analytics 社群運算與大數據分析 1 1042SCBDA12 MIS MBA (M2226) (8628) Wed, 8,9, (15:10-17:00) (B309) Social Network Analysis ( 社會網絡分析.

Slides:



Advertisements
Similar presentations
Complex Networks Albert Diaz Guilera Universitat de Barcelona.
Advertisements

Analysis and Modeling of Social Networks Foudalis Ilias.
Introduction to Social Network Analysis Lluís Coromina Departament d’Economia. Universitat de Girona Girona, 18/01/2005.
Social Network Analysis and Its Applications By Paul Rossman Indiana University of Pennsylvania.
Relationship Mining Network Analysis Week 5 Video 5.
Networks. Graphs (undirected, unweighted) has a set of vertices V has a set of undirected, unweighted edges E graph G = (V, E), where.
Funding Networks Abdullah Sevincer University of Nevada, Reno Department of Computer Science & Engineering.
Interactive Data Visualization for Rapid Understanding of Scientific Literature Cody Dunne Dept. of Computer Science and Human-Computer Interaction Lab,
San Francisco Bay Area News Ecology Daniel Ramos CS790G Fall 2010.
Social Network Analysis: Tasks and Tools Steven Loscalzo and Lei Yu Department of Computer Science Watson School of Engineering and Applied Science State.
Midterm Presentation Undergraduate Researchers: Graduate Student Mentor: Faculty Mentor: Jordan Cowart, Katie Allmeroth Krist Culmer Dr. Wenjun (Kevin)
Department of Computer Science, University of California, Irvine Site Visit for UC Irvine KD-D Project, April 21 st 2004 The Java Universal Network/Graph.
Social Networks Corina Ciubuc.
Research Meeting Seungseok Kang Center for E-Business Technology Seoul National University Seoul, Korea.
Pascal Visualization Challenge Blaž Fortuna, IJS Marko Grobelnik, IJS Steve Gunn, US.
Social Network Analysis: A Non- Technical Introduction José Luis Molina Universitat Autònoma de Barcelona
Social Media Marketing Analytics 社群網路行銷分析
Automated Social Hierarchy Detection through Network Analysis (SNAKDD07) Ryan Rowe, Germ´an Creamer, Shlomo Hershkop, Salvatore J Stolfo 1 Advisor:
2015/10/111 DBconnect: Mining Research Community on DBLP Data Osmar R. Zaïane, Jiyang Chen, Randy Goebel Web Mining and Social Network Analysis Workshop.
Science: Graph theory and networks Dr Andy Evans.
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 April 25, 2012.
Presentation Template Social Network Analysis using Socilyzer.
Structural Properties of Networks: Introduction Networked Life NETS 112 Fall 2015 Prof. Michael Kearns.
Special Topics in Educational Data Mining HUDK5199 Spring 2013 March 25, 2012.
Data Warehousing 資料倉儲 Min-Yuh Day 戴敏育 Assistant Professor 專任助理教授 Dept. of Information Management, Tamkang University Dept. of Information ManagementTamkang.
Data Mining 資料探勘 DM06 MI4 Thu. 9,10 (16:10-18:00) B513 社會網路分析、意見分析 (Social Network Analysis, Opinion Mining) Min-Yuh Day 戴敏育 Assistant Professor.
How to Analyse Social Network?
A project from the Social Media Research Foundation: Finding direction in a sea of connection:
Mining real world data Web data. World Wide Web Hypertext documents –Text –Links Web –billions of documents –authored by millions of diverse people –edited.
Slides are modified from Lada Adamic
Web Mining ( 網路探勘 ) WM07 TLMXM1A Wed 8,9 (15:10-17:00) U705 Social Network Analysis ( 社會網路分析 ) Min-Yuh Day 戴敏育 Assistant Professor 專任助理教授 Dept. of.
The Structure of the Web. Getting to knowing the Web How big is the web and how do you measure it? How many people use the web? How many use search engines?
Introduction to Graph Theory
商業智慧 Business Intelligence BI08 IM EMBA Fri 12,13,14 (19:20-22:10) D502 社會網路分析 (Social Network Analysis) Min-Yuh Day 戴敏育 Assistant Professor 專任助理教授.
(社群媒體大數據分析) Big Data Analytics on Social Media 李御璽 教授 銘傳大學資訊工程學系
Core Methods in Educational Data Mining HUDK4050 Fall 2015.
Mining information from social media
商業智慧實務 Practices of Business Intelligence BI09 MI4 Wed, 9,10 (16:10-18:00) (B130) 社會網路分析 (Social Network Analysis) Min-Yuh Day 戴敏育 Assistant Professor.
CSCE 5073 Section 001: Data Mining Spring Overview Class hour 12:30 – 1:45pm, Tuesday & Thur, JBHT 239 Office hour 2:00 – 4:00pm, Tuesday & Thur,
How to Analyse Social Network? Social networks can be represented by complex networks.
Informatics tools in network science
商業智慧實務 Practices of Business Intelligence BI09 MI4 Wed, 9,10 (16:10-18:00) (B113) 社會網路分析 (Social Network Analysis) Min-Yuh Day 戴敏育 Assistant Professor.
Class 2: Graph Theory IST402.
Social Computing and Big Data Analytics 社群運算與大數據分析
Social Computing and Big Data Analytics 社群運算與大數據分析 SCBDA06 MIS MBA (M2226) (8628) Wed, 8,9, (15:10-17:00) (B309) Finance Big Data Analytics with.
Social Network Mining for Digital Library Application Dr. Thanachart Ritbumroong King Mongkut’s University of Technology Thonburi Assist. Prof. Dr. Satidchoke.
SOCIAL NETWORK ANALYSIS: INTRODUCTION FEATURING R & NETDRAW WITH APPLICATIONS FROM LITERATURE.
Social Computing and Big Data Analytics 社群運算與大數據分析 SCBDA10 MIS MBA (M2226) (8628) Wed, 8,9, (15:10-17:00) (B309) Deep Learning with Google TensorFlow.
社群網路行銷管理 Social Media Marketing Management SMMM07 MIS EMBA (M2200) (8615) Thu, 12,13,14 (19:20-22:10) (D309) Min-Yuh Day 戴敏育 Assistant Professor.
Chapter 8: Web Analytics, Web Mining, and Social Analytics
社群網路行銷管理 Social Media Marketing Management SMMM09 MIS EMBA (M2200) (8615) Thu, 12,13,14 (19:20-22:10) (D309) Min-Yuh Day 戴敏育 Assistant Professor.
1 Social Computing and Big Data Analytics ( 社群運算大數據分析 ) Min-Yuh Day 戴敏育 Assistant Professor 專任助理教授 Dept. of Information Management, Tamkang University.
GUILLOU Frederic. Outline Introduction Motivations The basic recommendation system First phase : semantic similarities Second phase : communities Application.
Social Networks Some content from Ding-Zhu Du, Lada Adamic, and Eytan Adar.
MINING DEEP KNOWLEDGE FROM SCIENTIFIC NETWORKS
Structural Properties of Networks: Introduction
Social Computing and Big Data Analytics 社群運算與大數據分析
Tutorial: Big Data Algorithms and Applications Under Hadoop
Structural Properties of Networks: Introduction
Comparison of Social Networks by Likhitha Ravi
Core Methods in Educational Data Mining
Structural Properties of Networks: Introduction
Network Visualization
Network Science: A Short Introduction i3 Workshop
Centrality in Social Networks
Methodology & Current Results
Centralities (4) Ralucca Gera,
Practical Applications Using igraph in R Roger Stanton
CSCE 4143 Section 001: Data Mining Spring 2019.
Analyzing Massive Graphs - ParT I
Presentation transcript:

Social Computing and Big Data Analytics 社群運算與大數據分析 SCBDA12 MIS MBA (M2226) (8628) Wed, 8,9, (15:10-17:00) (B309) Social Network Analysis ( 社會網絡分析 ) Min-Yuh Day 戴敏育 Assistant Professor 專任助理教授 Dept. of Information Management, Tamkang University Dept. of Information ManagementTamkang University 淡江大學 淡江大學 資訊管理學系 資訊管理學系 tku.edu.tw/myday/ Tamkang University

週次 (Week) 日期 (Date) 內容 (Subject/Topics) /02/17 Course Orientation for Social Computing and Big Data Analytics ( 社群運算與大數據分析課程介紹 ) /02/24 Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data ( 資料科學與大數據分析: 探索、分析、視覺化與呈現資料 ) /03/02 Fundamental Big Data: MapReduce Paradigm, Hadoop and Spark Ecosystem ( 大數據基礎: MapReduce 典範、 Hadoop 與 Spark 生態系統 ) 課程大綱 (Syllabus) 2

週次 (Week) 日期 (Date) 內容 (Subject/Topics) /03/09 Big Data Processing Platforms with SMACK: Spark, Mesos, Akka, Cassandra and Kafka ( 大數據處理平台 SMACK : Spark, Mesos, Akka, Cassandra, Kafka) /03/16 Big Data Analytics with Numpy in Python (Python Numpy 大數據分析 ) /03/23 Finance Big Data Analytics with Pandas in Python (Python Pandas 財務大數據分析 ) /03/30 Text Mining Techniques and Natural Language Processing ( 文字探勘分析技術與自然語言處理 ) /04/06 Off-campus study ( 教學行政觀摩日 ) 課程大綱 (Syllabus) 3

週次 (Week) 日期 (Date) 內容 (Subject/Topics) /04/13 Social Media Marketing Analytics ( 社群媒體行銷分析 ) /04/20 期中報告 (Midterm Project Report) /04/27 Deep Learning with Theano and Keras in Python (Python Theano 和 Keras 深度學習 ) /05/04 Deep Learning with Google TensorFlow (Google TensorFlow 深度學習 ) /05/11 Sentiment Analysis on Social Media with Deep Learning ( 深度學習社群媒體情感分析 ) 課程大綱 (Syllabus) 4

週次 (Week) 日期 (Date) 內容 (Subject/Topics) /05/18 Social Network Analysis ( 社會網絡分析 ) /05/25 Measurements of Social Network ( 社會網絡量測 ) /06/01 Tools of Social Network Analysis ( 社會網絡分析工具 ) /06/08 Final Project Presentation I ( 期末報告 I) /06/15 Final Project Presentation II ( 期末報告 II) 課程大綱 (Syllabus) 5

Social Computing 6

Social Network Analysis 7

Social Computing Social Network Analysis Link mining Community Detection Social Recommendation 8

Business Insights with Social Analytics 9

Analyzing the Social Web: Social Network Analysis 10

11 Source: Jennifer Golbeck (2013), Analyzing the Social Web, Morgan Kaufmann

Social Network Analysis (SNA) Facebook TouchGraph 12

Social Network Analysis 13 Source:

Social Network Analysis A social network is a social structure of people, related (directly or indirectly) to each other through a common relation or interest Social network analysis (SNA) is the study of social networks to understand their structure and behavior 14 Source: (c) Jaideep Srivastava, Data Mining for Social Network Analysis

Centrality Prestige 15 Social Network Analysis (SNA)

Graph Theory 16

Graph 17 A A B B D D E E C C

Graph g = (V, E) 18

Vertex (Node) 19

Vertices (Nodes) 20

Edge 21

Edges 22

Arc 23

Arcs 24

Undirected Graph 25 Source: A A B B D D E E C C

Directed Graph 26 Source: A A B B D D E E C C

Degree 27 Source: A A B B D D E E C C

Degree 28 Source: A A B B D D E E C C A: 2 B: 4 C: 2 D:1 E: 1

Density 29 Source: A A B B D D E E C C

Density 30 Source: A A B B D D E E C C Edges (Links): 5 Total Possible Edges: 10 Density: 5/10 = 0.5

Density 31 Nodes (n): 10 Edges (Links): 13 Total Possible Edges: (n * (n-1)) / 2 = (10 * 9) / 2 = 45 Density: 13/45 = 0.29 A A B B D D C C E E F F G G H H I I J J

Which Node is Most Important? 32 A A B B D D C C E E F F G G H H I I J J

Centrality Important or prominent actors are those that are linked or involved with other actors extensively. A person with extensive contacts (links) or communications with many other people in the organization is considered more important than a person with relatively fewer contacts. The links can also be called ties. A central actor is one involved in many ties. 33 Source: Bing Liu (2011), “Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data”

Social Network Analysis (SNA) Degree Centrality Betweenness Centrality Closeness Centrality 34

Degree Centrality 35

36 Social Network Analysis: Degree Centrality A A B B D D C C E E F F G G H H I I J J

37 Social Network Analysis: Degree Centrality A A B B D D C C E E F F G G H H I I J J NodeScore Standardized Score A2 2/10 = 0.2 B2 C5 5/10 = 0.5 D3 3/10 = 0.3 E3 F2 2/10 = 0.2 G4 4/10 = 0.4 H3 3/10 = 0.3 I1 1/10 = 0.1 J1

Betweenness Centrality 38

Betweenness centrality: Connectivity Number of shortest paths going through the actor 39

Betweenness Centrality 40 Where g jk = the number of shortest paths connecting jk g jk (i) = the number that actor i is on. Normalized Betweenness Centrality Number of pairs of vertices excluding the vertex itself Source:

Betweenness Centrality 41 A A B B D D E E C C A: B  C: 0/1 = 0 B  D: 0/1 = 0 B  E: 0/1 = 0 C  D: 0/1 = 0 C  E: 0/1 = 0 D  E: 0/1 = 0 Total: 0 A: Betweenness Centrality = 0

Betweenness Centrality 42 A A B B D D E E C C B: A  C: 0/1 = 0 A  D: 1/1 = 1 A  E: 1/1 = 1 C  D: 1/1 = 1 C  E: 1/1 = 1 D  E: 1/1 = 1 Total: 5 B: Betweenness Centrality = 5

Betweenness Centrality 43 A A B B D D E E C C C: A  B: 0/1 = 0 A  D: 0/1 = 0 A  E: 0/1 = 0 B  D: 0/1 = 0 B  E: 0/1 = 0 D  E: 0/1 = 0 Total: 0 C: Betweenness Centrality = 0

Betweenness Centrality 44 A A B B D D E E C C A: 0 B: 5 C: 0 D: 0 E: 0

45 A A G G D D B B J J C C H H I I E E F F A A D D B B J J C C H H E E F F Which Node is Most Important?

46 A A G G D D B B J J C C H H I I E E F F A A G G D D J J H H I I E E F F Which Node is Most Important?

47 A A D D B B C C E E Betweenness Centrality

48 A A B B D D E E C C A: B  C: 0/1 = 0 B  D: 0/1 = 0 B  E: 0/1 = 0 C  D: 0/1 = 0 C  E: 0/1 = 0 D  E: 0/1 = 0 Total: 0 A: Betweenness Centrality = 0

Closeness Centrality 49

50 Social Network Analysis: Closeness Centrality A A B B D D C C E E F F G G H H I I J J C  A: 1 C  B: 1 C  D: 1 C  E: 1 C  F: 2 C  G: 1 C  H: 2 C  I: 3 C  J: 3 Total=15 C: Closeness Centrality = 15/9 = 1.67

51 Social Network Analysis: Closeness Centrality A A B B D D C C E E F F G G H H I I J J G  A: 2 G  B: 2 G  C: 1 G  D: 2 G  E: 1 G  F: 1 G  H: 1 G  I: 2 G  J: 2 Total=14 G: Closeness Centrality = 14/9 = 1.56

52 Social Network Analysis: Closeness Centrality A A B B D D C C E E F F G G H H I I J J H  A: 3 H  B: 3 H  C: 2 H  D: 2 H  E: 2 H  F: 2 H  G: 1 H  I: 1 H  J: 1 Total=17 H: Closeness Centrality = 17/9 = 1.89

53 Social Network Analysis: Closeness Centrality A A B B D D C C E E F F G G H H I I J J H: Closeness Centrality = 17/9 = 1.89 C: Closeness Centrality = 15/9 = 1.67 G: Closeness Centrality = 14/9 =

Eigenvector centrality: Importance of a node depends on the importance of its neighbors 54

55 where d(p j, p k ) is the geodesic distance (shortest paths) linking p j, p k Social Network Analysis: Closeness Centrality Sum of the reciprocal distances

56 Source: Abbasi, A., Hossain, L., & Leydesdorff, L. (2012). Betweenness centrality as a driver of preferential attachment in the evolution of research collaboration networks. Journal of Informetrics, 6(3), where g ij is the geodesic distance (shortest paths) linking p i and p j and g ij (p k ) is the geodesic distance linking p i and p j that contains p k. Social Network Analysis: Betweenness Centrality

57 where a (p i, p k ) = 1 if and only if p i and p k are connected by a line 0 otherwise Social Network Analysis: Degree Centrality

Social Network Analysis: Hub and Authority 58 Source: Hubs are entities that point to a relatively large number of authorities. They are essentially the mutually reinforcing analogues to authorities. Authorities point to high hubs. Hubs point to high authorities. You cannot have one without the other.

Application of SNA Social Network Analysis of Research Collaboration in Information Reuse and Integration 59 Source: Min-Yuh Day, Sheng-Pao Shih, Weide Chang (2011), "Social Network Analysis of Research Collaboration in Information Reuse and Integration"

Example of SNA Data Source 60 Source:

Research Question RQ1: What are the scientific collaboration patterns in the IRI research community? RQ2: Who are the prominent researchers in the IRI community? 61 Source: Min-Yuh Day, Sheng-Pao Shih, Weide Chang (2011), "Social Network Analysis of Research Collaboration in Information Reuse and Integration"

Methodology Developed a simple web focused crawler program to download literature information about all IRI papers published between 2003 and 2010 from IEEE Xplore and DBLP. – 767 paper – 1599 distinct author Developed a program to convert the list of coauthors into the format of a network file which can be readable by social network analysis software. UCINet and Pajek were used in this study for the social network analysis. 62 Source: Min-Yuh Day, Sheng-Pao Shih, Weide Chang (2011), "Social Network Analysis of Research Collaboration in Information Reuse and Integration"

Top10 prolific authors (IRI ) 1.Stuart Harvey Rubin 2.Taghi M. Khoshgoftaar 3.Shu-Ching Chen 4.Mei-Ling Shyu 5.Mohamed E. Fayad 6.Reda Alhajj 7.Du Zhang 8.Wen-Lian Hsu 9.Jason Van Hulse 10.Min-Yuh Day 63 Source: Min-Yuh Day, Sheng-Pao Shih, Weide Chang (2011), "Social Network Analysis of Research Collaboration in Information Reuse and Integration"

Data Analysis and Discussion Closeness Centrality – Collaborated widely Betweenness Centrality – Collaborated diversely Degree Centrality – Collaborated frequently Visualization of Social Network Analysis – Insight into the structural characteristics of research collaboration networks 64 Source: Min-Yuh Day, Sheng-Pao Shih, Weide Chang (2011), "Social Network Analysis of Research Collaboration in Information Reuse and Integration"

Top 20 authors with the highest closeness scores RankIDClosenessAuthor Shu-Ching Chen Stuart Harvey Rubin Mei-Ling Shyu Reda Alhajj Na Zhao Min Chen Gordon K. Lee Chengcui Zhang Isai Michel Lombera Michael Armella James B. Law Keqi Zhang Shahid Hamid Walter Z. Tang Chengjun Zhan Lin Luo Guo Chen Xin Huang Sneh Gulati Sheng-Tun Li 65 Source: Min-Yuh Day, Sheng-Pao Shih, Weide Chang (2011), "Social Network Analysis of Research Collaboration in Information Reuse and Integration"

Top 20 authors with the highest betweeness scores RankIDBetweennessAuthor Stuart Harvey Rubin Shu-Ching Chen Taghi M. Khoshgoftaar Xingquan Zhu Mei-Ling Shyu Reda Alhajj Xindong Wu Chengcui Zhang Wei Dai Narayan C. Debnath Qianhui Althea Liang Gordon K. Lee Du Zhang Baowen Xu Hongji Yang Zhiwei Xu Mohamed E. Fayad Abhijit S. Pandya Sam Hsu Wen-Lian Hsu 66 Source: Min-Yuh Day, Sheng-Pao Shih, Weide Chang (2011), "Social Network Analysis of Research Collaboration in Information Reuse and Integration"

Top 20 authors with the highest degree scores RankIDDegreeAuthor Shu-Ching Chen Stuart Harvey Rubin Taghi M. Khoshgoftaar Reda Alhajj Wen-Lian Hsu Min-Yuh Day Mei-Ling Shyu Richard Tzong-Han Tsai Eduardo Santana de Almeida Roumen Kountchev Hong-Jie Dai Narayan C. Debnath Jason Van Hulse Roumiana Kountcheva Silvio Romero de Lemos Meira Vladimir Todorov Mariofanna G. Milanova Mohamed E. Fayad Chengcui Zhang Waleed W. Smari 67 Source: Min-Yuh Day, Sheng-Pao Shih, Weide Chang (2011), "Social Network Analysis of Research Collaboration in Information Reuse and Integration"

Visualization of IRI (IEEE IRI ) co-authorship network (global view) 68 Source: Min-Yuh Day, Sheng-Pao Shih, Weide Chang (2011), "Social Network Analysis of Research Collaboration in Information Reuse and Integration"

69 Source: Min-Yuh Day, Sheng-Pao Shih, Weide Chang (2011), "Social Network Analysis of Research Collaboration in Information Reuse and Integration"

Visualization of Social Network Analysis 70 Source: Min-Yuh Day, Sheng-Pao Shih, Weide Chang (2011), "Social Network Analysis of Research Collaboration in Information Reuse and Integration"

71 Source: Min-Yuh Day, Sheng-Pao Shih, Weide Chang (2011), "Social Network Analysis of Research Collaboration in Information Reuse and Integration" Visualization of Social Network Analysis

72 Source: Min-Yuh Day, Sheng-Pao Shih, Weide Chang (2011), "Social Network Analysis of Research Collaboration in Information Reuse and Integration" Visualization of Social Network Analysis

Social Network Analysis (SNA) Tools NetworkX igraph Gephi UCINet Pajek 73

NetworkX 74

igraph 75

Gephi The Open Graph Viz Platform 76 Source:

SNA Tool: UCINet 77

SNA Tool: Pajek 78

SNA Tool: Pajek 79

80 Source:

81 Source:

References Bing Liu (2011), “Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data,” 2 nd Edition, Springer. Jennifer Golbeck (2013), Analyzing the Social Web, Morgan Kaufmann. Sentinel Visualizer, Min-Yuh Day, Sheng-Pao Shih, Weide Chang (2011), "Social Network Analysis of Research Collaboration in Information Reuse and Integration," The First International Workshop on Issues and Challenges in Social Computing (WICSOC 2011), August 2, 2011, in Proceedings of the IEEE International Conference on Information Reuse and Integration (IEEE IRI 2011), Las Vegas, Nevada, USA, August 3-5, 2011, pp