Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Finding a Team of Experts in Social Networks Theodoros.

Slides:



Advertisements
Similar presentations
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A 24-h forecast of solar irradiance using artificial neural.
Advertisements

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology A novel document similarity measure based on earth mover’s.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 SCAN: A Structural Clustering Algorithm for Networks Xiaowei.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Discovering Leaders from Community Actions Presenter : Wu, Jia-Hao Authors : Amit Goyal, Francesco Bonchi,
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Fast exact k nearest neighbors search using an orthogonal search tree Presenter : Chun-Ping Wu Authors.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology U*F clustering : a new performant “ clustering-mining ”
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Human eye sclera detection and tracking using a modified.
Mehdi Kargar Aijun An York University, Toronto, Canada Discovering Top-k Teams of Experts with/without a Leader in Social Networks.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 On-line Learning of Sequence Data Based on Self-Organizing.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. A new student performance analysing system using knowledge discovery in higher educational databases.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 TANGENT: A Novel, “Surprise-me”, Recommendation Algorithm.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Comparison of neural network models with ARIMA and regression models for prediction of Houston's daily.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology HE-Tree: a framework for detecting changes in clustering.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Web usage mining: extracting unexpected periods from web.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Mining Positive and Negative Patterns for Relevance Feature.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology On Data Labeling for Clustering Categorical Data Hung-Leng.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. OpinionMiner: A Novel Machine Learning System for Web Opinion Mining and Extraction Presenter : Jiang-Shan.
Presentation : Finding a Team of Experts in Social Networks Jack Cheng Ka Ho The Chinese University of Hong Kong SEEM 5010 Advanced Database and Information.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Topology Preservation in Self-Organizing Feature Maps: Exact.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. A quantitative stock prediction system based on financial news Presenter : Chun-Jung Shih Authors :Robert.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 An Empirical Study of Learning from Imbalanced Data Using.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. A semantic similarity metric combining features and intrinsic information content Presenter: Chun-Ping.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Automatic Recommendations for E-Learning Personalization.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. An IPC-based vector space model for patent retrieval Presenter: Jun-Yi Wu Authors: Yen-Liang Chen, Yu-Ting.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 AC-ViSOM: Hybridising the Modified Adaptive Coordinate.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology A Plagiarism Detection Technique for Java Program Using.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology A k-mean clustering algorithm for mixed numeric and categorical.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. TurSOM: A Turing Inspired Self-organizing Map Presenter: Tsai Tzung Ruei Authors: Derek Beaton, Iren.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 An Adaptation of the Vector-Space Model for Ontology-Based.
國立雲林科技大學 National Yunlin University of Science and Technology Self-organizing map learning nonlinearly embedded manifoldsmanifolds Author :Timo Simila.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A Study on Automatic Recognition of Road Signs Presenter.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 2007.SIGIR.8 New Event Detection Based on Indexing-tree.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology SEP/COP: An efficient method to find the best partition.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Utilizing Marginal Net Utility for Recommendation in E-commerce.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Extreme Visualization: Squeezing a Billion Records into a Million Pixels Presenter : Jiang-Shan Wang.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Extending the Growing Hierarchal SOM for Clustering Documents.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Psychiatric document retrieval using a discourse-aware model Presenter : Wu, Jia-Hao Authors : Liang-Chih.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. 1 Visualization of multi-algorithm clustering for better economic decisions - The case of car pricing.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Information Loss of the Mahalanobis Distance in High Dimensions-
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Multiclass boosting with repartitioning Graduate : Chen,
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 An initialization method to simultaneously find initial.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Region-based image retrieval using integrated color, shape,
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Adaptive FIR Neural Model for Centroid Learning in Self-Organizing.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. 1 Mining concept maps from news stories for measuring civic scientific literacy in media Presenter :
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Mining Source Code Elements for Comprehending Object- Oriented.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Direct mining of discriminative patterns for classifying.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Self Organizing Maps and Bit Signature: a study applied.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Modeling Semantic Similarities in Multiple Maps Presenter.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Towards comprehensive support for organizational mining Presenter : Yu-hui Huang Authors : Minseok Song,
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Providing Justifications in Recommender Systems Presenter.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Predicting corporate bankruptcy using a self-organizing map: An empirical study to improve the forecasting.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology ACM SIGMOD1 Subsequence Matching on Structured Time Series.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Text Classification Improved through Multigram Models.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Growing Hierarchical Tree SOM: An unsupervised neural.
Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Using annotation services in a ubiquitous Jigsaw cooperative learning environment Presenter : Su, Wun-Huei.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Dual clustering : integrating data clustering over optimization.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Graduate : Sheng-Hsuan Wang Author : Sanghamitra.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Discovering Interesting Usage Patterns in Text Collections:
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Text Classification, Business Intelligence, and Interactivity:
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Information Extraction from Wikipedia: Moving Down the Long.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Visualizing social network concepts Presenter : Chun-Ping Wu Authors :Bin Zhu, Stephanie Watts, Hsinchun.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Graduate : Chun Kai Chen Author : Andrew.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Named Entity Disambiguation by Leveraging Wikipedia Semantic Knowledge Presenter : Jiang-Shan Wang Authors.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology A support system for predicting eBay end prices Presenter.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A New Cluster Validity Index for Data with Merged Clusters.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 f-information measures in medical image registration Presenter.
Expert Team Formation made by Zhang, Cheng Social Network Data Analytics[M] Charu C. Aggarwal.
Finding a team of Experts in Social Networks
Finding a team of Experts in Social Networks
Presentation transcript:

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Finding a Team of Experts in Social Networks Theodoros Lappas, Kun Liu, and Evimaria Terzi KDD, 2009 Reported by Wen-Chung Liao, 2009/12/22

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 2 Outlines  Motivation  Objective  Preliminary  Problems  Algorithms  Experiments  Conclusions  Comments

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 3 Motivation  The success of a project depends not only on the expertise of the people who are involved, but also on how effectively they collaborate, communicate and work together as a team. Figure 1: Network of connections between individuals in {a, b, c, d, e}. X a ={algorithms}, X b ={web programming}, X c ={software engineering, distributed systems}, X d ={software engineering} X e ={software engineering, distributed systems, web programming}. T={algorithms, software engineering, distributed systems, web programming}. X’ = {a, b, c} or X” = {a, e}

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 4 Objectives  Given a task T that requires a set of skills, our goal is to find a set of individuals X ’ X, such that every required skill in T is exhibited by at least one individual in X ’.  Additionally, the members of team X ’ should define a subgraph in G with low communication cost.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 5 Preliminaries X = {1,..., n}: n individuals A = {a 1,..., a m }: a universe of m skills X i A T: a task, a subset of skills required to perform a job. T A. S(a): the support set of the skill a, the set of individuals in X that has the skill a. S(a) = {i | i X and a X i }. G(X,E): an undirected and weighted graph d(i, i ’ ) Path(i, i ’ ) d(i,X ’ ) = min i’ X’ d(i, i ’ ) Path(i, X ’ ) G[X ’ ]: the subgraph of G that contains only the nodes in X ’

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 6 Problems Problem [Team Formation]: Given the set of n individuals X = {1,..., n}, a graph G(X,E), and task T, find X’ X, so that C (X’, T) = T, and the communication cost Cc(X’) is minimized. Diameter (R): Cc-R(X’) Minimum Spanning Tree (Mst): Cc-Mst (X’), Proposition 1. The Diameter-TF problem is NP-complete. Proposition 2. The Mst-TF problem is NP-complete.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 7 Algorithms S(a 0 ) ={1,7} S(a 1 ) R 1 =max{1, 2, 2}=2 R 7 =max{1, 0, 1}=1 X’= {7} ∪ {2, 8} Proposition 3. Cc-R(X’) ≦ 2 Cc-R(X*) O(|S(a rare )| × n) S(a 2 ) S(a 3 )

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 8 X0X v X’ v* O(|X 0 | × |E|) O(|T| × |X|)

Intelligent Database Systems Lab N.Y.U.S.T. I. M Y3Y3 3 Y2Y2 Y1Y1 v v* X’ O(k × |E|)

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 10 Experiments GreedyDiameter (GreedyMST) DBLP dataset: papers in DB, DM, AI and T conferences. X dblp : authors that have at least three papers individuals. X i : terms that appear in at least two titles of papers that author i has co-authored distinct skills. Authors i, i’ are connected in G dblp (X dblp,E) if they appear as co-authors in at least two papers. A task T(t, s) is generated: (1) select S from {DB, DM, AI,T} with |S| = s. (2) randomly pick t required skills. For every (s, t), generate 100 random tasks, t=2, 4,…, 20 and s = 1,…,4.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 11 Experiments

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 12 Conclusion  Address the problem of forming a team of skilled individuals to perform a given task, while minimizing the communication cost among the members of the team.  Prove that the Team Formation problem is NP- Hard.  Propose appropriate approximation algorithms.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 13 Comments  Advantage  Shortage  Applications ─ Team formation