1 SEMEF : A Taxonomy-Based Discovery of Experts, Expertise and Collaboration Networks Delroy Cameron Masters Thesis Computer Science, University of Georgia.

Slides:



Advertisements
Similar presentations
Google News Personalization: Scalable Online Collaborative Filtering
Advertisements

Research Issues in Web Services CS 4244 Lecture Zaki Malik Department of Computer Science Virginia Tech
Darrell W. Gunter EVP / CMO Collexis Holdings, Inc. March 23, 2010 Spring Conference CONTENT: Uncovering the Value and Benefits of Semantic Technology.
The Small World of Software Reverse Engineering Ahmed E. Hassan and Richard C. Holt SoftWare Architecture Group (SWAG) University Of Waterloo.
Analysis and Modeling of Social Networks Foudalis Ilias.
LYRIC-BASED ARTIST NETWORK METHODOLOGY Derek Gossi CS 765 Fall 2014.
1 Social Influence Analysis in Large-scale Networks Jie Tang 1, Jimeng Sun 2, Chi Wang 1, and Zi Yang 1 1 Dept. of Computer Science and Technology Tsinghua.
T HE S TRUCTURE OF S CIENTIFIC C OLLABORATION N ETWORKS & R ESEARCH F UNDING N ETWORKS CS790g Complex Networks Jigar Patel November 30 th 2009.
A Taxonomy-based Model for Expertise Extrapolation Delroy Cameron, Amit P. Sheth Ohio Center for Excellence in Knowledge-enabled Computing (Kno.e.sis)
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
1 Secure Indexes Author : Eu-Jin Goh Presented by Yi Cheng Lin.
Xiaomeng Su & Jon Atle Gulla Dept. of Computer and Information Science Norwegian University of Science and Technology Trondheim Norway June 2004 Semantic.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
CS8803-NS Network Science Fall 2013
Bibliometrics in Computer Science MyRI project team.
FALL 2012 DSCI5240 Graduate Presentation By Xxxxxxx.
Semantic Analytics on Social Networks: Experiences in Addressing the Problem of Conflict of Interest Detection Boanerges Aleman-Meza, Meenakshi Nagarajan,
Semantic Analytics on Social Networks: Experiences in Addressing the Problem of Conflict of Interest Detection Boanerges Aleman-Meza, Meenakshi Nagarajan,
Social Networking and On-Line Communities: Classification and Research Trends Maria Ioannidou, Eugenia Raptotasiou, Ioannis Anagnostopoulos.
Page 1 WEB MINING by NINI P SURESH PROJECT CO-ORDINATOR Kavitha Murugeshan.
Pascal Visualization Challenge Blaž Fortuna, IJS Marko Grobelnik, IJS Steve Gunn, US.
Bibliometrics and Impact Analyses at the National Institute of Standards and Technology Stacy Bruss and Susan Makar Research Librarians SLA Pharmaceutical.
1 Formal Models for Expert Finding on DBLP Bibliography Data Presented by: Hongbo Deng Co-worked with: Irwin King and Michael R. Lyu Department of Computer.
PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.
 CiteGraph: A Citation Network System for MEDLINE Articles and Analysis Qing Zhang 1,2, Hong Yu 1,3 1 University of Massachusetts Medical School, Worcester,
Ontology Summit2007 Survey Response Analysis -- Issues Ken Baclawski Northeastern University.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
2015/10/111 DBconnect: Mining Research Community on DBLP Data Osmar R. Zaïane, Jiyang Chen, Randy Goebel Web Mining and Social Network Analysis Workshop.
Ontology-Driven Automatic Entity Disambiguation in Unstructured Text Jed Hassell.
Integrated Collaborative Information Systems Ahmet E. Topcu Advisor: Prof Dr. Geoffrey Fox 1.
WEB SEARCH PERSONALIZATION WITH ONTOLOGICAL USER PROFILES Data Mining Lab XUAN MAN.
Google News Personalization: Scalable Online Collaborative Filtering
On Finding Fine-Granularity User Communities by Profile Decomposition Seulki Lee, Minsam Ko, Keejun Han, Jae-Gil Lee Department of Knowledge Service Engineering.
Presentation : Finding a Team of Experts in Social Networks Jack Cheng Ka Ho The Chinese University of Hong Kong SEEM 5010 Advanced Database and Information.
LOGO A comparison of two web-based document management systems ShaoxinYu Columbia University March 31, 2009.
Finding Experts Using Social Network Analysis 2007 IEEE/WIC/ACM International Conference on Web Intelligence Yupeng Fu, Rongjing Xiang, Yong Wang, Min.
Harvesting Social Knowledge from Folksonomies Harris Wu, Mohammad Zubair, Kurt Maly, Harvesting social knowledge from folksonomies, Proceedings of the.
The Structure of the Web. Getting to knowing the Web How big is the web and how do you measure it? How many people use the web? How many use search engines?
2015/12/121 Extracting Key Terms From Noisy and Multi-theme Documents Maria Grineva, Maxim Grinev and Dmitry Lizorkin Proceeding of the 18th International.
Scalable Hybrid Keyword Search on Distributed Database Jungkee Kim Florida State University Community Grids Laboratory, Indiana University Workshop on.
Date: 2012/08/21 Source: Zhong Zeng, Zhifeng Bao, Tok Wang Ling, Mong Li Lee (KEYS’12) Speaker: Er-Gang Liu Advisor: Dr. Jia-ling Koh 1.
1 Finding Spread Blockers in Dynamic Networks (SNAKDD08)Habiba, Yintao Yu, Tanya Y., Berger-Wolf, Jared Saia Speaker: Hsu, Yu-wen Advisor: Dr. Koh, Jia-Ling.
Your caption here POLYPHONET: An Advanced Social Network Extraction System from the Web Yutaka Matsuo Junichiro Mori Masahiro Hamasaki National Institute.
Topical Scientific Community —A combined perspective of topic and topology Jin Mao Postdoc, School of Information, University of Arizona Sept 4, 2015.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
Selected Semantic Web UMBC CoBrA – Context Broker Architecture  Using OWL to define ontologies for context modeling and reasoning  Taking.
CIS750 – Seminar in Advanced Topics in Computer Science Advanced topics in databases – Multimedia Databases V. Megalooikonomou Link mining ( based on slides.
Corpus Exploitation from Wikipedia for Ontology Construction Gaoying Cui, Qin Lu, Wenjie Li, Yirong Chen The Department of Computing The Hong Kong Polytechnic.
Ontology Evaluation and Ranking using OntoQA Samir Tartir and I. Budak Arpinar Large-Scale Distributed Information Systems Lab University of Georgia The.
Meta-Path-Based Ranking with Pseudo Relevance Feedback on Heterogeneous Graph for Citation Recommendation By: Xiaozhong Liu, Yingying Yu, Chun Guo, Yizhou.
GRAPH AND LINK MINING 1. Graphs - Basics 2 Undirected Graphs Undirected Graph: The edges are undirected pairs – they can be traversed in any direction.
Personalized Ontology for Web Search Personalization S. Sendhilkumar, T.V. Geetha Anna University, Chennai India 1st ACM Bangalore annual Compute conference,
Presented by Edith Ngai MPhil Term 3 Presentation
MINING DEEP KNOWLEDGE FROM SCIENTIFIC NETWORKS
Web Routing Designing an Interface
Ming-Yao Chen#, Wen-Ta Chiu and Yuh-Shan Ho*
Knowledge Discovery in the Semantic Web
Jian Wang Assistant Professor Science Based Business Program LIACS, Leiden University
A research literature search engine with abbreviation recognition
CSC 682: Advanced Computer Security
Weakly Learning to Match Experts in Online Community
KDD Reviews 周天烁 2018年5月9日.
Google News Personalization: Scalable Online Collaborative Filtering
Peer-to-Peer Information Systems Week 6: Performance
Introduction of KNS55 Platform
Jiawei Han Department of Computer Science
Graph and Link Mining.
Presentation transcript:

1 SEMEF : A Taxonomy-Based Discovery of Experts, Expertise and Collaboration Networks Delroy Cameron Masters Thesis Computer Science, University of Georgia 11/27/2007 Advisor: I. Budak Arpinar Committee: Prashant Doshi Robert J. Woods

2 OUTLINE  Background  Expertise Profiles  Ranking Experts  Collaboration Networks Expansion  Results and Evaluation  Conclusion  Demo SEMEF: A Taxonomy-Based Discovery of Experts, Expertise and Collaboration Networks

3 BACKGROUND  Semantic Web  What ?  Extension of current Web  Attach Meaning to Data  Why ?  Under Utilization of Current Web  HTML Limitations  Goal  Enhance Information Exchange  Automatic Information Discovery  Interoperability of Services SEMEF: A Taxonomy-Based Discovery of Experts, Expertise and Collaboration Networks

4 BACKGROUND  Semantic Web  Technologies  XML  RDF/RDFS/OWL  URI  Ontology SEMEF: A Taxonomy-Based Discovery of Experts, Expertise and Collaboration Networks “David Billington is a Professor of Mathematics” David Billington Mathematics David Billington Mathematics David Billington

5 BACKGROUND  Semantic Web  Common Challenges  Entity Disambiguation  Ontology Mapping/Alignment  Trust/Provenance  Semantic Association Discovery  Application  Social Networks  Bio-Informatics  National Security  GPS Data Mining SEMEF: A Taxonomy-Based Discovery of Experts, Expertise and Collaboration Networks

6 BACKGROUND  Social Networks  What ?  Connected through Social Relationships  Characteristics  Clustering Coefficient (connectedness to neighbors) ‏  Centrality (average shortest path length) ‏  Geodesic (shortest path length) ‏ SEMEF: A Taxonomy-Based Discovery of Experts, Expertise and Collaboration Networks

7 BACKGROUND  Peer-Review Process  What ?  Review scholarly manuscripts  Challenges  Slow  Conflict of Interest  Finding Suitable Reviewers  Arbitrary Knowledge Approach  Research Diversification  Emerging Fields SEMEF: A Taxonomy-Based Discovery of Experts, Expertise and Collaboration Networks

8 CONTRIBUTIONS  Applicability of Semantics  Finding Expertise  Fine Levels of Granularity  Finding Experts  Taxonomy  Collaboration Networks  Discovery of Unknown Experts SEMEF: A Taxonomy-Based Discovery of Experts, Expertise and Collaboration Networks

9 SEMEF  SEMantic Expert Finder  Finding Expertise (Expertise Profiles) ‏  Collecting Expertise  Quantifying Expertise  Finding (Ranking) Experts  w/ and w/o taxonomy  Collaboration Networks  Geodesic  C-Nets SEMEF: A Taxonomy-Based Discovery of Experts, Expertise and Collaboration Networks

10 EXPERTISE PROFILES  Collecting Expertise  Collect All Publication  Map papers to topic  Quantify all papers  Publications Dataset  DBLP 473,296 papers (conference/session names - Nov. 2007) ‏  ACM, IEEE, Science Direct 29,454 papers (abstracts/index terms) ‏  Combined 476,299 papers SEMEF: A Taxonomy-Based Discovery of Experts, Expertise and Collaboration Networks

11 EXPERTISE PROFILES  Collecting Expertise  Papers-to-Topics Dataset  Combined (476,299) ‏  Topics (320) ‏  Relationships (676,569) ‏  Expertise Profiles (560,792) ‏ SEMEF: A Taxonomy-Based Discovery of Experts, Expertise and Collaboration Networks

12 EXPERTISE PROFILES  Quantifying Expertise  Mapping each paper to distinct value  Publication Impact  Hector Garcia-Molina (248 papers ) ‏  E. F. Codd (49 papers ) ‏  Citeseer Impact Statistics (1221 venues) ‏  DBLP URIs SEMEF: A Taxonomy-Based Discovery of Experts, Expertise and Collaboration Networks

13 EXPERTISE PROFILES Figure 1: Expertise Profile author_A topic 1 (4.50)‏ paper topic 2 (1.86)‏topic 3 (3.08)‏ paper 2 paper paper 4 paper 6 paper

14 RANKING EXPERTS  Taxonomy of Topics  Session names  Conference Names  O’CoMMA  Paper Abstracts  Index Terms Figure 2: Taxonomy of Topics

15 RANKING EXPERTS  Case 1  Single Topic without Taxonomy  Traverse all Expertise Profiles  Sum impact, (papers  topics) ‏  Case 2  Single Topic with Taxonomy  Traverse all Expertise Profiles  Sum impact, (papers  topics, subtopics) ‏ SEMEF: A Taxonomy-Based Discovery of Experts, Expertise and Collaboration Networks Prevent Expertise Overestimation 1) Map 2) Papers to leaf nodes only

16 RANKING EXPERTS  Case 3  Array of Topics without Taxonomy  Same as Case 2  Case 4  Array of Topics with Taxonomy  Filter input topics  Sum impact, (papers  topics, subtopics) ‏ SEMEF: A Taxonomy-Based Discovery of Experts, Expertise and Collaboration Networks

17 COLLABORATION NETWORKS EXPANSION  Geodesic Figure 3: Geodesic Relationships author_A author_1 author_Bauthor_A author_B author_2author_A author_B opus:Article_in_Proceedings_179 opus:Proceedings_543 opus:Article_in_Proceedings_35opus:Article_in_Proceedings_8 author_A STRONG MEDIUMUNKNOWN WEAK opus:author opus:Article_in_Proceedings_291 opus:author opus:Article_in_Proceedings_3 opus:author opus:isIncludedIn opus:author

18 COLLABORATION NETWORKS EXPANSION  C-Net  Ordering Cluster of Experts  Collaboration Strength* * Newman, M. E. J.: Coauthorship Networks and Patterns of Scientific Collaboration. National Academy of Sciences of the United States of America, 1(101): , (2004). coauthor_1 {0.73, 0.5} Super Node {14.80} coauthor_2 {1.81, 1.0} coauthor_3 {0.73, 0.5} coauthor_4 {0.73, 0.5} coauthor_5 {1.54, 1.0} coauthor_n {1.1, 0.8} Figure 3: Geodesic Relationships

19 RESULTS AND EVALUATION  Evaluation  WWW Search Track (2005/6/7) ‏  Input Topics Call For Papers  SWETO-DBLP Subset (67,366 authors) ‏  DBLP (560,792) ‏  Validation  Collaboration Networks Expansion SEMEF: A Taxonomy-Based Discovery of Experts, Expertise and Collaboration Networks

20 RESULTS AND EVALUATION  Validation Table 1: Past PC Lists comparison with SEMEF 52% % 58% % 65% % 73% % 79% % 82% % 85% % 85% % 85% % 29/3421/2526/2940/48 Total Search Search Average 83 35%10(top) 0-10% Search 2005 Cumulative Percentage in PC List Search Track (Number of PC Members in SEMEF List) Percentage in SEMEF List

21 RESULTS AND EVALUATION  Validation Figure 4: Average Number of PC in SEMEF List

22 RESULTS AND EVALUATION  Validation Figure 5: Average PC Distribution in SEMEF List

23 RESULTS AND EVALUATION  Collaboration Networks Expansion Table 4: PC Chair – SEMEF List Geodesic Relationships WEAK Chair Chair1 Search Chair1 Search 2007 PC List (Number of Expert Relationships) ‏ EXTREMELY WEAK MEDIUM STRONG Relationships Chair1 Search Chair Above Average Expertise (in PC) ‏ WEAK Chair Chair1 Search Chair1 Search 2007 SEMEF (Number of Expert Relationships) ‏ EXTREMELY WEAK MEDIUM STRONG Relationships Chair1 Search Chair Chair2 Above Average Expertise (in PC) ‏ Table 3: PC Chair – PC Member Geodesic Relationships

24 CONCLUSION  Expertise Profiles  Publication Data  Publication Impact Statistics  Papers-to-Topics Relationships  Ranking Experts  w/ and w/o Taxonomy  Single and Array of Topics  Collaboration Networks Expansion  Semantic Association Discovery  Geodesic  C-Nets SEMEF: A Taxonomy-Based Discovery of Experts, Expertise and Collaboration Networks

25 DEMO  Web Application  Apache Tomcat 6.0  Java Server Pages  Ubuntu 7.10 Delroy Cameron Masters Thesis Computer Science, University of Georgia

26 RELATED WORK  Particle Swarm Algorithm  ExpertiseNets  Expertise Browser  Experience Atoms  Expertise Recommender  Change history  Tech Support Heuristics  Profiling, Identification, Supervisor SEMEF: A Taxonomy-Based Discovery of Experts, Expertise and Collaboration Networks

27 RELATED WORK  Web-Based Communities  Expert Rank  Formal Probabilistic Models  Candidate Models  Document Models  RDF-Matcher SEMEF: A Taxonomy-Based Discovery of Experts, Expertise and Collaboration Networks

28 EXPERTISE PROFILE ALGORITHM Algorithm findExpertiseProfile(researcherURI, list of publications) ‏ create ‘empty expertise profile’ foreach paper of researcher do get ‘topics’ list of paper (using papers-to-topics dataset) get ‘publication impact’ if ‘publication impact’ is null do ‘publication impact’  default weight else ‘weight’  ‘publication impact’ + existing ‘weight’ from expertise profile if ‘expertise profile’ contains ‘topic’ do update ‘expertise profile’ with else add pair to ‘expertise profile’ end return ‘expertise profile’ SEMEF: A Taxonomy-Based Discovery of Experts, Expertise and Collaboration Networks

29 RANKING EXPERTS ALGORITHM Algorithm rankValue(researcherURI, list of topics) ‏ set expertRank to zero create temp ‘expertise profile’ filter topics foreach topic in filtered topics list do get ‘papers’ for this topic (using papers-to-topics dataset) foreach paper in papers list do if researcher is author do get ‘publication impact’ as ‘weight’ expertRankValue = expertRankValue + ‘publication impact’ add pair to temporary ‘expertise profile’ end if end return ‘rankValue’ SEMEF: A Taxonomy-Based Discovery of Experts, Expertise and Collaboration Networks