The Small World of Software Reverse Engineering Ahmed E. Hassan and Richard C. Holt SoftWare Architecture Group (SWAG) University Of Waterloo.

Slides:



Advertisements
Similar presentations
Peer-to-Peer and Social Networks Power law graphs Small world graphs.
Advertisements

Complex Networks Advanced Computer Networks: Part1.
Small-world networks.
Collective Dynamics of ‘Small World’ Networks C+ Elegans: Ilhan Savut, Spencer Telford, Melody Lim 29/10/13.
Milgram-Routing in Social Networks
Analysis and Modeling of Social Networks Foudalis Ilias.
Online Social Networks and Media Navigation in a small world.
P2P Topologies Centralized Ring Hierarchical Decentralized Hybrid.
Sociology and CS Philip Chan. How close are people connected? Are people closely connected, not closely connected, isolated into groups, …
Advanced Topics in Data Mining Special focus: Social Networks.
CS 599: Social Media Analysis University of Southern California1 The Basics of Network Analysis Kristina Lerman University of Southern California.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
Networks. Graphs (undirected, unweighted) has a set of vertices V has a set of undirected, unweighted edges E graph G = (V, E), where.
Network Models Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Models Why should I use network models? In may 2011, Facebook.
Small Worlds Presented by Geetha Akula For the Faculty of Department of Computer Science, CALSTATE LA. On 8 th June 07.
T HE S TRUCTURE OF S CIENTIFIC C OLLABORATION N ETWORKS & R ESEARCH F UNDING N ETWORKS CS790g Complex Networks Jigar Patel November 30 th 2009.
Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.
CS 728 Lecture 4 It’s a Small World on the Web. Small World Networks It is a ‘small world’ after all –Billions of people on Earth, yet every pair separated.
Sampling from Large Graphs. Motivation Our purpose is to analyze and model social networks –An online social network graph is composed of millions of.
Search in a Small World JIN Xiaolong Based on [1].
Small World Networks Somsubhra Sharangi Computing Science, Simon Fraser University.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006
Peer-to-Peer and Social Networks Introduction. What is a P2P network Uses the vast resource of the machines at the edge of the Internet to build a network.
Department of Computer Science, University of California, Irvine Site Visit for UC Irvine KD-D Project, April 21 st 2004 The Java Universal Network/Graph.
Community Structure and Rumor Blocking Ding-Zhu Du University of Texas at Dallas.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Analysis and Modeling of the Open Source Software Community Yongqin Gao, Greg Madey Computer Science & Engineering University of Notre Dame Vincent Freeh.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Jure Leskovec Computer Science Department Cornell University / Stanford University Joint work with: Eric Horvitz, Michael Mahoney,
Approximation Algorithms in Computational Social Networks Weili Wu Ding-Zhu Du University of Texas at Dallas.
Section 8 – Ec1818 Jeremy Barofsky March 31 st and April 1 st, 2010.
Advanced Software Engineering PROJECT. 1. MapReduce Join (2 students)  Focused on performance analysis on different implementation of join processors.
Small World Problem Christopher McCarty. Small World Phenomenon You meet someone, seemingly randomly, who has a connection to someone you know – Person.
Small World Social Networks With slides from Jon Kleinberg, David Liben-Nowell, and Daniel Bilar.
Small-world networks. What is it? Everyone talks about the small world phenomenon, but truly what is it? There are three landmark papers: Stanley Milgram.
COLOR TEST COLOR TEST. Social Networks: Structure and Impact N ICOLE I MMORLICA, N ORTHWESTERN U.
Science: Graph theory and networks Dr Andy Evans.
Complex Networks First Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Topology and Evolution of the Open Source Software Community Advisors: Dr. Vincent W. Freeh Dr. Kevin Bowyer Supported in part by the National Science.
Online Social Networks and Media
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
3. SMALL WORLDS The Watts-Strogatz model. Watts-Strogatz, Nature 1998 Small world: the average shortest path length in a real network is small Six degrees.
Neural Network of C. elegans is a Small-World Network Masroor Hossain Wednesday, February 29 th, 2012 Introduction to Complex Systems.
1. 2 CIShell Features A framework for easy integration of new and existing algorithms written in any programming language. CIShell Sci2 Tool NWB Tool.
Complex Network Theory – An Introduction Niloy Ganguly.
Complex Network Theory – An Introduction Niloy Ganguly.
Most of contents are provided by the website Network Models TJTSD66: Advanced Topics in Social Media (Social.
What Is A Network? (and why do we care?). An Introduction to Network Theory | Kyle Findlay | SAMRA 2010 | 2 “A collection of objects (nodes) connected.
Clusters Recognition from Large Small World Graph Igor Kanovsky, Lilach Prego Emek Yezreel College, Israel University of Haifa, Israel.
1 SEMEF : A Taxonomy-Based Discovery of Experts, Expertise and Collaboration Networks Delroy Cameron Masters Thesis Computer Science, University of Georgia.
CS:4980:0005 Peer-to-Peer and Social Networks Fall 2015 Introduction.
1 Friends and Neighbors on the Web Presentation for Web Information Retrieval Bruno Lepri.
March 3, 2009 Network Analysis Valerie Cardenas Nicolson Assistant Adjunct Professor Department of Radiology and Biomedical Imaging.
Small World Social Networks With slides from Jon Kleinberg, David Liben-Nowell, and Daniel Bilar.
Performance Evaluation Lecture 1: Complex Networks Giovanni Neglia INRIA – EPI Maestro 10 December 2012.
The Structure of Scientific Collaboration Networks by M. E. J. Newman CMSC 601 Paper Summary Marie desJardins January 27, 2009.
Analyzing Networks. Milgram’s Experiments “Six degrees of Separation” Milgram’s letters to various recruits in Nebraska who were asked to forward the.
Class 4: It’s a Small World After All Network Science: Small World February 2012 Dr. Baruch Barzel.
Topics In Social Computing (67810) Module 1 Introduction & The Structure of Social Networks.
GRAPH AND LINK MINING 1. Graphs - Basics 2 Undirected Graphs Undirected Graph: The edges are undirected pairs – they can be traversed in any direction.
CS:4980:0001 Peer-to-Peer and Social Networks Fall 2017
CS:4980:0001 Peer-to-Peer and Social Networks Fall 2017
Lecture 1: Complex Networks
Peer-to-Peer and Social Networks
The Watts-Strogatz model
Section 8.2: Shortest path and small world effect
Shortest path and small world effect
Peer-to-Peer and Social Networks Fall 2017
Graph and Link Mining.
Advanced Topics in Data Mining Special focus: Social Networks
Presentation transcript:

The Small World of Software Reverse Engineering Ahmed E. Hassan and Richard C. Holt SoftWare Architecture Group (SWAG) University Of Waterloo

Publications We study the evolution of a field through its publications Publications give a picture of –Collaboration: High degree in academia in contrast to industry –Emergence of topics: Hot topics and their effects on collaborations

DBLP DBLP: –DataBase systems and Logic Programming –Digital Bibliography and Library Project Tracks publications in several conferences and communities, such as: –WCRE –Reengineering and maintenance –Software engineering Records for each publication: –Title –Authors –Conference: name and year –Abstract Data available online as an XML file

Studying Collaboration Develop a social collaboration network using co- authorship data from DBLP: –A node exists for each author –An edge exists between two nodes (authors), if they co- authored a paper together –Size of node proportional to # of pubs –Edges have a weight proportional to # of co-authored papers Use a force based algorithm to layout the network

Co-Authorship Graph

1: E. Burd 2: E. Stroulia 3: M. Munro 4: M. Harman 5: E. Merlo 6: G. Canfora 7: K. Kontogiannis 8: R. Koschke 9: R. Holt Co-Authorship Graph

The Largest Component over Time

Small World Graphs Large graphs with small paths connecting its nodes Stanley Milgram studied them in the 60s: –Letters were given to people in Nebraska –Each person hands letter to someone they knew and whom they believe can eventually deliver the letter to a stockbroker in Pittsburgh –Average chain of people between both cities is 6 – six degrees of separation Collaboration networks which are small world graphs: –Good indicator of ease of communication of knowledge between members of a community

Small World Graph Characteristic Path Length (L): measures on average how many individuals an author has to go through to reach other authors –The average shortest path from any node in the graph to any other node in the largest component of the graph Clustering Coefficient (C): measures how collaborative on average are the co-authors of an author –For a node, C is the ratio of edges to neighbors of that node to the maximum number of edges between these neighboring nodes Watts and Strogatz give a formal definition of small world graphs using C, L, and random graphs: L>L random and C >> C random

WCRE is a small world graph! Clustering coefficient is 0.76 Characteristic path length is 4.3 Towards a Standard Schema for C/C++ By Ferenc, Sim, Holt, Koschke and Gyimothy 3.94  4.32 AuthorCentrality Canfora 2.76 Koschke 2.88 Merlo 2.94 De Lucia 3.1 Holt 3.2

Paper Titles Analysis for Emerging Terms

Bigger Small Worlds in SE We compare results against another two research communities: –Maintenance and Reengineering (MR): WCRE, IWPC, CSMR, ICSM –Software Engineering: MR + 17 Conferences DBLP data is not as complete for these conferences

The Largest Component over Time Slow constant growth then rapid growth once researchers know each other Soft Eng has slower growth –Less conferences in early days –Incomplete DBLP data –Wider scope MR and Soft Eng rapid growth since 1996 –Internet and ? MR and WCRE growing since late 90s: –Y2K?

Most Central Authors over Time

WCRE, MR, SE vs. other fields

Title Spectrograph Reverse Java Compon orient object software program experi system engin data design Joint work with J. Wu abstract

Conclusion A meta paper on publications and collaboration networks in WCRE, MR and SE communities Small World collaboration networks facilitate the exchange of ideas and results in a community Many of the techniques presented could be used to study the evolution of software systems (files or developers as nodes)

Generating Small World Graphs Using Random Re-wiring Large L Large C Small L Small C Small L Large C

Percentage of Papers in a year

# of new co-authors in a year