Conceptual Framework for Agent- Based Modeling and Simulation: The Computer Experiment Yongqin GaoVincent Freeh Greg Madey CSE DepartmentCS Department.

Slides:



Advertisements
Similar presentations
Overview of Free/Open Source Software for Librarians Eric Goldhagen
Advertisements

Chapter 1 Business Driven Technology
Emergence of Scaling in Random Networks Albert-Laszlo Barabsi & Reka Albert.
Publishing Scientific Data for Electronic Books: Challenges and Opportunities Expanding the Accessibility of Critically Evaluated Data Dr. Joan Fuller,
Software Engineering Institute Carnegie Mellon University Pittsburgh, PA Sponsored by the U.S. Department of Defense © 1998 by Carnegie Mellon.
Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle.
MANAGEMENT RICHARD L. DAFT.
Topology Generation Suat Mercan. 2 Outline Motivation Topology Characterization Levels of Topology Modeling Techniques Types of Topology Generators.
16 October 2003 Romania, ICT and FP5,6 1 Romanian Experiences Related to ICT and the Fifth Framework Program of the EU and Expectations From the Sixth.
NAACSOS 2005Scott Christley, Temporal Analysis of Social Positions An Algorithm for Temporal Analysis of Social Positions Scott Christley, Greg Madey Dept.
Well, Sort-of.
Web as Graph – Empirical Studies The Structure and Dynamics of Networks.
An Open Source Community Christina K Pikas LBSC708P November 10, 2005.
Innovation and IS Kieran Mathieson. What is Innovation?  Long definition Successful innovation is the creation and implementation of new processes, products,
Towards Understanding: A Study of the SourceForge.net Community using Modeling and Simulation Yongqin Gao Greg Madey Computer Science & Engineering University.
© 2007 The MITRE Corporation. All rights reserved Approved for Public Release; Distribution Unlimited Potential New Ideas from Complexity Science.
What is an Information System? Input of DataResourcesProcessing Data Data Control of System Performance Storage of Data Resources Output of InformationProducts.
Supported in part by the National Science Foundation – ISS/Digital Science & Technology Analysis of the Open Source Software development community using.
Introduction to E-Commerce and E-Marketplaces
Agent-Based Modeling and Simulation of Collaborative Social Networks Research in Progress Greg Madey Yongqin Gao Computer Science & Engineering University.
Patrick Adam Wagstrom October 2004 Community Building in Open Source Software Ecosystems Patrick Adam Wagstrom Department.
E-Business Technology Adoption Assessing B2B and e-Procurement in Canada Sandra Charles Raymond Lepage Presentation for the OECD Electronic Commerce Business.
Computers Are Your Future Eleventh Edition Chapter 10: Careers & Certification Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall1.
Analysis and Modeling of the Open Source Software Community Yongqin Gao, Greg Madey Computer Science & Engineering University of Notre Dame Vincent Freeh.
Eleventh Edition 1 Introduction to Information Systems Essentials for the Internetworked E-Business Enterprise Irwin/McGraw-Hill Copyright © 2002, The.
What is Enterprise Architecture?
Computers and Society Examine the extent to which Richard Stallman’s GNU manifesto has succeeded in challenging the dominance of conventionally distributed.
How I learned to stop worrying and love Open Source Software... Colin M. Sharples Advisory IT Specialist IBM Business Consulting Services SQNZ 21 October.
Open Source Web Services Peer to Peer Networking and Canada’s Innovation Agenda Bill St. Arnaud CANARIE Inc
MINING AND MODELING THE OPEN SOURCE SOFTWARE COMMUNITY
Accelerating Development Using Open Source Software Black Duck Software Company Presentation.
An Investigation into the Free/Open Source Software Phenomenon using Data Mining, Social Network Theory, and Agent-Based Greg Madey Computer Science &
CPS 82, Fall Open Source, Copyright, Copyleft.
Task Group on development of e-Government indicators (TGEG) 2008 Global Event on Measuring the Information Society Report on e-Government indicators 2008.
NATIONAL POLICIES AND INTERNATIONAL COOPERATION IN SOUTH EASTERN EUROPE ON THE WAY TO INFORMATION SOCIETY Prof. Marius GURAN - Romania UNCTAD-UNECE High-Level.
A Self-Manageable Infrastructure for Supporting Web-based Simulations Yingping Huang Xiaorong Xiang Gregory Madey Computer Science & Engineering University.
Topology and Evolution of the Open Source Software Community Advisors: Dr. Vincent W. Freeh Dr. Kevin Bowyer Supported in part by the National Science.
Emergence of Scaling and Assortative Mixing by Altruism Li Ping The Hong Kong PolyU
EU funded R&D collaboration networks in the area of Information Society Technologies and the role of Greek actors Aimilia Protogerou Team for the Technological,
Department of Information Business Discussion of a Large-Scale Open Source Data Collection Methodology Michael Hahsler and Stefan Koch Department of Information.
Library 2.06 February 2009 Linux for Librarians Nishtha Anilkumar Librarian Physical Research Laboratory Ahmedabad.
1.less than 3 million. 2.less than 10 million. 3.over 23 million. 4.over 100 million. 5.Not sure In the U.S., the number of managers that rely on Information.
Interorganisational Systems. Interorganisational Systems Information Partnering It is the driving force behind the emerging electronic marketplace. Case:
Agent 2004Scott Christley, Public Goods Theory of Open Source Community Public Goods Theory of the Open Source Development Community using Agent-based.
Overview: Electronic Commerce Electronic Commerce, Seventh Annual Edition.
Complexity Bruce Kogut October We are entering the epoch of the digitalization of knowledge: past, present, and future Sciences bring to this new.
Quiz  What does Eric Morris mean by “shoehorning”?  How does Ford use Big Data for marketing?  Name 3 things we can learn from Wikinomics according.
Powerful utilization of open source software in digital preservation, maintenance and utilization: an example of the creation of Union Catalogue of Serials.
Yongqin Gao, Greg Madey Computer Science & Engineering Department University of Notre Dame © Copyright 2002~2003 by Serendip Gao, all rights reserved.
Open Source Examples – Linux; Apache; Firefox Requirements – Distributed w/ source code – License allows for modifications (GPL) – License remains w/ any.
Foundations of Information Systems in Business. System ® System  A system is an interrelated set of business procedures used within one business unit.
IT and Network Organization Ecommerce. IT and Network Organization OPTIMIZING INTERNAL COLLABORATIONS IN NETWORK ORGANIZATIONS.
A RESEARCH SUPPORT SYSTEM FRAMEWORK FOR WEB DATA MINING Jin Xu, Yingping Huang, Gregory Madey Department of Computer Science and Engineering University.
Mass Collaboration and Crowd-sourcing: How does the Internet help Innovate?
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
1 Dr. Michael D. Featherstone Introduction to e-Commerce Network Theory 101.
A Research Collaboratory for Open Source Software Research Yongqin Gao, Matt van Antwerp, Scott Christley, Greg Madey Computer Science & Engineering University.
Lake Arrowhead 2005Scott Christley, Understanding Open Source Understanding the Open Source Software Community Presented by Scott Christley Dept. of Computer.
"The views expressed in this presentation are those of the author and do not necessarily reflect the views of the European Commission" Ilkka Lakaniemi.
State and Future of Computing Mary Lou Soffa
Sandra Charles Raymond Lepage
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
open source and free software Najeeb Ullah Student ID
Empirical analysis of Chinese airport network as a complex weighted network Methodology Section Presented by Di Li.
Overview of Electronic Commerce
Peer-to-Peer and Social Networks Fall 2017
Department of Computer Science University of York
Open Source Software Development Processes
Peer-to-Peer and Social Networks
A MULTI-MODEL DOCKING EXPERIMENT OF DYNAMIC SOCIAL NETWORK SIMULATIONS
Presentation transcript:

Conceptual Framework for Agent- Based Modeling and Simulation: The Computer Experiment Yongqin GaoVincent Freeh Greg Madey CSE DepartmentCS Department CSE Department University of Notre DameNCSU University of Notre Dame NAACSOS Conference Pittsburgh, PA June 25, 2003 Supported in part by the National Science Foundation - Digital Society & Technology Program

The Computer Experiment

Agent-Based Simulation as a Component of the Scientific Method Modeling (Hypothesis) Agent -Based Simulation (Experiment) Observation Social Network Model of F/OSS Grow Artificial SourceForge Analysis of SourceForge Data

Outline Investigation: Free/Open Source Software (F/OSS)Investigation: Free/Open Source Software (F/OSS) Conceptual framework(s)Conceptual framework(s) Model descriptionModel description ER modelER model BA modelBA model BA model with constant fitnessBA model with constant fitness BA model with dynamic fitnessBA model with dynamic fitness SummarySummary

Open Source Software (OSS) Free …Free … –to view source –to modify –to share –of cost ExamplesExamples –Apache –Perl –GNU –Linux –Sendmail –Python –KDE –GNOME –Mozilla –Thousands more Linux GNU Savannah

Free Open Source Software (F/OSS) DevelopmentDevelopment –Mostly volunteer –Global teams –Virtual teams –Self-organized - often peer-based meritocracy –Self-managed - but often a “ charismatic ” leader –Often large numbers of developers, testers, support help, end user participation –Rapid, frequent releases –Mostly unpaid

Typical Charismatic Leaders? Linus Tolvalds Linux Larry Wall Perl Richard Stallman GNU Manifesto Eric Raymond Cathedral and Bazaar

F/OSS: Significance Contradicts traditional wisdom:Contradicts traditional wisdom: –Software engineering –Coordination, large numbers –Motivation of developers –Quality –Security –Business strategy Almost everything is done electronically and available in digital formAlmost everything is done electronically and available in digital form Opportunity for Social Science Research -- large amounts of online data availableOpportunity for Social Science Research -- large amounts of online data available Research issues: –Understanding motives –Understanding processes –Intellectual property –Digital divide –Self-organization –Government policy –Impact on innovation –Ethics –Economic models –Cultural issues –International factors

SourceForge VA Software Part of OSDN Started 12/1999 Collaboration tools 58,685 Projects 80,000 Developers 590,00 Registered Users

Savannah Uses SourceForge Software Free Software Foundation 1,508 Projects 15,265 Registered Users

F/OSS: Importance Major Component of e-Technology Infrastructure with major presence in e-Commerce e-Science e-Government e-Learning Apache has over 65% market share of Internet Web servers Linux on over 7 million computers Most Internet runs on Sendmail Tens of thousands of quality products Part of product offerings of companies like IBM, Apple Apache in WebSphere, Linux on mainframe, FreeBSD in OSX Corporate employees participating on OSS projects

Free/Open Source Software Seems to challenge traditional economic assumptionsSeems to challenge traditional economic assumptions Model for software engineeringModel for software engineering New business strategiesNew business strategies –Cooperation with competitors –Beyond trade associations, shared industry research, and standards processes — shared product development! Virtual, self-organizing and self-managing teamsVirtual, self-organizing and self-managing teams Social issues, e.g., digital divide, international participationSocial issues, e.g., digital divide, international participation Government policy issues, e.g., US software industry, impact on innovation, security, intellectual propertyGovernment policy issues, e.g., US software industry, impact on innovation, security, intellectual property

Research Model

Data Collection — Monthly Web crawler (scripts)Web crawler (scripts) –Python –Perl –AWK –Sed MonthlyMonthly Since Jan 2001Since Jan 2001 ProjectIDProjectID DeveloperIDDeveloperID Almost 2 million recordsAlmost 2 million records Relational databaseRelational database PROJ|DEVELOPER 8001|dev |dev |dev |dev |dev |dev |dev |dev |dev |dev8975

dev[59] dev[54] dev[49] dev[64] dev[61] Project 6882 Project 9859 Project 7597 Project 7028 Project F/OSS Developers - Social Network Component Developers are nodes / Projects are links 24 Developers 5 Projects 2 Linchpin Developers 1 Cluster

Models of the F/OSS Social Network (Alternative Hypotheses) General model featuresGeneral model features –Agents are nodes on a graph (developers or projects) –Behaviors: Create, join, abandon and idle –Edges are relationships (joint project participation) –Growth of network: random or types of preferential attachment, formation of clusters –Fitness –Network attributes: diameter, average degree, power law, clustering coefficient Four specific modelsFour specific models –ER (random graph) –BA (scale free) –BA ( + constant fitness) –BA ( + dynamic fitness)

ER model – degree distribution Degree distribution is binomial distribution while it is power law in empirical data R 2 = for developer network R 2 = for project network

ER model - diameter Average degree is decreasing while it is increasing in empirical data Diameter is increasing while it is decreasing in empirical data

ER model – clustering coefficient Clustering coefficient is relatively low around 0.4 while it is around 0.7 in empirical data. Clustering coefficient is decreasing while it is increasing in empirical data

ER model – cluster distribution Cluster distribution in ER model also have power law distribution with R 2 as ( without the major cluster) while R 2 in empirical data is ( without the major cluster) The actual distribution is different from empirical data The later models (BA and further models) have similar behaviors

BA model – degree distribution Power laws in degree distribution, similar to empirical data (+ for simulated data and x for empirical data). For developer distribution: simulated data has R 2 as and empirical data has R 2 as For project distribution: simulated data has R 2 as and empirical data has R 2 as

BA model – diameter and CC Small diameter and high clustering coefficient like empirical data Diameter and clustering coefficient are both decreasing like empirical data

BA model with constant fitness Power laws in degree distribution, similar to empirical data (+ for simulated data and x for empirical data). For developer distribution: simulated data has R2 as and empirical data has R2 as For project distribution: simulated data has R2 as and empirical data has R2 as Diameter and CC are similar to simple BA model.

BA model with dynamic fitness Power laws in degree distribution, similar to empirical data (+ for simulated data and x for empirical data). For developer distribution: simulated data has R2 as and empirical data has R2 as For project distribution: simulated data has R2 as and empirical data has R2 as

Advantage of BA with dynamic fitness Intuition: Fitness should decreasing with time.Intuition: Fitness should decreasing with time. Statistics: project has life cycle behavior which can not be replicated by BA model with constant fitness but can be replicated by BA model with dynamic fitnessStatistics: project has life cycle behavior which can not be replicated by BA model with constant fitness but can be replicated by BA model with dynamic fitness

Conceptual Framework Agent-Based Modeling and Simulation as Components of the Scientific Method Observation Hypothesis Experiment

Summary We use ABM to model and simulate the SourceForge collaboration network.We use ABM to model and simulate the SourceForge collaboration network. Conceptual framework is proposed for agent- based modeling and simulation.Conceptual framework is proposed for agent- based modeling and simulation. Case study of this framework: SourceForge study through ER, BA, BA with constant fitness and BA with dynamic fitness.Case study of this framework: SourceForge study through ER, BA, BA with constant fitness and BA with dynamic fitness.

Thank you