Samad Paydar Web Technology Lab. Ferdowsi University of Mashhad 10 th August 2011.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Introduction to Databases
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
CSCI 572 Project Presentation Mohsen Taheriyan Semantic Search on FOAF profiles.
Metrics Project and Process Metrics. Why do we measure? Assessing project status Allows us to track risks Before they go critical Adjust workflow See.
1 © 1998 HRL Laboratories, LLC. All Rights Reserved Construction of Bayesian Networks for Diagnostics K. Wojtek Przytula: HRL Laboratories & Don Thompson:
Xyleme A Dynamic Warehouse for XML Data of the Web.
Sensemaking and Ground Truth Ontology Development Chinua Umoja William M. Pottenger Jason Perry Christopher Janneck.
File Systems and Databases
Object-Oriented Metrics
Samad Paydar Web Technology Laboratory Computer Engineering Department Ferdowsi University of Mashhad 1389/11/20 An Introduction to the Semantic Web.
Software Configuration Management CSC-532 Chandra Shekar Kandi Chandra Shekar Kandi.
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
Defect prediction using social network analysis on issue repositories Reporter: Dandan Wang Date: 04/18/2011.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Introduction To System Analysis and design
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
1 CSE 2102 CSE 2102 CSE 2102: Introduction to Software Engineering Ch9: Software Engineering Tools and Environments.
CS4723 Software Validation and Quality Assurance
A Framework for Examning Topical Locality in Object- Oriented Software 2012 IEEE International Conference on Computer Software and Applications p
Software Engineering CS3003
Graph Data Management Lab, School of Computer Science gdm.fudan.edu.cn XMLSnippet: A Coding Assistant for XML Configuration Snippet.
PLATFORM INDEPENDENT SOFTWARE DEVELOPMENT MONITORING Mária Bieliková, Karol Rástočný, Eduard Kuric, et. al.
1 Research Groups : KEEL: A Software Tool to Assess Evolutionary Algorithms for Data Mining Problems SCI 2 SMetrology and Models Intelligent.
1 PARSEWeb: A Programmer Assistant for Reusing Open Source Code on the Web Suresh Thummalapenta and Tao Xie Department of Computer Science North Carolina.
Change Impact Analysis for AspectJ Programs Sai Zhang, Zhongxian Gu, Yu Lin and Jianjun Zhao Shanghai Jiao Tong University.
Trisolda Jakub Yaghob Charles University in Prague, Czech Rep.
Mining and Analysis of Control Structure Variant Clones Guo Qiao.
Introduction To System Analysis and Design
Reviewing Recent ICSE Proceedings For:.  Defining and Continuous Checking of Structural Program Dependencies  Automatic Inference of Structural Changes.
Lesley Charles November 23, 2009.
WSMX Execution Semantics Executable Software Specification Eyal Oren DERI
© DATAMAT S.p.A. – Giuseppe Avellino, Stefano Beco, Barbara Cantalupo, Andrea Cavallini A Semantic Workflow Authoring Tool for Programming Grids.
OHTO -99 SOFTWARE ENGINEERING “SOFTWARE PRODUCT QUALITY” Today: - Software quality - Quality Components - ”Good” software properties.
C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.
SPARQL Query Graph Model (How to improve query evaluation?) Ralf Heese and Olaf Hartig Humboldt-Universität zu Berlin.
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
Q2Semantic: A Lightweight Keyword Interface to Semantic Search Haofen Wang 1, Kang Zhang 1, Qiaoling Liu 1, Thanh Tran 2, and Yong Yu 1 1 Apex Lab, Shanghai.
Second Line Intrusion Detection Using Personalization DISA Sponsored GWU-CS.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.
Tool for Ontology Paraphrasing, Querying and Visualization on the Semantic Web Project By Senthil Kumar K III MCA (SS)‏
Knowledge Modeling and Discovery. About Thetus Thetus develops knowledge modeling and discovery infrastructure software for customers who: Have high-value.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Survey of Tools to Support Safe Adaptation with Validation Alain Esteva-Ramirez School of Computing and Information Sciences Florida International University.
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
Connecting Architecture Reconstruction Frameworks Ivan Bowman, Michael Godfrey, Ric Holt Software Architecture Group University of Waterloo CoSET ‘99 May.
Presented by Lu Xiao Drexel University Quantifying Architectural Debt.
Objective ICT : Internet of Services, Software & Virtualisation FLOSSEvo some preliminary ideas.
1 Intelligent Information System Lab., Department of Computer and Information Science, Korea University Semantic Social Network Analysis Kyunglag Kwon.
Managing Large RDF Graphs Vaibhav Khadilkar Dr. Bhavani Thuraisingham Department of Computer Science, The University of Texas at Dallas December 2008.
EBI is an Outstation of the European Molecular Biology Laboratory. Semantic Interoperability Framework Sarala M. Wimalaratne (RICORDO project)
Ontology Engineering and Feature Construction for Predicting Friendship Links in the Live Journal Social Network Author:Vikas Bahirwani 、 Doina Caragea.
Samad Paydar WTLab Research Group Ferdowsi University of Mashhad LD2SD: Linked Data Driven Software Development 24 th February.
Software Project Configuration Management
Assessment of Geant4 Software Quality
Cloud based linked data platform for Structural Engineering Experiment
Software Engineering (CSI 321)
Applications of Data Mining in Software Engineering
File Systems and Databases
Assoc. Prof. Dr. Syed Abdul-Rahman Al-Haddad
LOD reference architecture
Software Metrics SAD ::: Fall 2015 Sabbir Muhammad Saleh.
Metrics for process and Projects
Presentation transcript:

Samad Paydar Web Technology Lab. Ferdowsi University of Mashhad 10 th August 2011

 Introduction  Software ontology models  Semantic web query methods for software analysis  Experimental evaluation  Conclusion 2 2

 In order for software to be developed, maintained and evolved  It is required that it is understood  How code works  Developers’ decisions  Some reasons  Development team changes  Programmers forget what they have done  Undocumented code  Outdated comments  Multiple versions 3 3

 Therefore a code comprehension framework is needed  Mainly composed of two major steps ▪ Converting source code to an internal representation ▪ Performing queries 4 4

 Further  Open source movement  Software complexity  Libraries dependent on other ones  Software that is developed locally is a node in a world-wide network of interlinked source code  Global Call Graph 5 5

 Each node in this cloud should exhibit its information in an open, accessible and uniquely identifiable way  Therefore “we propose the usage of semantic technologies such as OWL, RDF and SPARQL as a software comprehension framework with the abilities to be interlinked with other projects” 6 6

 Three models for different aspects of code 1. Software Ontology Model (SOM) 2. Bug Ontology Model (BOM) 3. Version Ontology Model (VOM)  Connected to related ontologies  DOAP  SIOC  FOAF  WF 7

8

 Based on FAMIX (FAMOOS Information Exchange Model)  A programming language independent model for representing object-oriented source code 9

 For specifying the relations between files, releases, and revisions of software projects  Based on the data model of Subversion 10

 Based on the bug-tracking system Bugzilla 11

 Two non-standard extensions of SPARQL  iSPARQL (Imprecise SPARQL)  SPARQL-ML (SPARQL Machine learning) 12

 Introduces the idea of “virtual triples”  Are not matched against the underlying ontology graph, but used to configure similarity joins  Which pairs of variables should be joined and compared using a certain type of similarity measure 13

14

 An extension of SPARQL with knowledge discovery capabilities  A tool for efficient relational data mining on Semantic Web data  Enables the Statistical Relational Learning (SLR) methods such as Relational Probability Trees (RPTs) and Relational Bayesian Classifiers (RBCs) 15

 Learning phase (building prediction model) 16

 Test phase (making prediction) 17

 4 years ( ) of the proceedings of ICSE Workshop on Mining Software Repositories (MSR) are surveyed  Most actively investigated software analysis tasks are determined 18

19

 Dataset: 206 releases of the org.eclipse.compare plug-in for Eclipse (average of about 150 Java classes per version) + bug tracking information  Exported to OWL 20

 Task 1: software evolution analysis  Applicability of iSPARQL to software evolution visualization (i.e. visualization of code changes foe a certain time span)  Compared all the classes of one major release with another major release with different similarity strategies 21

22

 Task 2: computing source code metrics  Calculating OO software design metrics 23

 Changing methods (CM) and changing classes (CC)  A method that is invoked by many other methods has a higher risk of causing defect in presence of chance 24

25

 Number of methods (NOM) and number of attributes (NOA)  As indicators of GOD classes 26

27

 Number of bugs (NOB) and number of revisions (NOR) 28

 Task 3: detection of code smells  Task 4: defeat and evolution density  Task 5: bug prediction 29

 A novel approach to analyze software systems using Semantic Web technologies  EvoOnt provides the basis for representing source code and metadata in OWL  This representation reduces analysis tasks to simple queries in SPARQL (or its extensions)  A limitation: loss of some information due to the use of FAMIX-based ontology model 30

 Language constructs like if-else are not modeled  Measurements cannot conducted at the level of statements  One of the greatest impediments towards widespread use of EvoOnt : current lack of high-performance industrial-strength triple- stores & reasoning engines 31