TA-RE 1 : An Exchange Language for Mining Software Repositories Sunghun Kim, Thomas Zimmermann, Miryung Kim, Ahmed Hassan, Audris Mockus, Tudor Girba,

Slides:



Advertisements
Similar presentations
Developer Identification Methods for Integrated Data from Various Sources Gregorio Robles Jesus M. Gonzalez-Barahona Presented by Brian Chan Cisc 864.
Advertisements

Clustering Software Artefacts Based on Frequent common changes Presented by Haroon Malik.
Doe Bug Prediction Support Human Developers? Findings From a Google Case Study Chris Lewis, ZhongPeng Lin, Caitlin Sadowski, Xiaoyan Zhu, Rong Ou, E.James.
Contributing source code to CSDMS Albert Kettner.
Memories of Bug Fixes Sunghun Kim, Kai Pan, and E. James Whitehead Jr., University of California, Santa Cruz Presented By Gleneesha Johnson CMSC 838P,
CSE 115 Week 14 April , Announcements April 14 – Exam 10 April 14 – Exam 10 April 18 – Last day to turn in Lab 8 for any credit April 18.
Homework Homework #1 is up Programming Language: whatever – Write your own code – HW questions about code: Be succinct and clear Partial credit will be.
1 Applications of Data Mining in Banking Maria Luisa Barja Jesús Cerquides Ubilab IT Laboratory UBS AG.
A PPLYING D ATA MINING FOR B UG D ISCOVERY FROM S OURCE CODE Chittampally Vasanth Raja 10IT05F
Extracting Test Cases by Using Data Mining; Reducing the Cost of Testing Andrea Ciocca COMP 587.
Multimedia Data Mining Arvind Balasubramanian Multimedia Lab (ECSS 4.416) The University of Texas at Dallas.
Oracle Data Mining Ying Zhang. Agenda Data Mining Data Mining Algorithms Oracle DM Demo.
Defect prediction using social network analysis on issue repositories Reporter: Dandan Wang Date: 04/18/2011.
Introduction and Overview to Mining Software Repository Zoltan Karaszi zkaraszi (at) kent.edu MS/PHD seminar (cs6/89191) November 9th,
Methodology for Architectural Level Reliability Risk Analysis Lalitha Krothapalli CSC 532.
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
A Comparative Analysis of the Efficiency of Change Metrics and Static Code Attributes for Defect Prediction Raimund Moser, Witold Pedrycz, Giancarlo Succi.
1 Software Maintenance and Evolution CSSE 575: Session 8, Part 2 Analyzing Software Repositories Steve Chenoweth Office Phone: (812) Cell: (937)
OOSE 01/17 Institute of Computer Science and Information Engineering, National Cheng Kung University Member:Q 薛弘志 P 蔡文豪 F 周詩御.
Clustering Software Artifacts Based on Frequent common changes Presented by: Ashgan Fararooy Prepared by: Haroon Malik (Modified)
Facilitating Decision making through Knowledge Capitalization of Maintenance Projects Management with KDD technique OLADEJO Bolanle F.(Ph.D) Department.
Mining Software Repositories to Assist Developers and Support Managers Ahmed E. Hassan SoftWare Architecture Group (SWAG) University Of Waterloo
Dependency Tracking in software systems Presented by: Ashgan Fararooy.
Elaine Weyuker August  To determine which files of a large software system with multiple releases are likely to contain the largest numbers of.
Reviewed By: Paul Varcholik University of Central Florida EEL 6883 – Software Engineering II Spring 2009 Marc Eaddy, Thomas Zimmermann,
Software Engineering CS3003
SZZ – An overview David Bowes University of Hertfordshire.
CMCD: Count Matrix based Code Clone Detection Yang Yuan and Yao Guo Key Laboratory of High-Confidence Software Technologies (Ministry of Education) Peking.
Automatic Identification of Bug-Introducing Changes. Presenter: Haroon Malik.
Open Source Software in Image Guided Surgery and New Age of Publishing Luis Ibáñez KITWARE, Inc.
Presented by: Ashgan Fararooy Referenced Papers and Related Work on:
November Ron McFadyen1 Composite Pattern A composite is a group of objects in which some objects contain others; one object may represent groups,
Computational Linguistics. The Subject Computational Linguistics is a branch of linguistics that concerns with the statistical and rule-based natural.
Software Analytics: Towards Software Mining that Matters Tao Xie University of Illinois at Urbana-Champaign
Maureen Doyle, James Walden Northern Kentucky University Students: Grant Welch, Michael Whelan Acknowledgements: Dhanuja Kasturiratna.
Data Mining and Machine Learning in Call Centers Project 4 – Technical Presentations Advanced Communicator Series Presenting A Technical Paper – Original.
KUFA UNIVERSITY Department of Computer Science 09/12/2015.
Text Mining Application Programming Chapter 1 Introduction Manu Konchady, 2006.
Ehsan Salamati Taba, Foutse Khomh, Ying Zou, Meiyappan Nagappan, Ahmed E. Hassan 1.
Advanced S/w Eng - s/w productivity issues 1 Software Productivity Issues Why do software projects fail? Advanced Software Engineering COM360 University.
A Classification-based Approach to Question Answering in Discussion Boards Liangjie Hong, Brian D. Davison Lehigh University (SIGIR ’ 09) Speaker: Cho,
CSE 403 Lecture 27 Course Wrap-up Discussion slides created by Marty Stepp
DATA MINING Using Association Rules by Andrew Williamson.
University of Waterloo Four “interesting” ways in which history can teach us about software Michael W. Godfrey * Xinyi Dong Cory Kapser Lijie Zou Software.
Embedded Lab. Park Yeongseong.  Introduction  Problem Formulation  Approach Overview  AOI(Area Of Interest) Extraction  CallStack Pattern Mining.
Department of Computer Science, Graduate School of Information Science and Technology, Osaka University Towards a Collection of Refactoring Patterns Based.
Presented by Lu Xiao Drexel University Quantifying Architectural Debt.
Walid Ibrahim, Nicolas Bettenburg, Emad Shihab and Ahmed Hassan Software Analysis and Intelligence Lab (SAIL) Queen’s University {walid, nicolas, emads,
Perspectives on fault data quality Tracy Hall Reader in Software Engineering Brunel University Two short talks on this topic…
Zohreh Raghebi.  A software platform provides an integrated environment  Machine learning  Data mining  Text mining  Predictive analytics  Business.
KNOWLEDGE DISCOVERY & DATA MINING Abhishek M. Mehta ROLL NO:24.
ITE Hirakjyoti Nath Class 7
Data Mining Generally, (Sometimes called data or knowledge discovery) is the process of analyzing data from different perspectives and summarizing it.
Building a community for genome and proteome annotation
PBPK & Systems Pharmacology
Applications of Data Mining in Software Engineering
Welcome to EduWeb Software Solutions Welcome to EduWeb Software Solutions.
DATA MINING.
Program Usability Based on the Perception of Bugs as Features
I Know Where You're Hiding
دومین کمیته مترجمین حاکمیت بالینی دانشگاه
  30 A 30 B 30 C 30 D 30 E 77 TOTALS ORIGINAL COUNT CURRENT COUNT
Times.
Methodology for Architectural Level Reliability Risk Analysis
How can I learn C language?
Welcome! Knowledge Discovery and Data Mining
LANGUAGE EDUCATION.
MAPO: Mining and Recommending API Usage Patterns
Text Mining Application Programming Chapter 1 Introduction
Contributing source code to CSDMS
Presentation transcript:

TA-RE 1 : An Exchange Language for Mining Software Repositories Sunghun Kim, Thomas Zimmermann, Miryung Kim, Ahmed Hassan, Audris Mockus, Tudor Girba, Martin Pinzger, E. James Whitehead, Jr., and Andreas Zeller 1 TA-RE is a Korean word and means “group” or “cluster”.

Software repositories have been getting a lot of attention..., but Extraction Intermediate Data Analysis SCM Repository

Software repositories have been getting a lot of attention..., but Extraction Intermediate Data Analysis SCM Repository Extraction requires a non-trivial effort Extracted data depend on the heuristics Difficult to reproduce existing repository mining results

Our proposal TA-RE Corpus: Extracted Data transactions changes snapshots nature counts references Change statistic Change pattern analysis Origin analysis Co-change analysis Code clone analysis Bug prediction

TA-RE is a work in progress □Future work □Finalize the exchange language □Extract data in TA-RE exchange language □Contributions and discussions are welcome □TA-RE Working Group □

TA-RE is a work in progress □Future work □Finalize the exchange language □Extract data in TA-RE exchange language □Contributions and discussions are welcome □TA-RE Working Group □ WE NEED YOU!!!