System Model Acquisition from Requirements Text

Slides:



Advertisements
Similar presentations
Language Technologies Reality and Promise in AKT Yorick Wilks and Fabio Ciravegna Department of Computer Science, University of Sheffield.
Advertisements

Profiles Construction Eclipse ECESIS Project Construction of Complex UML Profiles UPM ETSI Telecomunicación Ciudad Universitaria s/n Madrid 28040,
Software Modeling SWE5441 Lecture 3 Eng. Mohammed Timraz
Information Retrieval in Practice
April 22, Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Doerre, Peter Gerstl, Roland Seiffert IBM Germany, August 1999 Presenter:
A Flexible Workbench for Document Analysis and Text Mining NLDB’2004, Salford, June Gulla, Brasethvik and Kaada A Flexible Workbench for Document.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dörre, Peter Gerstl, and Roland Seiffert Presented By: Jake Happs,
About the Presentations The presentations cover the objectives found in the opening of each chapter. All chapter objectives are listed in the beginning.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Huimin Ye.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 8 Slide 1 Tools of Software Development l 2 types of tools used by software engineers:
Design, Implementation and Maintenance
Overview of Search Engines
Visualization By: Simon Luangsisombath. Canonical Visualization  Architectural modeling notations are ways to organize information  Canonical notation.
CASE Tools And Their Effect On Software Quality Peter Geddis – pxg07u.
Calculation BIM Curriculum 07. Topics  Calculation with BIM  List Types  Output.
Object Process Methodology OPM ד " ר אבי סופר. ניתוח מערכות מידע 2 OPM Basic Concepts Emphasis Equally balancing static (structure) and dynamic (behavior)
Can Controlled Language Rules increase the value of MT? Fred Hollowood & Johann Rotourier Symantec Dublin.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
Requirements as Usecases Capturing the REQUIREMENT ANALYSIS DESIGN IMPLEMENTATION TEST.
CASE (Computer-Aided Software Engineering) Tools Software that is used to support software process activities. Provides software process support by:- –
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
Concepts and Realization of a Diagram Editor Generator Based on Hypergraph Transformation Author: Mark Minas Presenter: Song Gu.
Developing GRID Applications GRACE Project
Multi-Class Sentiment Analysis with Clustering and Score Representation Yan Zhu.
 System Requirement Specification and System Planning.
1 Team Skill 3 Defining the System Part 1: Use Case Modeling Noureddine Abbadeni Al-Ain University of Science and Technology College of Engineering and.
Information and Information Technology 1. Information and employment 2.
Information Retrieval in Practice
Introduction to UML.
Chapter 1 The Systems Development Environment
OPCAT: Object-Process CASE Tool
Methodology Conceptual Databases Design
Chapter 1 The Systems Development Environment
Towards a framework for architectural design decision support
UML Diagrams By Daniel Damaris Novarianto S..
Chapter 1: Introduction to Systems Analysis and Design
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Object Management Group Information Management Metamodel
Methodology Conceptual Database Design
Objectives Importance of Requirement Engineering
IB Assessments CRITERION!!!.
Text Based Information Retrieval
OPM/S: Semantic Engineering of Web Services
Chapter 1 The Systems Development Environment
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Software Documentation
Chapter 1 The Systems Development Environment
UML Diagrams Jung Woo.
Greater Arizona eLearning Association (GAZEL)
Multimedia Information Retrieval
Object-Oriented Analysis
Chapter 2 Database Environment.
Tools of Software Development
Thanks to Bill Arms, Marti Hearst
Lecture 12: Data Wrangling
Introduction To System Analysis and Design PART 2
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
Resolution Proofs for Combinational Equivalence
Chapter 1: Introduction to Systems Analysis and Design
Methodology Conceptual Databases Design
Lecture # 7 System Requirements
Chapter 1 The Systems Development Environment
Information Retrieval and Web Design
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 8 Slide 1 Tools of Software Development l 2 types of tools used by software engineers:
*Internal Synthesizer Flow *Details of Synthesis Steps
Chapter 1: Introduction to Systems Analysis and Design
Modeling the involvement of Rpb4/7 in the mRNA biogenesis and degradation with Object-Process Methodology Dov Dori and Mordechai Choder Technion - Israel.
Software Architecture & Design
Presentation transcript:

System Model Acquisition from Requirements Text SMART System Model Acquisition from Requirements Text Technion – Israel Institute of Technology

System Model Acquisition from Requirements Text Operates on free text documentation, such as business process specifications or user requirements Results depend critically on the quality of the processed documentation Based on Object-Process Methodology (OPM) that has two semantically equivalent modalities: Textual – Object-Process Language (OPL) Graphic – Object-Process Diagram (OPD) Technion – Israel Institute of Technology

System Model Acquisition from Requirements Text Significantly reduces the quantity of material that needs to be processed manually Reduces the initial level of conceptual complexity Graphic manipulation (OPD) much easier than text editing Quality, accuracy, and conciseness of the system architecture – higher due to the discipline OPM introduces Capable of automatic generation of UML diagrams Technion – Israel Institute of Technology

SMART - System Diagram SMART OPCAT Categorization Engine OPL Generator System Model Acquisition System Requirements Unstructured Text System Architecting Team System Model Technion – Israel Institute of Technology

System Model Acquisition In-zoomed SMART System Requirements Unstructured Text Category Extraction Categorization Engine Category List raw edited System Architecting Team List Editing Relation Set Relation Formulating OPL Generator OPL Sentence Generating OPL Sentence Set OPCAT OPD Constructing System Model Technion – Israel Institute of Technology

SMART – Procedural Steps Automatic Extraction of Categories from Unstructured Text Manual Editing of Categories Automatic Search of OPM Relations Automatic Generation of OPL Sentences Manual Editing of the Results Technion – Israel Institute of Technology

Automatic Extraction of Categories from Unstructured Text Categorization engine in Common LISP Categories = idiomatic phrases (word sequence) reflecting the underlying topics in a given corpus of documents Based on heuristics Could combine external ontologies/taxonomies/thesauri Technion – Israel Institute of Technology

Manual Editing of Categories Selection of categories that can serve as things in the OPM model, and classifying them as either object or processes Clustering of alternative formulations for the selected OPM things based on their semantic similarity Optionally adding OPM things that did not show up among the extracted categories Technion – Israel Institute of Technology

Automatic Search of OPM Relations Utilizes a set of configurable, predefined templates: Template consists of two things and the relation between them, expressed in alternative ways Utilizes second order regular expressions defined on any lexical or grammatical attribute (part‑of‑speech, capitalization, punctuation) Finite‑state automaton that operates on suffix‑tree index consisting of tokens Instead of comparing character strings compares word sequences Technion – Israel Institute of Technology

Automatic Generation of OPL Sentences Every extracted natural language sentence straight‑forwardly translated into OPL Reformulation of outcome to better reflect the underlying relations: Custom relations transformed into processes (cached into => Caching) Complex relations transformed into two equivalent simple sentences (Actual Documents Cached into Document Repositories => (1) Caching requires Actual Documents, (2) Caching yields Document Repositories) Transformations do not modify the underlying semantics of the NL sentences Technion – Israel Institute of Technology

Manual Editing of the Results Non-semantic corrections – extraction did not depict all of the existing or implied relations Additions and eliminations - semantically modify original output Scaling applied to simplify results without losing details Technion – Israel Institute of Technology

Benefits Significant cut-down in time and resources Minimizes efforts Focus on the system overview ("big picture“) High-quality results Minimizes time-to-market Technion – Israel Institute of Technology

Future Research Directions Tested on EEC IST-2001-38100 GRACE (Grid Retrieval and Categorization Engine) To be utilized for system design in EEC IST-202-507126 COCOON (Building Knowledge-driven and Dynamically Networked Communities within European Healthcare Systems) Looking for commercial pilot application Technion – Israel Institute of Technology