An agile process for the creation of conceptual models from content descriptions Hans-Werner Sehring Centre for Sustainable Content Logistics TuTech Innovation.

Slides:



Advertisements
Similar presentations
Objectives Create an action query to create a table
Advertisements

Meta Data Larry, Stirling md on data access – data types, domain meta-data discovery Scott, Ohio State – caBIG md driven architecture semantic md Alexander.
Elliott / October Understanding the Construct to be Assessed Stephen N. Elliott, PhD Learning Science Institute & Dept. of Special Education Vanderbilt.
Objectives Explore a structured range of data Freeze rows and columns
Chapter 10: Designing Databases
Database Planning, Design, and Administration
Lecture plan Outline of DB design process Entity-relationship model
Word Tutorial 5 Working with Templates and Outlines
Database Systems: Design, Implementation, and Management Tenth Edition
©Ian Sommerville 2000 Software Engineering, 6th edition. Chapter 12Slide 1 Software Design l Objectives To explain how a software design may be represented.
Lecture # 2 : Process Models
The software process A software process is a set of activities and associated results which lead to the production of a software product. This may involve.
An Approach to Evaluate Data Trustworthiness Based on Data Provenance Department of Computer Science Purdue University.
Object-Oriented Analysis and Design
Software Testing and Quality Assurance
7M701 1 Software Engineering Object-oriented Design Sommerville, Ian (2001) Software Engineering, 6 th edition: Chapter 12 )
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 4- 1.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 8 The Enhanced Entity- Relationship (EER) Model.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 7 Conceptual Data Modeling Using Entities and Relationships.
COST G9 - Work group 2 Cadastral science meeting Aalborg, Dk Modeling methodology for real estate transactions Radoš Šumrada Faculty.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 5 Understanding Entity Relationship Diagrams.
Object-Oriented Databases
Creating Architectural Descriptions. Outline Standardizing architectural descriptions: The IEEE has published, “Recommended Practice for Architectural.
Information Modeling: The process and the required competencies of its participants Paul Frederiks Theo van der Weide.
Copyright 2004 Prentice-Hall, Inc. Essentials of Systems Analysis and Design Second Edition Joseph S. Valacich Joey F. George Jeffrey A. Hoffer Appendix.
7M822 Software Engineering: System Models 14 September 2009.
Chapter 4 Entity Relationship (E-R) Modeling
Lecture Nine Database Planning, Design, and Administration
1 Prototype Hierarchy Based Clustering for the Categorization and Navigation of Web Collections Zhao-Yan Ming, Kai Wang and Tat-Seng Chua School of Computing,
Chapter 7: The Object-Oriented Approach to Requirements
Introduction To System Analysis and design
Chapter 9 Database Planning, Design, and Administration Sungchul Hong.
Database System Development Lifecycle © Pearson Education Limited 1995, 2005.
Overview of the Database Development Process
Object-oriented methodology object models use case modeling unified modeling language the data dictionary the cornucopia case portfolio project Systems.
©Ian Sommerville 2000 Software Engineering, 6th edition. Chapter 6 Slide 1 Requirements Engineering Processes l Processes used to discover, analyse and.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 4 Slide 1 Software Processes.
Software Processes lecture 8. Topics covered Software process models Process iteration Process activities The Rational Unified Process Computer-aided.
Copyright 2001 Prentice-Hall, Inc. Essentials of Systems Analysis and Design Joseph S. Valacich Joey F. George Jeffrey A. Hoffer Appendix A Object-Oriented.
Copyright 2002 Prentice-Hall, Inc. Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich Chapter 20 Object-Oriented.
Software Engineering Management Lecture 1 The Software Process.
11 Chapter 11 Object-Oriented Databases Database Systems: Design, Implementation, and Management 4th Edition Peter Rob & Carlos Coronel.
Approaching a Problem Where do we start? How do we proceed?
 Three-Schema Architecture Three-Schema Architecture  Internal Level Internal Level  Conceptual Level Conceptual Level  External Level External Level.
Requirements as Usecases Capturing the REQUIREMENT ANALYSIS DESIGN IMPLEMENTATION TEST.
Review of Software Process Models Review Class 1 Software Process Models CEN 4021 Class 2 – 01/12.
1 What is OO Design? OO Design is a process of invention, where developers create the abstractions necessary to meet the system’s requirements OO Design.
PROC-1 1. Software Development Process. PROC-2 A Process Software Development Process User’s Requirements Software System Unified Process: Component Based.
Object-Oriented Modeling: Static Models. Object-Oriented Modeling Model the system as interacting objects Model the system as interacting objects Match.
Ch- 8. Class Diagrams Class diagrams are the most common diagram found in modeling object- oriented systems. Class diagrams are important not only for.
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 4 Slide 1 Software Processes.
Fusion Design Overview Object Interaction Graph Visibility Graph Class Descriptions Inheritance Graphs Fusion: Design The overall goal of Design is to.
Copyright © 2009 Pearson Education, Inc. Publishing as Prentice Hall Appendix A Object-Oriented Analysis and Design A.1.
OBJECT-ORIENTED TESTING. TESTING OOA AND OOD MODELS Analysis and design models cannot be tested in the conventional sense. However, formal technical reviews.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 4- 1.
Database Design, Application Development, and Administration, 6 th Edition Copyright © 2015 by Michael V. Mannino. All rights reserved. Chapter 5 Understanding.
Chapter 9 Database Planning, Design, and Administration Transparencies © Pearson Education Limited 1995, 2005.
Introduction to Spatial Computing CSE 555
Elaboration popo.
UNIT-IV Designing Classes – Access Layer ‐ Object Storage ‐ Object Interoperability.
Chapter (12) – Old Version
The Enhanced Entity- Relationship (EER) Model
Chapter 12 Outline Overview of Object Database Concepts
Object-Oriented Analysis
Methontology: From Ontological art to Ontological Engineering
Chapter 20 Object-Oriented Analysis and Design
ITEC 3220A Using and Designing Database Systems
Copyright 2007 Oxford Consulting, Ltd
Enhanced Entity-Relationship (EER) Modeling
Practical Database Design and Tuning Objectives
Presentation transcript:

An agile process for the creation of conceptual models from content descriptions Hans-Werner Sehring Centre for Sustainable Content Logistics TuTech Innovation GmbH / Hamburg University of Technology Joint work with: Sebastian Boßung Henner Carl Joachim W. Schmidt

30 September 2007An agile modelling process - Hans-Werner Sehring, Outline 1.Conceptual Content Management 2.Asset expressions and schemata 3.The Asset Schema Inference Process 4.Straight-forward schema inference 5.Cluster-based schema inference 6.Process evaluation 7.Summary and outlook

30 September 2007An agile modelling process - Hans-Werner Sehring, Conceptual Content Management Conceptual Content Management (CCM) –an approach to domain modelling –inspired by epistemology: entity description by classes and instances, called Assets –Assets are dual entity descriptions consisting of content visualising it and a conceptual model describing it –model-based system generation Features: –modelling is carried out by domain experts –domain models are open to changes –existing work is preserved, even if changes are applied –communication between domain experts with individual models is maintained

30 September 2007An agile modelling process - Hans-Werner Sehring, CCM dynamics CCM systems (CCMSs) are dynamically generated from domain models: –immediately realizing model changes –preserving existing Assets –maintaining communication Key contributions to this end: –modelling language –model compiler –architecture for evolvable systems model Historiography from Time import Timestamp from Topology import Place class Professor { content image concept characteristic n :String relationship publs :Work* } Intermediate model (parse tree) ………… a:AssetClassb:AssetClass m:Model superClass Political_Iconography (PI) ArtistsRegents m client1 client ( Regents ) m client client ( PI ) m med2 mediation ( Regents, Artists ) DB ( Regents ) m client2 client ( Artists ) DB ( Artists ) m med1 mediation ( PI, ( Regents, Artists )) m distrib1 distribution ( PI, Regents ) m distrib2 distribution ( PI, Artists ) DB ( PI )

30 September 2007An agile modelling process - Hans-Werner Sehring, Model-driven development All SW development starts with a conceptual model –especially model-driven development approaches call for models with a sufficient degree of formality –CCM is similar to model-driven development in the respect that software creation is highly automated –in CCM, software generation is even dynamic A CCM model is required as a starting point for CCMSs –usually, some modelling expert (analyst) is consulted –due to dynamics requirement, such a modelling expert cannot be employed in CCM –domain experts are not modelling experts; usually have problems with, e.g., sufficient formality –but: experts can “tell their story” by providing examples

30 September 2007An agile modelling process - Hans-Werner Sehring, Asset expressions and schemata In many domains research starts by regarding instances (samples), not concepts

30 September 2007An agile modelling process - Hans-Werner Sehring, Asset model from the example Manually defined classes for the example: model Historiography from Time import Timestamp from Topology import Place class Professor { content image concept characteristic name :String relationship publications :Work* } class Work { content scan concept characteristic title :String relationship concerns :Professor* relationship issued :Issuing relationship reviewers :Professor*} class Issuing { concept relationship issued :Place relationship issuedBy :Professor relationship issuedWhen :Timestamp } Models consisting of classes Classes with content handles and attributes (and constraints) characteristics relationships Models consisting of classes Classes with content handles and attributes (and constraints) characteristics relationships

30 September 2007An agile modelling process - Hans-Werner Sehring, Asset model from the example (cont’d) Example of personalisation: a domain expert introduces the distinction of documents: model MyHistoriography from Historiography import Work, Professor class Work { concept relationship reviewer unused } class Dissertation refines Work { concept relationship reviewer :Professor* } Import and redefinition of classes for schema evolution (user communities) personalisation (single users) … Import and redefinition of classes for schema evolution (user communities) personalisation (single users) …

30 September 2007An agile modelling process - Hans-Werner Sehring, Asset Schema Inference Process (ASIP) Bootstrapping: CCM itself requires an initial model as a starting point for the open dynamic modelling process Required: sytematic support for domain experts in finding suitable models Start with Asset Expressions: –content abstractions and applications: assigned names and bound values –semantic types (concepts): no inner structure Concepts and classes are not distinguished in CCM models, intensional and extensional definitions Free-form entity descriptions are used as samples; later they become instances of classes reviewer: Professor : Professor

30 September 2007An agile modelling process - Hans-Werner Sehring, Agile CCMS development Agility: –based on the possibility to generate CCMSs dynamically –domain experts review their models based on experiences with an operational CCMS –if changes to the model are required, another iteration of the process is started –entity descriptions created within the CCMS can be used as samples for the next iteration of the process Create Asset expressions Construct schema Generate CCMS

30 September 2007An agile modelling process - Hans-Werner Sehring, ASIP phases The ASIP has four phases Sample acquisition Schema inference Feedback questions Prototype generation System generation unhappy with schema: -modify samples (- modify schema) answer questions Phase 1 Phase 2 Phase 3 Phase 4

30 September 2007An agile modelling process - Hans-Werner Sehring, Two schema inference experiments Experiments with alternatives for phases 2 and 3: –(traditional) schema inference plus user feedback straight-forward approach starting from singletons –clustering, supervised by domain experts statistical approach, semi-supervised learning Phase 3 (generation of questions to gather feedback) is determined by the alternative chosen Result of phases 1-3 is a CCM model: –prototype generation and system generation (phase 4) are carried out by the CCM model compiler –the domain expert can modify the inferred schema (openness and dynamics)

30 September 2007An agile modelling process - Hans-Werner Sehring, Straight-forward schema inference Schema construction by traditional schema inference 1. derive naive classes directly from the set of samples 2. apply simplifications 3. if changes where applied to the schema, repeat step 2 Step 1: for each sample create an Asset class with –a content handle whose type is determined by the encoding format of the sample’s content –attributes for all abstractions over the content characteristics for certain known types relationships for other types no further constraints

30 September 2007An agile modelling process - Hans-Werner Sehring, Schema simplification Step 2: simplifications, repeatedly applied in the specified order –identical class: unify classes with attributes and content handles with identical names and types –inheritance: subtype relationship of classes whose sets of attributes are in a subset relationship –type match: if two classes have attributes and content handles of identical types, prompt expert for unification –inheritance orphan: ask domain expert about removal of classes with only few instances Note: –often classes considered equal if the attributes’ types match –here the name is considered, or else feedback is collected

30 September 2007An agile modelling process - Hans-Werner Sehring, Cluster-based schema inference Schema construction by clustering: –cluster samples, create classes from clusters –experiment based on k-means algorithm Clustering steps: –classification: assign classes to clusters based on distance measure d : d(s,c) = α  d sem (s,c) + (1-α)  d struct (s,c),α  [0..1] –optimisation: recompute the cluster centres –inheritance hierarchy creation: like in the simple approach –feedback: visualise the clusters, allow to partition clusters => semi-supervised learning Less user interaction than in the traditional approach

30 September 2007An agile modelling process - Hans-Werner Sehring, Structural distance measure d struct is based on the length of the shortest edit script (similar to string matching) Costs like: edit operationcost magnitude add attributelow remove attributehigh change attribute namelow broaden attribute typemedium narrow attribute typevery low increase cardinality of attribute valuemedium decrease cardinality of attribute valuevery low

30 September 2007An agile modelling process - Hans-Werner Sehring, Semantic distance measure d sem is determined by the shortest paths in the class hierarchy 1/2 h(T1) if T 1 is direct supertype of T C d sem (T 1,T m ) + d sem (T m,T C ) if T 1 is direct supertype of T m d sem (s,c) = and T m is supertype of T C d sem (T S,T 1 ) + d sem (T S,T C ) if T S is the most specific common supertype of T 1 and T C

30 September 2007An agile modelling process - Hans-Werner Sehring, Process evaluation Schema quality: –generally difficult to judge –for domain modelling: not a schema that describes sample best, but model that best represents the application domain Criteria [Cherfi, Akoka, Comyn-Wattiau]: –specification: graphical legibility simplicity expressiveness syntactical correctness semantic correctness –usage: completeness, understandability –implementation: implementability, maintainability

30 September 2007An agile modelling process - Hans-Werner Sehring, Process evaluation (cont’d) Selected parameters: –simplicity: in general depends on the given sample set domain expert’s answers in feedback phase –syntactical correctness: granted by model generation –semantic correctness: can be negatively impacted by structurally coinciding classes with different meanings –understandability: generated class names can be an obstacle but: generated system lowers impact of schema –implementability: by generation –maintainability: through dynamics

30 September 2007An agile modelling process - Hans-Werner Sehring, Summary and outlook Summary: –Conceptual Content Management allows domain experts to provide and individually change domain models –domain experts are usually no modelling experts, and they prefer to start with samples describing observations –a process helps domain experts defining initial models to start the open dynamic CCM activity –as one novel approach a cluster-based schema inference process has been investigated Outlook: future work will include … –the inclusion of the cluster-based approach into the open modelling for extensional concept definitions –the employment of reasoning techniques (induction, abduction) to guide the schema construction process