Presentation is loading. Please wait.

Presentation is loading. Please wait.

MESMUSES methodology Lessons learned and open issues… Alain Michard Florence, June 2003.

Similar presentations


Presentation on theme: "MESMUSES methodology Lessons learned and open issues… Alain Michard Florence, June 2003."— Presentation transcript:

1 MESMUSES methodology Lessons learned and open issues… Alain Michard Florence, June 2003

2 MESMUSES broad vision Just like several other projects SW is all about semantic interoperability Sharing machine-readable terminologies and classification schemes Science and culture are collective and international Semantic Web methodology should be highly relevant for managing and sharing scientific and cultural information

3 Some key S&T issues in the Project Model : is RDFS / OWL-Lite adequate ? Schema authoring : method and tools needed ! Metadata : where does it come from ? Automatic Indexing : experiments with a categorizer

4 The basic SW model Dwelling Person Artefact House Artist Artwork Lives-in Owner Produces Create Type : texte imprimé, monographie Auteur(s) : Zola, Émile (1840-1902) Titre(s) : L'assommoir [Texte imprimé] / par Emile Zola Edition : 50e éd. Publication : Paris : G. Charpentier, 1878 Description matérielle : 111-569 p. Notice n° : FRBNF35963044 Creates Lives-in Surrogates Schema Real-world entities

5 Model and Schema Language Typed attributes are needed XML-Schema types Derived types (e.g.: Celsius temperature, Gregorian date, etc.) Enumerated types, thesauri Time-stamping Cardinality constraints Explicit transitivity of properties (e.g.: geographic inclusion)

6 Schema authoring issues (1) Find the right level of abstraction Is « Glucid » a class or an instance ? Or is it sometime a class and sometime an instance ? Avoid the « KR » attitude and practices ! It’s all about indexing resources with shared terminologies, not about representing human knowledge !

7 Schema authoring issues (2) est-régulé-par est-expliquée-par Processus élémentair e Processus complexe est-réalisé-par nécessite déclenche Structure Cellule Molécule Organisme Appareil Organe Tissus Système GTANS Grande Thématique est-documentée-par est-constitué-de consomme transforme produit implique est-constitué-de élimine ISA

8 Schema authoring issues (3)

9 Schema authoring issues (4) Authoring tools are badly needed Graphical representation of the schema Zooming on sub-graphs (hierarchies) Versioning Consider using UML authoring environment ? Established methodology and tutorials are needed

10 Creating Surrogates Data extraction and fusion from structured sources R-DB, XML-DB, LDAP Updating When ? Should not create duplicates ! Detect cross-references Authority lists Thesauri Lexical distance ???

11 Automatic Categorization Automatic indexing By extracting metadata from resources By automatic categorization Define hierarchies of « concepts » inside the schema Seeding with representative documents Machine learning to create categorizers Pros : enriched search functionality Cons : hierarchies of categories are static Adding a category may change the categorizers of the others

12 Bottom-line… RDFS schema authoring may be more difficult than E-R modelling Debates on syntactic features are irrelevant Should be grounded on real-world implementations and testbeds A new query language (e.g.: RQL) is not high priority We have not addressed the « logical rules » layer Semantic Web vs. Community Webs


Download ppt "MESMUSES methodology Lessons learned and open issues… Alain Michard Florence, June 2003."

Similar presentations


Ads by Google