TMF - a tutorial Part 3: Designing (schemas and) filters TMF - Terminological Markup Framework Laurent Romary - Laboratoire Loria
General principles 4 Terminological information interchange –Three components: Source TDB 1 Target TDB 2 Terminological interchange format –A specific TML (DXLT, Geneter) TDB 1 TDB 2 TML
Important notice –GMT is not a TML A too abstract format –Uncontrolled recursivity ( struct element) –Uncontrolled content ( feat and annot ) Necessity to provide a schema to check interchanged data –Precise list of datacategory –Precise definition of format –GMT is here to provide conceptual simplicity
Designing filters TML to GMT
General principles 4 Just for your information –The creation of the filters can be automatized 4 Basic processes –Reduction of expansion trees –Mapping elements and attributes to the corresponding data categories
Reducing expansion trees 4 Example DXLT (Martif) sub-tree GMT
Element mapping 4 Example DXLT (Martif) Bla, bla, bla etc. GMT Bla, bla, bla etc.
Structural elements 4 Generating a GMT struct element TE
Features 4 Generating a GMT feat element »(style=Attribute) iso identifier
Features 4 Generating a GMT feat element »(style=Element) iso term
Features 4 Generating a GMT feat element »(style=TypedElement) SubjectField
XML Schemas for TMLs …work ahead…
Analysing existing TDBs Towards a generic methodology
General Architecture TDB Flat XMLGMTTML Format specific XSL stylesheet Simple DB dumper Automatic GMT2TML stylesheet
A two phase process 4 List the various Data Categories used in the TDB –Relate them to existing registries (e.g. iso 12620), cf. 4 Identify the underlying organization of the TDB –Relate it to the Meta-model –Anchor the DatCat where they actually occur
Analysis of an existing TDB Going through an example
Eurodicautom sample BTB DAG AG1 JUA key money CILF,Dict.Agriculture,ACCT,1977 pas-de-porte prix payé au précédent occupant pour le droit d'entrer dans une exploitation agricole TNC(1997) CILF,Dict.Agriculture,ACCT,1977 droit rural;pratique prohibée par la loi definition-12620A.5.1 (TS) term-12620A.1 (TS) Language 12620A.10.7(LS) note-12620A.8 (TS) classificationCode-12620A.4.2 (TE)
Result in GMT (1/2) BTB-TY-398 BTB DAG AG1 JUA EN key money CILF,Dict.Agriculture,ACCT,1977
Result in GMT (2/2) fr pas-de- porte prix payé au précédent occupant pour le droit d'entrer dans une exploitation agricole TNC(1997) CILF,Dict.Agriculture,ACCT,1977 droit rural;pratique prohibée par la loi
Simple rules 4 Using XSL locality
Introducing specific levels 4 Necessity to combine structure and content
Default rule 4 Useful for keeping track of unmapped data categories
Useful pointers 4 TMF page: – 4 HLT/Salt project page – 4 Data category query tool: –