ArchiWordNet Integrating WordNet with Domain-Specific Knowledge Luisa Bentivogli 1, Andrea Bocco 2, Emanuele Pianta 1 1 ITC-irst Trento, Italy 2 Politecnico.

Slides:



Advertisements
Similar presentations
ISDSI 2009 Francesco Guerra– Università di Modena e Reggio Emilia 1 DB unimo Searching for data and services F. Guerra 1, A. Maurino 2, M. Palmonari.
Advertisements

Building Wordnets Piek Vossen, Irion Technologies.
Ontological Resources and Top-Level Ontologies Nicola Guarino LADSEB-CNR, Padova, Italy
1 An innovative Policy-based Cross Certification methodology for Public Key Infrastructures V.Casola, A.Mazzeo, N.Mazzocca, M. Rak University of Naples.
Functional and non-functional requirements for building Service-oriented assessment model Adelina Aleksieva-Petrova Milen Petrov 5th TENCompetence Open.
Advanced Information Systems Laboratory Department of Computer Science and Systems Engineering GI-DAYS MÜNSTER A software tool.
what is VA? advantages tools a b c what is VA? advantages tools fully integrated architectural plug-in for Rhino 4 powerful feature-based editor to create.
Department of Software and Computing Systems Physical Modeling of Data Warehouses using UML Sergio Luján-Mora Juan Trujillo DOLAP 2004.
Prentice Hall, Database Systems Week 1 Introduction By Zekrullah Popal.
DEVELOPING LANGUAGES IN GENETICA A general and effective approach to domain-specific problem-solving is to use a high-level language specialized on the.
Multilingual multimedia thesaurus for conservation and restoration collaborative networked model of construction Lucijana Leoni University of Dubrovnik.
The Role of the UMLS in Vocabulary Control CENDI Conference “Controlled Vocabulary and the Internet” Stuart J. Nelson, MD.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
SE 555 Software Requirements & Specification1 Use-Case Modeling: Overview and Context.
Aligning Thesauri for an integrated Access to Cultural Heritage Collections Antoine ISAAC (including slides by Frank van Harmelen) STITCH Project UDC Conference.
UML CASE Tool. ABSTRACT Domain analysis enables identifying families of applications and capturing their terminology in order to assist and guide system.
CAD/CAM Design Process and the role of CAD. Design Process Engineering and manufacturing together form largest single economic activity of western civilization.
Resources Primary resources – Lexicons, structured vocabularies – Grammars (in widest sense) – Corpora – Treebanks Secondary resources – Designed for a.
Software Issues Derived from Dr. Fawcett’s Slides Phil Pratt-Szeliga Fall 2009.
Knowledge organisation and information architecture, Nils Pharo Knowledge organisation and the Web Nils Pharo, 6th November 2002.
SemanTic Interoperability To access Cultural Heritage Frank van Harmelen Henk Matthezing Peter Wittenburg Marjolein van Gendt Antoine Isaac Lourens van.
FRE 2672 Urban Ontologies : the Towntology prototype towards case studies Chantal BERDIER (EDU), Catherine ROUSSEY (LIRIS)
Building a UI with Zen Pat McGibbon –Sales Engineer.
POLITECNICO DI TORINO DIPARTIMENTO CASA-CITTA’ ARCHIWORDNET, A BILINGUAL THESAURUS FOR ARCHITECTURE AND BUILDING: COMPILATION AND APPLICATION TO HYBRID.
Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.
Of 39 lecture 2: ontology - basics. of 39 ontology a branch of metaphysics relating to the nature and relations of being a particular theory about the.
Using WordNet Predicates for Multilingual Named Entity Recognition Matteo Negri and Bernardo Magnini ITC-irst Centro per la Ricerca Scientifica e Tecnologica,
Improving Design Workflow in Architectural Design Applications Presentation Doctoral Seminar 16/06/2006 Leuven (Belgium)
Methodology - Conceptual Database Design Transparencies
Methodology Conceptual Databases Design
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
Odyssey A Reuse Environment based on Domain Models Prepared By: Mahmud Gabareen Eliad Cohen.
Semantic Enrichment of Ontology Mappings: A Linguistic-based Approach Patrick Arnold, Erhard Rahm University of Leipzig, Germany 17th East-European Conference.
Hyper/J and Concern Manipulation Environment. The need for AOSD tools and development environment AOSD requires a variety of tools Life cycle – support.
Ontologies and Lexical Semantic Networks, Their Editing and Browsing Pavel Smrž and Martin Povolný Faculty of Informatics,
The Agricultural Ontology Service (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Library and Documentation Systems.
SYMPOSIUM ON SEMANTICS IN SYSTEMS FOR TEXT PROCESSING September 22-24, Venice, Italy Combining Knowledge-based Methods and Supervised Learning for.
Methodology - Conceptual Database Design. 2 Design Methodology u Structured approach that uses procedures, techniques, tools, and documentation aids to.
Nadir Saghar, Tony Pan, Ashish Sharma REST for Data Services.
Methodology - Conceptual Database Design
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
Project Overview Vangelis Karkaletsis NCSR “Demokritos” Frascati, July 17, 2002 (IST )
Andreas Abecker Knowledge Management Research Group From Hypermedia Information Retrieval to Knowledge Management in Enterprises Andreas Abecker, Michael.
Part4 Methodology of Database Design Chapter 07- Overview of Conceptual Database Design Lu Wei College of Software and Microelectronics Northwestern Polytechnical.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Working with Ontologies Introduction to DOGMA and related research.
Shawn Jones INDUS Corporation January 18, 2000 Open Forum on Metadata Registries Santa Fe, NM SDC JE-2029.
1 Aligning the Parasite Experiment Ontology and the Ontology for Biomedical Investigations Using AgreementMaker Valerie Cross, Cosmin Stroe Xueheng Hu,
SYNTHESIS An information system for administration documentation and promotion of cultural instances Center for Cultural Informatics Foundation for Research.
1 STO A Lexical Database of Danish for Language Technology Applications Anna Braasch Center for Sprogteknologi Copenhagen SPINN Seminar, October 27, 2001.
Controlled Vocabulary & Thesaurus Design Associative Relationships & Thesauri.
Jemerson Pedernal IT 2.1 FUNDAMENTALS OF DATABASE APPLICATIONS by PEDERNAL, JEMERSON G. [BS-Computer Science] Palawan State University Computer Network.
Annotation of Multimedia Documents. Approaches to Cooperation and Personalization. Annotation System January 1998
A Simple English-to-Punjabi Translation System By : Shailendra Singh.
Oct Need for spatial hierarchy (what relevance do spatial concepts have in your domain and do they align with the current IFC spatial hierarchy?)
A Self-organizing Semantic Map for Information Retrieval Xia Lin, Dagobert Soergel, Gary Marchionini presented by Yi-Ting.
Ontology Evaluation Outline Motivation Evaluation Criteria Evaluation Measures Evaluation Approaches.
SERVICE ANNOTATION WITH LEXICON-BASED ALIGNMENT Service Ontology Construction Ontology of a given web service, service ontology, is constructed from service.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Learning Objectives • Add various components to a building model.
CCNT Lab of Zhejiang University
ece 627 intelligent web: ontology and beyond
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Data, Databases, and DBMSs
From Knowledge Organization (KO) to Knowledge Representation (KR)
Managing data Resources:
Methodology Conceptual Databases Design
Presentation transcript:

ArchiWordNet Integrating WordNet with Domain-Specific Knowledge Luisa Bentivogli 1, Andrea Bocco 2, Emanuele Pianta 1 1 ITC-irst Trento, Italy 2 Politecnico di Torino, Italy

GWC Brno, January 20-23, 2004 Outline ArchiWordNet: a WordNet-like thesaurus Adopting and adapting the MultiWordNet model Integrating ArchiWordNet with MultiWordNet Conclusion and future work

GWC Brno, January 20-23, 2004 Outline ArchiWordNet: a WordNet-like thesaurus Adopting and adapting the MultiWordNet model Integrating ArchiWordNet with MultiWordNet Conclusion and future work

GWC Brno, January 20-23, 2004 ArchiWordNet: a WordNet-like thesaurus A bilingual English/Italian thesaurus for the “Architecture and Construction” domain –structured according to the WordNet model –fully integrated with MultiWordNet MultiWordNet A multilingual lexical database in which the Italian WordNet is strictly aligned with Princeton’s English WordNet.

GWC Brno, January 20-23, 2004 Motivation Still Image Server, an architecture image archive available at the Polytechnic of Turin –need for a thesaurus: Image cataloguing (minimize subjectivity) Image retrieval (minimize ambiguity) No exhaustive thesauri for the architecture domain are available

GWC Brno, January 20-23, 2004 Why (Multi)WordNet model? A rich and rigorous structure –synonyms –many relations explicitly and homogeneously encoded Allows for a more powerful and expressive retrieval mechanism –no ambiguities –extended search with related concepts Is more suitable for educational purposes

GWC Brno, January 20-23, 2004 Why integrated with MultiWN? General and multilingual framework for the specialized knowledge Integrated access allowing for a more flexible retrieval of the information Information already existing in the generic (Multi)WordNet can be exploited in the creation of the specialized one

GWC Brno, January 20-23, 2004 Outline ArchiWordNet: a WordNet-like thesaurus Adopting and adapting the MultiWordNet model Integrating ArchiWordNet with MultiWordNet Conclusion and future work

GWC Brno, January 20-23, 2004 Adopting MultiWN model Sources: –Specialized sources Art and Architecture Thesaurus (AAT) Construction Indexing Manual of CI|SfB International and National standards (ISO, CEN, UNI) Architecture and Building Dictionaries Domain literature –MultiWN itself Issues: –Reorganize specialized sources to make them compatible with the MultiWN model –Modify MultiWN synsets to make them suitable for representing the specialized domain

GWC Brno, January 20-23, 2004 Reorganizing domain-specific sources AAT hierarchy ArchiWN hierarchy

GWC Brno, January 20-23, 2004 Tailoring MultiWN synsets MultiWN synsets considered appropriate by the domain experts are included into ArchiWN Several options are available: –add or delete synonyms to MultiWN synsets –modify MultiWN definitions of the synsets –delete and add relations between synsets

GWC Brno, January 20-23, 2004 New relations for ArchiWN HAS FORM (n/n) –{tympanum} HAS-FORM {triangle, trigon, …} HAS ROLE (n/n) –{metal section} HAS-ROLE {upright, vertical} HAS FUNCTION (n/v) –{beam} HAS-FUNCTION {to hold, to support,…}

GWC Brno, January 20-23, 2004 Outline ArchiWordNet: a WordNet-like thesaurus Adopting and adapting the MultiWordNet model Integrating ArchiWordNet with MultiWordNet Conclusion and future work

GWC Brno, January 20-23, 2004 Integrating ArchiWN with MultiWN 5,000 terms grouped in 13 semantic areas => the main ArchiWN hierarchies Architectural styles Materials Construction products Techniques Tools Components of buildings Single buildings and building complexes Physical properties Conditions Disciplines People Documents Drawings and representations

GWC Brno, January 20-23, 2004 Integration issues Identify the MultiWN nodes where to insert the ArchiWN hierarchies Include ArchiWN hierarchies in MultiWN Handle the overlaps between terms present in both MultiWN and ArchiWN Handle the possible inconsistencies in the hierarchies

GWC Brno, January 20-23, 2004 The integration methodology Basic operations –performed on single MultiWN synsets Complex procedures (plug-in) –apply to entire hierarchies

GWC Brno, January 20-23, 2004 Basic operations eclipse a synset tag a synset with the “architecture and construction” domain label add or delete relations to a synset add or delete synonyms in a synset modify the synset definition

GWC Brno, January 20-23, 2004 Complex procedures Substitutive plug-in Integrative plug-in Hyponymic plug-in Inverse plug-in

GWC Brno, January 20-23, 2004 Complex procedures Substitutive plug-in Integrative plug-in Hyponymic plug-in Inverse plug-in MWN

GWC Brno, January 20-23, 2004 Complex procedures Substitutive plug-in Integrative plug-in Hyponymic plug-in Inverse plug-in AWN MWN

GWC Brno, January 20-23, 2004 Complex procedures Substitutive plug-in Integrative plug-in Hyponymic plug-in Inverse plug-in MWN

GWC Brno, January 20-23, 2004 Complex procedures Substitutive plug-in Integrative plug-in Hyponymic plug-in Inverse plug-in AWN MWN

GWC Brno, January 20-23, 2004 Complex procedures Substitutive plug-in Integrative plug-in Hyponymic plug-in Inverse plug-in MWN

GWC Brno, January 20-23, 2004 Complex procedures Substitutive plug-in Integrative plug-in Hyponymic plug-in Inverse plug-in MWN AWN

GWC Brno, January 20-23, 2004 Complex procedures Substitutive plug-in Integrative plug-in Hyponymic plug-in Inverse plug-in AWN

GWC Brno, January 20-23, 2004 Complex procedures Substitutive plug-in Integrative plug-in Hyponymic plug-in Inverse plug-in AWN MWN

GWC Brno, January 20-23, 2004 Results 13 ArchiWN semantic areas plugged in 18 MultiWN synsets –11 ArchiWN semantic areas (12 hierarchies) directly plugged in MultiWN11 4 substitutive plug-ins 8 integrative plug-ins –2 ArchiWN semantic areas (6 hierarchies) required a reorganization of some MultiWN sub-hierarchies2 4 hyponymic plug-ins 2 inverse plug-ins large synset eclipsing

GWC Brno, January 20-23, 2004 ArchiWN up to now “Single buildings and building complexes” sub- hierarchy –900 synsets –Italian and English synonyms –accurate definition Work done manually using the MultiWN graphical interface which allows the user –to modify existing synsets and relations –to create new synsets

GWC Brno, January 20-23, 2004 Outline ArchiWordNet: a WordNet-like thesaurus Adopting and adapting the MultiWordNet model Integrating ArchiWordNet with MultiWordNet Conclusion and future work

GWC Brno, January 20-23, 2004 Conclusions It is possible to integrate ArchiWN with MultiWN MultiWN itself can be widely exploited in the creation of ArchiWN hierarchies Advantages of interdisciplinary cooperation –wrt specialized thesauri formalized structure inheritance of linguistic-oriented information from the generic WordNet –wrt lexical resources many synsets will be associated with images

GWC Brno, January 20-23, 2004 Future work Go on enriching the “Single buildings and building complexes” hierarchy and populating the remaining hierarchies Industrial applications: multilingual specialized lexicon of approximately 1,000 synsets for the window and curtain wall industry Agreement for the future usage of ArchiWN by the Piemonte region in the cataloguing of its architectural cultural heritage

GWC Brno, January 20-23, 2004 Details

GWC Brno, January 20-23, 2004 Direct plug-ins Architectural stylesarchitectural style/1Sub Materialsmaterial/1, substance/1Sub Construction productsbuilding material/1Sub Techniquestechnique/1Int Toolstool/1Int Physical propertiesphysical property/1Int Conditionscondition/1Int Disciplinesdiscipline/1Int Peopleperson/Int Documentsdocument/1Int Drawings and representationsdrawing/2,representation/2Int back

GWC Brno, January 20-23, 2004 Reorganizations back Components of buildingsstructure/1 component/3 region/1 Hypo Single buildings and building complexes structure/AWN building/1 building complex/1 Hypo Inverse

GWC Brno, January 20-23, 2004 Term overlapping ITC-irst provides the Polythecnic with lists of terms: -synsets tagged with the “architecture” label in WN-Domains -hyponyms of WordNet plug-in synsets WN-Domains: 2,595 Architecture = 155 synsets –Town planning = 444 synsets –Building industry = 1,541 synsets –Furniture = 455 synsets

GWC Brno, January 20-23, 2004 Hyponyms of Plug-in synsets Architectural stylesarchitectural style/1S12 hyponyms Materials material/1 substance/1 S 1,266 hyponyms 6,054 hyponyms Construction productsbuilding material/1S 95 hyponyms Techniquestechnique/1I 3 hyponyms Toolstool/1I301 hyponyms Physical propertiesphysical property/1I 103 hyponyms Conditionscondition/1I 1,721 hyponyms Disciplinesdiscipline/1I464 hyponyms Peopleperson/I6,068 hyponyms Documentsdocument/1I328 hyponyms Drawings and representations drawing/2, representation/2 IIII 26 hyponyms 159 hyponyms back

GWC Brno, January 20-23, 2004 building complex/1 room, area, building space building element open space entity/1 object/1 artifact/1 structure/1 architectural component part/4 location/1 structure (AWN) region/1 component/3 architectural space building/1 hypo inverse eclipsing Reorganization of: -Components of buildings -Single buildings and building complexes

GWC Brno, January 20-23, 2004 Modifying MultiWN definition structural_wall bearing_wall ISA an architectural partition with a height and length greater than its thickness; used to divide or enclose an area support wall partition divider any wall supporting a floor or the roof of a building WordNet: {wall – “an architectural partition with a height and length greater than its thickness; used to divide or enclose an area or to support another structure”}