A Common Ontology for Linguistic Concepts Scott Farrar University of Arizona.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

KR-2002 Panel/Debate Are Upper-Level Ontologies worth the effort? Chris Welty, IBM Research.
TU e technische universiteit eindhoven / department of mathematics and computer science Modeling User Input and Hypermedia Dynamics in Hera Databases and.
Copyright © 2002 Cycorp Introduction Fundamental Expression Types Top Level Collections Time and Dates Spatial Properties and Relations Event Types Information.
Computational Paradigms in the Humanities – eHumanities and their role and impact in transdisciplinary research Gerhard Budin University of Vienna.
IPY and Semantics Siri Jodha S. Khalsa Paul Cooper Peter Pulsifer Paul Overduin Eugeny Vyazilov Heather lane.
The Wichita lexicon in LEXUS Armik Mirzayan University of Colorado at Boulder Jacquelijn Ringersma Max Planck Institute for Psycholinguistics RELISH Workshop.
General architecture of Functional Discourse Grammar.
INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING NLP-AI IIIT-Hyderabad CIIL, Mysore ICON DECEMBER, 2003.
Searching the Semantic Web. Introduction  Research Focuses: IE Ontologies (creating, languages, merging, storing, querying)  Next Sep: Using the Semantic.
Introduction to Computational Linguistics Lecture 2.
A New Web Semantic Annotator Enabling A Machine Understandable Web BYU Spring Research Conference 2005 Yihong Ding Sponsored by NSF.
Resources Primary resources – Lexicons, structured vocabularies – Grammars (in widest sense) – Corpora – Treebanks Secondary resources – Designed for a.
Information Modeling: The process and the required competencies of its participants Paul Frederiks Theo van der Weide.
LREC 2000 Athens, Greece An XML-based Encoding Standard for Language Corpora Nancy Ide Vassar College Patrice Bonhomme LORIA/CNRS Laurent Romary LORIA/CNRS.
What Linguists Want (we think) Helen Aristar Dry & Anthony Aristar LINGUIST List & E-MELD.
The Rosetta Project Digital Language Archive Laura Buszard-Welcher The Long Now Foundation / University of California, Berkeley.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
M.Hosseinzadeh EDC Translation Art or Skill Session.
Current Trends in Language Documentation and the Hans Rausing Endangered Languages Project Lenore A. Grenoble Dartmouth College Lenore A. Grenoble Linguistics.
July 11, 2003E-MELD 2003 E-MELD “School” of Best Practice Helen Aristar-Dry & Gayathri Sriram The LINGUIST List Eastern Michigan University.
An ontology of computing. What is an ontology? An ontology is a specification of a conceptualization. A specification of a representational vocabulary.
Sharing and Browsing Linguistic Data EMELD Arizona: Terry Langendoen Scott Farrar.
Principles of the GOLD Ontology & Conversion of GOLD to DCIF Presenters: Anthony Aristar, Evelyn Richter.
E-Meld Workshop on Digitization of lexical Information 3-5 August 2002, EMU, Ypsilanti Working Group on Lexicon Macrostructures Chairman’s Report Dafydd.
Survey of Semantic Annotation Platforms
Overview of technologies for translators and language service providers Belinda Maia University of Porto.
Ontology Summit2007 Survey Response Analysis -- Issues Ken Baclawski Northeastern University.
24 Jan 2005 Kick off meeting (Luxembourg) 1 LIRICS Linguistic Infrastructure for Interoperable Resources and Systems ►Kick off meeting presentation ►Proposal.
Chapter 6. Semantics is the study of the meaning of words, phrases and sentences. In semantic analysis, there is always an attempt to focus on what the.
INFRASTRUCTURE FOR GIS INTEROPERABLITY APPLICATION FACULTY OF INFORMATION AND COMMUNICATION TECHNOLOGY (FTMK) THE TECHNICAL UNIVERSITY OF MALAYSIA MELAKA.
Nov 21, 2005University of Texas at Austin The E-MELD Project Helen Aristar Dry & Anthony Aristar The LINGUIST List Eastern Michigan U & Wayne State U.
“D.A.I. & S.M. for KM” a synergy of complementary domains and challenges  the semantic web addicted people “please, raise your hands !”
Christopher Wellen M.Sc. Candidate McGill University On Cognition and Computation: An Introduction to Spatial Ontologies.
1 What is an Ontology? n No exact definition n A tool to help organize knowledge n Or a way to convey a theory on how to represent a class of things n.
An Ontology for Linguistic Representation Scott Farrar, Terry Langendoen, William Lewis University of Arizona.
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA 1 Comparability of language data and analysis Using an ontology for linguistics Scott Farrar, U.
Aug 2-5, 2002 EMELD Workshop Overview & Update Helen Aristar Dry The LINGUIST List & Eastern Michigan University EMELD Workshop on The Digitization.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
Technology – Broad View Aspects that play a role when integrating archives leave the details of some core topics to the 2. day Bernhard Neumair:Base Technologies.
DenK and iCat Two Projects on Cooperative Electronic Assistants (CEA’s) Robbert-Jan Beun, Rogier van Eijk & Huub Prüst Department of Information and Computing.
What you have learned and how you can use it : Grammars and Lexicons Parts I-III.
Summary and Questions for Psycholinguistics. Psycholinguistics as cognitive study Stimuli (makeup of information) processing (functions & operations)
Christoph F. Eick University of Houston Organization 1. What are Ontologies? 2. What are they good for? 3. Ontologies and.
ReSeTrus Development of a digital library technology based on redundancy elimination and semantic elevation, with special emphasis on trust management.
ISO/TC37/SC4/N377 secretary report
Slide 1 SDTSSDTS FGDC CWG SDTS Revision Project ANSI INCITS L1 Project to Update SDTS FGDC CWG September 2, 2003.
Towards a roadmap for standardization in language technology Laurent Romary & Nancy Ide Loria-INRIA — Vassar College.
Semi-Automated Elicitation Corpus Generation The elicitation tool provides a simple interface for bilingual informants with no linguistic training and.
Language Language - a system for combining symbols (such as words) so that an unlimited number of meaningful statements can be made for the purpose of.
July 1-3, 2005 E-MELD 2005 Ontologies in Linguistic Annotation 1 The GOLD Effort So Far Terry Langendoen Brian Fitzsimons Emily Kidder Department of Linguistics.
Concept mining for programming automation. Problem ➲ A lot of trivial tasks that could be automated – Add field Patronim on Customer page. – Remove field.
Semantic Wiki: Automating the Read, Write, and Reporting functions Chuck Rehberg, Semantic Insights.
Semantic search-based image annotation Petra Budíková, FI MU CEMI meeting, Plzeň,
Constructing A Yami Language Lexicon Database from Yami Archiving Projects Meng-Chien Yang(Providence University, Taiwan) D. Victoria Rau(National Chung.
INTRODUCTION TO APPLIED LINGUISTICS
16 April 2011 Alan, Edison, etc, Saturday.. Knowledge, Planning and Robotics 1.Knowledge 2.Types of knowledge 3.Representation of knowledge 4.Planning.
NLP Midterm Solution #1 bilingual corpora –parallel corpus (document-aligned, sentence-aligned, word-aligned) (4) –comparable corpus (4) Source.
Find the Best Software Companies in Toronto
The Semantic Web By: Maulik Parikh.
Grammar Grammar analysis.
Language and Culture.
ece 627 intelligent web: ontology and beyond
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Introduction to Linguistics
Semantic Web - Ontologies
Introduction to Linguistics
Morphoogle - A Multilingual Interface to a Web Search Engine
Translation: key concepts
Discourse Analysis.
Presentation transcript:

A Common Ontology for Linguistic Concepts Scott Farrar University of Arizona

Endangered Languages As many as half of the world’s languages are in danger of disappearing LaPolla (1998) Including: Many languages in the Americas (Hopi), Africa, Australia (), and Southeast Asia (Biao Min).

EMELD EMELD (Electronic Metastructure for Endangered Languages Data) One of Application of EMELD: Make endangered languages available on the Semantic Web

Linguistic Field Work Linguists collect data Datasets (grammars, dictionaries, or glossed corpora) Hopi example of kachina: sivu-’ikwiw-ta-qa [vessel-carry: on: back-DUR-REL]

Problems Concerning Data Interoperability Dataset can vary according to: –markup –theoretical style –natural language semantics Az épület-be mégy-ek. the building-IllativeCase go-1P/SING I am going into the building.

Problems Concerning Data Interoperability Linguistic Data is Dynamic New data is collected. Datasets are revised. Theory changes.

Standardization is not Viable Text Encoding Initiative (TEI) (Sperberg- McQueen and Burnard 1994) Corpus Encoding Standard (CES) (Ide and Romary 2000)

Towards a Solution Data Storage and Distribution—local or distributed? Data model for linguistic datasets Linguistic ontology

EMELD Architecture EMELD Search Engine GUI HopiMocoviBiao Min Linguistic Ontology Semantic Web

Linguistic Ontology Conceptual Model for the Linguistics domain (special focus on morpho-syntax) Built on top of the Standard Upper Merged Ontology (SUMO) (Niles and Pease 2001) –already includes a number of concepts relating to semiotics and linguistics –incorporates concepts from a number of top-level ontologies –peer-reviewed and freely available

Backbone Taxonomy Entity Physical Object ContentBearingObject Icon SymbolicString LinguisticExpression WrittenLinguisticExpression Text Sentence Phrase Word Morpheme SpokenLinguisticExpression Dialogue Sentence Phrase Word Morpheme

Backbone Taxonomy (continued) Abstract Class Relation Predicate GrammaticalRelation Aspect Tense Case Agreement Attribute GrammaticalAttribute Gender Person Number

Morphosyntactic Case Case InherentCase Spatio-KineticCase PositionalCase InessiveCase DirectionalCase IllativeCase ExistentialCase AbessiveCase PartitiveCase InstrumentalCase StructuralCase GenitiveCase ErgativeCase NominativeCase

Future directions Include the domains of phonology and discourse analysis. The linguistics ontology has applications beyond the immediate EMELD project: –as part of an expert system for reasoning about language data –as part of an interlingua designed for machine translation systems

Contact Info Scott Farrar Will Lewis Terry Langendoen {farrar, wlewis,