SIL FieldWorks Language Explorer: The lexicon component Gary Simons SIL International Lexicon Tools and Lexicon Standards Nijmegen, 4–5 August 2010.

Slides:



Advertisements
Similar presentations
LIFTing LEGO with RELISH: Lexicon Interchange FormaT in Use Helen Aristar-Dry Institute for Language Information and Technology Eastern Michigan U.
Advertisements

DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
Language data and XML: archiving and interoperability Simon Musgrave Linguistics Program Monash University
Software Tools for Language Documentation DocLing 2013 Peter K. Austin Department of Linguistics, SOAS.
Macromedia Captivate for Information Literacy “Show-Me” Tutorials Presented by MILEX Member: Sean Henry, MLS, PhD Frostburg State University 1 June 2006.
With TimeCard appointments are tagged with information that converts them into time sheets. This way users can report time and expenses from their Outlook.
MP IP Strategy Stateye-GUI Provided by Edotronik Munich, May 05, 2006.
Calendar Browser is a groupware used for booking all kinds of resources within an organization. Calendar Browser is installed on a file server and in a.
A New Learning Tools. Topic Maps is a standard for the representation and interchange of knowledge, with an emphasis on the findability of information.
Integrating Access with the Web and with Other Programs.
Tutorial 8 Sharing, Integrating and Analyzing Data
Implementing ISO Aleta Vienneau and David Danko ESRI.
Today’s Agenda Bill Presentment Overview Demo. Tailoring Your Invoices with Oracle’s Bill Presentment Architecture March 7, 2005.
CRYSTAL REPORTS Jacob Grogan. CRYSTAL REPORTS AND WHY IT’S USEFUL? “ Crystal Reports is a popular Windows-based report generation program that allows.
Simple Web SQLite Manager/Form/Report
XP New Perspectives on Microsoft Access 2002 Tutorial 71 Microsoft Access 2002 Tutorial 7 – Integrating Access With the Web and With Other Programs.
SQL Server Reporting Services
Calendar Browser is a groupware used for booking all kinds of resources within an organization. The software is totally integrated in Outlook. Calendar.
©2011 Quest Software, Inc. All rights reserved. Steve Walch, Senior Product Manager Blog: November, 2011 Partner Training Webcast.
Web service testing Group D5. What are Web Services? XML is the basis for Web services Web services are application components Web services communicate.
1 New : Create your own message starting from scratch 2 New From Template: add professionally designed templates provided exclusively by Gorilla Contact.
DartGrid Browser-based mapping tool of SQL to RDF Point Template Zhejiang University & OpenLink Software.
Introduction KBase works with Office 2007 and higher KBase is a knowledge base that works inside Outlook. It is installed in a local.
TrendReader Standard 2 This generation of TrendReader Standard software utilizes the more familiar Windows format (“tree”) views of functions and file.
Chapter 11 Adding Media and Interactivity. Flash is a software program that allows you to create low-bandwidth, high-quality animations and interactive.
 Using Microsoft Expression Web you can: › Create Web pages and Web sites › Set what you site will look like as you design it › Add text, images, multimedia.
Overview of Mini-Edit and other Tools Access DB Oracle DB You Need to Send Entries From Your Std To the Registry You Need to Get Back Updated Entries From.
What’s New in Visio 2007 Office Visio 2007 is easy to use and comes with diagram- specific shapes and tools that enable you to quickly create professional-looking.
Using Microsoft FrontPage and Visual InterDev Stephen W. Meeley Vice President Product Management.
Copyright © 2012, Oracle and/or its affiliates. All rights reserved. 1 Quick Tutorial – Part 1 Using Oracle BPM with Open Data Web Services David Webber.
EMELD Workshop on Digitizing Lexical Information Modeling Lexical Entries in Bilingual Dictionaries —Or— Exegeting the UML Model Mike Maxwell Linguistic.
Joomla! Day France SEBLOD Version 2.0 for Joomla! 1.6.
FIX Repository based Products Infrastructure for the infrastructure Presenter Kevin Houstoun.
Introduction technology XSL. 04/11/2005 Script of the presentation Introduction the XSL The XSL standard Tools for edition of codes XSL Necessary resources.
London April 2005 London April 2005 Creating Eyeblaster Ads The Rich Media Platform The Rich Media Platform Eyeblaster.
London April 2005 London April 2005 Creating Eyeblaster Ads The Rich Media Platform The Rich Media Platform Eyeblaster.
Max Planck Institute for Psycholinguistics Tool development report H. Brugman MPI Nijmegen.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Android in Teaching Beginning Course. Workshop Agenda General Android Tablet Information Basic Tablet Functions Connectivity Wi-Fi Bluetooth & Bluetooth.
Copyright © 2012, Oracle and/or its affiliates. All rights reserved. 1 Quick Tutorial – Part 2 Open Data Web Services for Oracle BPM August, 2013 Forms.
Working with Metadata in ArcGIS Aleta Vienneau. Working with metadata in ArcGIS View metadata Edit metadata Set your metadata style Leverage metadata.
DEV-05: Ratcheting up your OpenEdge™ Development Productivity Sunil S Belgaonkar Principal Software Engineer.
The Dictionary Development Pathway Facilitating Dictionary Development through Language Software.
Working with Feature Services Gary MacDougall Russell Brennan.
Advanced Technical Writing 2006 Session #4. Today in Class… ► Meet with your editorial team, refine/post deliverables ► Send URL for deliverables to Bill.
© 2006 Altova GmbH. All Rights Reserved. Altova ® Product Line Overview.
L.T.E :: Learning Through Experimenting Using google-svn for MtM Docs Development Denis Thibault Version 3.2 Mar 12 th, 2009.
XP New Perspectives on Microsoft Office Access 2003, Second Edition- Tutorial 8 1 Microsoft Office Access 2003 Tutorial 8 – Integrating Access with the.
DocLing2016 Software Tools Peter K. Austin Department of Linguistics SOAS, University of London
Chapter 29. Copyright 2003, Paradigm Publishing Inc. CHAPTER 29 BACKNEXTEND 29-2 LINKS TO OBJECTIVES Attach an XML Schema Attach an XML Schema Load XML.
Integrating Components and Dynamic Text Boxes with the Animated Map– Lesson 101 Integrating Components and Dynamic Text Boxes with the Animated Map Lesson.
Here are some things you can do while you wait 1.Open your omeka.net site in your browser (e.g. 2.Open.
Introduction to FFI: Why and how FFI was developed Introduction to FFI: Why and how FFI was developed 04/02/2013.
Language Software Overview May Language Software Overview Which software to use for a given language development task? Kent Schroeder SIL Africa.
9 Copyright © 2004, Oracle. All rights reserved. Getting Started with Oracle Migration Workbench.
C Copyright © 2009, Oracle. All rights reserved. Using SQL Developer.
CMF For Content Authors. Slide 1©2001 Zope Corporation. All Rights Reserved. Outline Understand CMF approach to content Demonstrate content author goals.
FLEx 1 NATHANIEL EVERSOLE JULIET MORGAN. WHAT IS FLEx?
The Dictionary Development Pathway Facilitating Dictionary Development through Language Software.
Building Enterprise Applications Using Visual Studio®
Using Access and the Web
Microsoft Office Illustrated
FedEx Billing Online (FBO) Non-Revenue Quick Guide
iCIMS 17.1 Release: Highlights
Microsoft Office Access 2003
Microsoft Office Access 2003
Patents e-Commerce Update: Public and Private PAIR
Patents e-Commerce Update: Public and Private PAIR
Adobe Acrobat DC Accessibility - Metadata, Reading Order, Links
Adobe Acrobat DC Accessibility Data Tables
Presentation transcript:

SIL FieldWorks Language Explorer: The lexicon component Gary Simons SIL International Lexicon Tools and Lexicon Standards Nijmegen, 4–5 August 2010

2 SIL FieldWorks  FieldWorks is:  a suite of integrated software tools to help field workers manage language and cultural data, with support for complex scripts.   The Language Explorer tool is designed to:  manage a lexical database  produce dictionaries  interlinearize texts  analyze morphology

3 Quick Tour  A short quick tour screen movie demonstrates the look and feel  It is the first of 55 narrated screen movies available at:  /brief demo menu.html /brief demo menu.html

4 Integration among areas  The Lexicon, Texts, and Grammar areas all operate over the same database.  In the Lexicon area, users enter lexical entries directly.  In the Texts area, as new morphemes are glossed in text, new lexical entries are created behind the scenes.  In the Grammar area, users describe the categories and features used in lexical description, plus the inflectional templates that guide automatic parsing in Texts.

Conceptual-modeling approach  Lexicon, texts, and grammar are all stored in a single, normalized relational database.  We began by working with domain experts to build a conceptual model of the areas and how they integrate.  That was modeled in UML and transformed to a SQL relational database schema.  See the full model with over 100 classes at: 5

Some key features  Use automatic parsing to empirically verify morphological description within lexicon  Build the word net via lexical relations  Build richness into the lexicon by eliciting through semantic domains  Use “bulk edit” for global clean up  Repurpose content by developing multiple presentation views  Clean separation between stored data and presentation (see example in next 2 slides) 6

Root-based dictionary (Cherokee) 7 - Stem entries just cross-refer to root - Root entries list stems as subentries - Subentries give full description

Stem-based dictionary (Cherokee) 8 - Stem entries give full description - Root entries cross-refer to stems - No subentries

Pathways to publishing  First create a “configured view” to display the lexical entries as desired  Then use the Pathway plug-in to take this stream of configured content and lay it out onto pages for a publishable dictionary   Publishing tools supported so far:  Prince XML (to PDF)  Open Office (to ODF)  Adobe InDesign 9

Lexical interchange  Supports two import formats:  From Shoebox / Toolbox via SFM  “Standard Format Markers” = backslash codes  User configures the mapping of markers to conceptual equivalents in FLEx database  The default mapping is for MDF SFM  From WeSay / Lexique Pro via LIFT  Lexicon Interchange FormaT: an XML application for interchange of lexicons 

Lexicon export  The entire database for a language project can be dumped to Fieldworks XML  XML model.doc XML model.doc  The complete lexical database (a subset of the whole project) can be exported to:  LIFT XML  MDF-based SFM (either root- or stem-based)  options in Flex.doc options in Flex.doc 11

More lexicon export  Any configured view can be exported to:  A streamlined version of Fieldworks XML  MDF-based SFM  XHTML + CSS for presentation  Furthermore, one can create a Fieldworks XML Template (FXT) to define a custom export format (XML, SFM, plain text)  export options.doc export options.doc 12

Interoperation with GOLD  FLEX is preloaded with a grammatical categories catalog that is based on an early GOLD   Similarly, a Morphosyntactic Gloss Assistant is preloaded with morphosyntactic properties from an early GOLD; see p. 10 of:  Preprint.pdf Preprint.pdf  Thus morphosyntactic information in lexicon and texts is implicitly aligned with GOLD  The remaining step is for us to map to GOLD ids when they are standardized; then we can easily export GOLD ids in LIFT and other XML 13

Uptake  October 2009: FLEx 3.0 released in Fieldworks 6.0. Free download from:   323 members of a reasonably active Google Group (~3,000 messages)   185 language projects have registered as users  Over 30 did a 4-day FLEx workshop led by Beth Bryson at InField Beth will also do a one-day FLEx workshop at ICLDC, Feb