>fre(;fre1) fres1(;fres1) -- ((cons fre1 fres1); fres). fres1(;fres) -> "," >fre(;fre1) fres1(;fres1) -- ((cons fre1 fres1); fres). fres1(;fres) -> "." -- (nil; fres). #start-symbol engfre-entry #lexical-rules eng -> >>/" :". fre -> >[,.]. #lexical-order (":" "," "." eng fre) #working-code (sia-defclass engfre () (eng fre)) RECUPDIC (ENGFRE (ENG. "a") (FRE LIST "un" "une")) (ENGFRE (ENG. "black") (FRE LIST "noir")) (ENGFRE (ENG. "love") (FRE LIST "aimer" "amour")) Example of textual dictionary AA h-grammar of textual dictionary AA Structured format called LISPO A Practical Guide to Lexical Data Acquisition with RECUPDIC : Part III How does RECUPDIC work ? � Take a textual dictionary and its h-grammar as input files. � Parsing and analyzing textual dictionary. � Transform into a structured form (LISPO). Advantages of LISPO format � The structured form easier to manage by computer. � Easy for using as input files to construct a new dictionary which is more complex. RECUPDIC"> >fre(;fre1) fres1(;fres1) -- ((cons fre1 fres1); fres). fres1(;fres) -> "," >fre(;fre1) fres1(;fres1) -- ((cons fre1 fres1); fres). fres1(;fres) -> "." -- (nil; fres). #start-symbol engfre-entry #lexical-rules eng -> >>/" :". fre -> >[,.]. #lexical-order (":" "," "." eng fre) #working-code (sia-defclass engfre () (eng fre)) RECUPDIC (ENGFRE (ENG. "a") (FRE LIST "un" "une")) (ENGFRE (ENG. "black") (FRE LIST "noir")) (ENGFRE (ENG. "love") (FRE LIST "aimer" "amour")) Example of textual dictionary AA h-grammar of textual dictionary AA Structured format called LISPO A Practical Guide to Lexical Data Acquisition with RECUPDIC : Part III How does RECUPDIC work ? � Take a textual dictionary and its h-grammar as input files. � Parsing and analyzing textual dictionary. � Transform into a structured form (LISPO). Advantages of LISPO format � The structured form easier to manage by computer. � Easy for using as input files to construct a new dictionary which is more complex. RECUPDIC">
Download presentation
Presentation is loading. Please wait.
Published byMatthew O’Neal’ Modified over 9 years ago
1
Motivation Facts Idea Problem ð Translation need dictionaries in large quantities and of good quality. ð Construction a new dictionary is expensive. ð Many digital dictionaries available and some are free. Reusing free digital dictionaries to create a new dictionary which is composed of lexical information in large quantities and qualities. Digital dictionaries are varied in format, quantity, quality, available linguistic information, etc. Acquisition of Dictionary Resources � Analyzing source text � Extracting necessary data � Building output structures General method unstructured form structured form Objective A Practical Guide to Lexical Data Acquisition with RECUPDIC
2
A Practical Guide to Lexical Data Acquisition with RECUPDIC : Part II Approach in RECUPDIC Text modification techniques + Syntax directed translator Text modification techniques + Syntax directed translator Grammar formalism called h-grammar + running engine How to do text modification ? � Search/replace with string pattern- machine mechanism (regular expression) � Scripting mechanismAdvantages � Facilitate parsing process. � Obtain the result without parsing. � Retain necessary information & strip unnecessary information. � Explicit unanalyzed structures � Pre-edit & post-edit h - grammar � Context-free grammar + variables, actions, conditions � Describing the text of an entry and the construction of the result format #grammar grammars name #syntax-rules all the syntax rules #start-symbol a start symbol #lexical-rules all the lexical rules #lexical-order the lexical orders #working-code CLOS-Code
3
A : un, une. Black : noir. Love : aimer, amour #grammar ENGFRE-TEST #syntax-rules engfre-entry(;entry) -> >eng(;eng) ":" fres(;fres) -- (#!engfre(eng fres); entry). fres(;fres) -> >fre(;fre1) fres1(;fres1) -- ((cons fre1 fres1); fres). fres1(;fres) -> "," >fre(;fre1) fres1(;fres1) -- ((cons fre1 fres1); fres). fres1(;fres) -> "." -- (nil; fres). #start-symbol engfre-entry #lexical-rules eng -> >>/" :". fre -> >[,.]. #lexical-order (":" "," "." eng fre) #working-code (sia-defclass engfre () (eng fre)) RECUPDIC (ENGFRE (ENG. "a") (FRE LIST "un" "une")) (ENGFRE (ENG. "black") (FRE LIST "noir")) (ENGFRE (ENG. "love") (FRE LIST "aimer" "amour")) Example of textual dictionary AA h-grammar of textual dictionary AA Structured format called LISPO A Practical Guide to Lexical Data Acquisition with RECUPDIC : Part III How does RECUPDIC work ? � Take a textual dictionary and its h-grammar as input files. � Parsing and analyzing textual dictionary. � Transform into a structured form (LISPO). Advantages of LISPO format � The structured form easier to manage by computer. � Easy for using as input files to construct a new dictionary which is more complex. RECUPDIC
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.