A method to restrict the blow-up of hypotheses... A method to restrict the blow-up of hypotheses of a non-disambiguated shallow machine translation system Jernej Vičič
A method to restrict the blow-up of hypotheses... Outline 1. MT 2. RBMT 3. Apertium 4. Original Architecture 5. Problem 6. Changed Architecture 7. New problems
A method to restrict the blow-up of hypotheses... 1 MT Machine translation (MT) is the application (ANY) (by Jernej) of computers to the task of translating texts from one natural language to another (EAMT) Full-fledged translation of natural languages with no user support
A method to restrict the blow-up of hypotheses... 1 MT, continued
A method to restrict the blow-up of hypotheses... RBMT Rule-Based Machine Translation Main disadvantages: Production process for the resources Automatic production of resources: Morphologically annotated data Bilingual translation dictionaries Translation rules
A method to restrict the blow-up of hypotheses...
1 Original architecture /home/jernej/clanki/novo/RANLP2009/Original_design.ep s
A method to restrict the blow-up of hypotheses... POS tagger 1. POS – Part Of Speech 2. Morphology and (some) Syntax 3. Example: drevesadrevoNcnsg drevesidrevoNcnda 4. Disambiguation
A method to restrict the blow-up of hypotheses... 2 Changed architecture
A method to restrict the blow-up of hypotheses... New Problems ambiguities words in a sentence 3. BUUUM
A method to restrict the blow-up of hypotheses... Multiple candidate selector 1. All possible ambiguous candidates 2. Combinatorial explosion
A method to restrict the blow-up of hypotheses... Ranking module 1. Stochastic ranker 2. Based on 2/3-gram Statistical Language Module
A method to restrict the blow-up of hypotheses... Multiple candidate selector 1. All possible ambiguous candidates 2. Combinatorial explosion
A method to restrict the blow-up of hypotheses... Results 1. Better than the original system 2. Numbers are irrelevant:)