Download presentation
Presentation is loading. Please wait.
Published byWhitney Bridges Modified over 9 years ago
1
Software Applications for Processing Romanian Texts. Demonstration and Comparison Sanda Cherata Babeş-Bolyai University Faculty of Letters
2
2 Software Applications The Romanian Morphological Dictionary (DMR) – Software ITC SA – RoLingva www.rolingva.ro LEXICON – for updating attributes in lexical entries SIASTRO-AM – phrase analysis of noun, adjective, adverb, verb and prepositional phrases ETR – term extractor for Romanian specialised texts
3
3 DMR Paradigm of a given lemma classic form stem + termination Accents Syllabification Morphological analysis of a given word
4
4 Software Applications The Romanian Morphological Dictionary (DMR) – Software ITC SA – RoLingva www.rolingva.ro LEXICON – for updating attributes in lexical entries SIASTRO-AM – phrase analysis of noun, adjective, adverb, verb and prepositional phrases ETR – term extractor for Romanian specialised texts
5
5 LEXICON Specifying attributes for lexico-morphological classes Designed to collect data from multiple users Friendly interface
6
6 Software Applications The Romanian Morphological Dictionary (DMR) – Software ITC SA – RoLingva www.rolingva.ro LEXICON – for updating attributes in lexical entries SIASTRO-AM – phrase analysis of noun, adjective, adverb, verb and prepositional phrases ETR – term extractor for Romanian specialised texts
7
7 SIASTRO-AM Lexico-morphological analysis Parsing of noun, adjective, adverb, verb and prepositional phrases Uses a lexicon based on DMR, enriched with new lexical and syntactic attributes added with the LEXICON application Outputs an annotated text
8
8 SIASTRO-AM Tags for text elements sentence {F – Start sentence sentence sentence F} – End sentence word {C – Start word word word C} – End word unknown word {N – Start unknown word unknown word unknown word N} – End unknown word number {D – Start number number number D} – End number punctuation sign {S – Start punctuation sign punctuation sign punctuation sign S} – End punctuation sign hyphen {L – Start hyphen - hyphen L} – End hyphen ignored sequence {I – Start ignored sequence sequence ignored sequence I} – End ignored sequence
9
9 SIASTRO-AM Tags for words {C word ( part of speech + grammatical category +......, separates parts of speech + grammatical category +...... ) syllabification+accent position:, separates homographs (.......),....... (......) syllabification+ accent position:+ lemma +:...... C} {C date (vrb+p_fp+, (vrb+p_fp+, sbt+fdpn+fisn+fipn+fvpa+, sbt+fdpn+fisn+fipn+fvpa+, adj+fdpn+fisn+fipn+fvpa+ adj+fdpn+fisn+fipn+fvpa+ ) da-te+2:+da+:+dată+:+dat+: da-te+2:+da+:+dată+:+dat+:C}
10
10 Software Applications The Romanian Morphological Dictionary (DMR) – Software ITC SA – RoLingva www.rolingva.ro LEXICON – for updating attributes in lexical entries SIASTRO-AM – phrase analysis of noun, adjective, adverb, verb and prepositional phrases ETR – term extractor for Romanian specialised texts
11
11 ETR Desk top
12
12 ETR Menu bar
13
13 ETR Files menu
14
14 Files Menu – New Project
15
15 Files Files Menu – New Project - Files
16
16 Files Files Menu – New Project - Files
17
17 Subject Fields Files Menu – New Project Subject Fields
18
18 Abbreviations Files Menu – New Project - Abbreviations
19
19 Initialisms Files Menu – New Project - Initialisms
20
20 File File Menu – Open Project
21
21 Contexts File menu – Contexts
22
22 File menu – Terms
23
23 File menu – Terminological forms
24
24 View menu
25
25 View menu
26
26 Export menu
27
27 ETR – Term Extraction
28
28 ETR – Contexts
29
29 ETR – Move term in Terminological form
30
30 ETR – Terminological Forms – contexts
31
31 Source text
32
32 ETR – Terminological Form
33
33 ETR – Future Developments Syntactical analysis Enriching the terminological form by adding new terminological features
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.