1 / 25Sat. 31 Aug. 2002SEMANET Workshop Frameworks, Implementation & Open Problems for the Collaborative Building of a Multilingual Lexical Database Mathieu.

Slides:



Advertisements
Similar presentations
Automatically Populating Acception Lexical Database through Bilingual Dictionaries and Conceptual Vectors PAPILLON 2002 Mathieu Lafourcade LIRMM - France.
Advertisements

アドベンチャーにほんご1 L.1か4 CLASSROOM EXPRESSIONS. たんご Let’s begin. はじめましょう。 Stand. きりつ。 Bow. れい。 Sit. ちゃくせき。 ~is absent. (お)やすみです。 (the use of お O adds politeness.
あどべんちゃーにほんご L. 2か にほんごのきょうしつ /Japanese Classroom General goals of the lessons: You will be able to communicate the information below in the given situations.
Bellwork: Which is correct? 1) 2) 3) 4) 5) 6) o so yu ka me sa へいせい 二十五ねん 十がつ二十一にち・げつようび Assignments: -Get Hiragana test signed -Nan desu ka 1 Vocab. practice.
1 © S. Hamano and W. Kikuchi Visualizing Japanese Grammar Appendix Shoko Hamano George Washington University.
© S. Hamano and W. Kikuchi 1 Visualizing Japanese Grammar Appendix Shoko Hamano George Washington University.
Excelによる積分.
JPN 311: Conversation and Composition 確 ( たし ) かな情報 ( じょうほ う ) ・ 不確 ( ふたし ) かな情報 ( じょう ほう )
JPN 312 (Fall 2007): Conversation and Composition Contraction (2); 意見を言う (to express your opinion)
SUPJ2010 Japanese Ⅱ( A ) Elementary Japanes e ‐ in twenty hours- Chapter 7.
JPN 311: Conversation and Composition 勧誘 (invitation)
第十四課 単語 元気 II pp Nouns あに(兄) (my) older brother.
JPN 311: Conversation and Composition 伝言 (relaying a message)
JPN 311: Conversation and Composition 許可 (permission)
地図に親しむ 「しゅくしゃくのちがう 地図を 使ってきょりを調べよ う1」 小学4年 社会. 山口駅裁判所 県立 美術館 サビエル 記念聖堂 山口市役所 地図で探そう 市民会館 県立 図書館.
Created by Maki Meyer, AJSL KANJI 漢字ノート -1.
JPN 312 (Fall 2007): Conversation and Composition 文句 ( もんく ) を言う.
Elementary Japanese ‐in twenty hours- Chapter 9
漢字練習 第7課第7課第7課第7課. しる 知る まつ 待つ もつ 持つ まつ 待つ しる 知る.
「ネット社会の歩き方」レッスンキット プレゼンテーション資料集 15. チャットで個人情報は 言わない プレゼンテーション資料 著作権は独立行政法人情報処理推進機構( IPA )及び経済産業省に帰属します。
SUPJ1020 Japanese Ⅰ( B ) Elementary Japanes e ‐ in twenty hours- Chapter 1.
NODA Ken, KAJITA Satoshi and SASA Yoshinobu Truss Contest 2004 idea 男がこう言っ た。 「デザインにこだわろう。」 現場が一瞬凍りついた。 男はこう続けた。 「記録に残るだけではだめだ、記憶に残さなくて は。」 現場がひとつになった。
SUPJ2010 Japanese Ⅱ( A ) Elementary Japanes e ‐ in twenty hours- Review of Japanese Ⅰ (B)
Hiragana dan Katakana Hiragana dan Katakana Pertemuan 1 Matakuliah: N Bahasa Jepang l (Nihongo I) Tahun: 2008.
LANG3910 Japanese Ⅲ Lesson 14 依頼・現在進行形. 学習項目 1. 「て -form 」 2. 依頼表現 An expression of request 3. 相手の意向を尋ねる Ask someone’s mind 4. 現在進行形 Actions in Progress.
携帯電話でのコミュニ ケーションについて 1班真田 出水 佐伯 堺. 仮説  女性のほうが携帯電話を使ったコミュニ ケーションを重要視する。
Exercise IV-A p.164. What did they say? 何と言ってましたか。 1.I’m busy this month. 2.I’m busy next month, too. 3.I’m going shopping tomorrow. 4.I live in Kyoto.
JPN 312 (Fall 2007): Conversation and Composition 面接 ( めんせつ )
 Objectives 1. How do I say good morning? Afternoon? and other basic greetings? 挨拶 2. What is Japanese writing like? 3. Survey and Godzilla.
1/24 17/7/2002 (Papillon-02) Translation in Papillon (Ch. Boitet) The translation of examples, citations, definitions and glosses in the Papillon project.
GOOD, MULTILINGUAL interpretation, translation, resources What can we do for the OG-08? Christian BOITET GETA, CLIPS, IMAG-campus UJF & CNRS, Grenoble,
The “S” Line. Revision あ い う え お か く け.
B 02 Writing in Japanese: The 3 Character Sets How do you write in Japanese?
ASKING FOR CLASSROOM objectS
おはようございます! Ohayoo gozaimasu Good morning- (polite)
To join sentences in English we use ‘and’. To join sentences in Japanese we use the ‘ て form’. Example adjectives: It is big. It is fun. おおきいです。たのしいです。
Household Items (Unit 2 たんご) *House and Rooms *Locations *Saying/asking where something is.
A roadmap for MT : four « keys » to handle more languages, for all kinds of tasks, while making it possible to improve quality (on demand) International.
Joining Adjectives て form for い adjectives and な adjectives.
日本語一 1月 7 日 New Year’s Greetings : E b0.
たくさんの人がいっしょに乗れる乗り物を 「公共交通」といいます バスや電車 と 自動車 の よいところ と よくない ところ よいところ と よくない ところ を考えてみよう!
2006/12/081 Large Scale Crawling the Web for Parallel Texts Chikayama Taura lab. M1 Dai Saito.
日本語きほん文法の復習 Basic Japanese Grammar Review
と. What does it mean???? ビールを 飲むと ねむく なりま す 四月に なると さくらが さきま す たばこを すうと からだに わる いです.
Ho w to write ひらがな Left click the mouse to move through each of the slides. Place your mouse on each symbol to hear how it is said. When you see this.
2007/4/201 Extracting Parallel Texts from Massive Web Documents Chikayama Taura lab. M2 Dai Saito.
Useful daily expressions
The Climate as the Major Determinant Shaping Japanese National Character : True or False? B11567 Saki Yokomuro.
Self introduction From PowerPoint to interactive quiz.
 HOW TO SAY THAT YOU WANT TO DO SOMETHING  TALKING ABOUT FUTURE PLANS たい Form ほしいです.
12 月 1 6日 火曜日 チャプター5の文法(ぶんぽう)5. おしらせ announcement チャプターテストが、木曜日と金曜日に あります。 – 木曜日(もくようび) : all sections except writing. – 金曜日(きんようび) : writing Study guide.
PARTICLES The words that hold a Japanese sentence together.
By Bethany Haddon, Abbie Ulyatt. Hiragana (a – ro) あ、い、う、え、お。 か、き、く、け、こ。 さ、し、す、せ、そ。 た、ち、つ、て、と。 な、に、ぬ、ね、の。 は、ひ、ふ、へ、ほ。 ま、み、む、め、も。 ら、り、る、れ、ろ。
The Game Begins Background information screen (this may need to be presented some other way & need more detail.): Jicchan is your grandfather’s good friend.
B 04 How to Type in Japanese How do you TYPE in Japanese?
Japanese Jeopardy Answer the answers with questions. Or question the answers… Rules:
い 日本の どこに 行きたい です か。 Where do you want to go in Japan?
Event 8: The Old Lady スタート Start. ばあちゃんの質問に答えましょう ( Answer Baachan’s questions ) When the user clicks the help button, this instruction will appear in.
Bellwork: Write in English 1) きいて ください。 2) おそく なりました。 3) かいて ください。 4) もう いちど おねがい し ます。 へいせい 二十五ねん 九がつ十六にち – げつようび Assignments: -Get Test signed by your.
かぞく 家族. Today… Review family members vocabulary and kanji characters Enhance knowledge and understanding of connecting adjectives Answer questions in.
Chapter 6 Grammar. Japanese Adjectives There are two kinds of adjectives in Japanese; い adjectives and な adjectives. Both adjectives describe nouns, but.
Rethinking Retirement Unit 15. Paragraph 1  You probably haven’t thought much about retirement yet.  After all 要するに、つまり、結局は  Ahead of you 貴方の前には 
RELATIVE CLAUSES Adjectival Clauses/Modifiers. RELATIVE CLAUSES A relative clause is the part of a sentence which describes a noun Eg. The cake (which)
Plain Form Verb Box The first step to everyday Japanese.
Introduction to Japanese Language USE / Spring - Konya / TÜRKİYE.
雪 ゆき. 雪や こんこ ゆき.
Japanese I: Integrated Activities
Answer the answers with questions. Or question the answers…
Automatically Populating Acception Lexical Database through Bilingual Dictionaries and Conceptual Vectors PAPILLON 2002 Mathieu Lafourcade LIRMM -
Presentation transcript:

1 / 25Sat. 31 Aug. 2002SEMANET Workshop Frameworks, Implementation & Open Problems for the Collaborative Building of a Multilingual Lexical Database Mathieu Mangeot & Gilles Sérasset NII, Tokyo, Japan GETA-CLIPS, Grenoble, France

2 / 25Sat. 31 Aug. 2002SEMANET Workshop Outline  Presentation of Papillon Project  Macrostructure of the Dictionary  Microstructure of the Entries  Bootstrapping & Contribution Process  Limbo, Purgatory & Paradise  Bootstrapping with Conceptual Vectors  Contributions & Validation Process  Lexico-Semantical Network  Monolingual with Lexical Functions  Multilingual with Axies (Interlingual Links)  Conclusion & References

3 / 25Sat. 31 Aug. 2002SEMANET Workshop Motivations  Initial Goal  Build a French-Japanese electronic dictionary for humans  Very Few Existing Resources  French-Japanese, Free, Electronic  Construction Costs Too High  EDR English-Japanese Dictionary  1200 human-year; entries; price: 14,3 Mo ¥  On Going Collaborative Construction Projects  Edict Japanese->English, SAIKAM Japanese-Thai  Lack of Information  Numerical Specifiers, kanji+kana+romaji

4 / 25Sat. 31 Aug. 2002SEMANET Workshop Extended Goals  Build a More Complete Dictionary  Multilingual (English, French, German, Japanese, Lao, Malay, Thai, Vietnamese)  Multiusers (beginners, experts, applications)  Community Development  LINUX Construction Paradigm  Voluntary Contributors  Mutualization of the Resources  User Preferences & Profiles

5 / 25Sat. 31 Aug. 2002SEMANET Workshop Outline  Presentation of Papillon Project  Macrostructure of the Dictionary  Microstructure of the Entries  Bootstrapping & Contribution Process  Limbo, Purgatory & Paradise  Bootstrapping with Conceptual Vectors  Contributions & Validation Process  Lexico-Semantical Network  Monolingual with Lexical Functions  Multilingual with Axies (Interlingual Links)  Conclusion & References

6 / 25Sat. 31 Aug. 2002SEMANET Workshop Bilingual Dictionaries French English ThaiJapanese Malay Vietnamese Lao

7 / 25Sat. 31 Aug. 2002SEMANET Workshop Pivot Dictionary French English ThaiJapanese Malay Vietnamese Lao Int

8 / 25Sat. 31 Aug. 2002SEMANET Workshop Detailed Pivot Structure French DiCo Vocable affection n.f. lexie affection.1 (tendresse) lexie affection.2 (médecine) Interlingual Links (Axies) lexie maladie Vocable maladie n.f. Refinement Links English DiCo Vocable disease N lexie disease lexie affection Vocable affection N 病気 byouki 【びょうき】 Japanese DiCo Ref: Work done by Gilles Sérasset

9 / 25Sat. 31 Aug. 2002SEMANET Workshop Outline  Presentation of Papillon Project  Macrostructure of the Dictionary  Microstructure of the Entries  Bootstrapping & Contribution Process  Limbo, Purgatory & Paradise  Bootstrapping with Conceptual Vectors  Contributions & Validation Process  Lexico-Semantical Network  Monolingual with Lexical Functions  Multilingual with Axies (Interlingual Links)  Conclusion & References

10 / 25Sat. 31 Aug. 2002SEMANET Workshop Combinatorial Lexicography  From Meaning-Text Theory  Alain Polguère & Igor Mel’tchuk (U. de Montréal)  Gives the necessary information to go from an idea (the meaning) to its realisation in a given language (the text).  Existing Dictionaries: DEC, DiCo database & LAF  Same Structure for Every Language  56 Basic Lexical Functions

11 / 25Sat. 31 Aug. 2002SEMANET Workshop French Lexie (DiCo Entry)  Name of the Lexical Unit: MEURTRE  Grammatical Properties: nom, masc  Semantical Formula: action de tuer: ~ PAR L'individu X DE L'individu Y  Government Pattern: X =I = de N, A-poss Y= II = de N, A-poss  Lexical Functions:  {QSyn} assassinat,homicide#1;crime /*Quasi synonyms*/  {Oper 1 } accomplir, commettre, perpétrer [ART ~]; tremper [dans ART ~] /*Causes that X does a M.*/  {S 1 } auteur [de ART Ø]//meurtrier-n /*Name for X*/  {S 2 } victime [de ART Ø] /*Name for Y*/  Example: La mésentente pourrait être le mobile du meurtre.  Idioms:  _appel au meurtre_  _crier au meurtre_

12 / 25Sat. 31 Aug. 2002SEMANET Workshop Japanese Lexie  Name of the Lexical Unit: 殺人 【さつじん】  Reading: satsujin  Grammatical Properties: 名詞 【めいし】  Semantical Formula: どうさ : 人 Y の 人 X の ~  Government Pattern: X = I = N, Y = II = N の  Lexical Functions:  {QSyn} 殺戮【さつりく】, 殺害【さつがい】 /*Quasi synonyms*/  {Oper 1 } [~ を ] する; [~ を ] 犯す /* Causes that X does a M. */  {S 1 } 殺人者 【さつじんしゃ】, 殺人鬼 【さつじんき】 /*Name for X*/  {S 2 } 被害者【ひがいしゃ】 /*Name for Y*/  Example: 喧嘩【けんか】は殺人【さつじん】の動機【どうき】になり得【 え】るだろう。  Idioms:  _ 殺人剣 【さつじんけん】 _  _ 嘱託殺人 【しくたくさつじん】 _

13 / 25Sat. 31 Aug. 2002SEMANET Workshop Interlingual Links (Axies)  Links to other Axies  Synonyms, Refinement, Generalizations  Motivated by existing translation links.  Not like concepts  Links to External References  To be independent from any existing theory  Wordnet synsets, NTT Semantic category,  ONTOS or LexiGuide ontologies,  UNL Uws & Graphs etc.  Linking Monolingual Lexies

14 / 25Sat. 31 Aug. 2002SEMANET Workshop Structure of an Axie  Unique ID: a  Semantic Tag (entity, process, state, result): process  Links to lexies: fra: meurtre.1 eng: murder.1 jpn: satsujin.1  Links to other axies  synonym axies: a (assassination)  generic axies: a00002  refined axies: a  References to External Resources:  WordNet Synset: unlawful premeditated killing of a human being  UNL UW: murder(icl>action,agt>human,obj>human)  NTT Semantic Category  ONTOS Concept  LexiGuide concept

15 / 25Sat. 31 Aug. 2002SEMANET Workshop Outline  Presentation of Papillon Project  Macrostructure of the Dictionary  Microstructure of the Entries  Bootstrapping & Contribution Process  Limbo, Purgatory & Paradise  Bootstrapping with Conceptual Vectors  Contributions & Validation Process  Lexico-Semantical Network  Monolingual with Lexical Functions  Multilingual with Axies (Interlingual Links)  Conclusion & References

16 / 25Sat. 31 Aug. 2002SEMANET Workshop Preparation of the Existing Data Local Resources Export Recuperation DicDist DicOrig DicGen Contrib1 Contrib2 EDR FeM JMDict SAIKAM DiCo Contrib3 ELRA Limbo Original Format Purgatory XML Format Paradise Papillon Format Import Integration S pap Consultation

17 / 25Sat. 31 Aug. 2002SEMANET Workshop Introduction to Conceptual Vectors  An idea = a concept = a conceptual vector  The vector space is of K dimensions  K = nb of concepts in a thesaurus hierarchy  Eg: for French, Thesaurus Larousse = 873 concepts  One independent vector space for each language  Distance between 2 vectors = angular distance  D A (x, y) = acos(sim(x,y))  D A (x, y) = acos(x.y/|x||y|))  Ref: Work done by Mathieu Lafourcade 

18 / 25Sat. 31 Aug. 2002SEMANET Workshop demand English Vectorized monolingual dictionary English-French Bilingual dictionary v v v v Left over meaning Association demand.3 v defdemand.2 v defdemand.1 v def demand.1equivalents demand Slide from Mathieu Lafourcade Vector space for English demand.2equivalents demand.3equivalents demand.4equivalents Linking Word Senses with Vectors

19 / 25Sat. 31 Aug. 2002SEMANET Workshop Contributions & Validation Papillon Server Paradise Papillon Format S pap Voluntary Contributors Contributions User Space User Space Payed Specialists Validation Integration

20 / 25Sat. 31 Aug. 2002SEMANET Workshop Outline  Presentation of Papillon Project  Macrostructure of the Dictionary  Microstructure of the Entries  Bootstrapping & Contribution Process  Limbo, Purgatory & Paradise  Bootstrapping with Conceptual Vectors  Contributions & Validation Process  Lexico-Semantical Network  Monolingual with Lexical Functions  Multilingual with Axies (Interlingual Links)  Conclusion & References

21 / 25Sat. 31 Aug. 2002SEMANET Workshop Lexico Semantical Multilingual Network (1) DiCo French Assassinat Meurtre Assassination Murder _Lancer un appel au meurtre_ _To call for sb's assassination_ {Qsyn} {Qsyn} ? DiCo English Interlingual Links

22 / 25Sat. 31 Aug. 2002SEMANET Workshop Lexico Semantical Multilingual Network (2) DiCo French Meurtrier Meurtre 殺人者 【さつじんしゃ】 殺人 【さつじん】 {S0} {S0} ? DiCo Japanese Interlingual Links Satsujin (Murder) Satsujinsha (Murderer) _Lancer un appel au meurtre_

23 / 25Sat. 31 Aug. 2002SEMANET Workshop Outline  Presentation of Papillon Project  Macrostructure of the Dictionary  Microstructure of the Entries  Bootstrapping & Contribution Process  Limbo, Purgatory & Paradise  Bootstrapping with Conceptual Vectors  Contributions & Validation Process  Lexico-Semantical Network  Monolingual with Lexical Functions  Multilingual with Axies (Interlingual Links)  Conclusion & References

24 / 25Sat. 31 Aug. 2002SEMANET Workshop Conclusion  Framework for Experimenting Networks  Research Issues Remaining  Social issues: how to motivate people?  Contribution Interfaces  Checking Interfaces  The Project Cannot Succeed without the Help of the Public People (Voluntary Contributors)

25 / 25Sat. 31 Aug. 2002SEMANET Workshop References & Contacts  Web Site (information & consultation)   Steering Committee President  Gilles Sérasset  Technical Responsible in Japan  Mathieu Mangeot