Download presentation
Presentation is loading. Please wait.
Published byArabella McGee Modified over 9 years ago
1
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR Language Resource and Language Technology Virach Sornlertlamvanich NECTEC, Thailand TCL, NICT ALRC, AFNLP 1
2
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR ALRC, AFNLP ASIAN LANGUAGE RESOURCES COMMITTEE, ASIAN FEDERATION OF NATURAL LANGUAGE PROCESSING 2
3
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR AFNLP Jun’ichi Tsujii President Key-Sun ChoiVice President Keh-Yih Su Secretary General Kam-Fai Wong Honorary Treasurer Yuji Matsumoto Chair of CCC (Conference Coordinating Committee) Haizhou Li Chair of CLC (Communications and Liaison Committee) Virach Sornlertlamvanich Chair of ALRC (Asian Language Resources Committee) Benjamin TsouChair of NCAC (Nominations and Constitutional Affairs Committee) Mark Steedman ACL liaison member to AFNLP Rajeev Sangal Chengqing Zong 3
4
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR 4
5
Role of ALRC, AFNLP 1. ALR Workshop Take initiative in setting up ALR Workshop in every other year. This is to consider as an attaching workshop to a major conference such as IJCNLP. It involves setting up the workshop and program chairs. The process should start at the latest as soon as the call for workshop proposal has been announced, so that the workshop and program chairs can be announced at the appropriate time. The Chair must interact with the workshop chair to ensure that the workshop preparations are proceeding smoothly. 2. LR catalogue Throughout the year, monitor and maintain the LR catalogue up to the date. 5
6
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR ALR Workshop in the Past 1.Tokyo, Japan, under the name of Symposium on Language Resources in Asia, 2001 2.Tokyo, Japan, in conjunction with the 6th Natural Language Processing Pacific Rim Symposium, National Center of Sciences, 2001 3.Taipei, Taiwan, in conjunction with Coling2002 4.Sanya City, Hainan Island, China, in conjunction with IJCNLP2004 5.Jeju Island, Korea, in conjunction with IJCNLP2005 6.Hyderabad, India, in conjunction with IJCNLP2008 6
7
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR The 7th Workshop on Asian Language Resources Co-Chair: –Hammam Riza - IPTEKnet-BPPT, Indonesia –Virach Sornlertlamvanich - NECTEC, Thailand Venue: –Aug 7, 2009 –ACL-IJCNLP 2009, Singapore, Aug 2-7, 2009 –http://www.acl-ijcnlp-2009.org/main/workshops.htmlhttp://www.acl-ijcnlp-2009.org/main/workshops.html Important Date: –Paper submission due May 1, 2009 –Demo session requests due May 8, 2009 –Notification of acceptance July 1, 2009 –Camera-ready papers due June 7, 2009 7
8
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR LR Catalogue http://www.tcllab.org/add http://www.shachi.org/ 8
9
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR 9
10
10
11
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR ADD ASIAN APPLIED NATURAL LANGUAGE PROCESSING FOR LINGUISTICS DIVERSITY AND LANGUAGE RESOURCE DEVELOPMENT 11
12
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR Asian Applied Natural Language Processing for Linguistics Diversity and Language Resource Development (ADD) Objective:- –Build experts in NLP –Build a human network of NLP expert for sharing the experience, expertise, and collaboration in studying and applying NLP –Support the development of language resources for studying and evaluating the technology –Support the development of standards for language resource development –Support the research and development of NLP common utilities –Support the implementation of the existing NLP utilities 12
13
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR Asian Applied Natural Language Processing for Linguistics Diversity and Language Resource Development (ADD) Organizer and Supporter:- –NICT Asia Research Center –Asian Language Resources Network Project (ALRN) –National Electronics Computer and Technology Center (NECTEC) –Sirindhorn International Institute of Technology (SIIT) –Asia-Pacific Association for Machine Translation (AAMT) –Asian Federation of Natural Language Processing (AFNLP) –PAN Localization Project, CRULP 13
14
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR ADD School and Workshop ADD-1: Introduction to NLP –August 21–September 1, 2006 SIIT, Bangkok, Thailand ADD-2: Advanced NLP (Special Topic on Morpho-Syntactic Anaysis) –March 6-14, 2007 Thammasart University, Bangkok, Thailand ADD-3: Advanced NLP (Special Topic on Image and Speech processing) –February 25–March 1, 2008 SIIT, Bangkok, Thailand 14
15
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR ADD-1 27 from 34 applications of 12 countries –Bhutan1 –Cambodia2 –Indonesia2 –Lao3 –Mongolia1 –Myanmar3 –Nepal3 –Pakistan3 –Sri Lanka1 –Thailandopen –US1 –Vietnam7 ADD-2 36 from 42 applications of 13 countries –Bangladesh2 –Bhutan1 –Cambodia2 –India1 –Indonesia3 –Lao5 (7) –Mongolia1 –Myanmar1 (3) –Nepal3 (5) –Pakistan1 –Philippines1 –Thailand4 –Vietnam11 * Figures inside the bracket () are the number of applications ADD Applications (1) 15
16
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR ADD-3 37 from 39 applications of 12 countries –Bangladesh3 –Bhutan3 (4) –Indonesia7 –Lao3 –Mongolia2 –Myanmar4 –Nepal2 (3) –Pakistan2 –Philippines1 –Sri Lanka2 –Thailand1 [+18] –Vietnam7 * Figures inside the bracket () are the number of applications Figure inside the bracket [] is the number of sit-in participants ADD Applications (2) 16
17
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR CFP of ADD-4 Theme: –Language Resource Technology POS, tagging, word segmentation, terminology, Asian WordNet, tools for corpus development, tools for text mining, text summarization, categorization, approaches for morphological analysis Date: –Feb 23-27, 2009 Venue: –NECTEC Academy, Bangkok Application: –www.tcllab.org/add 17
18
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR http://www.tcllab.org/add 18
19
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR ALR SUMMIT March 2009, Phuket 19
20
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR ALR Summit March 2009, Phuket Discuss on Asian Language Resource in terms of developing, sharing, licensing, etc. Corpus, Terminology, WordNet, Language tools, etc. 20
21
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR POLICY CONSIDERATIONS FOR DEVELOPMENT AND DEPLOYMENT OF LOCAL LANGUAGE COMPUTING AND CONTENT 21
22
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR Asian WordNet Use English equivalents to link the existing dictionary to WordNet POS (n, v, adv, adj), English equivalent, and English equivalent of synonym of the target language are used to pinpoint the link Number of matched English equivalents in the Synset confirms the appropriate link Experiment on Thai-English, Indonesian-English and Mongolian-English dictionaries http://asianwordnet.org/ 22
23
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR Asian WordNet Development 23 GWN AWN Applications Dictionary Ontology CL-Search MT Summarization IE/IR …. KUI Correction Voting Lookup Translation Discussion Addition WN merged-WN X-English Thai-English X-English Indonesian -English
24
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR English-English 24
25
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR Thai-English 25
26
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR Thai-Indonesian 26
27
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR Thai-Lao Phoneme-based MT Sharing of character set (similar but different encoding scheme) Sharing of phrase structure Sharing of vocabulary http://www.tcllab.org/th2lao 27 Phoneme mapping with a table of word exception
28
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR Phoneme Mapping 28 Thai input text เครื่องร่อน G2P Thai phonetics Khr-vv-ng^-2|r-@-n^-2| Phonetic conversion rule Lao phonetics Kh-vv-ng^-2|l-@-n^-2| Surface generation Lao text Phoneme mapping Word mapping ເຄື່ອງລັ່ອນ khr -> kh r -> l
29
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR Sample of Consonant Phoneme Mapping 29 Thai mid SymLao mid ก จ ด, ฎ ต, ฏ บ ป อ kcdtbpzkcdtbpz ກຈດຕບປອກຈດຕບປອ Thai lowhigh Sym Lao lowhigh ค ฆข ช ฌฉ ซส ศ ษ งหง ญ ยหญ หย ฑ ฒ ธฐ ถ ณ นหน พ ภผ kh ch ng j th n ph ຄຂ ຊສ ງຫງ ຍຫຍ ທຖ ນຫນ ພຜ
30
PAN Localization, Jan 12-16, 2009, Novotel, Vientiane, Lao PDR Language Grid Lead by Prof Toru Ishida, Kyoto University and NICT Service of language resource and language computing Participation –Language resource provider –Computational resource provider –Language service user NECTEC as a node of Langrid Operation http://www.langrid.org 30
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.