Hiroshi NAKAGAWA Information Technology Center, University of Tokyo,Japan Postal:

Slides:



Advertisements
Similar presentations
European Masters Program in Language and Communication Technologies Free University.
Advertisements

Language Technologies Reality and Promise in AKT Yorick Wilks and Fabio Ciravegna Department of Computer Science, University of Sheffield.
Pisa Research Area National Research Council Computer Science Institutes ERCIM Italian Partners Norma Lijtmaer.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
The Application of Machine Translation in CADAL Huang Chen, Chen Haiying Zhejiang University Libraries, Hangzhou, China
Talk and Look: Tools for Ambient Linguistic Knowledge A Project funded by the European Community under the Sixth Framework Programme for Research and Technological.
Computational Paradigms in the Humanities – eHumanities and their role and impact in transdisciplinary research Gerhard Budin University of Vienna.
New Technologies Supporting Technical Intelligence Anthony Trippe, 221 st ACS National Meeting.
GSK: Development and Distribution of Resources Hitoshi ISAHARA GSK: Gengo Shigen Kyokai (Language Resource Association) National Institute of Information.
Jing-Shin Chang National Chi Nan University, IJCNLP-2013, Nagoya 2013/10/15 ACLCLP – Activities ( ) & Text Corpora.
The HIT Innovation Hub WP5 of Tempus IDEA. Basic Requirements Wide concept Interdisciplinary Serves Engineering, Design, Sciences and MOT disciplines.
Improving Machine Translation Quality via Hybrid Systems and Refined Evaluation Methods Andreas Eisele DFKI GmbH and Saarland University Helsinki, November.
INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING NLP-AI IIIT-Hyderabad CIIL, Mysore ICON DECEMBER, 2003.
Language Resources in Indonesia Language Technology & Applied Information Laboratory Directorate for Information Technology and Electronics Agency for.
The Unreasonable Effectiveness of Data Alon Halevy, Peter Norvig, and Fernando Pereira Kristine Monteith May 1, 2009 CS 652.
1 © Franz J. Kurfess Constrained Access Franz J. Kurfess Cal Poly SLO Computer Science Department.
C SC 620 Advanced Topics in Natural Language Processing Sandiway Fong.
Center for Computational Learning Systems Independent research center within the Engineering School NLP people at CCLS: Mona Diab, Nizar Habash, Martin.
1/26 Applications of NLP. 2/26 Applications What uses of the computer involve language? What language use is involved? What are the main problems? How.
Resources Primary resources – Lexicons, structured vocabularies – Grammars (in widest sense) – Corpora – Treebanks Secondary resources – Designed for a.
What is “Biomedical Informatics”?. Biomedical Informatics Biomedical informatics (BMI) is the interdisciplinary field that studies and pursues.
1/23 Applications of NLP. 2/23 Applications Text-to-speech, speech-to-text Dialogues sytems / conversation machines NL interfaces to –QA systems –IR systems.
Machine Translation History of Machine Translation Difficulties in Machine Translation Structure of Machine Translation System Research methods for Machine.
Korea Terminology Research Center for Language and Knowledge Engineering Infrastructures in Korea and for the Korean Language Key-Sun Choi.
June ELSNET Review1 A Roadmap for Computational Linguistics Steven Krauwer ELSNET / Utrecht University (NL)
CAREERS IN LINGUISTICS OUTSIDE OF ACADEMIA CAREERS IN INDUSTRY.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Introduction to NLP.
UNISA MY RESEARCH AND HOW IT RELATES TO UNISA’S IOP RESEARCH OBJECTIVES Presented by Mtholeni N. Ngcobo Young Academics Programme 15 May 2009.
LEFIS W2 Posgraduate Workshop 1 LEFIS, WG 2 Postgraduate studies Meeting, Rotterdam.
DFKI GmbH, , R. Karger Indo-German Workshop on Language Technologies Reinhard Karger, M.A. Deutsches Forschungszentrum für Künstliche Intelligenz.
Location of JSI EuropeSlovenia Micro-location of JSI Department of Knowledge Technologies Jožef Stefan Institute Ljubljana.
Improving Administrative Sciences Worldwide IIAS Perspectives on the Main Trends and Critical Governance Issues in the Public Sector.
University of Dublin Trinity College Localisation and Personalisation: Dynamic Retrieval & Adaptation of Multi-lingual Multimedia Content Prof Vincent.
Recent Activities of Speech Corpora and Assessment in Korea Yong-Ju Lee Wonkwang University Korea.
Syllabus and curriculum design From LETRAC to Bologna Belinda Maia University of Porto.
Machine Translation, Digital Libraries, and the Computing Research Laboratory Indo-US Workshop on Digital Libraries June 23, 2003.
Ontology Summit2007 Survey Response Analysis -- Issues Ken Baclawski Northeastern University.
TRANSLATION & THE HIGH TECH INDUSTRY. INTRODUCTION Translation has been existing ever since mythology began, passed the prophethood, and now in modern.
© Copyright 2013 ABBYY NLP PLATFORM FOR EU-LINGUAL DIGITAL SINGLE MARKET Alexander Rylov LTi Summit 2013 Confidential.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
Suléne Pilon & Danie Prinsloo Overview: Teaching and Training in South Africa 25 November 2008;
Incorporating Extra-linguistic Information into Reference Resolution in Collaborative Task Dialogue Ryu Iida Shumpei Kobayashi Takenobu Tokunaga Tokyo.
Language Technology I © 2005 Hans Uszkoreit Language Technology I 2005/06 Hans Uszkoreit Universität des Saarlandes and German Research Center for Artificial.
EVikings II WP3: Language Technologies. HLT Human Language Technologies (HLT) play a crucial role in the Information Society For small languages it is.
1 SMEs – a priority for FP6 Barend Verachtert DG Research Unit B3 - Research and SMEs.
1 Direction scientifique Networks of Excellence objectives  Reinforce or strengthen scientific and technological excellence on a given research topic.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
Computational Linguistics. The Subject Computational Linguistics is a branch of linguistics that concerns with the statistical and rule-based natural.
1 Viewing Vision-Language Integration as a Double-Grounding case Katerina Pastra Department of Computer Science, Natural Language Processing Group, University.
Translingual Information Management Stephan Busemann Language Technology Lab German Research Center for Artificial Intelligence.
DFKI GmbH, , R. Karger Perspectives for the Indo German Scientific and Technological Cooperation in the Field of Language Technology Reinhard.
Introduction A field survey of Dutch language resources has been carried out within the framework of a project launched by the Dutch Language Union (Nederlandse.
Digital University of Pisa Alessandro Lenci CoLing Lab – Laboratorio di Linguistica Computazionale Università di Pisa Aix-Marseille Université.
Your caption here POLYPHONET: An Advanced Social Network Extraction System from the Web Yutaka Matsuo Junichiro Mori Masahiro Hamasaki National Institute.
EDLproject WP3 “Developing the European Digital Library” LIBER – EBLIDA workshop Digitisation of Library Material in Europe Copenhagen, October.
IR&NLP Coursework P1 Text Analysis Within The Fields Of Information Retrieval and Natural Language Processing By Ben Addley Academic Year 2004.
Human-Assisted Machine Annotation Sergei Nirenburg, Marjorie McShane, Stephen Beale Institute for Language and Information Technologies University of Maryland.
Hitoshi ISAHARA National Institute of Information and Communications Technology (NICT) Sustainability of the work and PAN L10n network: Vision Beyond 2010.
Finding Social Network for Trust Calculation Yutaka Matsuo, Hironori Tomobe, Koiti Hasida and Mitsuru Ishizuka National Institute of Advance Industrial.
LE:NOTRE Spring Workshop The Role of Ontologies for Mapping the Domain of Landscape Architecture An introduction.
A Brief Summary of MISS Project Weiquan Liu Feiyu Xu the multilingual world before MISO.
INTRODUCTION TO APPLIED LINGUISTICS
LingWear Language Technology for the Information Warrior Alex Waibel, Lori Levin Alon Lavie, Robert Frederking Carnegie Mellon University.
NLP Midterm Solution #1 bilingual corpora –parallel corpus (document-aligned, sentence-aligned, word-aligned) (4) –comparable corpus (4) Source.
WP3: Supporting RTD in Language Technologies
The Role of Ontologies for Mapping the Domain of Landscape Architecture An introduction.
Translingual Knowledge Projection and Statistical Machine Translation
Computational Linguistics: New Vistas
European Masters Program Language & Communication Technologies
Presentation transcript:

Hiroshi NAKAGAWA Information Technology Center, University of Tokyo,Japan Postal: Hongo, Bunkyo, Tokyo, , JAPAN The Situation in Japan - A Personal View-

Structure of Academic Societies in JAPAN  Basic data of NLP research activities in Japan  At least 700 NLP researchers ( 700 is the number of the members of the Association of Natural Language Processing(ANLP) Japan)  Several major universities and Lab’s in computer related companies (NTT, Fujitsu, NEC, Hitachi, Toshiba, ….) and more.  Several major governmental laboratories (Communication Research Lab., Electro-Technical Lab., National Institute of Informatics)

Academic Societies in Japan -1  ANLP:Association of NLP (700 members): the core of NLP research activities in Japan  SIG NLP of Information Processing Society Japan:IPSJ (more than 500 members mostly overlapped with ANLP members)  SIG NLC of Institute of Electronics and Communication Engineering:IECE (also overlapped with ANLP)

Academic Societies in Japan -2  Japanese Society of AI :JSAI: NLP is a not so big part  SIG FI of IPSJ, SIG DD of IPSJ : rather small SIG  These are mainly IR people.  The trend of the 90s was that NLP people were flowing into IR applications.

Academic Society in Japan -3  Linguistics related societies  Cognitive Linguistics Society (new)  Social Linguistics Society (2 years old)  Japan Cognitive Science Society: NL of this society:JCSS is one of its main part but rather linguistics oriented research group.

 Active researchers commit two, three academic societies above mentioned.  Our worry is that very few linguists are interested in NLP.  The reason is that they feel some kind of gaps or barriers between them and corpus based statistic oriented NLP approaches of these days. Researchers really are:

The 80s: Big projects with MT-related issues, supported by MITI-related agencies –[The 5 th generation computer project] –Mu machine translation project –EDR concept dictionary project –CICC Translation project among Asian languages The 90s - Now: Smaller projects with diverse issues including NLP as their part, supported by diverse funding agencies Historical Perspectives

Exploratory Integration The 80s Industrial Application MT in Network Browsers MT for Abstracts Automatic Caption Systems Further Investigation Generic NLP software and Tuning Equivalence of different formalisms KA from comparable corpora Dialogue in less restricted domains Statistical learning theories Ontology Building Exploratory Integration NLP in IR IE, CLIR, Speech and Language Language in Multimedia From big project of the 80s towards sharply focused the 90s The 90s

JSPS project : NLP in IE and IR, Basic research of generic NLP technologies TIT-CRL-NLL project: Speech and Language, Summarization of spontaneous speeches, Speech corpus collection RWC: Corpus collection and annotation as one of the research activities ATR Speech Translation: less restricted subject domains, adding new languages like Chinese, Example-based Paradigm Projects currently going on

KA Matching fund project: Knowledge acquisition from comparable corpora Center for excellence on theoretical linguistics: Language teaching as one of its application fields, Corpus collection GDA: Text annotation in multimedia environments, now extending to multimedia, multi-modal presentations (in preparation) Language and Action Usability: Language in the network era

[Bottom-Up Initiatives] IREX, NTCIR: TREC,MUC-type workshop GSK: LDC, ELRA-type institution [Development Projects] MT for National Patent Office: MT in an integrated Information System JST: MT for Abstracts, Example-based MT Automatic Caption System at NHK

 IREX is a competition type workshop about Information retrieval and extraction, MUC style conference/ Evaluation, Training and test corpus collection  NTCIR-1(1999) is also a competition type workshop about IR, CLIR, IE(named entity recognition), and ATR. Up to 40 groups were participated.  NTCIR-2(2000) takes IR, CLIR, Automatic Summarization as its tasks  i.e. about 20 groups have been participated to CLIR task so far. TREC,MUC type bottom up activities in Japan

The 80s’ big projects have brought up core NLP researchers of the 90s. Funding agencies started to think “Big fund is inefficient.” They prefer small, clear target project. NLP researchers inclined to do realistic applications. (Good or bad?) This NLP research trend is timely fitting well today’s IT revolution. For instance, NLP for cell-phone like gears is the promising. Real effects of the 80s’ big projects

International Co-operation: The current state Co-operation based on individual projects JSPS project: DFKI, U-Penn, Stanford Univ., KAIST UMIST/Salford, etc ATR : C-Star Consortium Co-operation based on private companies Co-operation in broader communities, supported by Government agencies No since CICC, MT among Asian languages Human Genome, Physics, Space Science, Brain Science Language-based technology: Highly sensitive to the priority as a nation Too closely related with industrial interests

Needs for Co-operation Validation of research results –Language universal vs. Language specific –NE, IR, Learning methods, grammar formalisms Resource gathering –Simply many languages to treat Integration of resources –Ontology for multi-lingual applications Independent funding Sponsoring/Independent funding Standardization Co-ordinated/Joint funding

Exploratory Integration The 80s Industrial Application MT in Network Browsers MT for Abstracts Automatic Caption Systems Further Investigation Generic NLP software and Tuning Equivalence of different formalisms KA from comparable corpora Dialogue in less restricted domains Statistical learning theories Ontology Building Exploratory Integration NLP in IR IE, CLIR, Speech and Language Independent Funding Sponsoring/Independent Funding Standardization Coordinated Funding From the 80s big project towards the sharply focused 90s: Targets The 90s