Translingual Information Management Stephan Busemann Language Technology Lab German Research Center for Artificial Intelligence.

Slides:



Advertisements
Similar presentations
Warszawa, Jakub Piskorski SProUT Shallow Processing with Unification and Typed Feature Structures Jakub Piskorski Language Technology Lab DFKI.
Advertisements

European Masters Program in Language and Communication Technologies Free University.
Hans Uszkoreit German Research Center for Artificial Intelligence and Saarland University at Saarbruecken Hans Uszkoreit German Research Center for Artificial.
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
The Semantic Web and Language Technology BT Exact, Martlesham Hamish Cunningham Department of Computer Science, University of Sheffield Friday October.
Introduction to Computational Linguistics
Natural Language and Speech Processing Creation of computational models of the understanding and the generation of natural language. Different fields coming.
Towards an NLP `module’ The role of an utterance-level interface.
CSE111: Great Ideas in Computer Science Dr. Carl Alphonce 219 Bell Hall Office hours: M-F 11:00-11:
Shallow Processing: Summary Shallow Processing Techniques for NLP Ling570 December 7, 2011.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Center for Computational Learning Systems Independent research center within the Engineering School NLP people at CCLS: Mona Diab, Nizar Habash, Martin.
1/7 INFO60021 Natural Language Processing Harold Somers Professor of Language Engineering.
Center for Computational Learning Systems Independent research center within the Engineering School NLP people at CCLS: Mona Diab, Nizar Habash, Martin.
Natural Language Processing Ellen Back, LIS489, Spring 2015.
GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic Extending the “Facets” concept by applying NLP tools to catalog records of scientific literature *E.
Korea Terminology Research Center for Language and Knowledge Engineering Infrastructures in Korea and for the Korean Language Key-Sun Choi.
Language Technology 2005/06 Hans Uszkoreit Universität des Saarlandes
CAREERS IN LINGUISTICS OUTSIDE OF ACADEMIA CAREERS IN INDUSTRY.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Introduction to NLP.
WP5.4 - Introduction  Knowledge Extraction from Complementary Sources  This activity is concerned with augmenting the semantic multimedia metadata basis.
Computational Linguistics Yoad Winter *General overview *Examples: Transducers; Stanford Parser; Google Translate; Word-Sense Disambiguation * Finite State.
Some Thoughts on HPC in Natural Language Engineering Steven Bird University of Melbourne & University of Pennsylvania.
CIG Conference Norwich September 2006 AUTINDEX 1 AUTINDEX: Automatic Indexing and Classification of Texts Catherine Pease & Paul Schmidt IAI, Saarbrücken.
DFKI GmbH, , R. Karger Indo-German Workshop on Language Technologies Reinhard Karger, M.A. Deutsches Forschungszentrum für Künstliche Intelligenz.
February 2007MCST - FP7 Launch1 Michael Rosner Department of Computer Science and Artificial Intelligence University of Malta.
Survey of Semantic Annotation Platforms
University of Dublin Trinity College Localisation and Personalisation: Dynamic Retrieval & Adaptation of Multi-lingual Multimedia Content Prof Vincent.
Machine Translation, Digital Libraries, and the Computing Research Laboratory Indo-US Workshop on Digital Libraries June 23, 2003.
Next Generation Speech Science and Technologies - A Cross-Country Joint Project for Collaboration between Speech Research Labs in Taiwan and in Japan Lin-shan.
1 Computational Linguistics Ling 200 Spring 2006.
© Copyright 2013 ABBYY NLP PLATFORM FOR EU-LINGUAL DIGITAL SINGLE MARKET Alexander Rylov LTi Summit 2013 Confidential.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
Natural Language Processing Guangyan Song. What is NLP  Natural Language processing (NLP) is a field of computer science and linguistics concerned with.
MinorThird 서울시립대학교 인공지능연구실 곽별샘
Ngoc Minh Le - ePi Technology Bich Ngoc Do – ePi Technology
Language Technology I © 2005 Hans Uszkoreit Language Technology I 2005/06 Hans Uszkoreit Universität des Saarlandes and German Research Center for Artificial.
Edinburg March 2001CROSSMARC Kick-off meetingICDC ICDC background and know-how and expectations from CROSSMARC CROSSMARC Project IST Kick-off.
EVikings II WP3: Language Technologies. HLT Human Language Technologies (HLT) play a crucial role in the Information Society For small languages it is.
Reinhard Karger German Research Center for Artificial Intelligence, DFKI GmbH Stuhlsatzenhausweg Saarbruecken, Germany phone: ( )
1 CSI 5180: Topics in AI: Natural Language Processing, A Statistical Approach Instructor: Nathalie Japkowicz Objectives of.
Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.
Prof. Thomas Sikora Technische Universität Berlin Communication Systems Group Thursday, 2 April 2009 Integration Activities in “Tools for Tag Generation“
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
INSTITUTE FOR SIGNAL AND INFORMATION PROCESSING Joseph Picone Inst. for Signal and Info. Processing Dept. Electrical and Computer Eng. Mississippi State.
CSE467/567 Computational Linguistics Carl Alphonce Computer Science & Engineering University at Buffalo.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
DFKI GmbH, , R. Karger Perspectives for the Indo German Scientific and Technological Cooperation in the Field of Language Technology Reinhard.
Toward an Open Source Textual Entailment Platform (Excitement Project) Bernardo Magnini (on behalf of the Excitement consortium) 1 STS workshop, NYC March.
© 2003 DFKI Language Technology Lab Language Technology Information Extraction Retrieving relevant concepts and structured relations in unrestricted free.
Introduction A field survey of Dutch language resources has been carried out within the framework of a project launched by the Dutch Language Union (Nederlandse.
1 An Introduction to Computational Linguistics Mohammad Bahrani.
Natural Language Processing Group Computer Sc. & Engg. Department JADAVPUR UNIVERSITY KOLKATA – , INDIA. Professor Sivaji Bandyopadhyay
Basics of Natural Language Processing Introduction to Computational Linguistics.
Natural Language Processing Tasneem Ghnaimat Spring 2013.
ELRC Training Workshop in Belgium, April 13, 2016 Walter Daelemans Universiteit Antwerpen, CLiPS Language Technology in Belgium 1.
Using Human Language Technology for Automatic Annotation and Indexing of Digital Library Content Kalina Bontcheva, Diana Maynard, Hamish Cunningham, Horacio.
© W. Wahlster, DFKI IST ´98 Workshop „The Language of Business - the Business of Language“ Vienna, 2 December 1998 German Research Center for Artificial.
Computational UIUC Lane Schwartz Student Orientation August 23, 2017.
PRESENTED BY: PEAR A BHUIYAN
Tools for Natural Language Processing Applications
Thai AGROVOC Ontology Base for Agricultural Information Retrieval
Computational UIUC Lane Schwartz Student Orientation August 18, 2016.
GATE and the Semantic Web
Natural Language Processing (NLP)
3.0 Map of Subject Areas.
ITS 2.0 Enriched Terminology Annotation Showcase
Natural Language Processing (NLP)
Natural Language Processing (NLP)
Presentation transcript:

Translingual Information Management Stephan Busemann Language Technology Lab German Research Center for Artificial Intelligence

© 2004 DFKI Language Technology Lab Language Technology Lab D ATA Management Lab Director : Prof. Dr. Hans Uszkoreit Associate Lab Director: Dr. Stephan Busemann Projects: BMBF, EU, Saarland and Industry Turnover: > 2 Mio € per annum

© 2004 DFKI Language Technology Lab LT L ab S TAFF The lab employs 20 researchers and software engineers from 8 countries They are supported by 24 research assistants and guest scientists

© 2004 DFKI Language Technology Lab C OOPERATIONS Many of tasks we carry out in joint projects with partners from industry, academia and other contract research centers. We collaborate closely with: the Department of Computational Linguistics, the Department of Computer Science and other institutes at Saarland University.

© 2004 DFKI Language Technology Lab LT L AB O VERVIEW DFKI‘s Language Technology Lab has 20 researchers and software engineers from 8 countries Language resources for German, English, French, Chinese, Japanese, Spanish, Italian, Portuguese, Dutch, Slavic Languages,... Three-stage approach Develop and maintain reusable base technologies Configure complex systems Adapt or extend to build application systems

© 2004 DFKI Language Technology Lab B ASE T ECHNOLOGIES Preprocessing (tokenization, POS tagging, morphology) Shallow Parsing (statistical chunk parsing, FST grammars) Several Techniques for Categorization (machine learning) Deep Syntactic and Semantic Analysis (efficient HPSG parsing) Shallow and In-Depth Generation Several Techniques for Text Summarization (e.g. query-dependent) Text-to-Speech for German, English

© 2004 DFKI Language Technology Lab T HREE L INES OF C OMPLEX S YSTEMS Natural Communication response management, speech interpretation and production, emotion in synthesis Multilingual Authoring Support terminology and grammar checking for controlled language, tools for annotation by metadata (and other structuring information), linguistic lookup Information and Knowledge Management multilingual retrieval, information extraction, semantic-web infrastructure, automatic hyperlinking, open-domain question answering, report generation

© 2004 DFKI Language Technology Lab S OME P ROJECTS TEMSIS PARA DIME PARA DIME Multi-lingual Extraction of Travel Warning Information Cross-lingual Navigation Cross-lingual Tourism Information Multi-lingual Question Answering Indexing of Commented Video Material (soccer games) Multi-lingual Generation of Air Quality Reports

© 2004 DFKI Language Technology Lab S AMPLE A PPLICATIONS Deutsche Telekom question answering on tariff information Dresdner Bank automatic hyperlinking for structuring program listings and documentation SAP AG controlled language checking Interprice Technologies dialogue system for e-commerce product search

© 2004 DFKI Language Technology Lab I NFORMATION E XTRACTION Requirements  Must adapt to shallow or more deep tasks  Must be multi-lingual  Must efficiently process large sets of text Sample Applications  Named Entity Recognition  Opinion Extraction  Extraction of Travel Warning Information  Hyperlinking SProUT, a Java and C++ based IE framework

© 2004 DFKI Language Technology Lab S PROUT S YSTEM Configurable Linguistic Components  Tokenizers  Morphological analysis components (e.g. MMorph)  Feature-Enhanced Gazetteers  Finite State Grammars integrating and extending the above resources Core Tools  JTFS -- Type Unification and Subsumption  FSM Toolkit for processing FS transducers  Grammar Development Environment