Natural Language and Text Processing Laboratory Projects and Research Directions Head: Alexander Gelbukh

Slides:



Advertisements
Similar presentations
European Masters Program in Language and Communication Technologies Free University.
Advertisements

Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Special Topics in Computer Science Advanced Topics in Information Retrieval Chapter 1: Introduction Alexander Gelbukh
Special Topics in Computer Science The Art of Information Retrieval Chapter 1: Introduction Alexander Gelbukh
1 Alexander Gelbukh Moscow, Russia. 2 Mexico 3 Computing Research Center (CIC), Mexico.
Enrique Solano Márquez SVO Principal Investigator LAEX – CAB / INTA-CSIC The Spanish Virtual Observatory IVOA Interop., Garching, Nov 2009.
10th Conference on Artificial Intelligence in Medicine (AIME 05) July 2005 Aberdeen, Scotland Building Medical Ontologies based on Terminology.
Introduction to Computational Linguistics
Providing collections, tools and services for digital humanities A national library perspective Clément Oury Head of Digital Legal Deposit Bibliothèque.
Jing-Shin Chang National Chi Nan University, IJCNLP-2013, Nagoya 2013/10/15 ACLCLP – Activities ( ) & Text Corpora.
Multilingual Information Access in a Digital Library Vamshi Ambati, Rohini U, Pramod, N Balakrishnan and Raj Reddy International Institute of Information.
Galia Angelova Institute for Parallel Processing, Bulgarian Academy of Sciences Visualisation and Semantic Structuring of Content (some.
April 22, Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Doerre, Peter Gerstl, Roland Seiffert IBM Germany, August 1999 Presenter:
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Intelligent Information Retrieval CS 336 Lisa Ballesteros Spring 2006.
Advance Information Retrieval Topics Hassan Bashiri.
Identification of Composite Named Entities in a Spanish Textual Database Sofía N. Galicia-Haro Facultad de Ciencias - UNAM Alexander F. Gelbukh and Igor.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
Yuliya Morozova Institute for Informatics Problems of the Russian Academy of Sciences, Moscow.
GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic Extending the “Facets” concept by applying NLP tools to catalog records of scientific literature *E.
Some studies on Vietnamese multi-document summarization and semantic relation extraction Laboratory of Data Mining & Knowledge Science 9/4/20151 Laboratory.
Intelligent Systems Lecture 23 Introduction to Intelligent Data Analysis (IDA). Example of system for Data Analyzing based on neural networks.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
Laboratory for Internet Computing Harnessing Distributed, Heterogeneous Information Sources –Data integration with different formats –Extraction of information.
Claudia Marzi Institute for Computational Linguistics, “Antonio Zampolli” – Italian National Research Council University of Pavia – Dept. of Theoretical.
1 The BT Digital Library A case study in intelligent content management Paul Warren
CIG Conference Norwich September 2006 AUTINDEX 1 AUTINDEX: Automatic Indexing and Classification of Texts Catherine Pease & Paul Schmidt IAI, Saarbrücken.
Structure of Study Programmes
FishBase Summary Page about Salmo salar in the standard Language of FishBase (English) ENBI-WP-11: Multilingual Access to European Biodiversity Sites through.
Structure of Study Programmes Bachelor of Computer Science Bachelor of Information Technology Master of Computer Science Master of Information Technology.
‘INFORMATICS & MULTIMEDIA’ Department of Applied Informatics & Multimedia School of Applied Technology TEI-Crete.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
Machine Learning Lecture 1. Course Information Text book “Introduction to Machine Learning” by Ethem Alpaydin, MIT Press. Reference book “Data Mining.
VAN HOAI TRAN FACULTY OF COMPUTER SCIENCE & ENGINEERING HCMC UNIVERSITY OF TECHNOLOGY AAOS 2008 Open Grid Computing Architecture.
Semantic Technologies & GATE NSWI Jan Dědek.
29-30 October, 2006, Estonia 1 IST4Balt Information analysis using social bookmarking and other tools IST4Balt Information analysis using social bookmarking.
Data Mining By Dave Maung.
Research Topics CSC Parallel Computing & Compilers CSC 3990.
State and Local Government Legal Resources
1 CSI 5180: Topics in AI: Natural Language Processing, A Statistical Approach Instructor: Nathalie Japkowicz Objectives of.
Sergey Gromov Yulia Krasilnikova Vladimir Polyakov (NRTU MISIS, Moscow) KNOWLEDGE BASE CREATION FOR NATIONAL NANOTECHNOLOGY NETWORKS «CONSTRUCTIONAL NANOMATERIALS»
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
group ПР-09-4 м Shevchenko Lilia
Computational Linguistics. The Subject Computational Linguistics is a branch of linguistics that concerns with the statistical and rule-based natural.
Computational linguistics A brief overview. Computational Linguistics might be considered as a synonym of automatic processing of natural language, since.
Summary Knowledge Bases from Web are Real, Big & Useful: Entities, Classes & Relations Key Asset for Intelligent Applications: Semantic Search, Question.
Translingual Information Management Stephan Busemann Language Technology Lab German Research Center for Artificial Intelligence.
Terminology and documentation*  Object of the study of terminology:  analysis and description of the units representing specialized knowledge in specialized.
ICT-enabled Agricultural Science for Development Scenarios, Opportunities, Issues by ICTs transforming agricultural science, research & technology generation.
Mining and Oil Faculty Department of Oil and Gas Technologies Master program Technology of Oil Fields Development.
2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, 龙星计划课程 : 信息检索 Course Summary ChengXiang Zhai ( 翟成祥 ) Department of.
Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.
1 An Introduction to Computational Linguistics Mohammad Bahrani.
Digital University of Pisa Alessandro Lenci CoLing Lab – Laboratorio di Linguistica Computazionale Università di Pisa Aix-Marseille Université.
Semantic Wiki: Automating the Read, Write, and Reporting functions Chuck Rehberg, Semantic Insights.
Removing the Language Barrier Machine Translation And Digital Libraries.
Data Mining in Germany IIM Conference, Oct. 24, 2012 Gottfried Schwarz, DLR > Lecture > Author Document > Datewww.DLR.de Chart 1.
Artificial Intelligence and Lisp Lecture 13 Additional Topics in Artificial Intelligence LiU Course TDDC65 Autumn Semester,
Course Summary (Lecture for CS410 Intro Text Info Systems)
LACONEC A Large-scale Multilingual Semantics-based Dictionary
Discovery Search vs. Library Catalogue
CSE 635 Multimedia Information Retrieval
The MOVE-ME repository of open educational resources
Information. Knowledge. Decision
Information Retrieval
European Masters Program Language & Communication Technologies
Presentation transcript:

Natural Language and Text Processing Laboratory Projects and Research Directions Head: Alexander Gelbukh

Contents General info Main research directions Projects in Computational linguistics Projects in Text processing applications Projects in Mathematical modeling International contacts Possible future projects

NLP&TP Lab n Since 1996 n 4 doctors n 4 doctoral students n 3 master students n 23 projects n 100+ publications n 1 International Conference

Projects 5 CONACyT (1 Joint w/ Moscow City Government) 2 REDII-CONACyT 13 CGEPI-IPN (1 Joint w/ Mexican Oil Institute – IMP) 1 Senate of Mexican Republic 1 Joint w/ Moscow City Government

Main Research Directions Theoretical Computational Linguistics –Syntax analysis –Semantic analysis –Dictionaries and resources Intelligent Text Processing Applications –Information retrieval –Classification and text mining –Topical summarization Applied Mathematics –Modeling of diffusion processes

Projects in Computational Linguistics CONACyT A: Advanced syntactic analyzer for Spanish REDII-CONACyT: Subcategorization dictionary for Spanish CONACyT A: Compilation of text corpus of a new type through Internet CONACyT J31989: Extraction of semantic info from bilingual dictionaries

Without our tool

With our tool

Projects in Applied Text Processing 1 REDII-CONACyT (joint with ITESM): Agent-based search engine for digital libraries Senate of Mexican Republic: Concept-based search engine for the DB of the Senate

Projects in Applied Text Processing 2 CONACyT & Moscow City Government, No : Investigation of large ecological document bases Joint with Moscow City Government: Knowledge discovery in the flows of the letters of city dwellers

Topical summary

Topical comparison French version of the same English doc

Dictionary used

Projects in Mathematical Modeling CONACyT A: Modeling of air pollution in Mexico City Joint with IMP, No by CGEPI-IPN: Detection of local pollution sources

We are adding functionality to this feature

International Contacts CYTED and RITOS2 (Euro-American Research Organizations): Member group City Government, Moscow: Common Projects UPC, Barcelona: Student exchange & research contacts U. Montreal: Research in Meaning-Text Theory

Possible Future Joint Projects Heymans Institute, Gröningen, Holland: Classification of complex interdisciplinary information in very large databases City Government, Moscow: Decision making on the basis of laws and normative deeds U.P.C., Barcelona: Spanish in HPSG

Thank you!