Lecture 12 Applications and demos. Building applications Previous lectures have discussed stages in processing: algorithms have addressed aspects of language.

Slides:



Advertisements
Similar presentations
Introduction to Computational Linguistics
Advertisements

Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2007) Learning for Semantic Parsing Advisor: Hsin-His.
Natural Language and Speech Processing Creation of computational models of the understanding and the generation of natural language. Different fields coming.
Towards an NLP `module’ The role of an utterance-level interface.
Sunita Sarawagi.  Enables richer forms of queries  Facilitates source integration and queries spanning sources “Information Extraction refers to the.
Shallow Processing: Summary Shallow Processing Techniques for NLP Ling570 December 7, 2011.
CS4705 Natural Language Processing.  Regular Expressions  Finite State Automata ◦ Determinism v. non-determinism ◦ (Weighted) Finite State Transducers.
1/7 INFO60021 Natural Language Processing Harold Somers Professor of Language Engineering.
Introduction to CL Session 1: 7/08/2011. What is computational linguistics? Processing natural language text by computers  for practical applications.
Advance Information Retrieval Topics Hassan Bashiri.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang National Central University
Resources Primary resources – Lexicons, structured vocabularies – Grammars (in widest sense) – Corpora – Treebanks Secondary resources – Designed for a.
تمرين شماره 1 درس NLP سيلابس درس NLP در دانشگاه هاي ديگر ___________________________ راحله مکي استاد درس: دکتر عبدالله زاده پاييز 85.
Retrieval Evaluation. Introduction Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
Machine Learning in Natural Language Processing Noriko Tomuro November 16, 2006.
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
Lecture 8 Applications and demos. Building applications Previous lectures have discussed stages in processing: algorithms have addressed aspects of language.
Lecture 8 Applications and demos. Building applications Previous lectures have discussed stages in processing: algorithms have addressed aspects of language.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
March 1, 2009 Dr. Muhammed Al-Mulhem 1 ICS 482 Natural Language Processing INTRODUCTION Muhammed Al-Mulhem March 1, 2009.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
9/8/20151 Natural Language Processing Lecture Notes 1.
CSC 9010 Spring Paula Matuszek A Brief Overview of Watson.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
RuleML-2007, Orlando, Florida1 Towards Knowledge Extraction from Weblogs and Rule-based Semantic Querying Xi Bai, Jigui Sun, Haiyan Che, Jin.
Some Thoughts on HPC in Natural Language Engineering Steven Bird University of Melbourne & University of Pennsylvania.
CIG Conference Norwich September 2006 AUTINDEX 1 AUTINDEX: Automatic Indexing and Classification of Texts Catherine Pease & Paul Schmidt IAI, Saarbrücken.
For Friday Finish chapter 23 Homework: –Chapter 22, exercise 9.
University of Dublin Trinity College Localisation and Personalisation: Dynamic Retrieval & Adaptation of Multi-lingual Multimedia Content Prof Vincent.
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
Suléne Pilon & Danie Prinsloo Overview: Teaching and Training in South Africa 25 November 2008;
Combining terminology resources and statistical methods for entity recognition: an evaluation Angus Roberts, Robert Gaizauskas, Mark Hepple, Yikun Guo.
Research Topics CSC Parallel Computing & Compilers CSC 3990.
1 CSI 5180: Topics in AI: Natural Language Processing, A Statistical Approach Instructor: Nathalie Japkowicz Objectives of.
Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
Talk Schedule Question Answering from Bryan Klimt July 28, 2005.
Lecture 21 Computational Lexical Semantics Topics Features in NLTK III Computational Lexical Semantics Semantic Web USCReadings: NLTK book Chapter 10 Text.
ICS 482: Natural language Processing Pre-introduction
Chapter 23: Probabilistic Language Models April 13, 2004.
For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.
Auckland 2012Kilgarriff: NLP and Corpus Processing1 The contribution of NLP: corpus processing.
CS460/IT632 Natural Language Processing/Language Technology for the Web Lecture 1 (03/01/06) Prof. Pushpak Bhattacharyya IIT Bombay Introduction to Natural.
For Friday Finish chapter 23 Homework –Chapter 23, exercise 15.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Spring, 2005 CSE391 – Lecture 1 1 Introduction to Artificial Intelligence Martha Palmer CSE391 Spring, 2005.
AUTONOMOUS REQUIREMENTS SPECIFICATION PROCESSING USING NATURAL LANGUAGE PROCESSING - Vivek Punjabi.
Natural Language Processing Group Computer Sc. & Engg. Department JADAVPUR UNIVERSITY KOLKATA – , INDIA. Professor Sivaji Bandyopadhyay
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
AQUAINT Mid-Year PI Meeting – June 2002 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
King Faisal University جامعة الملك فيصل Deanship of E-Learning and Distance Education عمادة التعلم الإلكتروني والتعليم عن بعد [ ] 1 جامعة الملك فيصل عمادة.
Approaches to Machine Translation
Sentiment analysis algorithms and applications: A survey
Natural Language Processing (NLP)
Machine Learning in Natural Language Processing
Automatic Detection of Causal Relations for Question Answering
Approaches to Machine Translation
CS4705 Natural Language Processing
CSE 635 Multimedia Information Retrieval
Natural Language Processing (NLP)
Discovering Companies we Know
Natural Language Processing (NLP)
Presentation transcript:

Lecture 12 Applications and demos

Building applications Previous lectures have discussed stages in processing: algorithms have addressed aspects of language modelling. All but the simplest applications combine multiple components. Suitability of application, interoperability, evaluation etc. Avoiding error multiplication: robustness to imperfections in prior modules.

Demos Limited domain systems –CHAT-80 –BusTUC OSCAR: Named entity recognition for Chemistry DELPH-IN: Parsing and generation Blogging birds Rhetorical structure: Argumentative Zoning of scientific text Note also: demo systems mentioned in exercises.

CHAT-80 CHAT-80: a micro-world system implemented in Prolog in 1980 CHAT-80 demo –What is the population of India? –which(X:exists(X:(isa(X,population) and of(X,india)))) –have(india,(population=574))

Bus Route Oracle Query bus departures in Trondheim, Norway, built by students and faculty at NTNU. –42 bus lines, 590 stops, 60,000 entries in database –Norwegian and English –in daily use: half a million logged queries Prolog-based, parser analyses to query language, mapped to bus timetable database BusTUC demoBusTUC –When is the earliest bus to Dragvoll? –When is the next bus from Dragvoll to the centre?

Chemistry named entity recognition SciBorg: OSCAR 3 system: recognises chemistry named-entities in documents –(e.g. 2,4-dinitrotoluene; citric acid) Series of classifiers using n-grams, affixes, context plus external dictionaries Used in RSC ProjectProspect Also used as preprocessor for full parsing Precision/recall balance for different uses

Enhanced browsing of chemistry documents: RSC using OSCAR

Precision and recall in OSCAR: from Corbett and Copestake (2008) Modest precision, high recall: text preprocessing High precision, modest recall: text viewing

DELPH-IN DELPH-IN: informal consortium of 18 groups (EU, Asia, US) develops multilingual resources for deep language processing –hand-written grammars in feature structure formalism, plus statistical ranking –English Resource Grammar (ERG): approx 90% coverage of edited text ERG demo Metal reagents are compounds often utilized in synthesis.

Some uses of the ERG Automatic response (YY Corp, commercial use) Machine Translation –LOGON research project: Norwegian to English –smaller-scale MT with other language pairs Semantic search –SciBorg (chemistry, research) –WeSearch (Wikipedia, University of Oslo, research) English teaching (EPGY, Stanford: 20,000 users a week) – Smaller-scale projects in question answering, information extraction, paraphrase...

Application and domain- independent DELPH-IN Tools Application - (and maybe domain-) specific

Blogging birds: redkite.abdn.ac.uk

Argumentative Zoning Finding rhetorical structure in scientific texts automatically –Research goals –Criticism and contrast –Intellectual ancestry Robust Argumentative Zoning demo –input text (ASCII via Acrobat)input text (ASCII via Acrobat) Usages: search, bibliometrics, reviewing support, training new researchers

NLP Course conclusions Theme: ambiguity levels: morphology, syntax, semantic, lexical, discourse resolution: local ambiguity, syntax as filter for morphology, selectional restrictions. ranking: parse ranking, WSD, anaphora resolution. processing efficiency: chart parsing

Theme: evaluation training data and test data reproducibility baseline ceiling module evaluation vs application evaluation nothing is perfect!

Modules and algorithms different processing modules different applications blend modules differently many different styles of algorithm: –FSAa and FSTs –Markov models and HMMs –CFG (and probabilistic CFGs) –constraint-based frameworks –logic and compositional semantics –inheritance hierarchies (WordNet), decision trees (WSD) –vector space models (distributional semantics) –classifiers (anaphora resolution, content selection, …)

More about language and speech processing... Information Retrieval course Part III (or MPhil in Advanced Computer Science): –language and speech modules –in collaboration with speech group from Engineering – –