1/26 Applications of NLP. 2/26 Applications What uses of the computer involve language? What language use is involved? What are the main problems? How.

Slides:



Advertisements
Similar presentations
Configuration management
Advertisements

What is Word Study? PD Presentation: Union 61 Revised ELA guide Supplement (and beyond)
Introduction to Computational Linguistics
INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING NLP-AI IIIT-Hyderabad CIIL, Mysore ICON DECEMBER, 2003.
For Friday No reading Homework –Chapter 23, exercises 1, 13, 14, 19 –Not as bad as it sounds –Do them IN ORDER – do not read ahead here.
Natural Language and Speech Processing Creation of computational models of the understanding and the generation of natural language. Different fields coming.
CSE111: Great Ideas in Computer Science Dr. Carl Alphonce 219 Bell Hall Office hours: M-F 11:00-11:
Natural Language Processing AI - Weeks 19 & 20 Natural Language Processing Lee McCluskey, room 2/07
Basi di dati distribuite Prof. M.T. PAZIENZA a.a
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
1/16 LELA Language and Computers Harold Somers Professor of Language Engineering.
CALL: Computer-Assisted Language Learning. 2/14 Computer-Assisted (Language) Learning “Little” programs Purpose-built learning programs (courseware) Using.
Comments on Guillaume Pitel: “Using bilingual LSA for FrameNet annotation of French text from generic resources” Gerd Fliedner Computational Linguistics.
1/23 Applications of NLP. 2/23 Applications Text-to-speech, speech-to-text Dialogues sytems / conversation machines NL interfaces to –QA systems –IR systems.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Huimin Ye.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
Artificial Intelligence Research Centre Program Systems Institute Russian Academy of Science Pereslavl-Zalessky Russia.
Natural Language Processing and Speech Enabled Applications by Pavlovic Nenad.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Copyright R. Weber INFO 629 Concepts in Artificial Intelligence Fall 2004 Professor: Dr. Rosina Weber.
AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Introduction to NLP.
Communicative Language Teaching Vocabulary
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
1 Computational Linguistics Ling 200 Spring 2006.
Defining Text Mining Preprocessing Transforming unstructured data stored in document collections into a more explicitly structured intermediate format.
Scott Duvall, Brett South, Stéphane Meystre A Hands-on Introduction to Natural Language Processing in Healthcare Annotation as a Central Task for Development.
Teaching language means teaching the components of language Content (also called semantics) refers to the ideas or concepts being communicated. Form refers.
Natural Language Processing Rogelio Dávila Pérez Profesor – Investigador
Language Technology I © 2005 Hans Uszkoreit Language Technology I 2005/06 Hans Uszkoreit Universität des Saarlandes and German Research Center for Artificial.
Approaches to Machine Translation CSC 5930 Machine Translation Fall 2012 Dr. Tom Way.
Mining Topic-Specific Concepts and Definitions on the Web Bing Liu, etc KDD03 CS591CXZ CS591CXZ Web mining: Lexical relationship mining.
Research Topics CSC Parallel Computing & Compilers CSC 3990.
1 CSI 5180: Topics in AI: Natural Language Processing, A Statistical Approach Instructor: Nathalie Japkowicz Objectives of.
Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.
Indirect Supervision Protocols for Learning in Natural Language Processing II. Learning by Inventing Binary Labels This work is supported by DARPA funding.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
Introduction to Computational Linguistics
Computational linguistics A brief overview. Computational Linguistics might be considered as a synonym of automatic processing of natural language, since.
Dirk Van CompernolleAtranos Workshop, Leuven 12 April 2002 Automatic Transcription of Natural Speech - A Broader Perspective – Dirk Van Compernolle ESAT.
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
Topic #1: Introduction EE 456 – Compiling Techniques Prof. Carl Sable Fall 2003.
For Monday Read chapter 24, sections 1-3 Homework: –Chapter 23, exercise 8.
For Friday Finish chapter 24 No written homework.
For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.
Natural Language Processing Chapter 1 : Introduction.
For Friday Finish chapter 23 Homework –Chapter 23, exercise 15.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
1 An Introduction to Computational Linguistics Mohammad Bahrani.
Listening comprehension is at the core of second language acquisition. Therefore demands a much greater prominence in language teaching.
Natural Language Processing Group Computer Sc. & Engg. Department JADAVPUR UNIVERSITY KOLKATA – , INDIA. Professor Sivaji Bandyopadhyay
Natural Language and Speech (parts of Chapters 8 & 9)
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
Basics of Natural Language Processing Introduction to Computational Linguistics.
1 Applying Principles To Reading Presented By Anne Davidson Michelle Diamond.
INTRODUCTION TO APPLIED LINGUISTICS
HIGH SCHOOL TEACHER TRAINING WORKSHOP
Trends in NL Analysis Jim Critz University of New York in Prague EurOpen.CZ 12 December 2008.
Approaches to Machine Translation
Artificial Intelligence for Speech Recognition
Information Retrieval and Web Search
Information Retrieval and Web Search
Approaches to Machine Translation
CSE 635 Multimedia Information Retrieval
Artificial Intelligence 2004 Speech & Natural Language Processing
Information Retrieval
Presentation transcript:

1/26 Applications of NLP

2/26 Applications What uses of the computer involve language? What language use is involved? What are the main problems? How successful are they?

3/26 Speech applications Speech recognition (Speech-to-text) –Uses As a general interface to any text-based application Text dictation Speech understanding –Not the same: computer must understand intention, not necessarily exact words –Uses As a general interface to any application where meaning is important rather than text As part of speech translation Difficulties –Separating speech from background noise –Filtering of performance errors (disfluencies) –Recognizing individual sound distinctions (similar phonemes) –Variability in human speech –Ambiguity in language (homophones)

4/26 Speech applications Voice recognition –Not really a linguistic issue –But shares some of the techniques and problems Text-to-speech (Speech synthesis) –Uses: Computer can speak to you Useful where user cannot look at (or see) screen –Difficulties Homograph disambiguation Prosody determination (pitch, loudness, rhythm) Naturalness (pauses, disfluencies?)

5/26 Word processing Check and correct spelling, grammar and style Types of spelling errors –Non-existent words Easy to identify But suggested correction not always appropriate –Accidental homographs Deliberate ‘errors’ –Foreign words –Proper names, neologisms –Illustrations of spelling errors!

6/26 Better word processing Spell checking for homonyms Grammar checking Tuned to the user –You can (already) add your own auto-corrections –Non-native users (‘Interference checking’) –Dyslexics and other special needs users Intelligent word processing –Find/replace that knows about morphology, syntax

7/26 Text prediction Speed up word processing Facilitate text dictation At lexical level, already seen in SMS More sophisticated, might be based on corpus of previously seen texts Especially useful in repeated tasks –Translation memory –Authoring memory

8/26 Dialogue systems Computer enters a dialogue with user –Usually specific cooperative task-oriented dialogue –Often over the phone –Examples? Usually speech-driven, but text also appropriate Modern application is automatic transaction processing Limited domain may simplify language aspect Domain ‘model’ will play a big part Simplest case: choose closest match from (hidden) menu of expected answers More realistic versions involve significant problems

9/26 Dialogue systems Apart from speech recognition and synthesis issues, NL components include … Topic tracking Anaphora resolution –Use of pronouns, ellipsis Reply generation –Cooperative responses –Appropriate use of anaphora

10/26 (also know as) Conversation machines Another old AI goal (cf. Turing test) Also (amazingly) for amusement Mainly speech, but also text based Early famous approaches include ELIZA, which showed what you could do by cheating Modern versions have a lot of NLP, especially discourse modelling, and focus on the language generation component

11/26 QA systems NL interface to knowledge database Handling queries in a natural way Must understand the domain Even if typed, dialogue must be natural Handling of anaphora e.g. When is the next flight to Sydney? And the one after? What about Melbourne then? OK I’ll take the last one.

12/26 IR systems Like QA systems, but the aim is to retrieve information from textual sources that contain the info, rather than from a structured data base Two aspects –Understanding the query (cf Google, Ask Jeeves)Google –Processing text to find the answer Named Entity Recognition

13/26

14/26

15/26

16/26 Named entity recognition Typical textual sources involve names (people, places, corporations), dates, amounts, etc. NER seeks to identify these strings and label them Clues are often linguistic Also involves recognizing synonyms, and processing anaphora

17/26 Automatic summarization Renewed interest since mid 1990s, probably due to growth of WWW Different types of summary –indicative vs. informative –abstract vs. extract –generic vs. query-oriented –background vs. just-the-news –single-document vs. multi-document

18/26 Automatic summarization topic identification stereotypical text structure cue words high-frequency indicator phrases intratext connectivity discourse structure centrality topic fusion concept generalization semantic association summary generation sentence planning to achieve information compaction

19/26 Text mining Discovery by computer of new, previously unknown information, by automatically extracting information from different written resources (typically Internet) Cf data mining (e.g. using consumer purchasing patterns to predict which products to place close together on shelves), but based on textual information Big application area is biosciences

20/26 Text mining preprocessing of document collections (text categorization, term extraction) storage of the intermediate representations techniques to analyze these intermediate representations (distribution analysis, clustering, trend analysis, association rules, etc.) visualization of the results.

21/26 Story understanding An old AI application Involves … –Inference –Ability to paraphrase (to demonstrate understanding) Requires access to real-world knowledge Often coded in “scripts” and “frames”

22/26 Machine Translation Oldest non-numerical application of computers Involves processing of source-language as in other applications, plus … –Choice of target-language words and structures –Generation of appropriate target-language strings Main difficulty is source-language analysis and/or cross-lingual transfer implies varying levels of “understanding”, depending on similarities between the two languages MT ≠ tools for translators, but some overlap

23/26 Machine Translation First approaches perhaps most intuitive: look up words and then do local rearrangement “Second generation” took linguistic approach: grammars, rule systems, elements of AI Recent (since 1990) trend to use empirical (statistical) approach based on large corpora of parallel text –Use existing translations to “learn” translation models, either a priori (Statistical MT ≈ machine learning) or on the fly (Example-based MT ≈ case-based reasoning) –Convergence of empirical and rationalist (rule-based) approaches: learn models based on treebanks or similar.

24/26 Language teaching CALL Grammar checking but linked to models of –The topic –The learner –The teaching strategy Grammars (etc) can be used to create language-learning exercises and drills

25/26 Assistive computing Interfaces for disabled Many devices involve language issues, e.g. –Text simplification or summarization for users with low literacy (partially sighted, dyslexic, non-native speaker, illiterate, etc.) –Text completion (predictive or retrospective) Works on basis of probabilities or previous examples

26/26 Conclusion Many different applications But also many common elements –Basic tools (lexicons, grammars) –Ambiguity resolution –Need (but impossibility of having) for real-world knowledge Humans are really very good at language –Can understand noisy or incomplete messages –Good at guessing and inferring