1 Why do CPA? Patrick Hanks Research Institute for Information and Language Processing, University of Wolverhampton; Bristol Centre for Linguistics, University.

Slides:



Advertisements
Similar presentations
1 Lexical Semantics for the Semantic Web Patrick Hanks Masaryk University, Brno Czech Republic UFAL, Mathematics Faculty, Charles University.
Advertisements

Introduction to Computational Linguistics
ENG 626 CORPUS APPROACHES TO LANGUAGE STUDIES language teaching (3) Bambang Kaswanti Purwo
Introduction to Computational Linguistics
Language Assessment What it measures and how Jill Kerper Mora, Ed.D.
1 The Mechanics of Metaphor: Discussing Austerity and Political Crisis Patrick Hanks Research Institute for Information and Language Processing, University.
Semantic Resonance Patrick Hanks Research Institute for Information and Language Processing, University of Wolverhampton; Bristol Centre for Linguistics,
1 The Generative Lexicon (GL) meets Corpus Pattern Analysis (CPA) Patrick Hanks Institute of Formal and Applied Linguistics, Charles University in Prague,
Mapping meaning onto use: a Pattern Dictionary of English Verbs Patrick Hanks Faculty of Informatics, Masaryk University, Brno, Czech Republic
CL Research ACL Pattern Dictionary of English Prepositions (PDEP) Ken Litkowski CL Research 9208 Gue Road Damascus,
Interface metaphors & analogies pp pp
1 Computing Real Language Meaning for the Semantic Web Patrick Hanks Masaryk University, Brno Czech Republic UFAL, Mathematics Faculty,
CS 355 – Programming Languages
January 12, Statistical NLP: Lecture 2 Introduction to Statistical NLP.
Natural Language and Speech Processing Creation of computational models of the understanding and the generation of natural language. Different fields coming.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Chapter 6: Design of Expert Systems
The Semantic Web Week 13 Module Website: Lecture: Knowledge Acquisition / Engineering Practical: Getting to know.
Formal Methods. Importance of high quality software ● Software has increasingly significant in our everyday activities - manages our bank accounts - pays.
Christian Metz ( ) “A film is difficult to explain because it is easy to understand . . .”
Topic: Theoretical Bases for Cognitive Method Objectives Trainees will be able to give reasons for the design and procedures of the Cognitive Method.
1 SIMS 290-2: Applied Natural Language Processing Marti Hearst Sept 22, 2004.
PSY 369: Psycholinguistics Some basic linguistic theory part3.
Resources Primary resources – Lexicons, structured vocabularies – Grammars (in widest sense) – Corpora – Treebanks Secondary resources – Designed for a.
Article by: Feiyu Xu, Daniela Kurz, Jakub Piskorski, Sven Schmeier Article Summary by Mark Vickers.
Regular Expressions and Automata Chapter 2. Regular Expressions Standard notation for characterizing text sequences Used in all kinds of text processing.
Presented by Jennifer Robison TexTESOL II March 12, 2010 San Antonio, TX.
Research methods in corpus linguistics Xiaofei Lu.
Chapter 3: An Introduction to Corpus Linguistics Compiled by: Sajjad Ghadamyari Farhad Ghiasvand Presentation Date: Dec. 8, Monday.
UNIT 9. CLIL THINKING SKILLS
Main Branches of Linguistics
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Why teach Grammar? Attitues to Grammar.
Machine Translation Dr. Radhika Mamidi. What is Machine Translation? A sub-field of computational linguistics It investigates the use of computer software.
The DVC project: Disambiguation of Verbs by Collocation ____ an introduction to the linguistic theory of norms and exploitations Patrick Hanks Research.
1 How to Compute the Meaning of Natural Language Utterances Patrick Hanks, Research Institute of Information and Language Processing, University of Wolverhampton.
Linguistics and Language
Computational Linguistics WTLAB ( Web Technology Laboratory ) Mohsen Kamyar.
For Linguists who Want to be Linguists, not Computer Scientists Launching a Career in Tech Jeff Lilly, Research Scientist at Amazon.
1 Computational Linguistics Ling 200 Spring 2006.
Copyright 2004 Compsim LLC The Right Brain Architecture of a Holonic Manufacturing System Application of KEEL ® Technology to Holonic Manufacturing Systems.
Natural Language Processing Introduction. 2 Natural Language Processing We’re going to study what goes into getting computers to perform useful and interesting.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
Without data, nothing Adam Kilgarriff Lexical Computing Ltd University of Leeds.
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
BAA - Big Mechanism using SIRA Technology Chuck Rehberg CTO at Trigent Software and Chief Scientist at Semantic Insights™
1 CSI 5180: Topics in AI: Natural Language Processing, A Statistical Approach Instructor: Nathalie Japkowicz Objectives of.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
ARTIFICIAL INTELLIGENCE Human like intelligence Definitions: 1. Focus on intelligent Behaviour “Behaviour by a machine that, if performed by a human.
Introduction to Linguistics Class # 1. What is Linguistics? Linguistics is NOT: Linguistics is NOT:  learning to speak many languages  evaluating different.
For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.
Chapter 10 Basic Skills Count Chapter 10 Basic Skills Count Lesson 10.2 Applying Other Skills Lesson 10.2 Applying Other Skills.
 There must be a coherent set of links between techniques and principles.  The actions are the techniques and the thoughts are the principles.
Auckland 2012Kilgarriff: NLP and Corpus Processing1 The contribution of NLP: corpus processing.
EEL 5937 Agent communication EEL 5937 Multi Agent Systems Lotzi Bölöni.
CS460/IT632 Natural Language Processing/Language Technology for the Web Lecture 1 (03/01/06) Prof. Pushpak Bhattacharyya IIT Bombay Introduction to Natural.
FIDELITY IN TRANSLATION AND INTERPRETATION PLAN 1.Fidelity as a phenomenon in translation 2.Verbalizing a simple idea 3.Principles of fidelity 3.1. Primary.
1 CPA: Where do we go from here? Research Institute for Information and Language Processing, University of Wolverhampton; UPF Barcelona; University of.
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Quality Assurance, Ontology Engineering, and Semantic Interoperability.
Semantic Wiki: Automating the Read, Write, and Reporting functions Chuck Rehberg, Semantic Insights.
1 Types of Programming Language (1) Three types of programming languages 1.Machine languages Strings of numbers giving machine specific instructions Example:
Grounded theory, discourse analysis and hermeneutics Part Two – Discourse Analysis ERPM001 Interpretive Methodologies Dr Alexandra Allan.
In this lecture, we will learn about: Translation.
© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Quality Assurance, Ontology Engineering, and Semantic Interoperability.
Use of Concordancers A corpus (plural corpora) – a large collection of texts, written or spoken, stored on a computer. A concordancer – a computer programme.
Aristotel‘s concept to language studies was to study true or false sentences - propositions; Thomas Reid described utterances of promising, warning, forgiving.
A common-sense paradigm for linguistic research
Natural Language Processing
Information Retrieval
Presentation transcript:

1 Why do CPA? Patrick Hanks Research Institute for Information and Language Processing, University of Wolverhampton; Bristol Centre for Linguistics, University of the West of England

Meaning; collocation Analysing meaning; understanding meaning –A set of unresolved problems in linguistics –ONE CONTRIBUTION: analyse phraseology “Many meanings depend on the presence of more than word for their realization” – J.M. Sinclair Dictionaries (and WordNet) are a disappointment for software engineers and language learners –They list many “meanings” –But they don’t say how words are used 2

Aims To build an inventory of the phraseological patterns associated with (each sense of) each verb (“norms”) To relate unusual and imaginative uses to the normal uses of each word (“explotations”) –e.g. build a collection of newly created metaphors and similes 3

A discovery of corpus linguistics 85%-90% of everyday speech and writing is phraseologically “normal”. –The phraseological norms can be described and stored as models for future use. 9%-14% is unusual (creative) in some way. About 1% is uninterpretable. 4

Users (target market 1) Applications in computational linguistics –Natural language engineers –Machine translation? – Idiomatic language generation? –Message understanding? –Information extraction? Escaping the tyranny of text matching 5

Users (target market 2) Language teaching –Natural phraseology –Error correction –Prioritization and choices 6

Logical and analogical A natural language consists of a puzzling mixture of logical and analogical procedures Neglect of the analogical aspect has led to serious errors –E.g. the quest for precise definition in ontologies currently being designed for the Semantic Web In ordinary language people make new meanings by comparing one thing with another and by creating ad-hoc sets –Not merely by asserting identity –Nor by conforming exactly to conventional phraseology –Vagueness is an important principle of natural language Danger of confusing language with mathematical logic 7

We need to re-examine the relationship between language and logic The theory of norms and exploitations (TNE) argues that: Talk of an "underlying logical form" of an utterance is pernicious. What "underlies" linguistic behaviour is a set of behavioural regularities -- phraseological patterns. One of the many things that people do with these patterns is to make logics. They do other things too – notably, use language patterns for social interaction. 8

A usage-based, corpus-driven theory of language TNE research indicates that language is indeed a rule-governed system BUT: There are two sets of rules, not just one: 1.Rules for using words normally, “correctly”, boringly 2.Rules for exploiting normal patterns of word use. Exploitations include not only metaphors and similes, but ellipsis, anomalous arguments, irony, etc. etc. The two rule systems interact. Today’s exploitation may become tomorrow’s norm. –Compare Bowdle and Gentner (2005): ‘the Career of Metaphor’. The rules are probabilistic, not deterministic. Hanks, P. (2013): Lexical Analysis: Norms and Exploitations. MIT Press. 9