Introduction to Computational Linguistics (LIN3060) Lecture 1 Computers and Language.

Slides:



Advertisements
Similar presentations
CS 345: Chapter 9 Algorithmic Universality and Its Robustness
Advertisements

Introducing Formal Methods, Module 1, Version 1.1, Oct., Formal Specification and Analytical Verification L 5.
Intro to Linguistics Class # 2 Chapter 1: What is Language?
May 2006CLINT-LN Parsing1 Computational Linguistics Introduction Approaches to Parsing.
Introduction to Linguistics and Basic Terms
C. Varela; Adapted w/permission from S. Haridi and P. Van Roy1 Declarative Computation Model Defining practical programming languages Carlos Varela RPI.
PZ02A - Language translation
Topic: Theoretical Bases for Cognitive Method Objectives Trainees will be able to give reasons for the design and procedures of the Cognitive Method.
Artificial Intelligence 2004 Natural Language Processing - Syntax and Parsing - Language Syntax Parsing.
Models of Generative Grammar Smriti Singh. Generative Grammar  A Generative Grammar is a set of formal rules that can generate an infinite set of sentences.
March 1, 2009 Dr. Muhammed Al-Mulhem 1 ICS 482 Natural Language Processing INTRODUCTION Muhammed Al-Mulhem March 1, 2009.
Linguistic Theory Lecture 3 Movement. A brief history of movement Movements as ‘special rules’ proposed to capture facts that phrase structure rules cannot.
Three Generative grammars
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Introduction to Programming Lecture Number:. What is Programming Programming is to instruct the computer on what it has to do in a language that the computer.
9/8/20151 Natural Language Processing Lecture Notes 1.
CS 2104 Prog. Lang. Concepts Dr. Abhik Roychoudhury School of Computing Introduction.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
Computational Linguistics INTroduction
Computational Linguistics Yoad Winter *General overview *Examples: Transducers; Stanford Parser; Google Translate; Word-Sense Disambiguation * Finite State.
Linguistics and Language
TMF1013 : Introduction To Computing Lecture 1 : Fundamental of Computer ComputerFoudamentals.
Winter 2007SEG2101 Chapter 71 Chapter 7 Introduction to Languages and Compiler.
THE BIG PICTURE Basic Assumptions Linguistics is the empirical science that studies language (or linguistic behavior) Linguistics proposes theories (models)
CS 326 Programming Languages, Concepts and Implementation Instructor: Mircea Nicolescu Lecture 2.
Postgraduate Diploma in Translation Lecture 1 Computers and Language.
TextBook Concepts of Programming Languages, Robert W. Sebesta, (10th edition), Addison-Wesley Publishing Company CSCI18 - Concepts of Programming languages.
Problem Solving Techniques. Compiler n Is a computer program whose purpose is to take a description of a desired program coded in a programming language.
Lecture 1 Introduction Figures from Lewis, “C# Software Solutions”, Addison Wesley Richard Gesick.
For Wednesday Read chapter 23 Homework: –Chapter 22, exercises 1,4, 7, and 14.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
CSA2050 Introduction to Computational Linguistics Lecture 1 Overview.
CSA2050 Introduction to Computational Linguistics Lecture 1 What is Computational Linguistics?
PSY270 Michaela Porubanova. Language  a system of communication using sounds or symbols that enables us to express our feelings, thoughts, ideas, and.
CSA2050 Introduction to Computational Linguistics Parsing I.
CS 127 Introduction to Computer Science. What is a computer?  “A machine that stores and manipulates information under the control of a changeable program”
1 Language translation Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Sections
Data Structures and Algorithms Dr. Tehseen Zia Assistant Professor Dept. Computer Science and IT University of Sargodha Lecture 1.
ISBN Chapter 3 Describing Syntax and Semantics.
Computer Programming CONTENTS Introduction to Operating Systems Introduction to programming languages Introduction to perl programming language Programming.
SYNTAX.
Introduction CSE 1310 – Introduction to Computers and Programming Vassilis Athitsos University of Texas at Arlington 1.
CS223: Software Engineering
NATURAL LANGUAGE PROCESSING
Chapter 1 Introduction PHONOLOGY (Lane 335). Phonetics & Phonology Phonetics: deals with speech sounds, how they are made (articulatory phonetics), how.
INTRODUCTION TO COMPUTER PROGRAMMING(IT-303) Basics.
MENTAL GRAMMAR Language and mind. First half of 20 th cent. – What the main goal of linguistics should be? Behaviorism – Bloomfield: goal of linguistics.
King Faisal University جامعة الملك فيصل Deanship of E-Learning and Distance Education عمادة التعلم الإلكتروني والتعليم عن بعد [ ] 1 جامعة الملك فيصل عمادة.
Victoria Ibarra Mat:  Generally, Computer hardware is divided into four main functional areas. These are:  Input devices Input devices  Output.
10/31/00 1 Introduction to Cognitive Science Linguistics Component Topic: Formal Grammars: Generating and Parsing Lecturer: Dr Bodomo.
An Introduction to Linguistics
Algorithms, Part 1 of 3 The First step in the programming process
CSCI-235 Micro-Computer Applications
Theory of Computation Theory of computation is mainly concerned with the study of how problems can be solved using algorithms.  Therefore, we can infer.
Language translation Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Sections
Introduction to Linguistics
Algorithm and Ambiguity
COMS W1004 Introduction to Computer Science and Programming in Java
Natural Language - General
Introduction to Linguistics
Algorithm and Ambiguity
Natural Language Processing
Language translation Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Sections
Language translation Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Sections
Language translation Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Sections
Language translation Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Sections
Language translation Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Sections
Language translation Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Sections
Presentation transcript:

Introduction to Computational Linguistics (LIN3060) Lecture 1 Computers and Language

Feb MRCLINT - Lecture 12 Course Information Web Lecturers Books Speech and Language Processing, Jurafsky and Martin, Prentice Hall 2000 Algorithmics, David Harel, Addison Wesley, 2004

Feb MRCLINT - Lecture 13 Computers and Language Computational Linguistics Emphasis on mechanised linguistic theories. Grew out of early Machine Translation efforts Natural Language Processing Computational models of language analysis, interpretation, and generation. Language Engineering emphasis on large-scale performance example: Google

Feb MRCLINT - Lecture 14 CL: Two Main Disciplines COMP SCILINGUISTICS

Feb MRCLINT - Lecture 15 Linguistics is Multi Layered Phonetics: The study of speech sounds Phonology: The study of sound systems Morphology: The study of word structure Syntax: The study of sentence structure Semantics: The study of meaning Pragmatics: The study of language use

Feb MRCLINT - Lecture 16 Noam Chomsky Noam Chomsky’s work in the 1950s radically changed linguistics, making syntax central. Chomsky has been the dominant figure in linguistics ever since. Chomsky invented the generative approach to grammar.

Feb MRCLINT - Lecture 17 Generative Grammar is Prescriptive Prescriptive Grammar Rules for and against certain uses Proscribed forms that are in current use “don’t end a sentence with a preposition” Subjective Descriptive Grammar Rules characterizing what people actually say Goal to characterize all and only that which speakers find acceptable Objective

Feb MRCLINT - Lecture 18 Generative Grammar: Key Points A language is a (possibly infinite) set of sentences. Grammar is finite. Grammar of a particular language expresses linguistic knowledge of that language Theory of Grammar includes mathematical definition of what a grammar is. The “Theory of Grammar” is a theory of human linguistic abilities. [source: Sag & Wasow]

Feb MRCLINT - Lecture 19 Theories of Sentence and Word Structure: Rewrite Rules Rules can be used to specify the sentences of a language. Rules have the form LHS  RHS LHS may be a sequence of symbols RHS may be a sequence of symbols or words. Lexicon specifies words and their categories

Feb MRCLINT - Lecture 110 A Simple Grammar/Lexicon grammar: S  NP VP NP  N VP  V NP lexicon: V  kicks N  John N  Bill S NP N Johnkicks NPV VP N Bill

Feb MRCLINT - Lecture 111 Formal v. Natural Languages Formal Languages Arithmetic Logic  x man(x)  mortal(x) URL Natural Languages English John saw the dog German Johann hat den hund gesehen Maltese Ġianni ra kelb

Feb MRCLINT - Lecture 112 Points of Similarity A language is considered to be a (possibly infinite) set of sentences. Sentences are sequences of words. Rules determine which sequences are valid sentences. Sentences have a definite structure. Sentence structure related to meaning.

Feb MRCLINT - Lecture 113 Points of Difference Formal Languages The grammar defines the language Restricted application Non ambiguous Natural Languages The language defines the grammar Universal application Highly ambiguous

Feb MRCLINT - Lecture 114 Ambiguity Morphological Ambiguity en-large-ment Lexical Ambiguity the sheep is in the pen Syntactic Ambiguity small animals and children laugh Semantic Ambiguity every girl loves a sailor Pragmatic Ambiguity can you pass the salt? The management of ambiguity is central to the success of CL in general and MT in particular.

Feb MRCLINT - Lecture 115 Computer Science The study of basic concepts Information Data Algorithm Program The application of these concepts to practical tasks. Implementation of computational models.

Feb MRCLINT - Lecture 116 Information Information is an theoretical concept invented by Shannon in 1948 to measure uncertainty. The units of this measure are called bits. Length – metres Weight – kilos Information – bits 1 bit is the amount of uncertainty inherent to a situation when there are exactly two possible outcomes. Example: for breakfast I will have coffee or I will have tea (nothing else). When I tell you that I have tea, I have conveyed one bit of information. The greater the number of possible outcomes, the more bits of infomation involved in the statement that indicates the actual outcome.

Feb MRCLINT - Lecture 117 Data A formalized representation of facts or concepts suitable for communication, interpretation, or processing by people or automated means. Example: a telephone directory Unlike information, which is abstract, data is concrete Data has a certain level of structure. In the telephone directory, for example, we have the structure of a list of entries, each of which has a name, an address, and a number.

Feb MRCLINT - Lecture 118 Algorithm

Feb MRCLINT - Lecture 119 Algorithm Input: ingredients Output: delicious chocolate cake Method: Algorithm Hardware: oven, pan, chef Software: recipe

Feb MRCLINT - Lecture 120 Algorithm to Add X and Y subtract 1 from X add 1 to Y X = 0? Read X and Y X = 2, Y = 3 yesno Output Y

Feb MRCLINT - Lecture 121 Algorithm A well defined procedure for the solution of a given problem in a finite number of steps Abstract Designed to perform a well-defined task. Finite description length. Guaranteed to terminate.

Feb MRCLINT - Lecture 122 Levels of Detail Every algorithm assumes the existence of elementary instructions, e.g. spread the ingredients in the pan add 1 to Y The idea is that these can be executed by the hardware directly. There is nothing necessary about the particular instruction set. We could imagine greater or lesser amounts of detail. We need to agree about the instruction set before describing an algorithm.

Feb MRCLINT - Lecture 123 Abstraction Every algorithm could be described at the lowest level of detail. However, the process of abstracting away from the elementary details is central to efficient description – for computers as well as humans. Prepare a sauce bordelaise and pour over the meat. Computer programming languages embody higher levels of abstraction and allow more efficient descriptions

Feb MRCLINT - Lecture 124 Computer Program A set of instructions, written in a specific programming language, which a computer follows in processing data, performing an operation, or solving a logical problem.

Feb MRCLINT - Lecture 125 Instructions vs. Execution Steps 1.Read X 2.Read Y 3.X = X-1 4.Y = Y+1 5.If X = 0 then Print(X) else goto 3 How many instructions? How many execution steps?

Feb MRCLINT - Lecture 126 Computer Program Finite Length Concrete (can be written down) Implements an algorithm. More than one program may implement the same algorithm. Not all programs express good algorithms!

Feb MRCLINT - Lecture 127 Algorithms and Linguistics Linguistic theory provides linguistic knowledge in the form of grammar rules theories about grammar rules Putting knowledge to some use involves processing, e.g.: parsing generation

Feb MRCLINT - Lecture 128 Computational Linguistics – Issues How are a grammar and a lexicon represented? By what algorithm can we actually discover the structure of a sentence? actually generate a sentence to express a particular meaning? How can we actually test a linguistic theory? Could an artificial system acquire a grammar with limited exposure to grammatical sentences.

Feb MRCLINT - Lecture 129 Computers and Language Twin Goals Scientific Goal: Contribute to Linguistics by adding a computational dimension. Technological Goal: Develop machinery capable of handling human language that can support “language engineering”

Feb MRCLINT - Lecture 130 Computers and Language: Applications Information Retrieval/Extraction Document Classification Question Answering Style and Spell Checking Integrated Multimodal Tasks Machine Translation