Introduction to Cognitive Science Linguistics Component Lecture 2 September 22, 2005. (2.00 p.m. – 3.50 p.m.) Venue: Meng Wah Complex Room 324 Lecturer:

Slides:



Advertisements
Similar presentations
Computational language: week 10 Lexical Knowledge Representation concluded Syntax-based computational language Sentence structure: syntax Context free.
Advertisements

Intro to Linguistics Class # 2 Chapter 1: What is Language?
Why study grammar? Knowledge of grammar facilitates language learning
Lecture #4 Psycholinguistics.
Introduction: The Chomskian Perspective on Language Study.
* Cognition: mental processes involved in acquiring knowledge.
GRAMMAR & PARSING (Syntactic Analysis) NLP- WEEK 4.
Module 14 Thought & Language. INTRODUCTION Definitions –Cognitive approach method of studying how we process, store, and use information and how this.
1 Words and the Lexicon September 10th 2009 Lecture #3.
Introduction to Linguistics and Basic Terms
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Spring 2006-Lecture 4.
Chapter Nine The Linguistic Approach: Language and Cognitive Science.
Topic: Theoretical Bases for Cognitive Method Objectives Trainees will be able to give reasons for the design and procedures of the Cognitive Method.
Introduction to Cognitive Science: Linguistics components
Semi-Automatic Learning of Transfer Rules for Machine Translation of Low-Density Languages Katharina Probst April 5, 2002.
Psycholinguistics 12 Language Acquisition. Three variables of language acquisition Environmental Cognitive Innate.
Artificial Intelligence 2004 Natural Language Processing - Syntax and Parsing - Language Syntax Parsing.
Syntax and Semantics Dr. Walid Amer, Associate Professor of linguistics The Islamic university of Gaza February, 2009.
Models of Generative Grammar Smriti Singh. Generative Grammar  A Generative Grammar is a set of formal rules that can generate an infinite set of sentences.
Lecture 1 Introduction: Linguistic Theory and Theories
1. Introduction Which rules to describe Form and Function Type versus Token 2 Discourse Grammar Appreciation.
Language: Form, Meanings and Functions
Language Acquisition GST 400 Bethany Gilliam 04/10/06.
Introduction to Cognitive Science Linguistics Component Syntax and Semantics Date: 19th October 2000 Lecturer: Dr Bodomo Department of Linguistics.
14: THE TEACHING OF GRAMMAR  Should grammar be taught?  When? How? Why?  Grammar teaching: Any strategies conducted in order to help learners understand,
Main Branches of Linguistics
University of St. Thomas EDUC5355 Chapter 10—Cognitive, Language, and Literacy Development, 1-3 Dr. Ann Weiss The University of St. Thomas The Young Child.
EFL Anthony’s model: Approach Method Technique
The Grammar – Translation Method
X Language Acquisition
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
For Friday Finish chapter 23 Homework: –Chapter 22, exercise 9.
Literacy is...  the quality or state of being literate, esp. the ability to read and write  An individual’s ability to construct, create, and communicate.
What is linguistics  It is the science of language.  Linguistics is the systematic study of language.  The field of linguistics is concerned with the.
THE BIG PICTURE Basic Assumptions Linguistics is the empirical science that studies language (or linguistic behavior) Linguistics proposes theories (models)
SYNTAX Lecture -1 SMRITI SINGH.
Psycholinguistic Theory
Theories of First Language Acquisition
© 2008 The McGraw-Hill Companies, Inc. Chapter 8: Cognition and Language.
Cognitive and Language Development Pertemuan 4 Matakuliah: E Psikologi Pendidikan Tahun: 2010.
Copyright © Curt Hill Languages and Grammars This is not English Class. But there is a resemblance.
Universal Grammar Noam Chomsky.
For Wednesday Read chapter 23 Homework: –Chapter 22, exercises 1,4, 7, and 14.
PSY270 Michaela Porubanova. Language  a system of communication using sounds or symbols that enables us to express our feelings, thoughts, ideas, and.
Rules, Movement, Ambiguity
The Minimalist Program
Linguistic Anthropology Bringing Back the Brain. What Bloomfield Got “Right” Emphasized spoken language rather than written language The role of the linguist.
Second Language Acquisition
The Develop ment of Thought and Languag e Chapter 11 Thought & Language Chapter 10.
1Computer Sciences Department. Book: INTRODUCTION TO THE THEORY OF COMPUTATION, SECOND EDITION, by: MICHAEL SIPSER Reference 3Computer Sciences Department.
1 Syntax 1. 2 In your free time Look at the diagram again, and try to understand it. Phonetics Phonology Sounds of language Linguistics Grammar MorphologySyntax.
Levels of Linguistic Analysis
Language and Cognition Colombo, June 2011 Day 2 Introduction to Linguistic Theory, Part 3.
NATURAL LANGUAGE PROCESSING
Psychology of Human Learning Edfd 302 mgmsantos. Language:  the words, their pronunciation, and the methods of combining them, used and understood by.
Introduction : describing and explaining L2 acquisition Ellis, R Second Language Acquisition (3 – 14)
Chapter 11 Language. Some Questions to Consider How do we understand individual words, and how are words combined to create sentences? How can we understand.
Language: our spoken, written, or signed words & the ways we combine them to communicate meaning! “When we study language, we are approaching what some.
Language learning Approaches & key theorists. Historical language approaches 1 Grammar/translation Formalised end 19 th C. Mind consisting of separate.
Chapter 10 Language acquisition Language acquisition----refers to the child’s acquisition of his mother tongue, i.e. how the child comes to understand.
10/31/00 1 Introduction to Cognitive Science Linguistics Component Topic: Formal Grammars: Generating and Parsing Lecturer: Dr Bodomo.
Text Linguistics. Definition of linguistics Linguistics can be defined as the scientific or systematic study of language. It is a science in the sense.
PSYC 206 Lifespan Development Bilge Yagmurlu.
An Introduction to Linguistics
An Introduction to the Government and Binding Theory
What is Language Acquisition?
Natural Language - General
Traditional Grammar VS. Generative Grammar
Linguistic aspects of interlanguage
COMPILER CONSTRUCTION
Presentation transcript:

Introduction to Cognitive Science Linguistics Component Lecture 2 September 22, (2.00 p.m. – 3.50 p.m.) Venue: Meng Wah Complex Room 324 Lecturer: Dr. A. B. Bodomo Department of Linguistics

Topic 3: Formal Grammar: Parsing and Generation

3 Introduction In my previous lectures, we discussed how tacit linguistic knowledge can be represented at various levels of phonology, morphology, syntax, semantics, pragmatics, and their interfaces, including morphophonology, morphosyntax, and the syntax-semantics interrelationships. In this lecture, we shall look closely at how these linguistic knowledge representations can be formalised into an algorithm, a computational procedure for processing this linguistic knowledge.

4 Keywords Constituent structure rules initial symbol terminal symbol non-terminal symbol generative grammar formal grammar

5 Formal devices and notation The symbol ‘  ’ –indicates that a node is ‘rewritten as…’ or ‘consists of ’, or ‘has the constituents…’ This is used in rewrite rules of the type: –S  NP + VP a sentence, S, has the constituents: noun phrase (NP) and verb phrase (VP) Optionality in the grammar is expressed as {X, Y}. –This means apply either X or Y but not both

6 Formal devices and notation (cont’d) #The symbol # is used to indicate constituent boundary # _ _# –e.g. # _ is word initial while _# is word final X (Y)The notation X (Y) implies that X is obligatory and may be followed by Y Initial symbolInitial symbol: the symbol from which a rewrite rule begins (e.g. S) Terminal symbolTerminal symbol: the end symbols from which no constituent structure can be further developed (N, V, Art). All others are non- terminal symbols (e.g. NP, VP).

7 Generating and Parsing sentences Two main aspects of grammatical information processing: Generating and Parsing sentences Before we begin let us illustrate with a simple grammar and lexicon, using the following sentence: –The students greeted the teacher –The students greeted the teacher.

8 The students greeted the teacher. Grammar: –S  NP +VP –VP  V + NP –NP  Art + N Lexicon 1: –Greeted: V, - NP –Students: N –The: Art –Teacher: N But you have to augment i.e. increase the lexicon as follows: Lexicon2: an: Art teacher: N greeted: V, -NP the: Art students: N scared: V, -NP apple: N ate: V, -NP child: N This grammar can also generate (i.e. produce) the following sentences: The teacher greeted the students The teacher scared the students The child ate an apple

9 Sentence Generation:the algorithm To produce a sentence we need three things: A set of phrase structure rules (as illustrated above) A lexicon (as illustrated above), and A lexical insertion rule (as explained below) an instruction to select the right word from a lexicon A lexical insertion rule is an instruction to select the right word from a lexicon The following is an example of a lexical rule:

10 Lexical insertion rule For each terminal symbol of a phrase structure rule, select a word from the lexicon that satisfies the following conditions: –terminal symbol (e.g. N, V) It is a member of the class of –its subcategorization frame matches that of the terminal symbol (e.g. V, _NP). Attach this word as the daughter of this terminal symbol. sentence generatorThe set of rules above constitutes what is known as a sentence generator.

11 The whole procedure of beginning with an initial symbol and then working through phrase structure rules to adding the lexical items via lexical insertions rules is driven by an algorithm or a set of instructions. Let us set out an algorithm for the generation (production) of the sentence: The students greeted the teacher, a grammar and a lexicon as follows:

12 The students greeted the teacher Grammar: PS Rule (a): S  NP +VP PS Rule (b): VP  V + NP PS Rule (c): NP  Art + N Lexicon1: Greeted: V, - NP Students: N The: Art Teacher: N Rule 1 Start with the initial symbol, S. Rule 2 For every non-terminal symbol, X, find a phrase structure rule with X as left-hand symbol and others as the right hand symbol(s), and develop a rewrite rule with X as the mother and the right hand symbols as ordered daughters. Rule 3 Apply rule 2 until all branches end in terminal symbols. Rule 4 Apply lexical rule iteratively until every terminal symbol is replaced by a lexical item.

Illustrating the algorithm The professorgreetedthe students N NArt V NP VP NP S Applying Rule 1 Applying Rule 2,3 Applying Rule 3 Applying Rule 4

14 From the above we can see that we have started from an initial string and have ended with terminal strings with lexical items as their daughters. A sentence has thus been generated (produced), telling us how this sentence is built up. Now, let us see how we can begin with an existing sentence and then break it down into its component parts by applying rules.

15 Sentence parsing: the algorithm To parse a sentence means to analyse it into its constituent parts by the systematic application of lexical insertion rules and some phrase structure rules. It is like the reverse process of generation.

16 Types of Parsing Top-down: Begin with the symbol S. Bottom-up: Begin with terminal symbols (words). Possible research: Which types of parsing in natural languages provide the most cognitively realistic and efficient parser ?

17 Some sentence parsing rules which constitute a PARSER For a sentence, S – Rule 1 : Determine from the lexicon the word class of every item and develop a partial tree for each word where the word class label dominates the word. – Rule 2 : Find a PS rule of the type X  Y, Z and where the right hand symbols match some sequence of categories in the structure so far, and develop a partial tree with X as the mother and the right hand symbols as ordered daughters. – Rule 3 : Continue rule 2 until the root, S, is reached and there are no unattached strings.

18 The man drank the tea. Grammar: PS Rule1: S  NP +VP PS Rule2: VP  V + NP PS Rule3: NP  Art + N Lexicon1: drank: V, - NP man: N the: Art tea: N Art N V Art N Art N V Art N The man drank the tea The man drank the tea NP NP NP NP Art N V Art N Art N V Art N The man drank the tea The man drank the tea Applying Rule 1 Applying Rule 2

VP NP NP NP NP Art N V Art N Art N V Art N The man drank the tea The man drank the tea S NP VP NP VP NP NP Art N V Art N Art N V Art N The man drank the tea The man drank the tea Applying Rule 3

20 Conclusion Parsing and generation of natural language data is a very important area of linguistics, especially in computer applications of natural languages which has become an important aspect of the computer or information processing industry.

Topic 4: Language and Literacy Acquisition

22 Keywords language acquisition innateness hypothesis language faculty / Language Acquisition Device (LAD) literacy levels of literacy literacy acquisition

23 Introduction ThemeTheme –A survey of how linguistic knowledge is acquired/learnt by speakers of a language, from the point of view of spoken language and from the point of view of literacy (reading and writing). ObjectiveObjective –an understanding of the basic terms and issues in language and literacy acquisition –an interface approach: rather than rigidly discussing these issues from language acquisition as separate and different from literacy acquisition, we will look at how language acquisition relates to literacy acquisition.

24 What is language acquisition? Gleitman and Bloom 1999:434 –‘refers to the process of attaining a specific variant of human language…the fundamental puzzle in understanding this process has to do with the open- ended nature of what is learned: children appropriately use words acquired in one context to make reference in the next, and they construct novel sentences to make known their changing thoughts and desires’ (in MIT Encyclopedia of the Cognitive Sciences). Crystal 1997: 430 –The process of learning a first language in children. –The analogous process of gaining a foreign or second language.

25 Explaining how languages are acquired In previous lectures we have tried to account for how all and only the grammatical sentences of a language are produced and represented in the brain of the speakers of a language. However, a complete account of linguistic knowledge representation must address the issue of how we acquire a language as children and how we learn foreign languages as adults. We will mainly be concerned with first language acquisition and not foreign language learning.

26 Stages of language development the single word stage (12-18 months) –the language of the child consists of just a few isolated words of the target language, e.g. ‘mamma’, ‘daddy’,etc. –very little grammatical development the grammar stage (19-29 months) –marked by the emergence of a few nominal and verbal inflections in languages that have these. –a few phrases and word utterances apparently strung together: ‘mammy, milk’; ‘daddy bye bye’, etc. 30 months –can produce more adult-like speech: ‘Where's daddy ?’ ‘Daddy, I want to go with you.’

27 Explaining language acquisition: The reason for the uniformity and rapidity in child language acquisition is contained in the innateness hypothesis. This is, at least, the position of Chomsky and most cognitive approaches to linguistic explanation. In this hypothesis, language acquisition is determined by a biologically endowed innate language faculty (also called Language Acquisition Device (LAD)). LAD or language learning ‘ program ’ in children ’ s brains provides them with a set of procedures (let us call it an ‘ algorithm ’ since we are computer/cognitive science inclined) for developing a grammar. –Input: linguistic experience they get from the parents and teachers.

28 The nature of the language faculty Children can acquire any language as their native tongue. –e.g. a child of Cantonese speaking parents growing up in England can learn to speak perfect English as her native tongue. Those aspects of language innately determined are universal –language faculty does not vary significantly from human to human An important aspect in the language faculty is the search for principles of Universal Grammar!

29 Universal Grammar (UG) A theory of the human language faculty, i.e. a module of the mind/brain involved in the basic design of language (Noam Chomsky) It is part of an innate biologically endowed language faculty, an innate mental organ specific to the human species It allows us to perceive and interpret information governed by certain formal constraints These formal constraints refer to a system of rules and representations and one of its operations (its grammar) by which the acceptable sentences of a language can be generated –Examples of formal universals, linguistic constraints of an abstract nature: the binding principles determining what can or cannot be the antecedent of an anaphoric, pronominal, or fully referential nominal element, etc.

30 Literacy Acquisition Literacy: the ability to read, write and calculate basic numbers Difficult to define: –can mean different things to different people in different areas: computer literacy, investment literacy, etc. Is literacy part of our mental, cognitive faculty? –Yes, because any human can acquire literacy i.e. learn how to read, write and calculate basic numbers given the right environment

31 Levels of Literacy (cf. Stages of language acquisition) 6 stages of reading (Daswani 1999) –Stages 1-3: Pre-reading, decoding, fluency (approx. grades 1 – 3) –Stage 4: Acquiring new knowledge (approx.grades 4 – 8) –Stage 5: Reading a range of complex materials critically (grades 9 – 12) –Stage 6: Mature reader: able to read for various purposes: professional, personal, civic (university and beyond)

32 The relationship between language and literacy acquisition Traditional/historical view of child language acquisition: –learning to speak happens up to the age of five years, while learning to read happens after five. Now they are seen as very intertwined i.e. very related: learning to speak and learning to be literate both deal with learning to use language the basis of learning to speak has been outlined to provide an ecology for literacy. The most important lesson is that learning to speak and learning to read are very much interwoven.

33 Evidence of the interface of language and literacy acquisition They are both part of learning to USE language. Both need input from the environment. –can be compared with Vygotsky's idea of ZOPED, zone of proximal development, i.e. the distance between child initiative and ability of child to do things under the influence of parental support. –The learning environment: participants, situation, activity and a mechanism Literacy acquisition is like language acquisition (cf. Givon's idea of literacy acquisition as a weak reflex of language acquisition). Literacy is best acquired in a language one has acquired.

34 Conclusion Literacy (reading and writing) is then another level/kind of linguistic knowledge representation. Spoken and written linguistic knowledge representation interface with each other and are very intertwined. Language and literacy acquisition have very important social, educational and cognitive implications. Language and Literacy acquisition should therefore form an integral part of cognitive science.

35 References David Barton The roots of literacy. Literacy: An Introduction to the Ecology of Written Language. Oxford UK and Cambridge USA: Blackwell. Chapter 9, p C. J. Daswani Literacy. In Bernard Spolsky (ed) Concise Encyclopedia of Educational Linguistics. Oxford: Elsevier Science Ltd.. Viv Edwards and David Corson (eds.) Encyclopedia of Language and Education, Volume 2: Literacy. Netherlands: Kluwer Academic Publishers. Talmy Givon The grammar of Literacy. In Syntaxis, 1, 1998: Elfrieda Hierbert Literacy in preschool programs. In Alan C. Purves et al.(eds.) Encyclopedia of English Studies and Language Arts. New York: Scholastic Ernest Lepore and Zenon Pylyshyn (eds) What Is Cognitive Science. Blackwell Publishers. (especially chapters 10, 11, 12, and 13) Neil Stillings and others Cognitive Science: An Introduction. MIT Press. (especially chapters 6, 9, 10, and 11) Daniel A. Wagner Literacy: definitions. In Alan C. Purves et al.(eds.) Encyclopedia of English Studies and Language Arts. New York: Scholastic R. Wilson and Frank C. Neil (eds.) The MIT Encyclopedia of the Cognitive Sciences. MIT Press. –Lila Gleitman and Paul Bloom. Language Acquisition. p –David Olson. Literacy. p

36 Tentative List of research topics for Cognitive Science Students Supervisor: Dr. Adams BODOMO Topics in Syntax: Theory, Description and Application –Building human language components in Computational Systems –The LFG treatment of serial verbs, Complex Predicates, and other verbal constructions in various languages: French, Norwegian, Japanese, Chinese, Dagaare, etc Topics in Language and Literacy as cognitive processes –Chinese writing and computer technology: Survey and evaluation of various inputting systems. –New forms and functions of language and literacy in the age of Information technology ( s, ICQ, bulletin boards, mobile phone texting,etc).:A survey of SMS texting as a cognitive and communicative process in HK The grammar of aphasic patients

37 Further studies - courses by Dr Bodomo LING Language.com: Language in the Contemporary World (1st year undergraduate, co-taught with other staff members) LING Language and Literacy in the Information Age LING Syntactic Theory LING Lexical-Functional Grammar LING Language and Information Technology LING2050 – Grammatical Description LING2051 – French Syntax and Universal Grammar Also consider B.A. in Human Language Technology (HLT) as an option for a minor

38 Take-home Quiz Please submit your answers to your tutor on or before September 22, 2005.

39 - The End -