CS 4705 Natural Language Processing Julia Hirschberg COMS 4705 Fall 2010.

Slides:



Advertisements
Similar presentations
Introduction to Computer Programming I CSE 113
Advertisements

Drawing Trees & Ambiguity in Trees. Some Phrase Structure Rules of English S’ -> (Comp) S S’ -> (Comp) S S -> {NP/S’} (T) VP S -> {NP/S’} (T) VP VP 
Grammar of English: Introduction Eng August 2014.
CSE111: Great Ideas in Computer Science Dr. Carl Alphonce 219 Bell Hall Office hours: M-F 11:00-11:
IT 240 Intro to Desktop Databases Introduction. About this course Design a database: Entity Relation (ER) modeling and normalization techniques Create.
CS 4705 Lecture 1 CS4705 Introduction to Natural Language Processing.
Center for Computational Learning Systems Independent research center within the Engineering School NLP people at CCLS: Mona Diab, Nizar Habash, Martin.
CS4705 Natural Language Processing Fall What will we study in this course? How can machines recognize and generate text and speech? – Human language.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL Adrian Ilie COMP 14 Introduction to Programming Adrian Ilie Summer Session II, 2005 MTWRF 9:45-11:15 am.
1 Natural Language Processing for the Web Prof. Kathleen McKeown 722 CEPSR, Office Hours: Wed, 1-2; Mon 3-4 TA: Fadi Biadsy 702 CEPSR,
CS 4705 Natural Language Processing What is Natural Language Processing? The study of human languages and how they can be represented computationally.
Center for Computational Learning Systems Independent research center within the Engineering School NLP people at CCLS: Mona Diab, Nizar Habash, Martin.
COMS W1004 Introduction to Computer Science May 28, 2008.
CS4705 Natural Language Processing Fall What will we study in this course? How can machines recognize and generate text and speech? – Human language.
CS 410 Applied Algorithms Applied Algorithms Lecture #1 Introduction, class information, first problems.
تمرين شماره 1 درس NLP سيلابس درس NLP در دانشگاه هاي ديگر ___________________________ راحله مکي استاد درس: دکتر عبدالله زاده پاييز 85.
Introduction to Artificial Intelligence Prof. Kathleen McKeown 722 CEPSR, TAs: Kapil Thadani 724 CEPSR, Phong Pham TA Room.
COMP 110 Introduction to Programming Mr. Joshua Stough August 22, 2007 Monday/Wednesday/Friday 3:00-4:15 Gardner Hall 307.
COMS W1004 Introduction to Computer Science May 27, 2009.
CS4705 Natural Language Processing Fall  How can machines recognize and generate text and speech? ◦ Human language phenomena ◦ Theories, often.
Natural Language Processing Prof: Jason Eisner Webpage: syllabus, announcements, slides, homeworks.
SI485i : NLP Day 1 Intro to NLP. Assumptions about You You know… how to program Java basic UNIX usage basic probability and statistics (we’ll also review)
COP4020/CGS5426 Programming languages Syllabus. Instructor Xin Yuan Office: 168 LOV Office hours: T, H 10:00am – 11:30am Class website:
Computer Science 102 Data Structures and Algorithms V Fall 2009 Lecture 1: administrative details Professor: Evan Korth New York University 1.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
CAREERS IN LINGUISTICS OUTSIDE OF ACADEMIA CAREERS IN INDUSTRY.
Computer Network Fundamentals CNT4007C
Syllabus CS 765: Introduction to Database Management Systems Fall 2008 Text Database Management Systems Ramakrishnan/Gehrke, 3rd.
Introduction to Natural Language Processing Heshaam Faili University of Tehran.
1 Ling 569: Introduction to Computational Linguistics Jason Eisner Johns Hopkins University Tu/Th 1:30-3:20 (also this Fri 1-5)
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
Lecture 1 Page 1 CS 111 Summer 2015 Introduction CS 111 Operating System Principles.
CST 229 Introduction to Grammars Dr. Sherry Yang Room 213 (503)
CS 4705 Natural Language Processing Fall 2010 What is Natural Language Processing? Designing software to recognize, analyze and generate text and speech.
Presentation by Dianne Smith, MJE. Something went wrong In jet crash, expert says.
CSCE 1040 Computer Science 2 First Day. Course Dr. Ryan Garlick Office: Research Park F201 B –Inside the Computer Science department.
CS 4705 Natural Language Processing Fall 2010 What is Natural Language Processing? Designing software to recognize, analyze and generate text and speech.
CHAPTER 13 NATURAL LANGUAGE PROCESSING. Machine Translation.
BIT 115: Introduction To Programming Instructor: Mike Panitz
Welcome to CS 115! Introduction to Programming. Class URL Write this down!
1 Introduction to Linguistics Teacher: Simon Smith ( 史尚明 ) – “Dr Smith”, “Simon” or “ 老師 ”: OK – “Smith” or “Teacher”: not OK This semester’s course: –
Computer Science 102 Data Structures and Algorithms CSCI-UA.0102 Fall 2012 Lecture 1: administrative details Professor: Evan Korth New York University.
+ Introduction to Class IST210 Class Lecture. + Course Objectives Understand the importance of data, databases, and database management Design and implement.
1 CSI 5180: Topics in AI: Natural Language Processing, A Statistical Approach Instructor: Nathalie Japkowicz Objectives of.
Two Views of Computing Language / Functions Machine / Storage CSCI 312 CSCI 313.
Instructor: Chelsea Jones Teaching English in English (TEE) January 2012 Adapted from: Dr. Scott Phillabaum’s PPT Presentation on Pragmatics.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
WorkRoles Responsi- bilities Instructors
1 Computation Approaches to Emotional Speech Julia Hirschberg
What you have learned and how you can use it : Grammars and Lexicons Parts I-III.
Computational Structures Tim Sheard & James Hook Portland State University Class Preliminaries.
IST 210: Organization of Data
12/5/20151 Spoken Language Processing Julia Hirschberg CS 4706.
Introduction to Linguistics Ms. Suha Jawabreh Lecture # 1.
Drawing Trees & Ambiguity in Trees
1 An Introduction to Computational Linguistics Mohammad Bahrani.
CS 4705 Natural Language Processing Who am I? Julia Hirschberg –Computational Linguist in CS –Focus: Spoken Language Processing –Lab: The Speech Lab,
COP4020 INTRODUCTION FALL COURSE DESCRIPTION Programming Languages introduces the fundamentals of the design and implementation of programming languages.
Computer Networks CNT5106C
NATURAL LANGUAGE PROCESSING
IST 210: ORGANIZATION OF DATA Introduction IST210 1.
Natural Language Processing Tasneem Ghnaimat Spring 2013.
1 Computer Science 1021 Programming in Java Geoff Draper University of Utah.
CS101 Computer Programming I
Two Views of Computing Language / Functions Machine / Storage.
Computer Science 102 Data Structures CSCI-UA
Why Study Spoken Language?
Spoken Language Processing
Why Study Spoken Language?
Spoken Language Processing
Presentation transcript:

CS 4705 Natural Language Processing Julia Hirschberg COMS 4705 Fall 2010

What is Natural Language Processing? Software that can recognize, analyze and generate text and speech AKA computational linguistics At Columbia: –Michael Collins, CS, parsing, machine translation –Mona Diab, CCLS, semantics –Nizar Habash, CCLS, morphology, machine translation –Julia Hirschberg, CS, spoken language processing –Kathy McKeown, CS, summarization, generation –Becky Passonneau, CCLS, dialogue systems, reference resolution –Owen Rambow, CCLS, syntax, parsing

Why is NLP hard? Some Headlines… Something Went Wrong In Jet Crash, Expert Says Police Begin Campaign To Run Down Jaywalkers Drunk Gets Nine Months In Violin Case Farmer Bill Dies In House Iraqi Head Seeks Arms Enraged Cow Injures Farmer With Ax Stud Tires Out Eye Drops Off Shelf Teacher Strikes Idle Kids Squad Helps Dog Bite Victim

What will we learn about in this course? Morphology: the way words are formed Syntax: the way words are grouped together into larger constituents and phrases and the way these phrases can be ordered Semantics: the context-independent ‘meaning’ of utterances Pragmatics: the context-dependent ‘meaning’ of utterances Goal: What is a speaker/writer meaning to convey?

Morphology Stud tires out: Is `stud’ an adjective or a noun? `tires’: a noun or a verb? Internet search: `union activities in New York’ –What to look for? Union/unions; activities/activity Active? Action? Actor? Actual? Academic? New vs. New York, York vs. yorkie

Syntax Constituent Structure: –Teacher Strikes Idle Kids –Enraged Cow Injures Farmer With Ax Word Order and Position and Meaning –John hit Bill. –Bill was hit by John. –Bill, John hit. –Who John hit was Bill. –I said John hit Bill. –John hits Bill.

Semantics Word meaning – semantic roles –John picked up a bad cold. –John picked up a large rock. –John picked up Radio Netherlands on his radio. Is meaning compositional? –Squad helps dog bite victim –Enraged cow injures farmer with ax

Pragmatics Going Home, a play in one act (thanks to Bonnie Dorr) –Scene 1: Pennsylvania Station, NY Bonnie: Long Beach? Passerby: Downstairs, LIRR Station. –Scene 2: Ticket Counter, LIRR Station Bonnie: Long Beach? Clerk: $4.50.

–Scene 3: Information Booth, LIRR Station Bonnie: Long Beach? Clerk: 4:19, Track 17. –Scene 4: On the train, vicinity of Forest Hills Bonnie: Long Beach? Conductor: Change at Jamaica. –Scene 5: On the next train, vicinity of Lynbrook Bonnie: Long Beach? Conductor: Right after Island Park.

Algorithms Rule-based –Symbolic Parsers and morphological analyzers –Finite state automata Probabilistic/statistical –Learned from observation of (labeled) data –Predicting new data based on old –Machine learning

Current Real-World Applications Search: very large corpora, e.g. Google Question answering: e.g. IBM’s Jeopardy!, DARPA who/what/where…, Ask Jeeves Translating between one language and another: e.g. Google Translate, Babelfish Summarizing very large amounts of text or speech: e.g. your , the news, voic Sentiment analysis: restaurant or movie reviews Dialogue systems: e.g. Amtrak’s ‘Julie’

Instructor Julia Hirschberg –CEPSR 705, –Focus: Spoken Language Processing –Lab: The Speech Lab, CEPSR 7LW3-AThe Speech Lab –Research: Deceptive speech Charismatic speech: Emotional speech: anger, uncertainty Speech summarization: Broadcast News Spoken Dialogue Systems: Games CorpusGames Corpus `Translating Prosody’: English – Mandarin Text2Scene SynthesisText2Scene

Course Details Teaching Assistants: –Mohamed Altantawy Office Hours: CEPSR 7LW1 (Speech Lab), W 5-6, Th 5:30-6:30 Will manage CVN course –Wei Yun Ma Office Hours: CEPSR 725, Tu /syllabus10.htmhttp://www1.cs.columbia.edu/~julia/courses/CS47 05/syllabus10.htm

Text: Daniel Jurafsky and James H. Martin, Speech and Language Processing, second edition Speech and Language Processing –Note errata available on websiteerrata Check courseworks for additional information on class, homework assignments, posting questions Assignments: –3 homework assignments: Question-answering, text classification, delightful surprise –Midterm and final exams –Five ‘free’ late days for homeworks -- after that 10% off per late day– not usable on HW1 though –You will need a CS account

Recorded Lecture Availability For on-campus students –On CVN websitewebsite

Grading HW1: 10% Hw2: 20% Hw3: 20% Midterm: 15% Final: 25% Class participation: 10%

Academic Integrity Copying or paraphrasing someone's work (code included), or permitting your own work to be copied or paraphrased, even if only in part, is forbidden, and will result in an automatic grade of 0 for the entire assignment or exam in which the copying or paraphrasing was done. Your grade should reflect your own work. If you are going to have trouble completing an assignment, talk to the instructor or TA in advance of the due date please. Everyone: Read/write protect your homework files at all times.

For Next Class Look at syllabus – ask questions about anything you don’t understandsyllabus Read Chapters 1-2 of J&M