Introduction to NLP What is Natural Language Processing?

Slides:



Advertisements
Similar presentations
Introduction to NLP What is Natural Language Processing?
Advertisements

LING NLP 1 Introduction to Computational Linguistics Martha Palmer April 19, 2006.
1 Natural Language Processing for the Web Prof. Kathleen McKeown 722 CEPSR, Office Hours: Wed, 1-2; Mon 3-4 TA: Fadi Biadsy 702 CEPSR,
Introduction to CL Session 1: 7/08/2011. What is computational linguistics? Processing natural language text by computers  for practical applications.
تمرين شماره 1 درس NLP سيلابس درس NLP در دانشگاه هاي ديگر ___________________________ راحله مکي استاد درس: دکتر عبدالله زاده پاييز 85.
Machine Learning in Natural Language Processing Noriko Tomuro November 16, 2006.
Natural Language Processing Prof: Jason Eisner Webpage: syllabus, announcements, slides, homeworks.
NATURAL LANGUAGE PROCESSING AN AI TASK DOMAIN. TASK DOMAINS OF AI  Games Playing  Expert Systems  Natural Language  Neural Networks  Robotics.
Natural Language Processing Ellen Back, LIS489, Spring 2015.
March 1, 2009 Dr. Muhammed Al-Mulhem 1 ICS 482 Natural Language Processing INTRODUCTION Muhammed Al-Mulhem March 1, 2009.
Siemens Big Data Analysis GROUP 3: MARIO MASSAD, MATTHEW TOSCHI, TYLER TRUONG.
SI485i : NLP Day 1 Intro to NLP. Assumptions about You You know… how to program Java basic UNIX usage basic probability and statistics (we’ll also review)
ELN – Natural Language Processing Giuseppe Attardi
1 *Textbooks you need Manning, C. D., Sch ü tze, H.: Foundations of Statistical Natural Language Processing. The MIT Press ISBN [required]
9/8/20151 Natural Language Processing Lecture Notes 1.
1 Ling 569: Introduction to Computational Linguistics Jason Eisner Johns Hopkins University Tu/Th 1:30-3:20 (also this Fri 1-5)
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
For Friday Finish chapter 23 Homework: –Chapter 22, exercise 9.
Automatic Detection of Tags for Political Blogs Khairun-nisa Hassanali Vasileios Hatzivassiloglou The University.
Introduction to NLP ch1 What is Natural Language Processing?
CHAPTER 13 NATURAL LANGUAGE PROCESSING. Machine Translation.
Natural Language Processing Artificial Intelligence CMSC February 28, 2002.
11 Chapter 14 Part 1 Statistical Parsing Based on slides by Ray Mooney.
CS 124/LINGUIST 180 From Languages to Information
1/21 Automatic Discovery of Intentions in Text and its Application to Question Answering (ACL 2005 Student Research Workshop )
CS 188: Artificial Intelligence Spring 2009 Natural Language Processing Dan Klein – UC Berkeley 1.
CS460/IT632 Natural Language Processing/Language Technology for the Web Lecture 1 (03/01/06) Prof. Pushpak Bhattacharyya IIT Bombay Introduction to Natural.
Jeff Howbert Introduction to Machine Learning Winter Machine Learning Natural Language Processing.
Natural Language Processing Slides adapted from Pedro Domingos
AUTONOMOUS REQUIREMENTS SPECIFICATION PROCESSING USING NATURAL LANGUAGE PROCESSING - Vivek Punjabi.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
Context Free Grammars. Slide 1 Syntax Syntax = rules describing how words can connect to each other * that and after year last I saw you yesterday colorless.
Natural Language Processing Vasile Rus
COSC 6336 Natural Language Processing
/665 Natural Language Processing
Machine Learning for Computer Security
Taking a Tour of Text Analytics
Sentiment analysis algorithms and applications: A survey
Natural Language Processing
Sentence Modeling Representation of sentences is the heart of Natural Language Processing A sentence model is a representation and analysis of semantic.
Modeling Grammaticality
Lantern Festival Riddles
School of Computer Science & Engineering
Statistical NLP: Lecture 3
Natural Language Processing
Insight Ahmad Jabi | Yazan Shakhshir | Saleem Abu Dhair
Text Analytics / NLP Charan Puvvala, Mirabel Technologies Inc.
Are End-to-end Systems the Ultimate Solutions for NLP?
Introduction to Machine Learning and NLP
Text Analytics Giuseppe Attardi Università di Pisa
Probabilistic and Lexicalized Parsing
Machine Learning in Natural Language Processing
Probabilistic and Lexicalized Parsing
Parsing and More Parsing
Text Mining & Natural Language Processing
Syntax strikes Back Students, I am your answer.
CS246: Information Retrieval
CS224N Section 3: Corpora, etc.
Modeling Grammaticality
Introduction to Python programming for KS3
Semantics Going beyond syntax.
David Kauchak CS159 – Spring 2019
Anything and Everything You Would
Artificial Intelligence 2004 Speech & Natural Language Processing
From Language to Information
Information Retrieval
Extracting Why Text Segment from Web Based on Grammar-gram
CS249 Advanced Seminar: Learning From Text
CS249: Neural Language Model
Presentation transcript:

Introduction to NLP What is Natural Language Processing? Many slides reused from Dan Jurafsky/Christopher Manning (Stanford)

Question Answering: IBM’s Watson Won Jeopardy on February 16, 2011! WILLIAM WILKINSON’S “AN ACCOUNT OF THE PRINCIPALITIES OF WALLACHIA AND MOLDOVIA” INSPIRED THIS AUTHOR’S MOST FAMOUS NOVEL Bram Stoker

Information Extraction Event: Curriculum mtg Date: Aug. 23, 2017 Start: 10:00am End: 11:30am Where: JBHT 532 Subject: curriculum meeting Date: Aug. 22, 2017 To: Susan Gauch Hi Susan, we’ve now scheduled the curriculum meeting. It will be in JBHT 532 tomorrow from 10:00-11:30. -Dave Create new Calendar entry

Information Extraction & Sentiment Analysis Attributes: zoom affordability size and weight flash ease of use Size and weight ✓ nice and compact to carry! since the camera is small and light, I won't need to carry around those heavy, bulky professional cameras either! the camera feels flimsy, is plastic and very light in weight you have to be very delicate in the handling of this camera ✓ ✗

Machine Translation Helping human translators Fully automatic Enter Source Text:  这 不过 是 一 个 时间 的 问题 . Translation from Stanford’s Phrasal: This is only a matter of time.

Making progress on this problem… The task is difficult! What tools do we need? Knowledge about language Knowledge about the world A way to combine knowledge sources How we generally do this: probabilistic models built from language data P(“maison”  “house”) high P(“L’avocat général”  “the general avocado”) low Luckily, rough text features can often do half the job.

Ambiguity makes NLP hard: “Crash blossoms” 100% REAL Violinist Linked to JAL Crash Blossoms Teacher Strikes Idle Kids Red Tape Holds Up New Bridges Hospitals Are Sued by 7 Foot Doctors Juvenile Court to Try Shooting Defendant Local High School Dropouts Cut in Half

A few of the 83+ parses for The post office will hold out discounts and service concessions as incentives. [Shortened WSJ sentence.] • S NP Aux VP The post office will V NP hold out NP Conj NP discounts and N N PP service concessions as incentives 8

• S NP Aux VP The post office will V NP PP hold out NP Conj NP as incentives discounts and service concessions 9

• 1 S NP Aux VP The post office will VP Conj VP V NP and V NP PP hold out discounts service concessions as incentives

• 1 S NP Aux VP The post office will VP Conj VP V NP and V NP PP hold out discounts service concessions as incentives

• S NP Aux VP The post office will V PP PP hold P NP as incentives out Conj NP discounts and service concessions 14

• 1 S NP VP The post office will hold VP Conj VP V NP and V NP PP out discounts service concessions as incentives

Famous Ambiguity Examples Time flies like an arrow Translation English -> Russian -> English: The spirit is willing but the flesh is weak Became “The vodka is good but the meat is rotten” Punctuation matters Czarina saved a man’s life by changing: Pardon impossible, to be sent to Siberia Became “Pardon, impossible to be sent to Siberia”

Where do problems come in? Syntax Part of speech ambiguities Attachment ambiguities Semantics Word sense ambiguities (Semantic interpretation and scope ambigui- ties) 15

Why else is natural language understanding difficult? non-standard English segmentation issues idioms Great job @justinbieber! Were SOO PROUD of what youve accomplished! U taught us 2 #neversaynever & you yourself should never give up either♥ dark horse get cold feet lose face throw in the towel the New York-New Haven Railroad neologisms world knowledge tricky entity names unfriend Retweet bromance Mary and Sue are sisters. Mary and Sue are mothers. Where is A Bug’s Life playing … Let It Be was recorded … … a mutation on the for gene … But that’s what makes it fun!

Language Technology making good progress still really hard Sentiment analysis still really hard mostly solved Best roast chicken in San Francisco! Question answering (QA) The waiter ignored us for 20 minutes. Q. How effective is ibuprofen in reducing fever in patients with acute febrile illness? Spam detection Coreference resolution ✓ Let’s go to Agra! ✗ Paraphrase Carter told Mubarak he shouldn’t run again. Buy V1AGRA … Word sense disambiguation (WSD) XYZ acquired ABC yesterday Part-of-speech (POS) tagging I need new batteries for my mouse. ABC has been taken over by XYZ ADJ ADJ NOUN VERB ADV Summarization Parsing Colorless green ideas sleep furiously. The Dow Jones is up Economy is good I can see Alcatraz from the window! The S&P500 jumped Named entity recognition (NER) Housing prices rose Machine translation (MT) PERSON ORG LOC Dialog 第13届上海国际电影节开幕… Where is Citizen Kane playing in SF? Einstein met with UN officials in Princeton The 13th Shanghai International Film Festival… Castro Theatre at 7:30. Do you want a ticket? Information extraction (IE) Party May 27 add You’re invited to our dinner party, Friday May 27 at 8:30

Statistical NLP methods Involve deriving numerical data from text Are usually but not always probabilistic Many techniques are used: – n-grams, history-based models, decision trees / de- cision lists, memory-based learning, loglinear mod- els, HMMs, neural networks, vector spaces, graphi- cal models, …

This class Teaches key theory and methods for statistical NLP: Probability and information theory classifiers N-gram language modeling Statistical Parsing Inverted index, tf-idf, vector models of meaning For practical, robust real-world applications Information extraction Spelling correction Information retrieval Sentiment analysis

Skills you’ll need Simple linear algebra (vectors, matrices) Basic probability theory C++ or Java or Python programming Knowledge of data structures User-level knowledge of linux

What is Natural Language Processing? Introduction to NLP What is Natural Language Processing?