LING/C SC 581: Advanced Computational Linguistics

Slides:



Advertisements
Similar presentations
Hours Monday 8:00 AM – 6:00PM Tuesday thru Friday 8:00 Am – 10:00 PM Saturday 8:00 AM – 5:00 PM The last test is given out one hour before closing time.
Advertisements

LING 581: Advanced Computational Linguistics Lecture Notes January 19th.
LING 581: Advanced Computational Linguistics Lecture Notes March 9th.
LING 581: Advanced Computational Linguistics Lecture Notes January 26th.
LING 581: Advanced Computational Linguistics Lecture Notes January 19th.
LING 408/508: Programming for Linguists Lecture 19 November 4 th.
Teaching system for advanced statistics I. Nagy FD ČVUT, Prague J. Homolová FD ČVUT, Prague E. Suzdaleva ÚTIA AV ČR,
LING 581: Advanced Computational Linguistics Lecture Notes February 19th.
CS 6998 NLP for the Web Columbia University 04/22/2010 Analyzing Wikipedia and Gold-Standard Corpora for NER Training William Y. Wang Computer Science.
Extending VERA (Conference Information) Design Specification & Schedules Arthur Chan (AC) Rohit Kumar (RK) Lingyun Gu (LG)
LING 408/508: Programming for Linguists Lecture 26 December 7 th.
LING 408/508: Programming for Linguists Lecture 16 October 26 th.
LING 408/508: Programming for Linguists Lecture 15 October 21 th.
LING/C SC 581: Advanced Computational Linguistics Lecture Notes Feb 5 th.
LING/C SC/PSYC 438/538 Lecture 11 Sandiway Fong. Administrivia Homework 5 graded.
LING 408/508: Programming for Linguists Lecture 20 November 16 th.
LING/C SC/PSYC 438/538 Lecture 9 Sandiway Fong. Adminstrivia Homework 4 graded Homework 5 out today – Due Saturday night by midnight – (Gives me Sunday.
1 © 2015 Cisco and/or its affiliates. All rights reserved. Cisco Networking Academy NetRiders Skills Competitions for CCNA and IT Essentials students.
LING/C SC 581: Advanced Computational Linguistics Lecture Notes Feb 3 rd.
Lindsey Velez, Director of Instructional Technology Single Sign-On One Click.
LING/C SC 581: Advanced Computational Linguistics Lecture Notes Feb 17 th.
LING 581: Advanced Computational Linguistics Lecture Notes March 2nd.
Next Review Session is: Tuesday 3:15 PM
PRESENTED BY: PEAR A BHUIYAN
Cloud Computing Lesson 1
Google SyntaxNet “Parsey McParseface and other SyntaxNet models are some of the most complex networks that we have trained with the TensorFlow framework.
Changing Lives Through Education
Parsing in Multiple Languages
David Mareček and Zdeněk Žabokrtský
Improving a Pipeline Architecture for Shallow Discourse Parsing
LING/C SC/PSYC 438/538 Lecture 20 Sandiway Fong.
LING/C SC 581: Advanced Computational Linguistics
LING 388: Computers and Language
Text Analytics Giuseppe Attardi Università di Pisa
LING/C SC 581: Advanced Computational Linguistics
CSCE 590 Web Scraping - NLTK
LING/C SC 581: Advanced Computational Linguistics
Time management School of Rock.
LING/C SC 581: Advanced Computational Linguistics
LING/C SC 581: Advanced Computational Linguistics
LING 581: Advanced Computational Linguistics
Warm Up 10/3 How is the reading going for you? What questions or concerns do you have that you would like to address with the class?
LING/C SC 581: Advanced Computational Linguistics
Prepositio ns of time..
LING/C SC/PSYC 438/538 Lecture 23 Sandiway Fong.
LING/C SC 581: Advanced Computational Linguistics
LING/C SC 581: Advanced Computational Linguistics
This is a setup file for a Jeopardy game.
This is a setup file for a Jeopardy game.
Algebra 3 Pre-Calculus Mrs. Foreman Supplies Needed - Pencils, Red Pen, and Paper (Only pencils.
LING 388: Computers and Language
Visual For Weekly and Monthly Schedule
LING/C SC 581: Advanced Computational Linguistics
This is a setup file for a Jeopardy game.
Dependency Grammar & Stanford Dependencies
SD5906 Globalization in New Media Design and Technology
LING/C SC 581: Advanced Computational Linguistics
This presentation document has been prepared by Vault Intelligence Limited (“Vault") and is intended for off line demonstration, presentation and educational.
CSCE 590 Web Scraping - NLTK
Gerunds.
Information Lecture Time W F – 14.00
American Literature Thursday, October 2nd
This is a setup file for a Jeopardy game.
LING/C SC 581: Advanced Computational Linguistics
LING/C SC 581: Advanced Computational Linguistics
WEB TIMESHEETS Mindy Neissl Accounting Coordinator
Connect Training Series
WEB TIMESHEETS Mindy Neissl Accounting Coordinator
Open Bowling Times 9:30 pm – 12 Midnight Moonrock
LING/C SC 581: Advanced Computational Linguistics
Presentation transcript:

LING/C SC 581: Advanced Computational Linguistics Lecture 3 Jan 17th

2019 HLT Lecture Series Speaker Title Date Tatjana Scheffler Analyzing Discourse Structure on Social Media Friday Feb 15th, 3pm, Comm 311. Marcos Zampieri Language Variation and Automatic Language Identification. The Case of Dialects and Similar Languages. Wednesday Feb 20th, noon, room TBA Adriana Picoral TBA Wednesday Feb 27th, noon, room TBA Gus Hahn-Powell Wednesday Mar 13th, noon, room TBA Miikka Silfverberg Wednesday Mar 20th, noon, room TBA

Named Entity Recognition In my other class, doing a demo: University of Illinois https://cogcomp.org/page/demo_view/NERextended Unfortunately, it is down this week so far…

Named Entity Recognition Google Cloud Natural Language: https://cloud.google.com/natural- language/ also supplies sentiment/magnitude scores for the identified entities

Named Entity Recognition

Named Entity Recognition Illinois Named Entity Recognizer example: Helicopters will patrol the temporary no-fly zone around New Jersey's MetLife Stadium Sunday, with F-16s based in Atlantic City ready to be scrambled if an unauthorized aircraft does enter the restricted airspace. Down below, bomb-sniffing dogs will patrol the trains and buses that are expected to take approximately 30,000 of the 80,000-plus spectators to Sunday's Super Bowl between the Denver Broncos and Seattle Seahawks. The Transportation Security Administration said it has added about two dozen dogs to monitor passengers coming in and out of the airport around the Super Bowl. On Saturday, TSA agents demonstrated how the dogs can sniff out many different types of explosives. Once they do, they're trained to sit rather than attack, so as not to raise suspicion or create a panic. TSA spokeswoman Lisa Farbstein said the dogs undergo 12 weeks of training, which costs about $200,000, factoring in food, vehicles and salaries for trainers. Dogs have been used in cargo areas for some time, but have just been introduced recently in passenger areas at Newark and JFK airports. JFK has one dog and Newark has a handful, Farbstein said.

Dependency-Based Parsing

Universal Dependencies (UD) http://universaldependencies.org/ 100 treebanks in over 70 languages Some relations involving dependent clauses: ccomp: connects higher verb with verbal head of sentential complement with overt subject xcomp: connects higher verb with verbal head of non-finite sentential complement without a subject. csubj: connects higher verb with verbal head of sentential subject. vmod ➤ advcl/acl: connects word to verbal head of a reduced non-finite verbal modifier (deprecated in UD; still emitted by syntaxnet)

Google Cloud Natural Language RRS Sir David Attenborough  "Boaty McBoatface" Parsey McParseface (Andor et al., 2016) Free: DragNN (Kong et al., 2017), the follow-on to SyntaxNet (2016) Free sampling at https://cloud.google.com/natural-language/ For-Pay Google Cloud version is trained on additional proprietary corpora

Google Cloud Natural Language is ^

Google Cloud Natural Language

Quick Homework 3 The Penn Treebank is partially installed as a corpus in NLTK Data (Sections 00 and 01: wsj_0001.mrg to wsj_0199.mrg) from nltk.corpus import treebank Methods: .words() .sents() .parsed_sents() .draw() .fileids()

Quick Homework 3 Pick a random (see right) parse from treebank Run it through the Google Cloud Parser Analyze and comment on how it compares to the gold standard parse include the gold tree and the Google dependency parse One PDF file Due next Wednesday (by midnight) import random random.seed() random.randrange(0,391 4) 1462 >>> len(treebank.sents()) 3914