LING/C SC 581: Advanced Computational Linguistics

Slides:



Advertisements
Similar presentations
LING 581: Advanced Computational Linguistics Lecture Notes January 30th.
Advertisements

LING 581: Advanced Computational Linguistics Lecture Notes January 19th.
LING 581: Advanced Computational Linguistics Lecture Notes January 19th.
LING/C SC/PSYC 438/538 Lecture 8 Sandiway Fong. Adminstrivia.
HW7 Extracting Arguments for % Ang Sun March 25, 2012.
NLP. Introduction to NLP Background –From the early ‘90s –Developed at the University of Pennsylvania –(Marcus, Santorini, and Marcinkiewicz 1993) Size.
LING/C SC/PSYC 438/538 Lecture 18 Sandiway Fong. Adminstrivia Homework 7 out today – due Saturday by midnight.
LING/C SC/PSYC 438/538 Lecture 9 Sandiway Fong. Adminstrivia Homework 4 graded Homework 5 out today – Due Saturday night by midnight – (Gives me Sunday.
LING/C SC 581: Advanced Computational Linguistics Lecture Notes Feb 3 rd.
LING/C SC 581: Advanced Computational Linguistics Lecture Notes Feb 17 th.
CULTURE What is culture? The way of life of a group of people who share similar beliefs and customs –T–These are called “cultural traits” –L–Language,
Coping with Problems in Grammars Automatically Extracted from Treebanks Carlos A. Prolo Computer and Info. Science Dept. University of Pennsylvania.
CJA 444 Week 4 Learning Team Communications Paper or Presentation NEW Check this A+ tutorial guideline at
BSHS 441 Week 5 DQ 2 In your opinion, what are the 10 most critical issues challenging the human services agencies today? Rank them from 1-10 in order.
BSHS 442 Week 5 DQ 2 Check this A+ tutorial guideline at 442/BSHS-442-Week-5-DQ-2 In your opinion, what are the 10.
Lecture 9: Part of Speech
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Fall 2016 Room 150 Harvill Building 10: :50 Mondays, Wednesdays.
How to read, understand, and analyze real world data
CSE 662 – Languages and Databases Class Overview

CS 430: Information Discovery
LING/C SC/PSYC 438/538 Lecture 2 Sandiway Fong.
Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 15
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Fall 2016 Room 150 Harvill Building 10: :50 Mondays, Wednesdays.
LING/C SC/PSYC 438/538 Lecture 20 Sandiway Fong.
ENG 380 Education for Service/tutorialrank.com
LING/C SC 581: Advanced Computational Linguistics
LING 388: Computers and Language
How to read, understand, and analyze real world data
LING 388: Computers and Language
LING 388: Computers and Language
LING/C SC 581: Advanced Computational Linguistics
LING/C SC/PSYC 438/538 Lecture 3 Sandiway Fong.
LING/C SC 581: Advanced Computational Linguistics
Topics in Linguistics ENG 331
LING/C SC 581: Advanced Computational Linguistics
LING 581: Advanced Computational Linguistics
Graphing Linear Equations
LING/C SC 581: Advanced Computational Linguistics
LING/C SC/PSYC 438/538 Lecture 12 Sandiway Fong.
LING/C SC/PSYC 438/538 Lecture 23 Sandiway Fong.
LING/C SC 581: Advanced Computational Linguistics
LING/C SC 581: Advanced Computational Linguistics
Lecture 7 HMMs – the 3 Problems Forward Algorithm
Lecture 7 HMMs – the 3 Problems Forward Algorithm
LING 408/508: Computational Techniques for Linguists
Change over time: Working with diachronic data
C SC 620 Advanced Topics in Natural Language Processing
English I Lesson Plans Second Nine Weeks Fahrenheit 451 Guidebook Unit
WHS: AMERICAN LITERATURE & COMPOSITION A
LING 388: Computers and Language
LING/C SC/PSYC 438/538 Lecture 13 Sandiway Fong.
Natural Language Processing
LING/C SC 581: Advanced Computational Linguistics
Happy monday! You need: Your READIng comp. book. AND your PINK reading log. Your planner open to this week. A book to read/annotate.
Take control of your learning: … learn wherever, whenever, - even if you haven’t got your textbooks with you! …browse for resources if you didn’t.
LING/C SC 581: Advanced Computational Linguistics
Wrt 205: critical research
L8-3 Obj: Students will use definition of log
Graphing Linear Equations
LING/C SC 581: Advanced Computational Linguistics
LING/C SC 581: Advanced Computational Linguistics
English 1 - October 20th Agenda Warm-Up: Important Element
LING/C SC 581: Advanced Computational Linguistics
LING/C SC 581: Advanced Computational Linguistics
LING/C SC 581: Advanced Computational Linguistics
Part-of-Speech Tagging Using Hidden Markov Models
Graphing Linear Equations
LING/C SC/PSYC 438/538 Lecture 3 Sandiway Fong.
LING/C SC 581: Advanced Computational Linguistics
Presentation transcript:

LING/C SC 581: Advanced Computational Linguistics Lecture Notes Feb 8th

Adminstrivia The Homework Pipeline: No classes next week: Homework 4 + Homework 5 No classes next week: I'm out of town on business No new homework assigned this week I sorta lied: optional homework 6 (if you choose to do, due Monday 19th midnight: > 1 week)

Today's Topics Homework 5 review Zipf's Distribution Read the review paper: Zipf’s word frequency law in natural language: A critical review and future directions, Piantadosi, S. Psychon Bull Rev. 2014 October ; 21(5): 1112–1130. Optional Homework 6 Using Python and our ptb WSJ corpus, compute and graph frequency against rank Use pyplot In your opinion, does our WSJ corpus obey Zipf's Law?

Zipf's Law Let r be the rank of some unit, e.g. a word, in a corpus Let freq(r) be the frequency associated with the r-th ranked word Zipf: freq(r) ∝ Φ1/r^α Φ=1/10;α≈1 Zipf-Mandelbrot: freq(r)∝1/(r+β)^α α≈1; β≈2.7 There "should be" a power law relation between r and frequency read cited paper for discussion on "should be"

Zipf's Law Typically plotted as log-log http://hyperphysics.phy- astr.gsu.edu/hbase/Math/logplot.html

Zipf's Law: ANC ANC = American National Corpus

Zipf's Law: PTB Brown corpus Frequency of PTB POS tags: NN IN DT JJ NNP NNS RB PRP

Zipf's Law: PTB Brown corpus Most common tokens for each POS tag

Free Relatives Taken from Caponigro, I. (proceedings of WECOL 2002). Some background:

Free Relatives Free vs. Headed relatives:

Free Relatives

Homework 5 Review /^SBAR-NOM/ 775

Homework 5 Review /^SBAR-NOM/ 775 FR what(ever) 542

Homework 5 Review /^SBAR-NOM/ 775 FR what* 542 FR where(ver) 21

Homework 5 Review /^SBAR-NOM/ 775 FR what* 542 FR where* 21 FR when(ever) 14

Homework 5 Review /^SBAR-NOM/ 775 FR what* 542 FR where* 21 FR when* 14 FR who(ever) 26 whose/whom

Homework 5 Review /^SBAR-NOM/ 775 FR what* 542 FR where* 21 FR when* 14 FR who* 26 FR how 104

Homework 5 Review /^SBAR-NOM/ 775 FR what* 542 FR where* 21 FR when* 14 FR who* 26 FR how 104 FR whether 15

Homework 5 Review /^SBAR-NOM/ 775 FR what* 542 FR where* 21 FR when* 14 FR who* 26 FR how 104 FR whether 15 FR which(ever) 13 "three of which"

Homework 5 Review /^SBAR-NOM/ 775 FR what* 542 FR where* 21 FR when* 14 FR who* 26 FR how 104 FR whether 15 FR which* 13 FR why 8

Homework 5 Review

Homework 5 Review /^SBAR-NOM/ 775 FR what* 542 ADVP 4 FR where* 21 0 FR when* 14 0 FR who* 26 0 FR how 104 0 FR whether 15 0 FR which* 13 0 FR why 8 0

Homework 5 Review /^SBAR-NOM/ 775 FR what* 553 ADVP 4 QP 3 FR where* 21 0 FR when* 14 0 FR who* 26 0 FR how 104 0 FR whether 15 0 FR which* 13 0 FR why 8 0

Homework 5 Review Inconsistent annotation

Homework 5 Review 782 total due to overlaps, e.g. coordination /^SBAR-NOM/ 775 /^SBAR-NOM/ <<, (/^WHNP/ << /[Ww]hat/) 554 ADVP/QP 7 /^SBAR-NOM/ <<, (/^WH/ << /[Ww]here/) 21 /^SBAR-NOM/ <<, (/^WH/ << /[Ww]hen/) 14 /^SBAR-NOM/ <<, (/^WH/ << /[Ww]ho/) 26 /^SBAR-NOM/ <<, (/^WH/ << /[Hh]ow/) 118 /^SBAR-NOM/ <<, (/^IN/ << /[Wh]ether/) 15 /^SBAR-NOM/ <<, (/^WH/ << /[Ww]hich/) 19 /^SBAR-NOM/ <<, (/^WH/ << /[Ww]hy/) 8 782 total due to overlaps, e.g. coordination

Homework 5 Review Mistake?

Homework 5 Review