LING 581: Advanced Computational Linguistics Lecture Notes February 2nd
tregex Assuming corpus wsj tregex.mrg and Java runtime memory setting –mx1000m
tregex
TREEBANK_3/docs/prsguid1.pdf
Homework Task – Systematically tregex search patterns for selected constructions from the Bracketing Guidelines, e.g. Gapping – Report on how many constructions are found in the Wall Street Journal text – Present your results next time in class
Example: looking for passives Pattern: using variable names and regex group numbering for coindexation matching for passives (NP-SBJ-i and object of VP [NP [ –NONE- [ -*-I ]]])