LING/C SC 581: Advanced Computational Linguistics

Slides:



Advertisements
Similar presentations
LING 388: Language and Computers
Advertisements

LING 581: Advanced Computational Linguistics Lecture Notes February 2nd.
Learning Accurate, Compact, and Interpretable Tree Annotation Recent Advances in Parsing Technology WS 2011/2012 Saarland University in Saarbrücken Miloš.
Learning and Inference for Hierarchically Split PCFGs Slav Petrov and Dan Klein.
LING 581: Advanced Computational Linguistics Lecture Notes February 9th.
LING 581: Advanced Computational Linguistics Lecture Notes January 19th.
LING 581: Advanced Computational Linguistics Lecture Notes February 16th.
LING 581: Advanced Computational Linguistics Lecture Notes January 26th.
LING 581: Advanced Computational Linguistics Lecture Notes January 19th.
IERG4180 Tutorial 4 Jim.
Geant4 Documentation and User Support Geant4 Users Workshop February 2002 Dennis Wright (SLAC)
CS0007: Introduction to Computer Programming Setting Up Java.
Marlene Galea.  The JDK (Java Development Kit)  An IDE (Integrated Development Environment) ◦ Different IDEs include:  JCreator  NetBeans  BlueJ.
Reproducible Environment for Scientific Applications (Lab session) Tak-Lon (Stephen) Wu.
Installing Tomcat on Windows  You may find the Tomcat install shield has some problems recognizing JSDK 1.4 beta installations.  You.
LING/C SC/PSYC 438/538 Lecture 27 Sandiway Fong. Administrivia 2 nd Reminder – 538 Presentations – Send me your choices if you haven’t already.
SharePoint 2010 Development Environment A Guide to Setup SharePoint 2010 Development Environment on Windows 7 Machine.
LING 581: Advanced Computational Linguistics Lecture Notes February 12th.
LING 388: Language and Computers Sandiway Fong Lecture 3.
Chapter 1© copyright Janson Industries Java ▮ Java programming concepts ▮ Downloading the JDK ▮ Java coding and executing example This presentation.
Java and C# [this is a bonus – it is not a required lesson] ACO101: Introduction to Computer Science.
( ) 1 ISYE 7210—Fall 2005 Design of Real-Time Interactive Simulations (in Java) Initial Notes Christine M. Mitchell Director & Professor Center for.
Java Programming Presented by Daniel Rosenthal Friday, November 30 th, 2007.
Copyright © 2015 Curt Hill Environment Establishing one for Minecraft Modding.
Agenda Link of the week Use of Virtual Machine Review week one lab assignment This week’s expected outcomes Review next lab assignments Break Out Problems.
PARSING David Kauchak CS159 – Spring 2011 some slides adapted from Ray Mooney.
LING 581: Advanced Computational Linguistics Lecture Notes February 19th.
Setting Up Eclipse. What is Eclipse? Eclipse is a free, downloadable software that allows us to create, compile, and run JAVA programs.
Welcome to java installation. After download java software, assuming you downloaded jdk1.7.0_11 Follow the procedure bellow to install java.
LING 388: Language and Computers Sandiway Fong Lecture 12.
Installing JDK Vijayan Sugumaran Department of DIS Oakland University.
How to Install the JDK (Java Development Kit) Click hereClick here to download the JDK.
LING/C SC 581: Advanced Computational Linguistics Lecture Notes Feb 5 th.
NLP. Introduction to NLP Time flies like an arrow –Many parses –Some (clearly) more likely than others –Need for a probabilistic ranking method.
A Brief Overview of Unix Brandon Bohrer. Topics What is Unix? – Quick introduction Documentation – Where to get it, how to use it Text Editors – Know.
PARSING David Kauchak CS159 – Fall Admin Assignment 3 Quiz #1  High: 36  Average: 33 (92%)  Median: 33.5 (93%)
LING/C SC 581: Advanced Computational Linguistics Lecture Notes Feb 3 rd.
LING/C SC 581: Advanced Computational Linguistics Lecture Notes Feb 17 th.
LING 581: Advanced Computational Linguistics Lecture Notes February 24th.
LING 581: Advanced Computational Linguistics Lecture Notes March 2nd.
Introduction to Algorithm. What is Algorithm? an algorithm is any well-defined computational procedure that takes some value, or set of values, as input.
CSCE 742 Software Architectures
Web Interface for Formatter
1. Environment Setting Minhaeng Lee.
The Linux Operating System
Installing OpenRefine
LING 581: Advanced Computational Linguistics
LING/C SC 581: Advanced Computational Linguistics
LING/C SC/PSYC 438/538 Lecture 3 Sandiway Fong.
CSE 390a Lecture 1 introduction to Linux/Unix environment
LING/C SC 581: Advanced Computational Linguistics
LING 581: Advanced Computational Linguistics
LING/C SC 581: Advanced Computational Linguistics
LING 581: Advanced Computational Linguistics
LING/C SC 581: Advanced Computational Linguistics
Constraining Chart Parsing with Partial Tree Bracketing
Devops Jenkins as CI/CD tool Created By: Amrit Choudhary
CSE 390a Lecture 1 introduction to Linux/Unix environment
LING/C SC 581: Advanced Computational Linguistics
Run Java file with Window cmd
LING/C SC 581: Advanced Computational Linguistics
Yung-Hsiang Lu Purdue University
LING/C SC 581: Advanced Computational Linguistics
introduction to Linux/Unix environment
LING/C SC 581: Advanced Computational Linguistics
David Kauchak CS159 – Spring 2019
LING/C SC 581: Advanced Computational Linguistics
LING/C SC 581: Advanced Computational Linguistics
LING/C SC/PSYC 438/538 Lecture 3 Sandiway Fong.
LING/C SC 581: Advanced Computational Linguistics
Presentation transcript:

LING/C SC 581: Advanced Computational Linguistics Lecture 23 April 9th

Today's Topics continuing from last time…

Bikel Collins Parser Java re-implementation of Collins’ parser (originally in C) easy to train (computationally inexpensive) Paper Daniel M. Bikel. 2004. Intricacies of Collins’ Parsing Model. (PS) (PDF) 
in Computational Linguistics, 30(4), pp. 479-511. Software http://www.cis.upenn.edu/~dbikel/software.html#stat-parser (page no longer exists)

Bikel Collins Download and install Dan Bikel’s parser dbp.zip (on course homepage)

Bikel Collins Training the parser with the WSJ PTB See guide userguide/guide.pdf directory: TREEBANK_3/parsed/mrg/wsj chapters 02-21: create one single .mrg file events: wsj-02-21.obj.gz

Bikel Collins Settings:

Bikel Collins Parsing Command Input file format (sentences)

Java Runtime (JRE) Notes: JDK: Java Development Kit (superset of..) java -version java version "1.8.0_191" Java(TM) SE Runtime Environment (build 1.8.0_191-b12) Java HotSpot(TM) 64-Bit Server VM (build 25.191-b12, mixed mode) Notes: JDK: Java Development Kit (superset of..) JRE: Java Runtime Environment

Bikel Collins Verify the trainer and parser work on your machine: must have Java installed Let's test it: cd dbp dbp$ ls LICENSE dbparser.jar scorer telescope.lisp README doc settings userguide bin policy-files src dbp$ more telescope.lisp  (I saw a man with a telescope) (I saw a man with a sword) bin/parse 500 settings/collins.properties ../wsj-02-21.obj.gz telescope.lisp  Executing command \tjava -server -Xms500m -Xmx500m -cp /Users/sandiway/courses/581/ling581- 19/dbp/dbparser.jar -Dparser.settingsFile=settings/collins.properties danbikel.parser.Parser - is ../wsj-02-21.obj.gz -sa telescope.lisp

Bikel Collins processing sentence No. 1: (I saw a man with a telescope) danbikel.parser.Decoder: current sentence length: 7 words danbikel.parser.Decoder: cummulative average length: 7.0 words danbikel.parser.Decoder: trying with prune factor of 4.0 danbikel.parser.Decoder: highest probability item for sentence-length span (0,6): -35.89487016064518 (S (NP-A (NPB (PRP I))) (VP (VBD saw) (NP-A (NPB (DT a) (NN man)) (PP (IN with) (NP-A (NPB (DT a) (NN telescope))))))) danbikel.parser.Decoder: top-ranked +TOP+ item: (+TOP+ (S (NP-A (NPB (PRP I))) (VP (VBD saw) (NP-A (NPB (DT a) (NN man)) (PP (IN with) (NP-A (NPB (DT a) (NN telescope))))))))

Bikel Collins processing sentence No. 2: (I saw a man with a sword) danbikel.parser.Decoder: current sentence length: 7 words danbikel.parser.Decoder: cummulative average length: 7.0 words danbikel.parser.Decoder: trying with prune factor of 4.0 danbikel.parser.Decoder: highest probability item for sentence-length span (0,6): -35.625191959838 (S (NP-A (NPB (PRP I))) (VP (VBD saw) (NP-A (NPB (DT a) (NN man)) (PP (IN with) (NP-A (NPB (DT a) (NN sword))))))) danbikel.parser.Decoder: top-ranked +TOP+ item: (+TOP+ (S (NP-A (NPB (PRP I))) (VP (VBD saw) (NP-A (NPB (DT a) (NN man)) (PP (IN with) (NP-A (NPB (DT a) (NN sword))))))))

Bikel Collins File: bin/parse is a shell script that sets up program parameters and calls java

Bikel Collins

Bikel Collins File: bin/train is another shell script

Bikel Collins Relevant WSJ PTB files