1 I256: Applied Natural Language Processing Marti Hearst October 18, 2006.

Slides:



Advertisements
Similar presentations
Using Wikispaces for Collaborative Knowledge Construction in a Masters-Level Course Cheryl A. Stevens, Professor Recreation & Leisure Studies College of.
Advertisements

Request Management Mirror-. A random three day sample of Incidents revealed that about 86% of the registered Incidents were legitimate Requests Many other.
Easy, like an attachment. But can your doc stand on its own? Yes. Only teachers can upload files to course site. So definitely a push- tool. Maybe.
Curriculum 2.0 Reading / Language Arts By the end of third grade, students: Create and follow rules for collaborative conversations. Acquire and use.
Zhang Hongyi CSCI2100B Data Structures Tutorial 2
Monitoring Security With Standard SAP Tools Session Code 805 Sandi McKinney.
EGR 106 – Truss Design Project (cont.) Truss design programs Graphical interface tools in Matlab Saving and loading data Formatted output Project Assignment.
I256 Applied Natural Language Processing Fall 2009 Lecture 12 Projects Barbara Rosario.
1 Course Schedule Spring 2007 Exam 1 October 2007 Project Introduction(5pages)Oct – Nov Exam 2 November 2007 Project Testing Nov 26 - Dec4 Last Homework.
CIS101 Introduction to Computing Week 11. Agenda Your questions Copy and Paste Assignment Practice Test JavaScript: Functions and Selection Lesson 06,
Blogging in the Classroom Blogging Assignment and Expectations MSTI 131 Introduction to Educational Technology Fall 2010 Prof. Nichole Heinsler What is.
Principles of Procedural Programming
Collecting and Compiling Data G/T Research Program Collecting and Compiling Data G/T Research Program.
Introduction to Collecting and Compiling Data G/T Research Program Introduction to Collecting and Compiling Data G/T Research Program.
Web Information Retrieval Projects Ida Mele. Rules Students can work in teams (max 3 people) The project must be delivered by the deadline that will be.
Project Lifecycle Section 6 - Closeout. Project Manager’s Role During Project Close-Out  Ensure that all project deliverables have been completed and.
Build Your Own Website Introduction Introduction Phil Stone Phil Stone ICT tutor for Dudley Adult Community Learning ICT tutor for Dudley Adult Community.
The Information Component: Help Desk Performance Measures
Resources available in the conference hall: Remote access to the handout materials from mobile devices via Wi-Fi in the conference hall Copying.
Hook, Housekeeping & Homework Monday-Thursday Have out your choice book and begin quietly reading! If you present today, you may spend the reading minutes.
Introduction to the Nevada SharePoint Site
Computer Technology 7 Group/Individual Work on:. Instructions for Research  Work by groups or individually on your assigned topic.  Do not put in too.
Drupal Features by Lois Delcambre with much assistance from Payal Agrawal and from Yinlin and Potluri and the ensemble team.
Glogster EETT Training Mathew Swerdloff November 30, 2010.
Proposal for Term Project J. H. Wang Mar. 2, 2015.
Fundamental Programming: Fundamental Programming K.Chinnasarn, Ph.D.
Online Journalling for online mentoring and coaching.
Edmodo Training A Guide to Getting Started. 2 Free social learning network for teachers, students, schools and districts Safe and easy way to connect.
How Useful are Your Comments? Analyzing and Predicting YouTube Comments and Comment Ratings Stefan Siersdorfer, Sergiu Chelaru, Wolfgang Nejdl, Jose San.
Building Collaborative Learning Communities a division of Computer Strategies, LLC Your professional development solution.
Community Board Orientation 2- Community Board Orientation 7-1.
Integrating Web-based Blackboard™ into ESL Discussion and Writing Activities Presenter: Professor Lyra Riabov Southern New Hampshire University Presentation.
Microsoft Access is a database program to manage sort retrieve group filter for certain records.
Quick Response Codes Betsy Sanford Palmer Middle School.
Moodle for Your Noodle. What in the world is Moodle? An open source Content Management System (or Course Management System) built around a sound educational.
Exploring in the Weblog Space by Detecting Informative and Affective Articles Xiaochuan Ni, Gui-Rong Xue, Xiao Ling, Yong Yu Shanghai Jiao-Tong University.
Welcome to Edmodo Professional Development Workshop American School of Guatemala Tuesday, October 23, 2012 Thank you to Mrs. Vanita Vance – College Station.
21CLD Learning Activity Rubrics Collaboration 2 nd Workshop on Telecollaboration and Project-based Learning to Reorient Teacher Education towards EFA.
Evaluation Assistant Research Projects EAs are required to lead an evaluation research project for the academic year.
Agenda Debrief on past module development Tools for online content development Module development template Timeline Suggested guidelines for developing.
Biology Front End. Abushnag’s Tickets Establish Communication document the team work and how the site works Job page Adding the new features to the add.
The Development Process Compilation. Compilation - Dr. Craig A. Struble 2 Programming Process Problem Solving Phase We will spend significant time on.
1 ICASSP Paper Survey Presenter: Chen Yi-Ting. 2 Improved Spoken Document Retrieval With Dynamic Key Term Lexicon and Probabilistic Latent Semantic Analysis.
I really loved the idea of having a class blog. A place where students can get online accessibility to course information and updates and a place where.
College Career Ready Conference Participants will  Review components of the Grade 3 and the Grades 4 and 5 Condensed Scoring Rubric for Prose Constructed.
Text2PTO: Modernizing Patent Application Filing A Proposal for Submitting Text Applications to the USPTO.
Medicine, Nursing and Health Sciences Learning Management Systems: Best Practice Presented by Geoffrey Hooke Senior Instructional Designer Faculty of Medicine,
Running Records Feedback… What is a running record?
Easy, like an attachment. But can your doc stand on its own? Yes. Only teachers can upload files to course site. So definitely a push- tool. Maybe.
Theresa Gabor, CCCOE Web 2.0 What You Need to Know.
Making the Most of Search in SharePoint 2013 Christina Wheeler.
Proposal for Term Project
AIT Design Project- App Creation
Green Entrepreneurship Programme: Development of the Incubation Phase Ramon Farreny, Fern Marie Bowles, Anna Ibáñez, Nicola Cerantola.
Coding Defensively Coding Defensively
Discussion Section 3 HW1 comments HW2 questions
MTM Tools key to walking
Unit 4:L02 Creating an Interactive Media Product
Software Programming J. Holvikivi 2014.
New Teacher Orientation OnCourse Training
Working with Google Maps
Audio Visual Promo LO2 Learning Outcome: To plan and create concept documentation for your projects.
IOF Event Overview Tool – status Feb 2018
Presentation and project
AP Research The second course in College Board’s Capstone Program
Google Apps and a little Bit of eCLASS
Genome 540: Discussion Section Week 3
SeeSoft A Visualization Tool..
Presentation transcript:

1 I256: Applied Natural Language Processing Marti Hearst October 18, 2006

2 Community-based Summarizer Results on training data with cross-validation?

3 Community-based Summarizer Results on test data:

4 Problems with Community Code Not reading the instructions: Hardcoding directory paths Hardcoding filenames of testing files Here is an easy way to do it generally: import os files = os.listdir(“dirname”) So the code should take two parameters: –Directory name containing the documents –Filename in which to write the output

5 Problems with Community Code Not reading the instructions: Hardcoding directory paths within the code Hardcoding filenames of testing files Here is an easy way to do it generally: import os files = os.listdir(“dirname”) So the code should take two parameters: –Directory name containing the documents –Filename in which to write the output

6 Problems with Community Code What I did wrong: Had said in class that the files should be self- contained but didn’t put that into the assignment description. Should have said explicitly that you should take as input a directory name and an output filename. Should have made an easy way to indicate if external files were needed, and what they were. Should have added another task: analyze the individual features contribution.

7 Final Projects I’d like proposals in two weeks (Nov 1) Gives me a week to give you feedback We’ll spend about 5 weeks on the projects I want to give you one or two more homeworks Class presentations the week of Dec 5, but projects due the following week You can work in teams of 2 (maybe 3, depends on the project)

8 Final Project Ideas Blog analysis Categorize blog topics (maybe including link analysis) Segment blogs into pieces based on topics Do blog author analysis Summarize blog reaction to some event, e.g., what did people think of “An Inconvenient Truth” There is a contest on this: Do analysis as input for an interesting viz:

9 Final Project Ideas Analyze the accuracy of best-paper awards* Often given out for conferences How prescient are these awards?

10 Final Project Ideas Create a Negativity/Emotion/Flame Recognizer There is some related work, but this is somewhat under-explored

11 Final Project Ideas Improve an Automatic Faceted Hierarchy Creation Tool* Students used this two years ago for making a hierarchy for photo text Sample output on two collections: – bin/flamenco.cgi/recipes-automated/Flamencohttp://orange.sims.berkeley.edu/cgi- bin/flamenco.cgi/recipes-automated/Flamenco – bin/flamenco.cgi/recipes-automated/Flamencohttp://orange.sims.berkeley.edu/cgi- bin/flamenco.cgi/recipes-automated/Flamenco

12 Final Project Ideas Analyze profiles for online dating* Use characteristics from social psychology to score them Use other metrics as well.

13 Final Project Ideas Work on a timeline comparison project One idea: use output of the new Google news archive Create input for a visualizer built by students last semester: