Headline Generation Based on Statistical Translation Michele Banko Computer Science Department Johns Hopkins University Vibhu O.Mittal Just Research 報告人.

Slides:



Advertisements
Similar presentations
ThemeInformation Extraction for World Wide Web PaperUnsupervised Learning of Soft Patterns for Generating Definitions from Online News Author Cui, H.,
Advertisements

1 CS 388: Natural Language Processing: N-Gram Language Models Raymond J. Mooney University of Texas at Austin.
Ani Nenkova Lucy Vanderwende Kathleen McKeown SIGIR 2006.
Text Categorization Moshe Koppel Lecture 1: Introduction Slides based on Manning, Raghavan and Schutze and odds and ends from here and there.
Israel and Palestine Today Unresolved Hatred of the Arab-Israeli Conflict.
1 I256: Applied Natural Language Processing Marti Hearst Sept 13, 2006.
A Maximum Coherence Model for Dictionary-based Cross-language Information Retrieval Yi Liu, Rong Jin, Joyce Y. Chai Dept. of Computer Science and Engineering.
REDUCED N-GRAM MODELS FOR IRISH, CHINESE AND ENGLISH CORPORA Nguyen Anh Huy, Le Trong Ngoc and Le Quan Ha Hochiminh City University of Industry Ministry.
A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts 04 10, 2014 Hyun Geun Soo Bo Pang and Lillian Lee (2004)
Learning Within-Sentence Semantic Coherence Elena Eneva Rose Hoberman Lucian Lita Carnegie Mellon University.
Language Model based Information Retrieval: University of Saarland 1 A Hidden Markov Model Information Retrieval System Mahboob Alam Khalid.
Predicting Text Quality for Scientific Articles Annie Louis University of Pennsylvania Advisor: Ani Nenkova.
Predicting Text Quality for Scientific Articles AAAI/SIGART-11 Doctoral Consortium Annie Louis : Louis A. and Nenkova A Automatically.
Business and Management Research WELCOME. Lecture 9.
Inducing Information Extraction Systems for New Languages via Cross-Language Projection Ellen Riloff University of Utah Charles Schafer, David Yarowksy.
Approaches to automatic summarization Lecture 5. Types of summaries Extracts – Sentences from the original document are displayed together to form a summary.
Basic Scientific Writing in English Lecture 3 Professor Ralph Kirby Faculty of Life Sciences Extension 7323 Room B322.
1 Lecture 8 Measures of association: chi square test, mutual information, binomial distribution and log likelihood ratio.
Introduction to Language Models Evaluation in information retrieval Lecture 4.
Carnegie Mellon © Copyright 2000 Michael G. Christel and Alexander G. Hauptmann 1 Informedia 03/12/97.
Query session guided multi- document summarization THESIS PRESENTATION BY TAL BAUMEL ADVISOR: PROF. MICHAEL ELHADAD.
Natural Language Processing Expectation Maximization.
An Automatic Segmentation Method Combined with Length Descending and String Frequency Statistics for Chinese Shaohua Jiang, Yanzhong Dang Institute of.
Query Rewriting Using Monolingual Statistical Machine Translation Stefan Riezler Yi Liu Google 2010 Association for Computational Linguistics.
13 Miscellaneous.  Computer languages ranking  ll&lang=all&lang2=sbcl
Homing in on the Text- Initial Cluster Mike Scott School of English University of Liverpool Aston Corpus Symposium Friday May 4th 2007 This presentation.
Conclusion Abstract Article analysis.  Talk about conclusion part of your research,  See what is “abstract”, how to write one.  Analyze a paper.
1 A study on automatically extracted keywords in text categorization Authors:Anette Hulth and Be´ata B. Megyesi From:ACL 2006 Reporter: 陳永祥 Date:2007/10/16.
6. N-GRAMs 부산대학교 인공지능연구실 최성자. 2 Word prediction “I’d like to make a collect …” Call, telephone, or person-to-person -Spelling error detection -Augmentative.
A Compositional Context Sensitive Multi-document Summarizer: Exploring the Factors That Influence Summarization Ani Nenkova, Stanford University Lucy Vanderwende,
Key players and organizations in Israel and Palestine Directions: Your group will be responsible for doing a little research on your assigned topic, preparing.
Learning Phonetic Similarity for Matching Named Entity Translation and Mining New Translations Wai Lam, Ruizhang Huang, Pik-Shan Cheung ACM SIGIR 2004.
1 Text Summarization: News and Beyond Kathleen McKeown Department of Computer Science Columbia University.
Chapter 6: Statistical Inference: n-gram Models over Sparse Data
Mining the Web to Create Minority Language Corpora Rayid Ghani Accenture Technology Labs - Research Rosie Jones Carnegie Mellon University Dunja Mladenic.
Topic Modelling: Beyond Bag of Words By Hanna M. Wallach ICML 2006 Presented by Eric Wang, April 25 th 2008.
Synopsis Writing.
COMPARISON OF A BIGRAM PLSA AND A NOVEL CONTEXT-BASED PLSA LANGUAGE MODEL FOR SPEECH RECOGNITION Md. Akmal Haidar and Douglas O’Shaughnessy INRS-EMT,
NLP. Introduction to NLP Extrinsic –Use in an application Intrinsic –Cheaper Correlate the two for validation purposes.
Chapter6. Statistical Inference : n-gram Model over Sparse Data 이 동 훈 Foundations of Statistic Natural Language Processing.
Experiments of Opinion Analysis On MPQA and NTCIR-6 Yaoyong Li, Kalina Bontcheva, Hamish Cunningham Department of Computer Science University of Sheffield.
CS 6998 NLP for the Web Columbia University 04/22/2010 Analyzing Wikipedia and Gold-Standard Corpora for NER Training William Y. Wang Computer Science.
1 Sentence Extraction-based Presentation Summarization Techniques and Evaluation Metrics Makoto Hirohata, Yousuke Shinnaka, Koji Iwano and Sadaoki Furui.
Summarisation Work at Sheffield Robert Gaizauskas Natural Language Processing Group Department of Computer Science University of Sheffield.
Processing of large document collections Part 5 (Text summarization) Helena Ahonen-Myka Spring 2005.
1 Modeling Long Distance Dependence in Language: Topic Mixtures Versus Dynamic Cache Models Rukmini.M Iyer, Mari Ostendorf.
Methods for Automatic Evaluation of Sentence Extract Summaries * G.Ravindra +, N.Balakrishnan +, K.R.Ramakrishnan * Supercomputer Education & Research.
Yuya Akita , Tatsuya Kawahara
Improving Named Entity Translation Combining Phonetic and Semantic Similarities Fei Huang, Stephan Vogel, Alex Waibel Language Technologies Institute School.
Creating Subjective and Objective Sentence Classifier from Unannotated Texts Janyce Wiebe and Ellen Riloff Department of Computer Science University of.
Estimating N-gram Probabilities Language Modeling.
UWMS Data Mining Workshop Content Analysis: Automated Summarizing Prof. Marti Hearst SIMS 202, Lecture 16.
Multi-level Bootstrapping for Extracting Parallel Sentence from a Quasi-Comparable Corpus Pascale Fung and Percy Cheung Human Language Technology Center,
Retrieval of Relevant Opinion Sentences for New Products
FILTERED RANKING FOR BOOTSTRAPPING IN EVENT EXTRACTION Shasha Liao Ralph York University.
Query Suggestions in the Absence of Query Logs Sumit Bhatia, Debapriyo Majumdar,Prasenjit Mitra SIGIR’11, July 24–28, 2011, Beijing, China.
A Statistical Approach to Machine Translation ( Brown et al CL ) POSTECH, NLP lab 김 지 협.
Event-Based Extractive Summarization E. Filatova and V. Hatzivassiloglou Department of Computer Science Columbia University (ACL 2004)
An evolutionary approach for improving the quality of automatic summaries Constantin Orasan Research Group in Computational Linguistics School of Humanities,
1 ICASSP Paper Survey Presenter: Chen Yi-Ting. 2 Improved Spoken Document Retrieval With Dynamic Key Term Lexicon and Probabilistic Latent Semantic Analysis.
The P YTHY Summarization System: Microsoft Research at DUC 2007 Kristina Toutanova, Chris Brockett, Michael Gamon, Jagadeesh Jagarlamudi, Hisami Suzuki,
DISSECTING A RESEARCH PAPER. WHAT IS A PERIODICAL? Period: amount of time –Magazines (every week or month) –Newspapers (every day) –Journals (every month.
Learning Event Durations from Event Descriptions Feng Pan, Rutu Mulkar, Jerry R. Hobbs University of Southern California ACL ’ 06.
Correcting Misuse of Verb Forms John Lee, Stephanie Seneff Computer Science and Artificial Intelligence Laboratory, MIT, Cambridge ACL 2008.
Maximum Entropy techniques for exploiting syntactic, semantic and collocational dependencies in Language Modeling Sanjeev Khudanpur, Jun Wu Center for.
N-Gram Model Formulas Word sequences Chain rule of probability Bigram approximation N-gram approximation.
Question Answering Passage Retrieval Using Dependency Relations (SIGIR 2005) (National University of Singapore) Hang Cui, Renxu Sun, Keya Li, Min-Yen Kan,
Multi-Class Sentiment Analysis with Clustering and Score Representation Yan Zhu.
STUDENTS’ FAVOURITE ERRORS
Tuesday, May 21st HW: Aim: What efforts have been made to achieve peace between the Arabs and Israelis? Do Now: Why is the historic region of Palestine.
Presentation transcript:

Headline Generation Based on Statistical Translation Michele Banko Computer Science Department Johns Hopkins University Vibhu O.Mittal Just Research 報告人 : 翁鴻加 ACL2000 Michael J. Witbrock Lycos Inc.

Abstract  Extractive approach can’t generate document summaries shorter than one sentence  Non-extractive approach : statistical models of term selection  Actual headline often ungrammatical and incomplete phrase

Introduction  Generating effective summaries requires the ability to select, evaluate, order and aggregate item of information according to subject  Previous work has focused on extractive summarization drawback: 1.inability to generate coherent summaries shorter than considering context-span 2.most important information scattered across multiple sentences 3.tend to select long sentence

The System  Content Selection  Generate summary 1.Length of summaries:fixed length based on document genre 2.Coherently ordered summary from content selected

The System  Assumption : likelihood of a word in summary is independent of other words in the summary =>initial modeling choice

The System  Bigram instead of n-gram  Model : zero-level, Cross-validation is used to learn weight

Experiments 3 Clinton netanyahu arafat 4 Clinton to mideast peace 5 Clinton to meet netanyahu arafat 6 Clinton to meet Netanyahu Arafat Israel U.S. Pushes for Mideast Peace President Clinton met with his top Mideast advisers, including Secretary of State Madeleine Albright and U.S. peace envoy Dennis Ross, in preparation for a session with Israel Prime Minister Benjamin Netanyahu tomorrow. Palestinian leader Yasser Arafat is to meet with Clinton later this week. Published reports in Israel say Netanyahu will warn Clinton that Israel can’t withdraw from more than nine percent of the West Bank in its next scheduled pullback, although Clinton wants a percent pullback.

Experiment  Corpus : news articles from Reuters between 1/1/1997 ~ 1/6/1997  Strip punctuation except apostrophes  unique tokens in the article  tokens in the headline  All pairwise conditional probability added complexity : limited vocabulary

Experiments  Lack of sufficient training data  Lexical model 1000 unseen documents Gen.Headline length Word Overlap Percentage of complete matches % % % % %

Multiple Selection Models : POS and Position  Part of speech information : learn which word-senses are more likely to be part of headline and coherently order  Position information : estimating the probability of a token appea- ring in the headline given that it appeared in the 1 st, 2 st, 3 st, 4 st quartile of the body of the article

LLex+position+POS+position +POS Experiments Overlap with headline

Some “equally good” generated headlines count as error Original term Generated term Nations Top JudgeRehnquist KaczynskiUnabomber Suspect ERTop-Rated Hospital Drama Wall Street Stocks DeclineDow Jones index lower 49ers Roll Over Vikings ers to nfc title game Corn, Wheat prices FallSoybean grain prices lower

Conclusion and Future Work  This paper has presented an approach to make it possible to generate coherent summaries shorter than a single sentence  With slight generalization of the system, the summaries need not contain any of the words in original document  Given good corpora, this approach used in Japanese documents and English headline