The ICSI Summarization System Dan Gillick, Benoit Favre, and Dilek Hakkani-Tür {dgillick, favre, International Computer Science.

Slides:

Advertisements

Similar presentations

Information Retrieval (IR) on the Internet. Contents  Definition of IR  Performance Indicators of IR systems  Basics of an IR system  Some IR Techniques.

Advertisements

Ani Nenkova Lucy Vanderwende Kathleen McKeown SIGIR 2006.

Overview of the TAC2013 Knowledge Base Population Evaluation: Temporal Slot Filling Mihai Surdeanu with a lot help from: Hoa Dang, Joe Ellis, Heng Ji,

Page 1 SRL via Generalized Inference Vasin Punyakanok, Dan Roth, Wen-tau Yih, Dav Zimak, Yuancheng Tu Department of Computer Science University of Illinois.

Fast Algorithms For Hierarchical Range Histogram Constructions

WWW 2014 Seoul, April 8 th SNOW 2014 Data Challenge Two-level message clustering for topic detection in Twitter Georgios Petkos, Symeon Papadopoulos, Yiannis.

Recovering Human Body Configurations: Combining Segmentation and Recognition Greg Mori, Xiaofeng Ren, and Jitentendra Malik (UC Berkeley) Alexei A. Efros.

1 Multi-topic based Query-oriented Summarization Jie Tang *, Limin Yao #, and Dewei Chen * * Dept. of Computer Science and Technology Tsinghua University.

Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.

UNC-CH at DUC2007: Query Expansion, Lexical Simplification, and Sentence Selection Strategies for Multi-Document Summarization Catherine Blake Julia Kampov.

Information Retrieval in Practice

Text Specificity and Impact on Quality of News Summaries Annie Louis & Ani Nenkova University of Pennsylvania June 24, 2011.

Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.

A Markov Random Field Model for Term Dependencies Donald Metzler and W. Bruce Croft University of Massachusetts, Amherst Center for Intelligent Information.

T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) IR Queries.

Dimensional reduction, PCA

Page 1 Generalized Inference with Multiple Semantic Role Labeling Systems Peter Koomen, Vasin Punyakanok, Dan Roth, (Scott) Wen-tau Yih Department of Computer.

Vector Space Model Any text object can be represented by a term vector Examples: Documents, queries, sentences, …. A query is viewed as a short document.

Overview of Search Engines

Artificial Intelligence Research Centre Program Systems Institute Russian Academy of Science Pereslavl-Zalessky Russia.

Query session guided multi- document summarization THESIS PRESENTATION BY TAL BAUMEL ADVISOR: PROF. MICHAEL ELHADAD.

Keyphrase Extraction in Scientific Documents Thuy Dung Nguyen and Min-Yen Kan School of Computing National University of Singapore Slides available at.

Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification on Reviews Peter D. Turney Institute for Information Technology National.

1 The BT Digital Library A case study in intelligent content management Paul Warren

A Compositional Context Sensitive Multi-document Summarizer: Exploring the Factors That Influence Summarization Ani Nenkova, Stanford University Lucy Vanderwende,

Reyyan Yeniterzi Weakly-Supervised Discovery of Named Entities Using Web Search Queries Marius Pasca Google CIKM 2007.

METEOR-Ranking & M-BLEU: Flexible Matching & Parameter Tuning for MT Evaluation Alon Lavie and Abhaya Agarwal Language Technologies Institute Carnegie.

The CoNLL-2013 Shared Task on Grammatical Error Correction Hwee Tou Ng, Yuanbin Wu, and Christian Hadiwinoto 1 Siew.

A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.

Chapter 2 Architecture of a Search Engine. Search Engine Architecture n A software architecture consists of software components, the interfaces provided.

When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.

Processing of large document collections Part 7 (Text summarization: multi- document summarization, knowledge- rich approaches, current topics) Helena.

Review from before Christmas Break. Sampling Distributions Properties of a sampling distribution of means:

LexRank: Graph-based Centrality as Salience in Text Summarization

Improving Suffix Tree Clustering Base cluster ranking s(B) = |B| * f(|P|) |B| is the number of documents in base cluster B |P| is the number of words in.

Effective Query Formulation with Multiple Information Sources

1 Automating Slot Filling Validation to Assist Human Assessment Suzanne Tamang and Heng Ji Computer Science Department and Linguistics Department, Queens.

Generic text summarization using relevance measure and latent semantic analysis Gong Yihong and Xin Liu SIGIR, April 2015 Yubin Lim.

Statistical NLP Spring 2011 Lecture 25: Summarization Dan Klein – UC Berkeley TexPoint fonts used in EMF. Read the TexPoint manual before you delete this.

LexPageRank: Prestige in Multi- Document Text Summarization Gunes Erkan and Dragomir R. Radev Department of EECS, School of Information University of Michigan.

Greedy is not Enough: An Efficient Batch Mode Active Learning Algorithm Chen, Yi-wen( 陳憶文 ) Graduate Institute of Computer Science ＆ Information Engineering.

Searching the web Enormous amount of information –In 1994, 100 thousand pages indexed –In 1997, 100 million pages indexed –In June, 2000, 500 million pages.

Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.

1 Web-Page Summarization Using Clickthrough Data* JianTao Sun, Yuchang Lu Dept. of Computer Science TsingHua University Beijing , China Dou Shen,

BLAST: Basic Local Alignment Search Tool Altschul et al. J. Mol Bio CS 466 Saurabh Sinha.

1 FollowMyLink Individual APT Presentation Third Talk February 2006.

1 Sentence Extraction-based Presentation Summarization Techniques and Evaluation Metrics Makoto Hirohata, Yousuke Shinnaka, Koji Iwano and Sadaoki Furui.

Pairwise Local Alignment and Database Search Csc 487/687 Computing for Bioinformatics.

August 30, 2004STDBM 2004 at Toronto Extracting Mobility Statistics from Indexed Spatio-Temporal Datasets Yoshiharu Ishikawa Yuichi Tsukamoto Hiroyuki.

August 17, 2005Question Answering Passage Retrieval Using Dependency Parsing 1/28 Question Answering Passage Retrieval Using Dependency Parsing Hang Cui.

Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick.

Do Summaries Help? A Task-Based Evaluation of Multi-Document Summarization Kathleen McKeown, Rebecca J. Passonneau David K. Elson, Ani Nenkova, Julia Hirschberg.

Event-Based Extractive Summarization E. Filatova and V. Hatzivassiloglou Department of Computer Science Columbia University (ACL 2004)

The P YTHY Summarization System: Microsoft Research at DUC 2007 Kristina Toutanova, Chris Brockett, Michael Gamon, Jagadeesh Jagarlamudi, Hisami Suzuki,

Naïve Bayes Classifier April 25 th, Classification Methods (1) Manual classification Used by Yahoo!, Looksmart, about.com, ODP Very accurate when.

哈工大信息检索研究室 HITIR ’ s Update Summary at TAC2008 Extractive Content Selection Using Evolutionary Manifold-ranking and Spectral Clustering Reporter: Ph.d.

A Survey on Automatic Text Summarization Dipanjan Das André F. T. Martins Tolga Çekiç

Towards an Extractive Summarization System Using Sentence Vectors and Clustering John Cadigan, David Ellison, Ethan Roday.

1 Query Directed Web Page Clustering Daniel Crabtree Peter Andreae, Xiaoying Gao Victoria University of Wellington.

Amy Wagaman Amherst College Mathematics and Statistics.

An Effective Statistical Approach to Blog Post Opinion Retrieval Ben He, Craig Macdonald, Jiyin He, Iadh Ounis (CIKM 2008)

Information Retrieval in Practice

Plan for Today’s Lecture(s)

Simone Paolo Ponzetto University of Heidelberg Massimo Poesio

Chinese Academy of Sciences, Beijing, China

Entity- & Topic-Based Information Ordering

Built by Schools for Schools

Compact Query Term Selection Using Topically Related Text

Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science

Minwise Hashing and Efficient Search

Presentation transcript:

The ICSI Summarization System Dan Gillick, Benoit Favre, and Dilek Hakkani-Tür {dgillick, favre, International Computer Science Institute Berkeley, CA

Dan Gillick (2)November 18, 2008ICSI at TAC 2008 Who Are We? Graduate student at UC Berkeley Postdoc at ICSI, PhD from Avignon Senior Researcher at ICSI Benoit Favre Dilek Hakkani-Tür Dan Gillick

Dan Gillick (3)November 18, 2008ICSI at TAC 2008 Summarization Assumptions Information is conveyed by discrete, independent concepts. The content value of a summary can be measured by the total value of the unique concepts it contains. Linguistic quality is enforced primarily by units of selection (e.g. sentences).

Dan Gillick (4)November 18, 2008ICSI at TAC 2008 What are Concepts? Christians make up just 3 percent of Iraq's population of about 25 million. (1) Christians make up 3 percent of Iraq’s population (2) The population of Iraq is 25 million (1) Christians make (2) 3 percent (3) Iraq’s population (4) 25 million Original sentence Pyramid concepts Word bigram concepts

Dan Gillick (5)November 18, 2008ICSI at TAC 2008 ILP Formulation Maximize a single linear objective function: i : concept index c i : indicator for concept i in summary w i : weight (value) of concept i Image: chilton- computing.org.uk

Dan Gillick (6)November 18, 2008ICSI at TAC 2008 ILP Formulation Maximize a single linear objective function: Subject to linear constraints: i : concept index j : sentence index c i : indicator for concept i in summary s j : indicator for sentence j in summary w i : weight (value) of concept i l j : length of s j o ij : indicator for c i in s j L : maximum summary length Image: chilton- computing.org.uk

Dan Gillick (7)November 18, 2008ICSI at TAC 2008 Building Systems (1) ICSI-1 –Concepts: word bigrams –Mapping Function: document frequency only include sentences with some query overlap prune concepts appearing in fewer than 3 documents –Units of Selection: sentences ICSI-2 –Units of Selection: compressed sentence candidates

Dan Gillick (8)November 18, 2008ICSI at TAC 2008 Building Systems (2) MRO (Maximum ROUGE Oracle) –Concepts: word bigrams –Mapping Function: document frequency in human “gold” summaries –Units of Selection: sentences

Dan Gillick (9)November 18, 2008ICSI at TAC 2008 Pre/post - processing Sentence segmentation, tokenization, stop- words, Porter stemming – NLTK Simple rules for removing newswire headers and formatting markup ICSI-1, MRO: ordering first by source date, then by sentence number ICSI-2: dendrogram ordering (not clear this is better)

Dan Gillick (10)November 18, 2008ICSI at TAC 2008 Only the Most Related Work Assigning value to words based on frequency (Nenkova and Vanderwende, 2005) Global optimization with learned word values using a beam search (Yih, et al., 2007) Set cover formalism for summarization (Filatova and Hatzivassiloglou, 2004) ILP for summarization (McDonald, 2007) Approximate ROUGE-1 oracle results (Conroy et al., 2006)

Dan Gillick (11)November 18, 2008ICSI at TAC 2008 TAC Results (1) Excellent performance on non-update problems, t-test shows no significant difference between ICSI-1 and the best system in every category No specific update task processing

Dan Gillick (12)November 18, 2008ICSI at TAC 2008 TAC Results (2) Overall best ROUGE scores Relatively poor linguistic quality

Dan Gillick (13)November 18, 2008ICSI at TAC 2008 Linguistic Quality Analysis Among summaries receiving linguistic quality scores of 1 or 2, we counted how many contained each type of error: ICSI-1 could be drastically improved by better sentence segmentation and rules for removing a few words. ICSI-2 is too aggressive with sentence compression. Co-reference resolution is a major problem.

Dan Gillick (14)November 18, 2008ICSI at TAC 2008 An Oracle Experiment (1) Data: DUC 2007 update task set A (10 topics) Note: “Content responsiveness” evaluation does not include linguistic quality as in TAC Systems Evaluated: B1: Returns all leading sentences up to the length limit from the most recent document B2: NIST’s “high performance generic summarizer” (Conroy, et al., 2004) ICSI-1: Our submitted system MRO: The oracle system H: Each of 4 human summaries written by NIST’s IR experts.

Dan Gillick (15)November 18, 2008ICSI at TAC 2008 An Oracle Experiment (2) MRO gets better content scores than ICSI-1, but worse than humans All differences significant at 95% confidence interval, using Tukey’s “Honestly Significant Differences” Suggests there is room for improvement in sentence extraction

Dan Gillick (16)November 18, 2008ICSI at TAC 2008 Not Randomly Selected Example Summaries for D0808-A: “Describe the events related to Christian minorities in Iraq and their current status.” MRO: Iraq's Christians, increasingly targeted by insurgents, are fleeing Baghdad for the safety of the Kurdish north or neighboring Syria and Jordan. But the exodus is temporary, insist many, because they are not selling their homes and property. Christians make up just 3 percent of Iraq's population of about 25 million. Officials estimate that as many as 15,000 of Iraq's nearly one million Christians have left the country since August, when four churches in Baghdad and one in Mosul were attacked in a coordinated series of car bombings. Insurgents abducted Syrian Catholic Archbishop Basile Casmoussa apparently to frighten Iraqi Christians. ICSI-1: In an interview, Yonadem Kana, the leader of the Assyrian Democratic Movement in Iraq and a member of the Iraqi National Council, said the fighters have been deployed in Baghdida near the northern city of Mosul. Christians make up just 3 percent of Iraq's population of about 25 million. Officials estimate that as many as 15,000 of Iraq's nearly one million Christians have left the country since August, when four churches in Baghdad and one in Mosul were attacked in a coordinated series of car bombings. Most of Christians in Iraq are in Baghdad and northern cities. ICSI-2: Officials estimate that as many as of Iraq's nearly one million Christians have left the country since August, when four churches in Baghdad and one in Mosul were attacked in a coordinated series of car bombings. Most of Christians in Iraq are in Baghdad and northern cities. Christians make up just 3 percent of Iraq's population of about 25 million. Armed men kidnapped a Catholic archbishop in Iraq's main northern city of Mosul Monday. In an interview, Yonadem Kana, the leader of the Assyrian Democratic Movement in Iraq and a member, said the fighters have been deployed in Baghdida. Responsiveness: ? Linguistic Quality: ? Pyramid: ? ROUGE-2: Responsiveness: 3 Linguistic Quality: 3 Pyramid: ROUGE-2: Responsiveness: 4 Linguistic Quality: 4 Pyramid: ROUGE-2: 0.119

Dan Gillick (17)November 18, 2008ICSI at TAC 2008 Conclusion ICSI system is simple, fast, and performs well. Linguistic quality needs work but a set of rules for cleaning sentences will help a lot. Oracle system suggests: –room for improvement in sentence selection –more is likely needed to match human performance Source code available soon (