articleJun 19, 2011Closed access

A Class of Submodular Functions for Document Summarization

University of Washington

Abstract

We design a class of submodular functions meant for document summarization tasks. These functions each combine two terms, one which encourages the summary to be representative of the corpus, and the other which positively rewards diversity. Critically, our functions are monotone nondecreasing and submodular, which means that an efficient scalable greedy optimization scheme has a constant factor guarantee of optimality. When evaluated on DUC 2004-2007 corpora, we obtain better than existing state-of-art results in both generic and query-focused document summarization. Lastly, we show that several well-established methods for document summarization correspond, in fact, to submodular function optimization, adding…

Citation impact

637
total citations
FWCI
38.69
Percentile
100%
References
37
Citations per year

Authors

2

Topics & keywords

Keywords
  • Submodular set function
  • Automatic summarization
  • Computer science
  • Class (philosophy)
  • Scalability
  • Monotone polygon
  • Function (biology)
  • Greedy algorithm
No related works found for this paper.