SUMMAC: a text summarization evaluation

INDERJEET MANI; GARY KLEIN; DAVID HOUSE; LYNETTE HIRSCHMAN; THERESE FIRMIN; BETH SUNDHEIM

doi:10.1017/S1351324901002741

Abstract

The TIPSTER Text Summarization Evaluation (SUMMAC) has developed several new extrinsic and intrinsic methods for evaluating summaries. It has established definitively that automatic text summarization is very effective in relevance assessment tasks on news articles. Summaries as short as 17% of full text length sped up decision-making by almost a factor of 2 with no statistically significant degradation in accuracy. Analysis of feedback forms filled in after each decision indicated that the intelligibility of present-day machine-generated summaries is high. Systems that performed most accurately in the production of indicative and informative topic-related summaries used term frequency and co-occurrence statistics, and vocabulary overlap comparisons between text passages. However, in the absence of a topic, these statistical methods do not appear to provide any additional leverage: in the case of generic summaries, the systems were indistinguishable in accuracy. The paper discusses some of the tradeoffs and challenges faced by the evaluation, and also lists some of the lessons learned, impacts, and possible future directions. The evaluation methods used in the SUMMAC evaluation are of interest to both summarization evaluation as well as evaluation of other ‘output-related’ NLP technologies, where there may be many potentially acceptable outputs, with no automatic way to compare them.

Crossref Citations

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Reiter, Ehud Sripada, Somayajulu Hunter, Jim Yu, Jin and Davy, Ian 2005. Choosing words in computer-generated weather forecasts. Artificial Intelligence, Vol. 167, Issue. 1-2, p. 137.

Hsun-Hui Huang Yau-Hwang Kuo and Horng-Chang Yang 2006. Fuzzy-Rough Set Aided Sentence Extraction Summarization. Vol. 1, Issue. , p. 450.

Hourigan, Tríona and Murray, Liam 2006. Mapping successful language learning approaches in the adaptation of generic software. Computer Assisted Language Learning, Vol. 19, Issue. 4-5, p. 301.

Maybury, M. 2006. Encyclopedia of Language & Linguistics. p. 518.

Mani, I. 2006. Encyclopedia of Language & Linguistics. p. 274.

Lin, Jimmy and Wilbur, W. John 2007. Syntactic sentence compression in the biomedical domain: facilitating access to related articles. Information Retrieval, Vol. 10, Issue. 4-5, p. 393.

Kuo, Yau-Hwang and Huang, Hsun-Hui 2007. Automatic Extraction of Key Sentences via Word Sense Identification for Chinese Text Summarization. Journal of Advanced Computational Intelligence and Intelligent Informatics, Vol. 11, Issue. 4, p. 416.

Díaz, Alberto and Gervás, Pablo 2007. User-model based personalized summarization. Information Processing & Management, Vol. 43, Issue. 6, p. 1715.

Spärck Jones, Karen 2007. Automatic summarising: The state of the art. Information Processing & Management, Vol. 43, Issue. 6, p. 1449.

Hirao, Tsutomu Okumura, Manabu Yasuda, Norihito and Isozaki, Hideki 2007. Supervised automatic evaluation for summarization with voted regression model. Information Processing & Management, Vol. 43, Issue. 6, p. 1521.

Hobson, Stacy President Dorr, Bonnie J. Monz, Christof and Schwartz, Richard 2007. Task-based evaluation of text summarization using Relevance Prediction. Information Processing & Management, Vol. 43, Issue. 6, p. 1482.

Zhan, Jiaming Loh, Han Tong and Liu, Ying 2008. Web Information Systems and Technologies. Vol. 8, Issue. , p. 245.

Saggion, Horacio 2008. Automatic Summarization: An Overview. Revue française de linguistique appliquée, Vol. Vol. XIII, Issue. 1, p. 63.

Zajic, David M. Dorr, Bonnie J. and Lin, Jimmy 2008. Single-document and multi-document summarization techniques for email threads using sentence compression. Information Processing & Management, Vol. 44, Issue. 4, p. 1600.

Zamanifar, Azadeh Minaei-Bidgoli, Behrouz and Sharifi, Mohsen 2008. A New Hybrid Farsi Text Summarization Technique Based on Term Co-Occurrence and Conceptual Property of the Text. p. 635.

Altman, Russ B Bergman, Casey M Blake, Judith Blaschke, Christian Cohen, Aaron Gannon, Frank Grivell, Les Hahn, Udo Hersh, William Hirschman, Lynette Jensen, Lars Juhl Krallinger, Martin Mons, Barend O'Donoghue, Seán I Peitsch, Manuel C Rebholz-Schuhmann, Dietrich Shatkay, Hagit and Valencia, Alfonso 2008. Text mining for biology - the way forward: opinions from leading scientists. Genome Biology, Vol. 9, Issue. S2,

Yang, Yu-Bin Lu, Tong and Lin, Jin-Jie 2009. Advances in Multimedia Information Processing - PCM 2009. Vol. 5879, Issue. , p. 292.

Latif, Seemab Wood, Mary McGee and Nenadic, Goran 2009. Correlation between human assessment of essays and ROUGE evaluation of essays' summaries. p. 122.

Azmi, Aqil and Al-thanyyan, Suha 2009. Ikhtasir — A user selected compression ratio Arabic text summarization system. p. 1.

Clarke, James and Lapata, Mirella 2010. Discourse Constraints for Document Compression. Computational Linguistics, Vol. 36, Issue. 3, p. 411.

Download full list

Article contents

SUMMAC: a text summarization evaluation

Abstract

Access options

Article purchase

Temporarily unavailable

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Article contents

SUMMAC: a text summarization evaluation

Abstract

Access options

Article purchase

Temporarily unavailable

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests