Hostname: page-component-745bb68f8f-lrblm Total loading time: 0 Render date: 2025-01-26T11:48:43.097Z Has data issue: false hasContentIssue false

Choosing the content of textual summaries of large time-series data sets

Published online by Cambridge University Press:  15 February 2006

JIN YU
Affiliation:
Department of Computing Science, University of Aberdeen Aberdeen AB24 3UE, UK e-mail: [email protected], [email protected], [email protected], [email protected]
EHUD REITER
Affiliation:
Department of Computing Science, University of Aberdeen Aberdeen AB24 3UE, UK e-mail: [email protected], [email protected], [email protected], [email protected]
JIM HUNTER
Affiliation:
Department of Computing Science, University of Aberdeen Aberdeen AB24 3UE, UK e-mail: [email protected], [email protected], [email protected], [email protected]
CHRIS MELLISH
Affiliation:
Department of Computing Science, University of Aberdeen Aberdeen AB24 3UE, UK e-mail: [email protected], [email protected], [email protected], [email protected]

Abstract

Natural Language Generation (NLG) can be used to generate textual summaries of numeric data sets. In this paper we develop an architecture for generating short (a few sentences) summaries of large (100KB or more) time-series data sets. The architecture integrates pattern recognition, pattern abstraction, selection of the most significant patterns, microplanning (especially aggregation), and realisation. We also describe and evaluate SumTime-Turbine, a prototype system which uses this architecture to generate textualsummaries of sensor data from gas turbines.

Type
Papers
Copyright
2006 Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)