Book contents
- Frontmatter
- Contents
- List of Contributors
- Introduction and Overview
- Part One Foundational Components of Storylines
- Part Two Connecting the Dots: Resources, Tools, and Representations
- 7 The Richer Event Description Corpus for Event–Event Relations
- 8 Low-Resource Event Extraction via Share-and-Transfer and Remaining Challenges
- 9 Reading Certainty across Sources
- 10 Narrative Homogeneity and Heterogeneity in Document Categories
- 11 Exploring Machine Learning Techniques for Linking Event Templates
- 12 Semantic Storytelling: From Experiments and Prototypes to a Technical Solution
- Author Index
10 - Narrative Homogeneity and Heterogeneity in Document Categories
from Part Two - Connecting the Dots: Resources, Tools, and Representations
Published online by Cambridge University Press: 06 November 2021
- Frontmatter
- Contents
- List of Contributors
- Introduction and Overview
- Part One Foundational Components of Storylines
- Part Two Connecting the Dots: Resources, Tools, and Representations
- 7 The Richer Event Description Corpus for Event–Event Relations
- 8 Low-Resource Event Extraction via Share-and-Transfer and Remaining Challenges
- 9 Reading Certainty across Sources
- 10 Narrative Homogeneity and Heterogeneity in Document Categories
- 11 Exploring Machine Learning Techniques for Linking Event Templates
- 12 Semantic Storytelling: From Experiments and Prototypes to a Technical Solution
- Author Index
Summary
This chapter presents techniques for examining the distributional properties of narrative schemas in a subset of the New York Times (NYT) Corpus. In one technique, the narrative argument salience through entities annotated (NASTEA) task, we use the event participants indicated by narrative schemas to replicate salient entity annotations from the NYT Corpus. In another technique, we measure narrative schema stability by generating schemas with various permutations of input documents. Both of these techniques show differences between homogeneous and heterogeneous document categories. Homogeneous categories tend to perform better on the NASTEA task using fewer schemas and exhibit more stability, whereas heterogeneous categories require more schemas applied on average to peak in performance at the NASTEA task and exhibit less stability. This suggests that narrative schemas succeed at detecting and modeling the repetitive nature of template-written text, whereas more sophisticated models are required to understand and interpret the complex novelty found in heterogeneous categories.
- Type
- Chapter
- Information
- Computational Analysis of StorylinesMaking Sense of Events, pp. 203 - 220Publisher: Cambridge University PressPrint publication year: 2021