Data Segmentation

Kiyong Lee

doi:10.1017/9781108884532.005

2 - Data Segmentation

from Part I - Fundamentals

Published online by Cambridge University Press: 05 August 2023

Kiyong Lee

Show author details

Kiyong Lee: Affiliation:
Korea University, Seoul

Book contents

Get access

Summary

Data can be segmented into minimal units. Such a process is called base segmentation. In this chapter, I discuss three types of base segmentation of language data, depending on its three media types: phoneme segmentation, image segmentation, and text segmentation. They can be grouped into larger units. Base segmented text, for instance, undergo tokenization, annotated segmentation such as word segmentation, and chunking with POS-tagging. The semantic annotation of language data, whether written, spoken, or visualized, requires the target data to be segmented and preferably annotated with appropriate morpho-syntactic information.

Type: Chapter
Information: Annotation-Based Semantics for Space and Time in Language , pp. 23 - 45

DOI: https://doi.org/10.1017/9781108884532.005 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2023

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

2 - Data Segmentation

Summary

Access options

Book purchase

Temporarily unavailable

Book contents

2 - Data Segmentation

Summary

Access options

Book purchase

Temporarily unavailable

Save book to Kindle

Save book to Dropbox

Save book to Google Drive