Applications of term identification technology: domain description and content characterisation

BRANIMIR BOGURAEV; CHRISTOPHER KENNEDY

doi:10.1017/S1351324999002090

Applications of term identification technology: domain description and content characterisation

Published online by Cambridge University Press: 01 March 1999

BRANIMIR BOGURAEV and

CHRISTOPHER KENNEDY

Show author details

BRANIMIR BOGURAEV: Affiliation:
IBM T.J. Watson Research Center, IBM Corporation, NY, USA; e-mail: [email protected]
CHRISTOPHER KENNEDY: Affiliation:
Department of Linguistics, Northwestern University, IL, USA; e-mail: [email protected]

Article contents

Abstract

Get access

Rights & Permissions

Abstract

The identification and extraction of technical terms is one of the better understood and most robust Natural Language Processing (NLP) technologies within the current state of the art of language engineering. In generic information management contexts, terms have been used primarily for procedures seeking to identify a set of phrases that is useful for tasks such as text indexing, computational lexicology, and machine-assisted translation: such tasks make important use of the assumption that terminology is representative of a given domain. This paper discusses an extension of basic terminology identification technology for the application to two higher level semantic tasks: domain description, the specification of the technical domain of a document, and content characterisation, the construction of a compact, coherent and useful representation of the topical content of a text. With these extensions, terminology identification becomes the foundation of an operational environment for document processing and content abstraction.

Type: Research Article
Information: Natural Language Engineering , Volume 5 , Issue 1 , March 1999 , pp. 17 - 44

DOI: https://doi.org/10.1017/S1351324999002090 [Opens in a new window]

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article contents

Applications of term identification technology: domain description and content characterisation

Abstract

Access options

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests