Hostname: page-component-78c5997874-j824f Total loading time: 0 Render date: 2024-11-05T04:10:06.750Z Has data issue: false hasContentIssue false

Word sense disambiguation with pattern learning and automatic feature selection

Published online by Cambridge University Press:  22 January 2003

RADA F. MIHALCEA
Affiliation:
Department of Computer Science, University of North Texas, Denton, TX 76203-1366, USA e-mail: [email protected]

Abstract

This paper presents a novel approach for word sense disambiguation. The underlying algorithm has two main components: (1) pattern learning from available sense-tagged corpora (SemCor), from dictionary definitions (WordNet) and from a generated corpus (GenCor); and (2) instance based learning with automatic feature selection, when training data is available for a particular word. The ideas described in this paper were implemented in a system that achieves excellent performance on the data provided during the SENSEVAL-2 evaluation exercise, for both English all words and English lexical sample tasks.

Type
Research Article
Copyright
2002 Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)