Engineering “word experts” for word disambiguation

Daniel Berleant

doi:10.1017/S1351324900000255

Engineering “word experts” for word disambiguation

Published online by Cambridge University Press: 12 September 2008

Daniel Berleant

Show author details

Daniel Berleant: Affiliation:
Department of Computer Systems EngineeringUniversity of Arkansas, Fayetteville, AR 72701, USA Email: [email protected]

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

Every word in the lexicon of a natural language is used distinctly from all the other words. A word expert is a small expert system-like module for processing a particular word based on other words in its vicinity. A word expert exploits the idiosyncratic nature of a word by using a set of context testing decision rules that test the identity and placement of context words to infer the word's role in the passage.

The main application of word experts is disambiguating words. Work on word experts has never fully recognized previous related work, and a comprehensive review of that work would therefore contribute to the field. This paper both provides such a review, and describes guidelines and considerations useful in the design and construction of word expert based systems.

Type: Articles
Information: Natural Language Engineering , Volume 1 , Issue 4 , December 1995 , pp. 339 - 362

DOI: https://doi.org/10.1017/S1351324900000255 [Opens in a new window]
Copyright: Copyright © Cambridge University Press 1995

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Adriaens, G., (1986) Word expert parsing: A natural language analysis program revised and applied to Dutch. Leuvensche Bijdragen 75(1):73–154.Google Scholar

Adriaens, G., (1987) WEP (word expert parsing) revised and applied to Dutch. ECAI '86: Proceedings 7th European Conference on Artificial Intelligence,Brighton, UK,21–25 July 1986, pp. 222–235. (Reprinted in Du Boulay, B., Hogg, D. and Steels, L., eds., Advances in Artificial Intelligence II, Elsevier, 1987, pp. 403–416.)Google Scholar

Adriaens, G.. (1989) The parallel expert parser: a meaning-oriented, lexically-guided, parallel- interactive model of natural language understanding. International Workshop on Parsing Technologies, Carnegie-Mellon University, pp. 309–319.Google Scholar

Adriaens, G., and Small, S. L.. (1988) Word expert parsing revisited in a cognitive science perspective. In Small, S. L., Cottrell, G. W., and Tanenhaus, MK (eds), Lexical Ambiguity Resolution: Perspectives from Psycholinguistics, Neuropsychology and Artificial Intelligence, San Mateo, CA: Morgan Kaufmann, pp.13–43.Google Scholar

Allen, J.. (1995) Statistical word sense disambiguation. Chapter 10 in Natural Language Understanding, 2nd ed. New York, NY: Benjamin/Cummings.Google Scholar

Armstrong, S.. (1994) Using Large Corpora. Cambridge, MA: MIT Press.Google Scholar

Barnhart, C. L.. (ed.) (1967) Thorndike-Barnhart Comprehensive Desk Dictionary. Doubleday, pp. XI, 5.Google Scholar

ben Jehiel, N.. (1969) Sefer Arukh Ha-Shalem. Shiloh.Google Scholar

Berleant, J. D.. (1982) Subliminal stimuli and the interpretation of ambiguous sentences. B.S. thesis, MIT.Google Scholar

Berleant, D., Lovelady, S., and Viswanathan, K.. (1994) A foreign vocabulary learning aid for the networked world of tomorrow: The LEARN Project. SIGICE Bulletin 19(3):22–29.CrossRef Google Scholar

Brown, P. F., Delia Pietra, S. A., Delia Pietra, V. J., and Mercer, R. L., (1991) Word-sense disambiguation using statistical methods. Proceedings 29th Annual Meeting of the Association for Computational Linguistics, pp. 264–270.CrossRef Google Scholar

Brown, P. E., Delia Pietra, S. A., Delia Pietra, V. J., and Mercer, R. L., (1993) The Mathematics of statistical machine translation: parameter estimation. Computational Linguistics 19(2):263–311.Google Scholar

Cerri, S. A.. (1989) ALICE: Acquisition of Linguistic Items in the Context of Examples. Instructional Science 18:63–92.CrossRef Google Scholar

Choueka, Y., and Lusignan, S., (1985) Disambiguation by Short Contexts. Computers and the Humanities 19.Google Scholar

Cottrell, G. W., (1989) A Connectionist Approach to Word Sense Disambiguation. San Mateo, CA: Morgan Kaufmann.Google Scholar

Cottrell, G., and Small, S., (1983) A connectionist scheme for modelling word sense disambiguation. Cognition and Brain Theory 6:89–120.Google Scholar

Devos, M., and Adriaens, G., (1994) The parallel expert parser. In Adriaens, G. and Hahn, U., (eds), Parallel Natural Language Processing. NJ: Ablex.Google Scholar

Devos, M., Adriaens, G., and Willems, Y. D., (1988) The parallel expert parser (PEP): a thoroughly revised descendent of the word expert parser (WEP). Proceedings of the 12th International Conference on Computational Linguistics (COLING '88), 1:142–147, Budapest, Hungary, 22–27 August.CrossRef Google Scholar

Gale, W. A., and Church, K. W.. (1993) A program for aligning sentences in bilingual corpora. Computational Linguistics 19 (1): 75–102.Google Scholar

Gale, W. A., Church, K. W., and Yarowsky, D.. (1993) A method for disambiguating word senses in a large corpus. Computers and the Humanities 26: 415–439.Google Scholar

Hahn, U., (1989) Making understanders out of parsers: Semantically driven parsing as a key concept for realistic text understanding applications. Int J Intelligent Systems 4 (3):345–393.Google Scholar

Hahn, U.. (1994) An actor model of distributed natural language parsing. In Adriaens, G. and Hahn, U (eds), Parallel Natural Language Processing. NJ: Ablex.Google Scholar

Harris, B., (1988) Bi-text, a new concept in translation theory. Language Monthly 54:8–10.Google Scholar

Hearst, M., (1991) Noun homograph disambiguation using local context in large text corpora. In Using Corpora, Waterloo, Ontario, Canada: University of Waterloo.Google Scholar

Hirst, G.. (1987) Semantic Interpretation and the Resolution of Ambiguity. Cambridge, UK: Cambridge University Press.CrossRef Google Scholar

Kay, M. and Röscheisen, M.. (1993) Text-translation alignment. Computational Linguistics 19 (1): 121–142.Google Scholar

Kelly, E. and Stone, P.. (1975) Computer Recognition of English Word Senses. Amsterdam, The Netherlands: North-Holland.Google Scholar

Kolodner, J. L., (1993) Case-Based Reasoning. San Mateo, CA: Morgan Kaufmann.CrossRef Google Scholar

Lorge, I., (1937) The English semantic count. Teacher's College Record 39 (1): 65–77.Google Scholar

Lorge, I., (1938) A Semantic Count of English Words. 3 vols. Institute of Educational Research, Teachers College, Columbia University, New York. (Reprinted 1979.)Google Scholar

Lorge, I., (1949) The Semantic Count of the 570 Commonest English Words. Institute of Educational Research, Teachers College, Columbia University, New York. (Reprinted 1949.)Google Scholar

Mathew, J., Conrad, J. M. and Berleant, D.. (1993) Word sense disambiguation by constraint satisfaction: a feasibility study. Proceedings of the Arkansas Computer Conference,Google Scholar

The Natural Language Software Registry, on the world wide web at http://cl-wwwdfki.uni-sb.de/ci/registry/draft.html.Google Scholar

The Oxford English Dictionary (1933) Vols. I–XI1 and Supplement (corrected re-issue). Oxford, UK: Oxford University Press.Google Scholar

Oxford Text Archive, Word Wide Web location ftp://ota.ox.ac.uk/pub/ota/public/dicts/info.Google Scholar

Rieger, C., (1978) Computational linguistics. In Dingwall, W., (ed), A Survey of Linguistic Science (2nd ed only), pp. 97–134. Greylock, 1978.Google Scholar

Rieger, C., (1978 a) GRIND-1: First Report on the Magic Grinder Story Comprehension Project. Discourse Processes 1 (3): 267–303.Google Scholar

Rieger, C., (1979) Five aspects of a full-scale story comprehension model. In Findler, N., (ed), Associative Networks: Representation and Use of Knowledge by Computers. New York, NY: Academic Press.Google Scholar

Rieger, C., and Small, S., (1979) Word expert parsing. Proceedings International Joint Conference on Artificial Intelligence (IJCAI-79), pp. 723–728.Google Scholar

Rieger, C., and Small, S., (1981) Toward a theory of distributed word expert natural language parsing. IEEE Trans Systems, Man and Cybernetics 11 (1): 43–51.CrossRef Google Scholar

Sadler, V., (1989) Working with Analogical Semantics: Disambiguation Techniques in DLT. Dordrecht, The Netherlands: Foris Publications.CrossRef Google Scholar

Schaller, N., (1993) Massively parallel approach aids speech translation. IEEE Computer 26 (11):78.Google Scholar

Small, S., (1979) Word expert parsing. Proceedings of the 17th Annual Meeting of the Association for Computational Linguistics.Google Scholar

Small, S., (1980) Word Expert Parsing: A Theory of Distributed Word-Based Natural Language Understanding. Dissertation, Technical report TR-954, Department of Computer Science, University of Maryland.Google Scholar

Small, S., (1981) Viewing word expert parsing as linguistic theory. Proceedings International Joint Conference on Artificial Intelligence (IJCAI-81), pp. 70–76.Google Scholar

Small, S., (1983) Parsing as cooperative distributed inference: understanding through memory interactions. In King, M., (ed), Parsing Natural Language. London, UK: Academic Press.Google Scholar

Small, S., Cottrell, G., and Shastri, L., (1982) Toward connectionist parsing. Proceedings National Conference on Artificial Intelligence (AAAI-82),, pp. 247–250.Google Scholar

Small, S., and Rieger, C., (1982) Parsing and comprehending with word experts (A theory and its realization). In Lehnert, W., and Ringle, M., (eds), Strategies for Natural Language Processing, pp. 89–147. Hillsdale, NJ: Lawrence Erlbaum.Google Scholar

Sproat, R., Hirschberg, J., and Yarowsky, D., (1992) A corpus-based synthesizer. Proceedings International Conference on Spoken Language Processing.CrossRef Google Scholar

Stone, P. J.. (1969) Improved quality of content analysis categories: computerized-disambiguation rules for high-frequency English words. In Gerbner, G., Hoisti, O. R., Krippendorff, K., Paisley, W. J., and Stone, P. J., (eds), The Analysis of Communication Content New York, NY: Wiley.Google Scholar

Stone, P. J.. (1995) Thematic text analysis: new agendas for analyzing text content. In Roberts, C., (ed), Text Analysis for the Social Sciences: Methods for Drawing Statistical Inferences from Texts and Transcripts. Hilisdale, NJ: Lawrence Erlbaum.Google Scholar

Stone, P. J., Dunphy, D. C., Smith, M. S., Ogilvie, D. M. et al. , (1966) The General Inquirer: A Computer Approach to Content Analysis. Cambridge, MA: MIT Press.Google Scholar

Thorndike, E. L., and Barnhart, C. L., (eds), Thorndike-Barnhart Student Dictionary (Updated Edition), p. 13. London, UK: HarperCollins.Google Scholar

University of Birmingham, Corpus Linguistics research group's email tagging service at taggerlc1g.bham.ac.uk (information at http://clgl.bham.ac.uk/tagger.html).Google Scholar

Weiss, S. F., (1973) Learning to disambiguate. Information Storage and Retrieval 9:33–41.Google Scholar

Wieger, L., (1965) Chinese Characters: Their Origin, Etymology, History, Classfication and Signification: A Thorough Study from Chinese Documents New York, NY: Dover.Google Scholar

Xerox Part-of-Speech Tagger (XPOST), http://cl-www.dfki.uni-sb.de:80/cl/registry/parsers/xerox.html.Google Scholar

Yarowsky, D., (1994) Decision lists for lexical ambiguity resolution: application to accent restoration in Spanish and French. Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics (ACL-94).http://xxx.lani.gov/abs/cmp-lg/9406034.Google Scholar

Yarowsky, D., (1994 a) A comparison of corpus-based techniques for restoring accents in Spanish and French text. Proceedings of the 2nd Annual Workshop on Very Large Text Corpora.Google Scholar

Yarowsky, D., (1993) One sense per collocation. Proceedings of the 1993 ARPA Human Language Technology Workshop, pp. 266–271. San Mateo, CA: Morgan Kaufmann.Google Scholar

Article contents

Engineering “word experts” for word disambiguation

Abstract

Access options

Article purchase

Temporarily unavailable

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests