Hostname: page-component-cd9895bd7-gbm5v Total loading time: 0 Render date: 2024-12-24T04:01:20.008Z Has data issue: false hasContentIssue false

Engineering “word experts” for word disambiguation

Published online by Cambridge University Press:  12 September 2008

Daniel Berleant
Affiliation:
Department of Computer Systems EngineeringUniversity of Arkansas, Fayetteville, AR 72701, USA Email: [email protected]

Abstract

Every word in the lexicon of a natural language is used distinctly from all the other words. A word expert is a small expert system-like module for processing a particular word based on other words in its vicinity. A word expert exploits the idiosyncratic nature of a word by using a set of context testing decision rules that test the identity and placement of context words to infer the word's role in the passage.

The main application of word experts is disambiguating words. Work on word experts has never fully recognized previous related work, and a comprehensive review of that work would therefore contribute to the field. This paper both provides such a review, and describes guidelines and considerations useful in the design and construction of word expert based systems.

Type
Articles
Copyright
Copyright © Cambridge University Press 1995

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Adriaens, G., (1986) Word expert parsing: A natural language analysis program revised and applied to Dutch. Leuvensche Bijdragen 75(1):73154.Google Scholar
Adriaens, G., (1987) WEP (word expert parsing) revised and applied to Dutch. ECAI '86: Proceedings 7th European Conference on Artificial Intelligence,Brighton, UK,21–25 July 1986, pp. 222235. (Reprinted in Du Boulay, B., Hogg, D. and Steels, L., eds., Advances in Artificial Intelligence II, Elsevier, 1987, pp. 403416.)Google Scholar
Adriaens, G.. (1989) The parallel expert parser: a meaning-oriented, lexically-guided, parallel- interactive model of natural language understanding. International Workshop on Parsing Technologies, Carnegie-Mellon University, pp. 309319.Google Scholar
Adriaens, G., and Small, S. L.. (1988) Word expert parsing revisited in a cognitive science perspective. In Small, S. L., Cottrell, G. W., and Tanenhaus, MK (eds), Lexical Ambiguity Resolution: Perspectives from Psycholinguistics, Neuropsychology and Artificial Intelligence, San Mateo, CA: Morgan Kaufmann, pp.1343.Google Scholar
Allen, J.. (1995) Statistical word sense disambiguation. Chapter 10 in Natural Language Understanding, 2nd ed. New York, NY: Benjamin/Cummings.Google Scholar
Armstrong, S.. (1994) Using Large Corpora. Cambridge, MA: MIT Press.Google Scholar
Barnhart, C. L.. (ed.) (1967) Thorndike-Barnhart Comprehensive Desk Dictionary. Doubleday, pp. XI, 5.Google Scholar
ben Jehiel, N.. (1969) Sefer Arukh Ha-Shalem. Shiloh.Google Scholar
Berleant, J. D.. (1982) Subliminal stimuli and the interpretation of ambiguous sentences. B.S. thesis, MIT.Google Scholar
Berleant, D., Lovelady, S., and Viswanathan, K.. (1994) A foreign vocabulary learning aid for the networked world of tomorrow: The LEARN Project. SIGICE Bulletin 19(3):2229.CrossRefGoogle Scholar
Brown, P. F., Delia Pietra, S. A., Delia Pietra, V. J., and Mercer, R. L., (1991) Word-sense disambiguation using statistical methods. Proceedings 29th Annual Meeting of the Association for Computational Linguistics, pp. 264270.CrossRefGoogle Scholar
Brown, P. E., Delia Pietra, S. A., Delia Pietra, V. J., and Mercer, R. L., (1993) The Mathematics of statistical machine translation: parameter estimation. Computational Linguistics 19(2):263311.Google Scholar
Cerri, S. A.. (1989) ALICE: Acquisition of Linguistic Items in the Context of Examples. Instructional Science 18:6392.CrossRefGoogle Scholar
Choueka, Y., and Lusignan, S., (1985) Disambiguation by Short Contexts. Computers and the Humanities 19.Google Scholar
Cottrell, G. W., (1989) A Connectionist Approach to Word Sense Disambiguation. San Mateo, CA: Morgan Kaufmann.Google Scholar
Cottrell, G., and Small, S., (1983) A connectionist scheme for modelling word sense disambiguation. Cognition and Brain Theory 6:89120.Google Scholar
Devos, M., and Adriaens, G., (1994) The parallel expert parser. In Adriaens, G. and Hahn, U., (eds), Parallel Natural Language Processing. NJ: Ablex.Google Scholar
Devos, M., Adriaens, G., and Willems, Y. D., (1988) The parallel expert parser (PEP): a thoroughly revised descendent of the word expert parser (WEP). Proceedings of the 12th International Conference on Computational Linguistics (COLING '88), 1:142147, Budapest, Hungary, 22–27 August.CrossRefGoogle Scholar
Gale, W. A., and Church, K. W.. (1993) A program for aligning sentences in bilingual corpora. Computational Linguistics 19 (1): 75102.Google Scholar
Gale, W. A., Church, K. W., and Yarowsky, D.. (1993) A method for disambiguating word senses in a large corpus. Computers and the Humanities 26: 415439.Google Scholar
Hahn, U., (1989) Making understanders out of parsers: Semantically driven parsing as a key concept for realistic text understanding applications. Int J Intelligent Systems 4 (3):345393.Google Scholar
Hahn, U.. (1994) An actor model of distributed natural language parsing. In Adriaens, G. and Hahn, U (eds), Parallel Natural Language Processing. NJ: Ablex.Google Scholar
Harris, B., (1988) Bi-text, a new concept in translation theory. Language Monthly 54:810.Google Scholar
Hearst, M., (1991) Noun homograph disambiguation using local context in large text corpora. In Using Corpora, Waterloo, Ontario, Canada: University of Waterloo.Google Scholar
Hirst, G.. (1987) Semantic Interpretation and the Resolution of Ambiguity. Cambridge, UK: Cambridge University Press.CrossRefGoogle Scholar
Kay, M. and Röscheisen, M.. (1993) Text-translation alignment. Computational Linguistics 19 (1): 121142.Google Scholar
Kelly, E. and Stone, P.. (1975) Computer Recognition of English Word Senses. Amsterdam, The Netherlands: North-Holland.Google Scholar
Kolodner, J. L., (1993) Case-Based Reasoning. San Mateo, CA: Morgan Kaufmann.CrossRefGoogle Scholar
Lorge, I., (1937) The English semantic count. Teacher's College Record 39 (1): 6577.Google Scholar
Lorge, I., (1938) A Semantic Count of English Words. 3 vols. Institute of Educational Research, Teachers College, Columbia University, New York. (Reprinted 1979.)Google Scholar
Lorge, I., (1949) The Semantic Count of the 570 Commonest English Words. Institute of Educational Research, Teachers College, Columbia University, New York. (Reprinted 1949.)Google Scholar
Mathew, J., Conrad, J. M. and Berleant, D.. (1993) Word sense disambiguation by constraint satisfaction: a feasibility study. Proceedings of the Arkansas Computer Conference,Google Scholar
The Natural Language Software Registry, on the world wide web at http://cl-wwwdfki.uni-sb.de/ci/registry/draft.html.Google Scholar
The Oxford English Dictionary (1933) Vols. I–XI1 and Supplement (corrected re-issue). Oxford, UK: Oxford University Press.Google Scholar
Oxford Text Archive, Word Wide Web location ftp://ota.ox.ac.uk/pub/ota/public/dicts/info.Google Scholar
Rieger, C., (1978) Computational linguistics. In Dingwall, W., (ed), A Survey of Linguistic Science (2nd ed only), pp. 97134. Greylock, 1978.Google Scholar
Rieger, C., (1978 a) GRIND-1: First Report on the Magic Grinder Story Comprehension Project. Discourse Processes 1 (3): 267303.Google Scholar
Rieger, C., (1979) Five aspects of a full-scale story comprehension model. In Findler, N., (ed), Associative Networks: Representation and Use of Knowledge by Computers. New York, NY: Academic Press.Google Scholar
Rieger, C., and Small, S., (1979) Word expert parsing. Proceedings International Joint Conference on Artificial Intelligence (IJCAI-79), pp. 723728.Google Scholar
Rieger, C., and Small, S., (1981) Toward a theory of distributed word expert natural language parsing. IEEE Trans Systems, Man and Cybernetics 11 (1): 4351.CrossRefGoogle Scholar
Sadler, V., (1989) Working with Analogical Semantics: Disambiguation Techniques in DLT. Dordrecht, The Netherlands: Foris Publications.CrossRefGoogle Scholar
Schaller, N., (1993) Massively parallel approach aids speech translation. IEEE Computer 26 (11):78.Google Scholar
Small, S., (1979) Word expert parsing. Proceedings of the 17th Annual Meeting of the Association for Computational Linguistics.Google Scholar
Small, S., (1980) Word Expert Parsing: A Theory of Distributed Word-Based Natural Language Understanding. Dissertation, Technical report TR-954, Department of Computer Science, University of Maryland.Google Scholar
Small, S., (1981) Viewing word expert parsing as linguistic theory. Proceedings International Joint Conference on Artificial Intelligence (IJCAI-81), pp. 7076.Google Scholar
Small, S., (1983) Parsing as cooperative distributed inference: understanding through memory interactions. In King, M., (ed), Parsing Natural Language. London, UK: Academic Press.Google Scholar
Small, S., Cottrell, G., and Shastri, L., (1982) Toward connectionist parsing. Proceedings National Conference on Artificial Intelligence (AAAI-82),, pp. 247250.Google Scholar
Small, S., and Rieger, C., (1982) Parsing and comprehending with word experts (A theory and its realization). In Lehnert, W., and Ringle, M., (eds), Strategies for Natural Language Processing, pp. 89147. Hillsdale, NJ: Lawrence Erlbaum.Google Scholar
Sproat, R., Hirschberg, J., and Yarowsky, D., (1992) A corpus-based synthesizer. Proceedings International Conference on Spoken Language Processing.CrossRefGoogle Scholar
Stone, P. J.. (1969) Improved quality of content analysis categories: computerized-disambiguation rules for high-frequency English words. In Gerbner, G., Hoisti, O. R., Krippendorff, K., Paisley, W. J., and Stone, P. J., (eds), The Analysis of Communication Content New York, NY: Wiley.Google Scholar
Stone, P. J.. (1995) Thematic text analysis: new agendas for analyzing text content. In Roberts, C., (ed), Text Analysis for the Social Sciences: Methods for Drawing Statistical Inferences from Texts and Transcripts. Hilisdale, NJ: Lawrence Erlbaum.Google Scholar
Stone, P. J., Dunphy, D. C., Smith, M. S., Ogilvie, D. M. et al. , (1966) The General Inquirer: A Computer Approach to Content Analysis. Cambridge, MA: MIT Press.Google Scholar
Thorndike, E. L., and Barnhart, C. L., (eds), Thorndike-Barnhart Student Dictionary (Updated Edition), p. 13. London, UK: HarperCollins.Google Scholar
University of Birmingham, Corpus Linguistics research group's email tagging service at taggerlc1g.bham.ac.uk (information at http://clgl.bham.ac.uk/tagger.html).Google Scholar
Weiss, S. F., (1973) Learning to disambiguate. Information Storage and Retrieval 9:3341.Google Scholar
Wieger, L., (1965) Chinese Characters: Their Origin, Etymology, History, Classfication and Signification: A Thorough Study from Chinese Documents New York, NY: Dover.Google Scholar
Yarowsky, D., (1994) Decision lists for lexical ambiguity resolution: application to accent restoration in Spanish and French. Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics (ACL-94).http://xxx.lani.gov/abs/cmp-lg/9406034.Google Scholar
Yarowsky, D., (1994 a) A comparison of corpus-based techniques for restoring accents in Spanish and French text. Proceedings of the 2nd Annual Workshop on Very Large Text Corpora.Google Scholar
Yarowsky, D., (1993) One sense per collocation. Proceedings of the 1993 ARPA Human Language Technology Workshop, pp. 266271. San Mateo, CA: Morgan Kaufmann.Google Scholar