On some applications of finite-state automata theory to natural language processing

MEHRYAR MOHRI

doi:10.1017/S135132499600126X

Abstract

We describe new applications of the theory of automata to natural language processing: the representation of very large scale dictionaries and the indexation of natural language texts. They are based on new algorithms that we introduce and describe in detail. In particular, we give pseudocodes for the determinisation of string to string transducers, the deterministic union of p-subsequential string to string transducers, and the indexation by automata. We report on several experiments illustrating the applications.

Footnotes

This work was done while the author was an associate professor of computer science and computational linguistics at the Institut Gaspard Monge-LADL in Paris, France.

Crossref Citations

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Mohri, M. Riley, M. Hindle, D. Ljolje, A. and Pereira, F. 1998. Full expansion of context-dependent networks in large vocabulary speech recognition. Vol. 2, Issue. , p. 665.

Mohri, Mehryar Pereira, Fernando and Riley, Michael 1998. Automata Implementation. Vol. 1436, Issue. , p. 144.

Mohri, Mehryar and Riley, Michael 1999. Network optimizations for large-vocabulary speech recognition. Speech Communication, Vol. 28, Issue. 1, p. 1.

Karttunen, Lauri and Oflazer, Kemal 2000. Introduction to the Special Issue on Finite-State Methods in NLP. Computational Linguistics, Vol. 26, Issue. 1, p. 1.

Sojka, Petr 2000. Text, Speech and Dialogue. Vol. 1902, Issue. , p. 157.

Mihov, Stoyan and Maurel, Denis 2001. Implementation and Application of Automata. Vol. 2088, Issue. , p. 217.

Mohri, Mehryar Pereira, Fernando and Riley, Michael 2002. Weighted finite-state transducers in speech recognition. Computer Speech & Language, Vol. 16, Issue. 1, p. 69.

Piskorski, Jakub Jäger, Tilman and Xu, Feiyu 2002. Databases and Information Systems II. p. 311.

GUINGNE, FRANCK NICART, FLORENT CHAMPARNAUD, JEAN-MARC KARTTUNEN, LAURI GAÁL, TAMÁS and KEMPE, ANDRÉ 2003. VIRTUAL OPERATIONS ON VIRTUAL NETWORKS: THE PRIORITY UNION. International Journal of Foundations of Computer Science, Vol. 14, Issue. 06, p. 1055.

Fatholahzadeh, Abolfazl 2003. Implementation and Application of Automata. Vol. 2608, Issue. , p. 95.

Allauzen, C. and Mohri, M. 2003. Generalized optimization algorithm for speech recognition transducers. Vol. 1, Issue. , p. I-352.

Maurel, Denis 2003. Grammars and Automata for String Processing. Vol. 20032543, Issue. , p. 177.

Yi-Cheng Pan Chia-Hsing Yu and Lin-Shan Lee 2004. Large vocabulary continuous Mandarin speech recognition using finite state machine. p. 5.

Tounsi, L. Maurel, D. and Bouchou, B. 2005. Basic search of sub automata Application to electronic dictionaries. p. 543.

Galvez, Carmen 2006. Aplicación de transductores de estado-finito a los procesos de unificación de términos. Ciência da Informação, Vol. 35, Issue. 3, p. 67.

Cohen-Sygal, Yael and Wintner, Shuly 2006. Finite-State Registered Automata for Non-Concatenative Morphology. Computational Linguistics, Vol. 32, Issue. 1, p. 49.

Cohen-Sygal, Yael and Wintner, Shuly 2006. Finite-State Methods and Natural Language Processing. Vol. 4002, Issue. , p. 43.

Rojc, Matej Rotovnik, Tomaž Brus, Mišo Jan, Dušan and Kačič, Zdravko 2007. Verbal and Nonverbal Communication Behaviours. Vol. 4775, Issue. , p. 294.

Rojc, Matej and Kačič, Zdravko 2007. Time and space-efficient architecture for a corpus-based text-to-speech synthesis system. Speech Communication, Vol. 49, Issue. 3, p. 230.

Troncoso-Pastoriza, Juan Ramón Katzenbeisser, Stefan and Celik, Mehmet 2007. Privacy preserving error resilient dna searching through oblivious automata. p. 519.

Download full list

Article contents

On some applications of finite-state automata theory to natural language processing

Abstract

Access options

Article purchase

Temporarily unavailable

Footnotes

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Article contents

On some applications of finite-state automata theory to natural language processing

Abstract

Access options

Article purchase

Temporarily unavailable

Footnotes

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests