Sintéiseoir 1.0: a multidialectical TTS application for Irish

Mícheál Mac Lochlainn

doi:10.1017/S0958344010000054

Sintéiseoir 1.0: a multidialectical TTS application for Irish

Published online by Cambridge University Press: 01 May 2010

Mícheál Mac Lochlainn

Show author details

Mícheál Mac Lochlainn*: Affiliation:
Acadamh na hOllscolaíochta Gaeilge, Ollscoil na hÉireann Gaillimh, Roisín na Mainiach, Carna, Cúndae na Gaillimhe, Éire (email: [email protected])

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

This paper details the development of a multidialectical text-to-speech (TTS) application, Sintéiseoir, for the Irish language. This work is being carried out in the context of Irish as a lesser-used language, where learners and other L2 speakers have limited direct exposure to L1 speakers and speech communities, and where native sound systems and vocabularies can be seen to be receding even among L1 speakers – particularly the young.

Sintéiseoir essentially implements the diphone concatenation model, albeit augmented to include phones, half-phones and, potentially, other phonic units. It is based on a platform-independent framework comprising a user interface, a set of dialect-specific tokenisation engines, a concatenation engine and a playback device.

The tokenisation strategy is entirely rule-based and does not refer to dictionary look-ups. Provision has been made for prosodic processing in the framework but has not yet been implemented. Concatenation units are stored in the form of WAV files on the local file system.

Sintéiseoir’s user interface (UI) provides a text field that allows the user to submit a grapheme string for synthesis and a prompt to select a dialect. It also filters input to reject graphotactically invalid strings, restrict input to alphabetic and certain punctuation marks found in Irish orthography, and ensure that a dialect has, indeed, been selected.

The UI forwards the filtered grapheme string to the appropriate tokenisation engine. This searches for specified substrings and maps them to corresponding tokens that themselves correspond to concatenation units.

The resultant token string is then forwarded to the concatenation engine, which retrieves the relevant concatenation units, extracts their audio data and combines them in a new unit. This is then forwarded to the playback device.

The terms of reference for the initial development of Sintéiseoir specified that it should be capable of uttering, individually, the 99 most common Irish lemmata in the dialects of An Spidéal, Músgraí Uí Fhloínn and Gort a’ Choirce, which are internally consistent dialects within the Connacht, Munster and Ulster regions, respectively, of the dialect continuum. Audio assets to satisfy this requirement have already been prepared, and have been found to produce reasonably accurate output. The tokenisation engine is, however, capable of processing a wider range of input strings and when required concatenation units are found to be unavailable, returns a report via the user interface.

Keywords

Irish TTS speech synthesis dialect

Type: Research Article
Information: ReCALL , Volume 22 , Issue 2 , May 2010 , pp. 152 - 171

DOI: https://doi.org/10.1017/S0958344010000054 [Opens in a new window]
Copyright: Copyright © European Association for Computer Assisted Language Learning 2010

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Buntús Gaeilge (1966), Baile Átha Cliath: An Roinn Oideachais.Google Scholar

Coláiste na Tríonóide, Baile Átha Cliath, Centre for Language and Communication Studies (2008) abair.ie http://www.abair.ie/.Google Scholar

Lenzo, K. A.Black, A. W. (2000) Diphone Collection and Synthesis http://www.cs.cmu.edu/~awb/papers/ICSLP2000_diphone/.CrossRef Google Scholar

Mac Lochlainn, M. (2008) Forbairt Chreat Ílchanúnach Téacs-go-hUrlabhrtha don Ghaelainn, unpublished thesis (M.Sc.), University of Limerick.Google Scholar

Niagra College, Canada, Technology Division (2009) CTEC1631 Course Notes: WAV File Format http://technology.niagarac.on.ca/courses/ctec1631/WavFileFormat.html.Google Scholar

Ó Baoill, D. P. (ed.) (1986a) Lárchanúint don Ghaeilge (Téip Léirithe). Baile Átha Cliath: Institiúid Teangeolaíochta Éireann.Google Scholar

Ó Baoill, D. P. (1986b) Lárchanúint don Ghaeilge (Tuarascáil Taighde). Baile Átha Cliath: Institiúid Teangeolaíochta Éireann.Google Scholar

Ó Cuív, B. (1944) The Irish of West Muskerry, Co. Cork. Baile Átha Cliath: Institiúid Ardléinn Bhaile Átha Cliath.Google Scholar

Ó Domhnaill, N. (1977) Foclóir Gaeilge-Béarla. Baile Átha Cliath: Rialtas na hÉireann.Google Scholar

Ó Duilearga, S. (ed.) (1981) Leabhar Stiofán Uí Ealaoire. Baile Átha Cliath: Comhairle Bhaloideas Éireann.Google Scholar

Ó Sé, D. (2000) Gaeilge Chorca Dhuibhne. Baile Átha Cliath: Institiúid Teangeolaíochta Éireann.Google Scholar

Schmitt, S. R. (2009) Permutation generator http://home.att.net/~srschmitt/script_permutations.html.Google Scholar

Article contents

Sintéiseoir 1.0: a multidialectical TTS application for Irish

Abstract

Keywords

Access options

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests