Hostname: page-component-586b7cd67f-t7czq Total loading time: 0 Render date: 2024-11-27T11:09:37.682Z Has data issue: false hasContentIssue false

The articulatory properties of apical vowels in Hefei Mandarin

Published online by Cambridge University Press:  23 June 2023

Huifang Kong
Affiliation:
Institute of Linguistics and Applied Linguistics, School of Foreign Languages, Anhui Jianzhu University [email protected]
Shengyi Wu*
Affiliation:
Department of Chinese Language and Literature, Shaoxing University [email protected]
Mingxing Li
Affiliation:
Department of English Language and Literature, Hong Kong Baptist University [email protected]
Xiangrong Shen
Affiliation:
Key Innovation Group of Digital Humanities Resource and Research, Shanghai Normal University [email protected]
*
Correspondence should be addressed to: Shengyi Wu, Department of Chinese Language and Literature, Shaoxing University. Email: [email protected]
Rights & Permissions [Opens in a new window]

Abstract

Apical vowels are widely observed across Chinese dialects, such as the rime of [sɹ̩55] ‘think’ in Mandarin Chinese, which is a syllabic approximant homorganic to its preceding sibilant. The apical vowels in Hefei Mandarin differ from those in Mandarin Chinese and most other languages in three aspects: (i) there are three phonetic apical vowels [ɹ̩], [ɹ̩ʷ], and [ɻ̩] while others usually have one or two, (ii) the alveolar apical [ɹ̩] appears after both homorganic and non-homorganic consonants, e.g. [sɹ̩] vs. [pɹ̩], and (iii) there is a phonological contrast between an unrounded apical [ɹ̩] and a rounded apical [ɹ̩ʷ], e.g. [sɹ̩] vs. [sɹ̩ʷ]. The articulatory properties of the three apical vowels were examined in this study using ultrasound techniques and the results revealed that: (i) the commonalities of tongue gestures for the apical vowels include a retracted tongue root, a lowered tongue dorsum or blade, or both, together with a coronal constriction implemented with the blade and/or the tip; (ii) lip gestures are involved in distinguishing the three apical segments; (iii) the three segments each have its distinct articulatory gestures within a speaker that cannot be simply attributed to the influence from their preceding consonants, with [ɹ̩] and [ɹ̩ʷ] involving a grooving in the front part of the tongue and [ɻ̩] involving a retraction of tongue body in the back region of the vocal tract; (iv) the articulatory gesture of [ɹ̩] after a homorganic consonant, e.g. in [sɹ̩], is similar to that after a non-homorganic consonant, e.g. in [pɹ̩], suggesting an independent articulatory target for this segment.

Type
Research Article
Copyright
© The Author(s), 2023. Published by Cambridge University Press on behalf of The International Phonetic Association

1 Introduction

Apical vowels are syllabic segments such as the nuclei [ɹ̩] and [ɻ̩] of the syllables [sɹ̩55] ‘think’ and [ʂɻ̩55] ‘lion’ in Mandarin Chinese, which typically occur respectively after the alveolar sibilants [s ts tsʰ] and the retroflex sibilants [ʂ tʂ tʂʰ], as illustrated in Table 1.Footnote 1 In Mandarin Chinese, these apical segments share the places of their preceding alveolar and retroflex sibilants (Chao Reference Chao1930, Ladefoged & Maddieson Reference Ladefoged and Maddieson1996) and can be regarded as the ‘vocalized prolongation’ of their preceding consonants (Chao Reference Chao and Joos1934). Similarly to vowels such as [i] and [a], the apical vowels [ɹ̩] and [ɻ̩] in Mandarin Chinese have formant patterns (Howie Reference Howie1970) and there is no visible change in formant transitions between an alveolar/retroflex sibilant and a following apical vowels [ɹ̩/ɻ̩] (Lee & Li Reference Lee and Li2003), which corresponds to their homorganicity. There has been controversy in the literature about the phonetic status of the apical vowels [ɹ̩] and [ɻ̩] in Mandarin Chinese, which have been described as syllabic consonants (Hartman Reference Hartman1944, Hockett Reference Hockett1967, Duanmu Reference Duanmu2007), syllabic approximants (Lee & Zee Reference Lee and Zee2003, Zee & Lee Reference Zee, Lee, Lu and Wang2004, Lee-Kim Reference Lee-Kim2014), fricative vowels (Ladefoged & Maddieson Reference Ladefoged and Maddieson1996), etc., based on different aspects of their phonetic properties. In this study, we refer to apical vowels using the IPA symbols [ɹ̩] and [ɻ̩], following Kong, Wu & Li (published online 15 July Reference Kong, Wu and Li2022). In terms of their phonological properties, the two apical vowels in Mandarin Chinese are in complementary distribution with the phonetic vowel [i], as allophones of the phoneme /i/. In the current study, the term ‘apical vowel’ is used to refer to these two segments in Mandarin Chinese and their counterparts in other Chinese dialects with similar phonetic properties. Apical vowels are reported to be widely distributed across Chinese dialects (Lee & Zee Reference Lee, Zee and Sybesma2017, Li Reference Li2017, among others), e.g. in 85% in a sample of 170 Chinese dialects in Li’s (Reference Li2017) typological survey.

Table 1 Phonotactics of the apical vowels [ɹ̩/ɻ̩] and [i] in Mandarin Chinese.

1.1 The articulation of apical vowels

The articulatory gestures of the apical vowels [ɹ̩]/[ɻ̩] in Mandarin Chinese are similar to those of their preceding alveolar/retroflex sibilants, and their production can be impressionistically recognized as a voiced extension of the sibilant onsets to carry the syllables (Chao Reference Chao1968). Instrumental investigation of apical vowels in Mandarin Chinese and other Chinese dialects revealed some of their general articulatory properties: First, there are slight differences between the tongue gestures of a preceding alveolar/retroflex sibilant and a following apical vowel, as observed in the articulatory studies of Mandarin Chinese (Chen Reference Chen2011, Lee-Kim Reference Lee-Kim2014, Chen et al. Reference Chen, Jin Zhang, Liu, Wei and Dang2015, Faytak & Lin Reference Faytak and Lin2015), Jixi-Hui Chinese (Shao Reference Shao2020, Shao & Ridouane, published online 19 January Reference Shao and Ridouane2023) and Suzhou Chinese (Ling Reference Ling2009, Faytak Reference Faytak2018, Hu & Ling Reference Hu and Ling2019). Second, different apical vowels have their own articulatory gestures (Lee-Kim Reference Lee-Kim2014, Faytak & Lin Reference Faytak and Lin2015), as observed in the X-ray study of the two apical vowels in Mandarin Chinese (Zhou & Wu Reference Zhou and Wu1963) showing that the alveolar [ɹ̩] involves a more front constriction and the retroflex [ɻ̩] a more back one. Bao (Reference Bao1984) further noted that the articulation of the two apical vowels in Mandarin Chinese is characterized by a concavity in the tongue shape. For apical vowels in Ningbo Chinese, Hu’s (Reference Hu2005) study showed that their articulation involves the tongue apex as well as the tongue dorsum. Third, inter-speaker variation has been observed in previous studies, in particular for the apical [ɹ̩] and [ɻ̩] in Mandarin Chinese, with a wide variety of lingual adjustments such as tongue dorsum lowering, tongue blade lowering, and tongue raising, etc. (Chen Reference Chen2011, Faytak & Lin Reference Faytak and Lin2015, Huang, Hsieh & Chang Reference Huang, Hsieh and Chang2021).

1.2 Apical vowels in Hefei Mandarin

This study focuses on apical vowels in Hefei Mandarin, which is a branch of Jianghuai Mandarin spoken in Anhui Province, China. As in Mandarin Chinese (illustrated in Table 1) and many other Chinese dialects, two apical vowels [ɹ̩] and [ɻ̩] in Hefei Mandarin are both in complementary distribution with the vowel [i] and occur after homorganic alveolar and retroflex sibilants respectively. In contrast to such cases, apical vowels in Hefei Mandarin have different phonological properties (Meng Reference Meng1962, Reference Meng1997; Li Reference Li1997), as illustrated in Table 2.Footnote 2

  • First, while many other Chinese dialects have one or two apical vowels (e.g. [ɹ̩] and [ɻ̩] in Mandarin Chinese), Hefei Mandarin has three phonetic apical vowels: the alveolar unrounded [ɹ̩], the alveolar rounded [ɹ̩ʷ], and the retroflex unrounded [ɻ̩].Footnote 3 With its three apical vowels, Hefei Mandarin was recognized as one of the Chinese dialects with the largest number of apical segments (Baron Reference Baron1974).Footnote 4 The rounded alveolar apical [ɹ̩ʷ], as in [sɹ̩ʷ213] ‘allow’, is relatively rare across Chinese dialects. For example, it is observed in only two of the 88 Chinese dialects in the survey of Lee & Zee (Reference Lee, Zee and Sybesma2017), although its counterparts exist in a number of Wu dialects (Zhu Reference Zhu2004). For example, Suzhou Chinese has contrastive unrounded and rounded apical vowels, phonetically realized as syllabic fricatives [z] and [zʷ], with a loose degree of constriction (Faytak Reference Faytak2018).Footnote 5

  • Second, the alveolar apical [ɹ̩] in Hefei Mandarin appears after homorganic consonants (e.g. [sɹ̩213] ‘dead’) as well as after non-homorganic ones (e.g. [pɹ̩213] ‘to compare’) as seen in Table 2. It thus differs from the apical [ɹ̩] in Mandarin Chinese (see Table 1) which occurs only after a homorganic alveolar sibilant such as in [sɹ̩55] ‘think’.

  • Third, the unrounded [ɹ̩] and the rounded [ɹ̩ʷ] are phonologically contrastive in Hefei Mandarin (e.g. [sɹ̩213] ‘dead’ vs. [sɹ̩ʷ213] ‘allow’), as seen in Table 2, while apical vowels in other languages are usually in complementary distribution with each other, e.g. the two apical vowels [ɹ̩] and [ɻ̩] in Mandarin Chinese appear respectively after alveolar and retroflex sibilants such as in [sɹ̩55] ‘think’ and [ʂɻ̩55] ‘wet’.

In addition to these properties, the three apical segments in Hefei Mandarin are all in complementary distribution with the high front vowel [i], as seen in Table 2, although there is a phonological contrast between the unrounded [ɹ̩] and the rounded [ɹ̩ʷ].

Table 2 Phonotactics of apical vowels and [i] in Hefei Mandarin.

Previous studies have shown that the apical vowels in Hefei Mandarin have clear formant structure (Li Reference Li1997, Meng Reference Meng1997, Hou Reference Hou2007, Kong et al., published online 15 July 2022), similar to their counterparts in Mandarin Chinese, and frication noise, as observed in studies such as Hou (Reference Hou2007) and Kong, Wu & Li (Reference Kong, Wu and Li2019). Figure 1 gives an illustration of the waveforms and spectrograms of the syllables [sɹ̩213] ‘wash’, [sɹ̩ʷ213] ‘to allow, [ʂɻ̩213] ‘arrow’, and [pɹ̩213] ‘to compare’, [tsɹ̩213] ‘to squeeze’, [zɹ̩213] ‘gift’, produced by a male speaker of Hefei Mandarin (M02).Footnote 6 As illustrated in Figure 1f, the apical [ɹ̩] differs from the voiced alveolar fricative [z] by having a clear formant structure, which is relatively consistent when [ɹ̩] appears after different consonants, e.g. [s] [ts] [z] [p]. As shown in Figure 1a–c, [ɹ̩] [ɹ̩ʷ] [ɻ̩] have different formant patterns, with the F2 of [ɻ̩] slightly higher than those of [ɹ̩] and [ɹ̩ʷ]. Based on the data from three female and three female speakers, Hou (Reference Hou2007) reported F1 and F2 values of the three segments in Hefei Mandarin, as shown in Table 3. This table also includes mean F1 and F2 values of the two apical vowels [ɹ̩] and [ɻ̩] in Mandarin Chinese as reported in Lin & Wang (Reference Lin and Wang1992).Footnote 7 As shown in Table 3, F1 and F2 values of [ɹ̩] and [ɻ̩] in Hefei Mandarin are similar to those of their counterparts in Mandarin Chinese across the male and female speakers; F1 and F2 values of the rounded alveolar [ɹ̩ʷ] in Hefei Mandarin are slightly lower respectively than those of the unrounded alveolar [ɹ̩] in Hefei Mandarin in terms of mean values for both female speakers and male speakers. F3 of [ɻ̩] in Hefei Mandarin was reported to be lower than that in [ɹ̩] while being close to that in [ɹ̩ʷ] (Kong et al., published online 15 July Reference Kong, Wu and Li2022).

Figure 1 Waveforms and spectrograms of the syllables [sɹ̩213] 洗 ‘to wash’, [sɹ̩ʷ213] 许 ‘to allow’, [ʂɻ̩213] 矢 ‘arrow’, [pɹ̩213] 比 ‘to compare’, [tsɹ̩213] 挤 ‘to squeeze’, and [zɹ̩213] 礼 ‘gift’ in Hefei Mandarin produced by a male speaker (M02).

Table 3 Mean F1 and F2 values of the three apical vowels in Hefei Mandarin in comparison with the apical vowels in Mandarin Chinese (Lin & Wang Reference Lin and Wang1992, Hou Reference Hou2007).

Note: HF = Hefei Mandarin, data from Hou (Reference Hou2007) based on three female and three male speakers; MC = Mandarin Chinese, data from Lin & Wang (Reference Lin and Wang1992) based on four female and four male speakers.

In contrast to the many acoustic studies, there have been few articulatory studies on the apical vowels in Hefei Mandarin, which is representative of an under-studied group of languages differing drastically from Mandarin Chinese in the inventory and phonotactics of apical vowels. In particular, articulatory properties of the typologically rare rounded apical [ɹ̩ʷ] have not been investigated with the exception of a handful of studies of other Chinese dialects, namely of Suzhou Chinese (Ling Reference Ling2009) and Ningbo Chinese (Hu Reference Hu2005). Both these dialects differ from Hefei Mandarin in not having the retroflex [ɻ̩] in their inventories. A case study on the apical vowels in Hefei Mandarin, therefore, is expected to supplement materials to the investigation of apical vowels and syllabic segments in general. Regarding the special properties of Hefei Mandarin as compared with many other languages, this study aims to address the following issues:

  1. 1. What are the tongue gestures involved in the production of the apical [ɹ̩] [ɹ̩ʷ] [ɻ̩] relative to [i]?

  2. 2. What are the lip gestures involved in the production of the apical vowels [ɹ̩] [ɹ̩ʷ] [ɻ̩]?

  3. 3. Is there articulatory difference between the consonant and the following vowel in the syllables [sɹ̩] [sɹ̩ʷ] [ʂɻ̩]?

  4. 4. Is the apical [ɹ̩] articulated differently or similarly when it appears after a homorganic sibilant (e.g. [s]) and a non-homorganic consonant (e.g. [p])?

To answer these questions, we employed ultrasound imaging to examine the tongue gestures and video recordings to examine the lip gestures in the production of the apical vowels in Hefei Mandarin. Ultrasound is a non-invasive and low-cost technique used to observe the real-time movement of the tongue in speech production and it has been applied in studying apical vowels in Mandarin Chinese (Chen Reference Chen2011, Lee-Kim Reference Lee-Kim2014, Chen et al. Reference Chen, Jin Zhang, Liu, Wei and Dang2015, Faytak & Lin Reference Faytak and Lin2015, Huang et al. Reference Huang, Hsieh and Chang2021) and the alveolar vs. retroflex consonant contrast in Mandarin Chinese (Luo Reference Luo2020) and Jixi Chinese (Shao Reference Shao2020, Shao & Ridouane, published online 19 January Reference Shao and Ridouane2023).

2 Method

2.1 Participants

Native speakers of Hefei Mandarin were recruited as participants based on the following criteria: The speakers should have grown up in Hefei, with Hefei Mandarin being their mother tongue as well as that of their parents. They must not have resided in a place other than Hefei for more than three months in the past 12 months. Finally, they should have had no speaking or hearing impairments. Based on these criteria, three female speakers and three male speakers from the old city area of Hefei were identified as the participants, with their ages from 25 to 39 (median = 26) at the time of the recording.Footnote 8 After the data collection, it was found that the ultrasound images of the participant M03 were of inferior quality and could not be extracted with EdgeTrak or other contour-based methods; therefore his data were excluded.

2.2 Materials

The stimuli for the ultrasound study included nine disyllabic sequences, as shown in Table 4.Footnote 9 The first syllable of each word in (a), (b), and (c) contained one of the three apical vowels, while the first syllable of the word in (d) contained the vowel [i] as a baseline for comparison. Following Lee-Kim (Reference Lee-Kim2014), we selected stimuli containing [x] as the onset of the second syllable. This is because, among the consonants of Hefei Mandarin, the fricative [x] is expected to have a minimal influence on the articulation of its preceding vowel in the first syllable.Footnote 10 In addition, Hefei Mandarin has a small number of syllables with the onset [x] and the second syllables [xɔ213] ‘good’ in (a)–(c) and [xu31] (an adjective affix) in (d) are two of the most frequently used morphemes, which are expected to facilitate the speakers’ natural articulation of the disyllabic sequences than other less frequent [x]-initial syllables. In terms of tone, the target syllables all bear the same lexical tone /213/, except for [ɕi], which bears a /24/ tone. In Hefei Mandarin, a sequence of two /213/ tones triggers a tone sandhi, i.e. /213 + 213/ → [24 + 213], by which the first tone surfaces as [24] (Kong et al., published online 15 July Reference Kong, Wu and Li2022). Thus, across the stimulus words in (a)–(c) in Table 4, the first syllable bears a [24] tone in its real phonetic form, which is the same tone as for the first syllable [ɕi] in (d) in Table 4. This design aimed to filter out the potential laryngeal difference when producing different tones.Footnote 11

Table 4. The stimuli used in the ultrasound study of Hefei Mandarin.

2.3 Procedure

Following previous studies (Lee-Kim Reference Lee-Kim2014, Westerberg Reference Westerberg2016, among others), we focused on the midsagittal contour of the tongue in producing of the target segments, as they display the relative backness, height, and slope of the tongue, as well as the lip gestures.Footnote 12 Before the recording, the participants were presented with the written forms of the stimuli in simplified Chinese to let them get familiar with the stimulus words. Following the common practice of ultrasound studies (e.g. Epstein & Stone Reference Epstein and Stone2005), the participants were asked to swallow water before the production for extracting the palate trace. During the ultrasound data collection, the stimuli were displayed on a teleprompter one meter in front of the participant at the eye level, and the disyllabic words were presented in a random order. The participants read each disyllabic word in the carrier sentence: 我读___第个词 ‘I read___this word’ [o213 tuəʔ4__ti5353 tsʰɹ̩24] as in Hefei Mandarin. For each of the nine target stimulus words in Table 4, five tokens were recorded with ultrasound imaging and audio recording. Thus, the five participants gave a total of 225 tokens (= 9 words × 5 repetitions × 5 participants).

When collecting the ultrasound data, the midsagittal sections were recorded at 40 fps using a SonixTablet ultrasound system in a sound-attenuated booth at Shanghai Normal University, Shanghai, China. The audio was recorded using a lavalier microphone through an Mbox mini audio interface, at the sampling rate of 44100 Hz, which was synchronized into an AVI file with the ultrasound video using an Epiphan USB capture card. Figure 2 illustrates the ultrasound probe and the ultrasound stabilization helmet. The probe was held in place under a participant’s chin using a stabilization helmet (Articulate Instruments Ltd. 2008), adjusted to maximize the freedom of movement of the mandible while maintaining full contact of the probe with the participant’s skin. This aims to avoid the movement of the probe relative to the head and to ensure that ultrasound videos are controlled with respect to the orientation of the probe to provide a positional reference in quantitative assessment.

Figure 2 The ultrasound stabilization helmet and ultrasound probe as fit to a participant.

When recording the lip gestures in producing the apical vowels, the speakers were invited to read the same materials as used for the ultrasound recording. Each participant produced three tokens for each target word and the five participants gave a total of 135 tokens (= 9 words × 3 repetitions × 5 participants). A built-in camera in a Xiaomi smartphone was set up at 23 cm directly in front of a speaker’s mouth when recording the front view and at 30 cm away from the mouth when recording the side view. The recording was done at a resolution of 640 × 368 pixels at a sampling rate of 44,100 Hz. An audio recording was made simultaneously with the video recording of the lip gestures, which was used to track the time course correspondence between the lip gestures and the produced segments.

2.4 Data analysis

In processing the ultrasound data, the onset and offset of each onset consonant and each vocalic segment were identified following the practice in previous studies (e.g. Iskarous, Shadle & Proctor Reference Iskarous, Shadle and Proctor2011, Lee-Kim Reference Lee-Kim2014, among others). Frames from the midpoints of the target segments were selected using the software Adobe Premiere, referring to the acoustic landmarks in the time-aligned audio; the tongue surface contours were extracted using EdgeTrak (Li, Kambhamettu & Stone Reference Li, Kambhamettu and Stone2005). These contours are assumed to represent the genuine tongue shapes for the relevant vocalic segments. When there was an even number of frames, the first frame after the middle was used (Tabain & Beare Reference Tabain and Beare2018). Figure 3 demonstrates a sample waveform and spectrogram of the syllable [sɹ̩] and the corresponding ultrasound imaging frames at which the vocalic articulation reached its maximal constriction and Figure 4 shows an extracted tongue contour.

Figure 3 A sample waveform and spectrogram of [sɹ̩] in Hefei Mandarin, with five frames (Frame 1–5) during the consonant period and seven frames (Frames 6–12) during the vocalic period. Of the 12 frames, #1, #3, #5 are presented for the consonant part and #6, #8, #10, #12 for the vocalic part. The tongue dorsum is on the left, and the tongue front is on the right.

Figure 4 The third of the five frames extracted during the articulation of the apical segment [ɹ̩] after an alveolar fricative [s], when the vocalic articulation reached its maximal constriction.

To assess the tongue shapes of the apical vowels and the vowel /i/, a Smoothing Spline ANOVA in polar coordinates was adopted as the statistical procedure, using the R code tongue_ssanova.r by Mielke (Reference Mielke2017).Footnote 13 Polar coordinates were adopted as they are expected to be more appropriate than Cartesian coordinates for comparisons involving tongue retraction or advancement, especially in vowels (Mielke Reference Mielke2015). The horizontal coordinate x and the vertical coordinate y of each point in the traces were converted from Cartesian coordinates into polar coordinates with the angular coordinate θ and the radial coordinate r. The origin was determined according to the x and y values of the highest point and the lowest point of all the traces in the samples. The x coordinate of the origin was the x value of the highest point of the traces, while the y coordinate is the point 1% less than the y value of the lowest point of the traces. The x and y values of the polar coordinates are the differences between the point in the trace and the origin. The SSANOVA does not return an F value, instead, the smoothing parameters of the components of the equation are compared to determine their relative contributions (Gu Reference Gu2002, Stone Reference Stone2005, Davidson Reference Davidson2006). Using this method, smoothing splines that best fitted the five repetitions for the stimuli were obtained. Analyses were conducted with the R statistical package version 3.1.2 (R Core Team 2014). Figures were created using the ggplot2 package in R (Wickham Reference Wickham2009).

In the analysis of lip gestures, the video recordings and the audio recordings were examined to find the time points corresponding to the midpoints of the apical vowels. With the limited number of speakers and the relatively small amount of video recording for lip gestures, visual inspection was adopted to generalize the qualitative property of the lip gestures when producing the apical vowels.

3 Results

In this section, we present the articulatory properties of the apical vowels and relevant comparisons following the order of the four research questions in Section 1.2: the tongue gestures involved in the production of the three apical vowels vs. the vowel [i] (Section 3.1), the lip gestures of the three apical vowels (Section 3.2), the tongue gestures involved in the production of the onset vs. vowel in the syllables [sɹ̩] [sɹ̩ʷ] [ʂɻ̩] (Section 3.3), and the tongue gestures of [ɹ̩] after homorganic vs. non-homorganic consonants (Section 3.4).

3.1 Tongue gestures of the apical vowels and the vowel [i]

Figure 5 presents the smoothing spline estimates of the tongue gestures associated with the production of the vowels [ɹ̩] (solid line), [ɹ̩ʷ] (dashed line), [ɻ̩] (dotted line) and those of the vowel [i] (dash-dotted line) by the five speakers, with 95% Bayesian confidence intervals, based on five repetitions of each syllable. Across the female and male speakers, the strategies to produce [i] appeared to be relatively invariant while those of the three apical vowels showed some commonalities as well as certain variations, especially in the tongue blade region. First, the most obvious difference between [i] and the apical vowels is that the latter three have an obviously more retracted tongue root or tongue dorsum, although to different degrees. For example, the most retracted tongue root occurred in [ɻ̩] (dotted line) in speakers F01, M02, F03, while it occurred in [ɹ̩ʷ] (dashed line) in speakers M01 and F02. Thus, tongue root retraction is likely to be a defining articulatory property of Hefei Mandarin apical vowels, in contrast to [i]. Second, as compared with [i], the three apical vowels had a lower tongue body and generally a lower tongue blade, with the exception in F01, for whom [ɹ̩] (solid line) and [ɹ̩ʷ] (dashed line) had a slightly more raised tongue blade, and in M02, for whom [ɻ̩] (dashed line) had a more raised tongue blade. Third, the vowel [i] (dash-dotted line) was articulated with an obvious front bunching while [ɹ̩] [ɹ̩ʷ] [ɻ̩] were generally not as front-bunched as [i] across the five speakers, as revealed by the separation of [ɹ̩] [ɹ̩ʷ] [ɻ̩] on the left and [i] on the right. Fourth, the constriction location of [i] was closer to the hard palate while those of the apical vowels were at a more anterior position. It is likely that the tongue tip was involved in making the constriction of apical vowels along with the tongue blade, which was not visible in the images.

Figure 5 Smoothing spline estimates of the curves with 95% confidence intervals of the vowels [i] (purple), [ɹ̩] (red), [ɹ̩ʷ] (green), and [ɻ̩] (blue) across the five speakers, each line based on 15 tokens (three syllables × five repetitions).

The alveolar apical vowel [ɹ̩] (solid line) generally had a lowered tongue body and a more front tongue root as compared with the other two apical vowels, with the exception in F01 and M02. For F01, the tongue root of [ɹ̩] overlapped with that of [ɹ̩ʷ] (dashed line) and, for M02, it was slightly more retracted than [ɹ̩ʷ]. As a special case, the [ɹ̩] (solid line) of speaker F01 had a tongue concavity, which was not obvious in other speakers. The tongue blade of the vowel [ɹ̩] (solid line) was generally lower than [ɻ̩] (dotted line) with the exception of F01 and M01.

The alveolar apical [ɹ̩ʷ] is recognized in the literature as a rounded counterpart of the unrounded [ɹ̩], which usually concerns the gestures of lips (to be detailed in Section 3.2). As shown in Figure 5, [ɹ̩ʷ] (dashed line) differed from [ɹ̩] (solid line) in having a more retracted tongue root in speakers F02, F03, and M01; for speakers F01 and M02, [ɹ̩ʷ] and [ɹ̩] generally overlapped in the tongue root area with M02’s [ɹ̩ʷ] even a bit more front than [ɹ̩]. Close to the area of the alveolar ridge, the rounded [ɹ̩ʷ] involved a reduced degree of constriction, as compared with [ɹ̩], which is most obvious in F03 and M01.

The retroflex apical [ɻ̩] (dotted line) generally had a higher tongue body and a more retracted tongue root than [ɹ̩], with the exception of M01, whose [ɻ̩] had a more front tongue root than [ɹ̩]; for speakers F02, F03, and M02, the tongue root of [ɻ̩] had a similar degree of retraction as [ɹ̩ʷ]. Focusing on the tongue blade, a raising gesture in [ɻ̩] could be observed for speakers F01, F02, and M02, while no obvious raising for speakers F03 and M01. Close to the alveolar ridge region, the retroflex apical [ɻ̩] showed a reduced degree of constriction across the five speakers, which was most obvious for M02 and similar to [ɹ̩ʷ] for F02.

3.2 Lip gestures of the apical vowels

To examine the lip gestures associated with the three apical vowels, the images corresponding to the temporal midpoints of the vowels were obtained from the video recording, referring to the corresponding audio recording. A visual inspection showed that the speakers were generally consistent in their lip gestures when producing the three apical vowels respectively. Below, Figure 6 gives an illustration of the lip gestures by a female speaker (F01) and sample lip gestures of the other speakers are provided in appendix Figure A1.

Figure 6 Lip gestures in producing the three apical vowels by a female speaker (F01) as in the syllables [sɹ̩213], [sɹ̩ʷ213], and [ʂɻ̩213].

When producing the unrounded alveolar [ɹ̩], the speakers consistently had unrounded lips with a moderate aperture; when producing the rounded alveolar [ɹ̩ʷ], in contrast, the speakers generally had a clear rounding of their lips with a small aperture. As shown in the Figure A1, the lip gestures for [ɹ̩ʷ] of speakers F03 and M01 were similar to that of F01 as shown in Figure 6, and those of F02 and M02 involved an even stronger rounding than F01. This is consistent with the impressionistic description of the two apical vowels in the literature respectively as an unrounded vowel vs. a rounded vowel (Hou Reference Hou2007).

When producing the retroflex apical vowel [ɻ̩], the speakers generally had a larger vertical aperture than [ɹ̩]. A lip protrusion (out-rounding) was generally involved for [ɻ̩] across the speakers, which was the most obvious for F02 and the least obvious for M01 as shown in the Figure A1. The out-rounding in the apical [ɻ̩] of Hefei Mandarin is reminiscent of similar gestures in English rhotic segments and postalveolar obstruents, which is presumably an enhancement to lower F3 or center of gravity (King & Ferragne Reference King and Ferragne2019, Smith et al. Reference Smith, Mielke, Magloughlin and Wilbanks2019).Footnote 14

Across the five speakers, the lip gestures for the three apical vowels generally differed from each other the same way as in speaker F01, although some speakers had a stronger lip rounding when producing [ɹ̩ʷ] or a stronger lip protrusion when producing [ɻ̩].

3.3 Onset consonants vs. vowels in the syllables [sɹ̩] [sɹ̩ʷ] [ʂɻ̩]

To examine the potential articulatory difference between the onset consonants and vowels in the syllables [sɹ̩] [sɹ̩ʷ] [ʂɻ̩], the tongue gestures across the syllables were extracted, as illustrated in Figure 7 using the data of a female speaker (F01). On average, five to seven frames were obtained for a consonant and seven to nine frames for a vowel. For each speaker, Figure 8 presents the smoothing spline estimates of the curves of the onset consonants (dashed line) and the vowels (solid line) in the syllables [sɹ̩] [sɹ̩ʷ] [ʂɻ̩], each based on five tokens, with the palate traces imposed. The curves of the onset consonants and vowels were modeled respectively from the middle points of the segments.

Figure 7 Ultrasound images of [sɹ̩], [sɹ̩ʷ] and [ʂɻ̩] by a female speaker (F01). The tongue dorsum is on the left and the tongue blade is on the right. Frame #1 corresponds to the start of a consonant, Frame #7 the onset of a vocalic segment, and Frame # 13 the offset of it.

Figure 8 Smoothing spline estimates of the curves of the onset consonants (blue line) and vowels (red line) in the syllables [sɹ̩], [sɹ̩ʷ] and [ʂɻ̩] for the five speakers, each line based on five tokens, modeled from the midpoints of the consonants and those of the vowels respectively. The grey lines indicate the palate traces. The tongue body is on the left and the tongue blade on the right. The traces for each speaker used in the SSANOVA In response to the second round of copy-editing queries to do with Figures 5, 8, 9, A2 and A3, you have labelled their x- and y-axes ‘x’ and ‘y’. This is odd because these labels do not tell the reader what the numbers on the axes mean or what the measurements are. Please replace ‘x’ and ‘y’ with labels that include the name of the measurement and, in parentheses and if applicable, the unit of the measurement. See also two other figure-related queries in this list.splines are In Figures 8 and Figure A2, please insert the legends for the lines used in the figures – for consistency with the format in the other figures of this type. Consequently, omit the expressions ‘(blue line)’ and ‘(red line)’ from Figure 8’s caption, to avoid redundancy and for consistency with the content of the captions for similar figures in this paper.provided in appendix Figure A2.

As shown in Figure 8, across the syllables [sɹ̩] [sɹ̩ʷ] [ʂɻ̩], the onset consonants and the vowels deviated more or less in their tongue positions to different degrees across the speakers. Specifically, the lowering of tongue dorsum/blade and the retraction of tongue root existed as adjustments of the tongue position from the onset /s/ and /ʂ/ to the apical vowels across the speakers. The exception to this pattern occurred in the [sɹ̩ʷ] and [ʂɻ̩] by M01, for which the smoothing spline estimates showed largely overlapping curves of the onset consonants and the vowels, and the [sɹ̩] by F03, for which the tongue root was less retracted in the vowel.

In the syllable [sɹ̩], the onset [s] had a slightly higher tongue blade than the apical vowel [ɹ̩] across the speakers, with the smallest difference in F03, whose [s] and [ɹ̩] overlapped in the tongue blade. This is consistent with the recognition that the onset sibilant [s] has a tighter constriction, and thus more frication noise, than its following vocalic [ɹ̩]. In terms of the tongue root, [s] is more front than [ɹ̩] in speakers F01 and F02, while the reverse seems to be true for F03 and M02, indicating inter-speaker variation in this aspect.

In the syllable [sɹ̩ʷ], the tongue body and tongue blade in the onset [s] were higher than those in the apical [ɹ̩ʷ], with the exception of M01, whose [s] and [ɹ̩ʷ] generally overlapped in the tongue blade. The tongue root of the vocalic [ɹ̩ʷ] was more retracted than that of the onset [s], which was also observed in the syllable [sɹ̩] as stated above. Across the five speakers, the tongue dorsum was generally more raised in [ɹ̩ʷ] than in [s]. In addition, [ɹ̩ʷ] also involved a concavity curve as compared with [s], more obvious in F01 and F03 than in M02, by which the tongue blade of [ɹ̩ʷ] was a bit more distant from the palate than the onset [s].

In the syllable [ʂɻ̩], the onset [ʂ] and the vocalic [ɻ̩] differed in their gestures across the five speakers with diverse patterns in tongue blade, tongue body, or tongue root. More specifically, [ʂ] had a higher tongue body than [ɻ̩], which was more obvious in F01 than in F02; [ʂ] had a more front tongue dorsum than [ɻ̩] in F01, F03 and M02; [ʂ] had a slightly more retracted tongue root than [ɻ̩] in M01, for whom the two almost overlapped in the tongue blade. The relative diversity in the tongue gestures of [ɻ̩] presumably indicates that, for speakers of Hefei Mandarin, with lip protrusion in position, the tongue gesture has more flexibility as long as it differs from the onset [ʂ] with a higher tongue blade or a less retracted tongue dorsum.

In general, the above shows different tongue gestures in the onset consonants and the vowels of the syllables [sɹ̩] [sɹ̩ʷ] [ʂɻ̩], with [s] vs. [ɹ̩] differing primarily in tongue blade and tongue body, [s] vs. [ɹ̩ʷ] mainly in tongue body and tongue root, and [ʂ] vs. [ɻ̩] with diverse patterns. This is reminiscent of the observed adjustment to tongue position in the apical vowels in other Chinese dialects, in terms of dorsum or blade lowering, relative to homorganic onset sibilant (Lee-Kim Reference Lee-Kim2014, Faytak & Lin Reference Faytak and Lin2015). Put differently, the three apical vowels in Hefei Mandarin seem to involve articulatory gestures that are not necessarily the same as in their preceding consonants. In addition, again, it is possible that the articulation of the apical vowels and the onset consonants in the syllables [sɹ̩] [sɹ̩ʷ] [ʂɻ̩] involves the tongue tip together with the tongue blade. Given the limitation of ultrasound imaging, such articulatory details are not clearly visible, and this may explain some of the inter-speaker variation discussed above.

Figure 9 SSANOVA splines for the apical [ɹ̩] across five speakers, each line based on 10 tokens. The red solid lines represent the [ɹ̩] after the bilabial onsets /m/ and /p/ (abbreviated as B) and the blue dash lines represent the [ɹ̩] after the alveolar fricatives /s/ and /z/ (abbreviated as F). The tongue body is on the left and the tongue blade is on the right. The traces for each speaker used in the SSANOVA splines are provided in appendix Figure A3.

3.4 Apical /ɹ̩/ after the alveolar and bilabial consonants

The alveolar apical vowel [ɹ̩] in Hefei Mandarin may appear after a homorganic alveolar sibilant, as in [sɹ̩213] ‘wash’, as well as after a non-homorganic bilabial consonant, as in [pɹ̩213] ‘to compare’ and [mɹ̩213] ‘rice’. An SSANOVA analysis was performed for the tongue gestures of the apical vowel [ɹ̩] over the time course of the vowel when it appears after the bilabial consonants, as in [pɹ̩213] [mɹ̩213], as opposed to when it appears after alveolar consonants, as in [sɹ̩213]. The results of SSANOVA splines are presented in Figure 9.

As Figure 9 shows, focusing on a single speaker, the tongue gestures of the apical vowel [ɹ̩] were generally similar when the preceding consonants were homorganic vs. non-homorganic, with the exception of M01. In other words, for a speaker, the gesture of the [ɹ̩] after an alveolar fricative /s/ or /z/ (blue line), largely overlapped with that of the [ɹ̩] after a bilabial consonant /m/ or /p/ (brown line), despite slight discrepancies in the tongue blade or tongue body. The discrepancies differed across the speakers. The tongue gestures of [ɹ̩] after an alveolar /s/ or /z/ involved a tongue blade raising (M01 and M02) or a tongue body raising (F02 and F03); it also involved a less degree of tongue root retraction (M01 and F03) as compared with the [ɹ̩] after a bilabial /m/ or /p/. In general, the data seem to suggest a relative intra-speaker articulatory uniformity in producing the apical [ɹ̩] in Hefei Mandarin across different consonantal contexts, whether the onset consonants are homorganic or non-homorganic.

4 General discussion

Apical vowels as in Chinese dialects are generally recognized as vocalic segments homorganic to their proceeding sibilants, with controversies arising as to whether these vowels have their own intrinsic articulatory gestures independent of the preceding consonants. In terms of articulatory gestures, some studies showed virtually identical gestures of an apical vowel and its preceding consonant, e.g. the apical [ɹ̩] in Suzhou Chinese (Faytak Reference Faytak2018). Other studies observed that an apical vowel may have different gestures relative to its preceding consonant, e.g. a slight lowering of the tongue body in [ɻ̩] in Mandarin Chinese (Lee-Kim Reference Lee-Kim2014), a raising of tongue blade in [ɻ̩] in Mandarin Chinese (Chen et al. Reference Chen, Jin Zhang, Liu, Wei and Dang2015), or a lowering of tongue dorsum in [ɹ̩] in Jixi Chinese (Shao Reference Shao2020, Shao & Ridouane, published online 19 January Reference Shao and Ridouane2023). The observation in the current study about apical vowels in Hefei Mandarin is consistent with the view that an apical vowel may differ from its preceding consonant in its articulatory gesture. As reported in Section 3, the apical [ɹ̩] [ɹ̩ʷ] [ɻ̩] involved different degrees of tongue root retraction and tongue body lowering relative to the vowel [i], and their constrictions occurred at a more anterior position than that of [i]. Within a speaker, they each had a relatively consistent tongue gesture that might not be necessarily attributed to the influence of a preceding consonant. In particular, the tongue gestures in producing [ɹ̩], in terms of tongue root retraction or tongue dorsum lowering across the speakers, showed a relative articulatory similarity when it appears after a homorganic vs. non-homorganic consonant.

The results reported in Section 3 suggested that, for apical vowels in Hefei Mandarin, the retraction of tongue root or tongue dorsum is likely to be a defining articulatory property. This finding is reminiscent of reports by Lee-Kim (Reference Lee-Kim2014), Faytak & Lin (Reference Faytak and Lin2015) and Huang, Hsieh & Chang (Reference Huang, Hsieh and Chang2021). For example, a slightly retracted tongue root was observed for [ɹ̩] and a slightly lowered tongue body for [ɻ̩] in Mandarin Chinese (Lee-Kim Reference Lee-Kim2014); similarly, the tongue dorsum when producing [ɹ̩] and [ɻ̩] was reported to be as low and retracted as [a] (Faytak & Lin Reference Faytak and Lin2015). The results about Hefei Mandarin showed that, within the syllables [sɹ̩] [sɹ̩ʷ] [ʂɻ̩], the onset sibilants and the apical vowels deviate more or less in their tongue gestures, with the apical vowels involving a retracted tongue root, lowered tongue dorsum or blade, or both, relative to the onset fricatives respectively. This is reminiscent of the observation of apical vowels in Jixi-Hui Chinese, i.e. a lower tongue dorsum in apical vowel than in [s] on the mid-sagittal plane (Shao Reference Shao2020) as well as Mandarin Chinese, i.e. an articulatory change from a homorganic onset to an apical vowel (Chen et al. Reference Chen, Jin Zhang, Liu, Wei and Dang2015). That being the case, it needs to be noted that other studies have also reported little displacement between a homorganic consonant and an apical vowel in Mandarin Chinese (Chen Reference Chen2011, Faytak & Lin Reference Faytak and Lin2015) and Suzhou Chinese (Ling Reference Ling2009, Faytak Reference Faytak2018). The similarities and differences between the results in our study and those in the literature suggest a future direction of cross-linguistic investigation of apical vowels. It is possible that the tongue gestures involved in the articulation of apical vowels in a particular language involve commonalities as well as idiosyncrasy. In addition, ultrasound data cannot provide detailed information in the gestures involving tongue tip, which might be relevant to the tongue gestures involved in the production of apical vowels. Such details might be revealed in future research with the use of other experimental method such as EMA.

The unrounded apical [ɹ̩] and the rounded apical [ɹ̩ʷ], which differ in their lip gestures, contrast with each other in Hefei Mandarin. In terms of tongue gestures, [ɹ̩] generally has a more retracted tongue root while [ɹ̩ʷ] involves a reduced degree of constriction close to the alveolar ridge. Such a difference echoes previous studies showing that unrounded vs. rounded vowels may differ in their tongue gestures (Raphael et al. Reference Raphael, Bell-Berti, Collier and Baer1979, Wood Reference Wood1986, Radisic Reference Radisic2014). For example, Wood (Reference Wood1986) observed that the rounded [y] has a tongue blade raising and a slightly lower tongue body relative to its unrounded counterpart. Radisic (Reference Radisic2014) observed that unrounded vowels in Turkish are articulated higher than rounded ones in the front region while lower in the back region. Raphael et al. (Reference Raphael, Bell-Berti, Collier and Baer1979) observed that front rounded vowels may involve a lower tongue height relative to their unrounded counterpart. Our results for different speakers are consistent with these points respectively. For example, for speakers M01 and F03, the unrounded [ɹ̩] was more front than the rounded [ɹ̩ʷ]; for the same two speakers, the unrounded [ɹ̩] was higher than the rounded [ɹ̩ʷ] in the more front region while the reverse was true in the more back region, which also held true for speaker F02. On the other hand, the tongue gestures of [ɹ̩] vs. [ɹ̩ʷ] by speakers F02, F03, and M01 differed from the pattern observed in Wood (Reference Wood1986) in that the rounded [ɹ̩ʷ] involved a higher tongue body than its unrounded counterpart [ɹ̩]. Across different speakers, the rounded vowel [ɹ̩ʷ] had a lower tongue body relative to the unrounded vowel in some cases, while it had a higher tongue body in other cases. This echoes the observations in Perkell et al. (Reference Perkell, Matthies, Svirsky and Jordan1993) and Chen (Reference Chen2011) that some speakers’ rounded vowels were associated with a tongue body raising relative to unrounded vowels while others’ were associated with a tongue body lowering. Furthermore, for speaker F01, there seemed to be no obvious difference between the unrounded [ɹ̩] and the rounded [ɹ̩ʷ], as shown in their overlap in Figure 5.

5 Conclusion

This study examined the articulatory properties of apical vowels in Hefei Mandarin, which represents an under-studied type of languages in terms of its inventory of apical vowels and their phonological status. While the apical vowels in languages such as Mandarin Chinese follow a homorganic consonant and are in complementary distribution, the apical vowels in Hefei Mandarin may follow non-homorganic consonants, thus allowing us to disassociate an apical vowel from a homorganic consonant onset to examine its own articulatory gesture in particular in terms of tongue position. Our results showed that an apical vowel involves distinct articulatory targets that may not be simply attributed to the influence from its preceding consonants, consistent with some observations in the literature. The commonalities in producing apical vowels in Hefei Mandarin include a retracted tongue root, lowered tongue dorsum or blade, or both, in addition to a coronal constriction implemented with the blade and/or the tip; lip gestures are also involved in distinguishing the apical segments. We would like to acknowledge that the observations in the current research were based on a relatively limited number of speakers, although the speakers are representative of Hefei Mandarin. Finally, as the focus of this study was on articulatory properties of apical vowels, we did not explore the connection between their articulation and acoustics – something that would be important to address in the future work.

Acknowledgements

The authors would like to thank Professor Zhongmin Chen for his generous help. We are greatly indebted to the anonymous reviewers and the editors of Journal of the International Phonetic Association, whose comments led to great improvement of this research. The research is partly supported by Philosophical and Social Science Grant of Anhui Province (Grant No. AHSKY2019D099), Hong Kong Baptist University Faculty Research Grant (FRG) Category II [Grant No. FRG2/17-18/076], and Hong Kong Baptist University Research Committee’s Start-up Grant for New Academics.

Appendix

Figure A1 Sample lip gestures of the three apical vowels of speakers F02, F03, M01, M02.

Figure A2 The traces for each speaker used in the SSANOVA splines in Figure 8: onset fricatives (blue lines) vs. apical vowels (brown lines). The x-axis and the y-axis correspond to the horizontal coordinate and the vertical coordinate, respectively, both in millimeters.

Figure A3 The traces for each speaker used in the SSANOVA splines in Figure 9: the bilabial /m/ and /p/ (brown lines) vs. the alveolar /s/ and /z/ (blue lines). The x-axis and the y-axis correspond to the horizontal coordinate and the vertical coordinate, respectively, both in millimeters.

Footnotes

1 The phonemic status of the segment [ɹ̩/ɻ̩], in Mandarin Chinese for example, has been controversial: It was analyzed as an allophone of /i/ (Xu Reference Xu1980), a different phoneme from /i/ (Cheng Reference Cheng1968), or an underlying unspecified segment (Lin Reference Lin1989). In this paper, we focus on the phonetic segments [ɹ̩] and [ɻ̩] and use square brackets for the apical vowels and the relevant consonants and vowels.

2 For the retroflex consonants, the actual place of constriction is close to the post-alveolar [ʃ tʃ tʃʰ] (Kong et al., published online 15 July 2022). Following the convention in the Chinese literature, we represent them with the retroflex symbols.

3 In the Chinese literature of Hefei Mandarin, the three apical vowels were usually represented with the symbols [ɿ ʮ ʅ]. In this paper, we represent them with the IPA symbols [ɹ̩ ɹ̩ʷ ɻ̩], respectively regarding their articulatory properties as reported in previous studies.

4 The historical development of the apical vowels in Hefei Mandarin is also special in their fronting as compared with their counterparts in other Chinese dialects, for which detailed discussions were made in Wu (Reference Wu1995), Zhu (Reference Zhu2004), and Zhao (Reference Zhao2007).

5 Unlike their counterparts in Suzhou Chinese, the apical vowels in Hefei Chinese (e.g. [ɹ̩]) have less frication noise as compared with the relevant voiced fricatives ([z]), as can been seen in Figure 1f). In this study, therefore, we transcribe the apical vowels as syllabic approximants.

6 The apical vowels in Hefei Mandarin have been reported to include audible fricative noise (Hou Reference Hou2007, Kong et al. Reference Kong, Wu and Li2019, among others). While frication noise is a typical characteristics of fricative, it is also frequently found in high vowels such as [i]. Following de Krom (Reference De Krom1993) and Maniwa, Jongman & Wade (Reference Maniwa, Jongman and Wade2009), Kong et al. (published online 15 July 2022) measured the harmonic-to-noise ratios (HNR) of the three apical vowels in Hefei Mandarin, and observed that [ɹ̩] and [ɻ̩] have lower HNR values (i.e. more noise) than /i/ while [ɹ̩ʷ] and /i/ are similar in HNR values.

7 Previous acoustic studies of apical vowels in Mandarin Chinese (Howie 1976, Zee & Lee Reference Zee, Lee, Lu and Wang2004, Lee-Kim Reference Lee-Kim2014) indicated that the alveolar unrounded apical [ɹ̩] is characterized by a low F2 and a high F3, whereas the retroflex unrounded apical [ɻ̩] is characterized by a mid F2 and a low F3.

8 Previous studies such as Stone (Reference Stone2005) indicated that, for an ultrasound study, younger speakers might be preferable as there is more moisture and less fat in the tissue of their oral cavities.

9 Due to limits in data collection, we were not able to include some syllables in the stimuli in Table 4 such as [pʰɹ̩], [tsɹ̩ʷ], [tsʰɹ̩ʷ], [zɹ̩ʷ], [tɕi], [tɕʰi], [tʂɻ̩], and [tʂʰɻ̩]. As apical vowels after consonants at the same place of articulation (e.g. [ʂ tʂ tʂʰ]) are generally similar, the stimuli in Table 4 are expected to be representative of the syllables involving the relevant apical vowels.

10 Lee-Kim (Reference Lee-Kim2014) used [x] as the syllable onsets after the target vowels in her ultrasound study of Mandarin apical vowels and her results showed no tongue dorsum raising, e.g. in the apical [ɻ̩] before [x], which indicated relatively minimal anticipatory coarticulation from [x] to its preceding vowel.

11 While there would be a better control if all the first syllables bear a high-level tone, it turned out to be impossible with the limitation of available combinations of segments and tones in Hefei Mandarin.

12 In this study, we focus on the midsagittal data excluding the coronal plane of the tongue, which has been shown to be usually difficult to position in field studies (Stone Reference Stone2005).

13 We thank an anonymous reviewer for suggesting this point and the use of the r-package.

14 Studies of rhotics in English suggested that the degree of lip protrusion may depend on tongue shape, with more protrusion accompanying bunched tongue shapes than retroflex ones (Tiede et al. Reference Tiede, Boyce, Espy-Wilson, Gracco, Maassen and van Lieshout2010; King & Ferragne Reference King and Ferragne2019, Reference King and Ferragne2020). Constrained by the limited amount of data in this study, no discussion was formed as to if it is the case for the apical vowels in Hefei Mandarin.

References

Bao, Huaiqiao. 1984. Putonghua danyuanyin de shengli jieshi [Physiological explanations for the monophthongs in Putonghua]. Zhongguo Yuwen [Studies of the Chinese language] 2, 117127.Google Scholar
Baron, Stephten. 1974. On the tip of many tongues: Apical vowels across Sino-Tibetan. Proceedings of the 7th International Conference on Sino-Tibetan Language and Linguistic Studies, Atlanta, GA, 18–19 October 1974.Google Scholar
Chao, Yuan-Ren. 1930. A system of tone letters. Le Maître Phonétique 45, 2427.Google Scholar
Chao, Yuan-Ren. 1934. The non-uniqueness of phonemic solutions of phonetic system. Reprinted in Joos, Martin (ed.), Readings in linguistics, 3rd edn., 38–54. New York: American Council of Learned Societies, 1963.Google Scholar
Chao, Yuan-Ren. 1968. A grammar of spoken Chinese. Berkeley, CA: University of California Press.Google Scholar
Chen, Yu. 2011. Jiyu chaoshengbo jiance de Hanyu Putonghua jichu yuanyin fayin de sheti yundong yanjiu [An investigation of tongue movement in the production of Mandarin basic vowels using ultrasound]. Ph.D. dissertation, Nankai University.Google Scholar
Chen, Yu, Jin Zhang, Yanting Chen, Liu, Licheng, Wei, Jianguo & Dang, Jianwu. 2015. An articulatory analysis of apical syllables in Standard Chinese. Proceedings of 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 123–127. 28–30 October 2015, Shanghai, China.CrossRefGoogle Scholar
Cheng, Chin-chuan. 1968. Mandarin phonology. Ph.D. dissertation, University of Illinois at Urbana–Champaign.Google Scholar
Davidson, Lisa. 2006. Comparing tongue shapes from ultrasound imaging using smoothing spline analysis of variance. The Journal of the Acoustical Society of America 120, 407415.CrossRefGoogle ScholarPubMed
De Krom, Guss. 1993. A cepstrum-based technique for determining a harmonics-to-noise ratio in speech signals. Journal of Speech and Hearing Research 36(2), 254266.CrossRefGoogle ScholarPubMed
Duanmu, San. 2007. The phonology of Standard Chinese. Oxford: Oxford University Press.CrossRefGoogle Scholar
Epstein, Melissa & Stone, Maureen. 2005. The tongue stops here: Ultrasound imaging of the palate. The Journal of the Acoustical Society of America 118, 21282131.CrossRefGoogle ScholarPubMed
Faytak, Matthew. 2018. Articulatory uniformity through articulatory reuse: Insights from an ultrasound study of Sūzhōu Chinese. Ph.D. dissertation, University of California, Berkeley.CrossRefGoogle Scholar
Faytak, Matthew & Lin, Susan. 2015. Articulatory variability and fricative noise in apical vowels. Proceedings of the 18th International Congress of Phonetic Sciences (ICPhS XVIII), Glasgow, 10–14 August 2015.Google Scholar
Gu, Chong. 2002. Smoothing spline ANOVA models. New York: Springer.CrossRefGoogle Scholar
Hartman, Lawton M. 1944. The segmental phonemes of the Peiping dialect. Language 20, 2842.CrossRefGoogle Scholar
Hockett, Charles F. 1967. The quantification of functional load. Word 23, 320339.CrossRefGoogle Scholar
Hou, Chao. 2007. Hefei fangyan gaoyuanyin shiyan yanjiu [An experimental study of high vowels in Hefei dialect]. MA thesis, Nanjing Normal University.Google Scholar
Howie, John Marshall. 1970. Acoustic studies of Mandarin vowels and tones. Ph.D. dissertation, Indiana University.Google Scholar
Hu, Fang. 2005. A phonetic study of the vowels in Ningbo Chinese. Ph.D. dissertation, City University of Hong Kong.Google Scholar
Hu, Fang & Ling, Feng. 2019. Fricative vowels as an intermediate stage of vowel apicalization. Language and Linguistics 20(1), 114.Google Scholar
Huang, Jing, Hsieh, Feng-Fan & Chang, Yueh-Chin. 2021. A cross-dialectal comparison of apical vowels in Beijing Mandarin, Northeastern Mandarin and Southwestern Mandarin: An EMA and ultrasound study. Proceedings of Interspeech 2021, 3989–3993, Brno, Czech Republic, 30 August – 3 September 2021.Google Scholar
Iskarous, Khalil, Shadle, Christine H. & Proctor, Michael I.. 2011. Articulatory–acoustic kinematics: The production of American English /s/. The Journal of the Acoustical Society of America 129, 944954.CrossRefGoogle ScholarPubMed
King, Hannah & Ferragne, Emmanuel. 2019. The contribution of lip protrusion to Anglo-English /r/: Evidence from hyper-and non-hyperarticulated speech. Proceedings of Interspeech 2019, 33223326, Graz, Austria, 15–19 September 2019.Google Scholar
King, Hannah & Ferragne, Emmanuel. 2020. Loose lips and tongue tips: The central role of the /r/-typical labial gesture in Anglo-English. Journal of Phonetics 80, 100978.CrossRefGoogle Scholar
Kong, Huifang, Wu, Shengyi & Li, Mingxing. 2019. Hefeihua shejian yuanyin de moca xingzhi ji yuyin zengqiang lilun jiedu [The frication property of apical vowels in Hefei Mandarin and its perceptual enhancement account]. Yuyan Yanjiu [Studies in language and linguistics] 39(1), 2333.Google Scholar
Kong, Huifang, Wu, Shengyi & Li, Mingxing. Hefei Mandarin. Journal of the International Phonetic Association, doi:https://doi.org/10.1017/S0025100322000081. Published online by Cambridge University Press, 15 July 2022.Google Scholar
Ladefoged, Peter & Maddieson, Ian. 1996. The sounds of the world’s languages. London: Wiley Blackwell.Google Scholar
Lee, Chao-Yang & Li, Zhiqiang. 2003. Enhancement of phonological contrast: Acoustics of apical and retroflex vowels in Mandarin Chinese. Poster presentation at 34th Annual Meeting of the North-Eastern Linguistic Society (NELS 34), Stony Brook, NY, 13 September 2003.Google Scholar
Lee, Wai-Sum & Zee, Eric. 2003. Standard Chinese. Journal of the International Phonetic Association 33(1), 109112.CrossRefGoogle Scholar
Lee, Wai-Sum & Zee, Eric. 2017. Apical vowels. In Sybesma, Rint (ed.), Encyclopedia of Chinese language and linguistics, 169172. Leiden: Brill.Google Scholar
Lee-Kim, Sang-Im. 2014. Revisiting Mandarin ‘apical vowels’: An articulatory and acoustic study. Journal of the International Phonetic Association 44(3), 261282.CrossRefGoogle Scholar
Li, Jingling. 1997. Hefeihua yindang [The phonetic files of Hefei dialect]. Shanghai: Shanghai Educational Press.Google Scholar
Li, Min, Kambhamettu, Chandra & Stone, Maureen. 2005. Automatic contour tracking in ultrasound images. Clinical Linguistics and Phonetics 19(6–7), 545554.CrossRefGoogle ScholarPubMed
Li, Mingxing. 2017. Sibilant contrast: Perception, production, and sound change. Ph.D. dissertation, The University of Kansas.Google Scholar
Lin, Tao & Wang, Lijia. 1992. Yuyinxue jiaocheng [An introduction to phonetics]. Beijing: Peking University Press.Google Scholar
Lin, Yen-Hwei. 1989. Autosegmental treatment of segmental processes in Chinese phonology. Ph.D. dissertation, The University of Texas at Austin.Google Scholar
Ling, Feng. 2009. A phonetic study of the vowel system in Suzhou Chinese. Ph.D. dissertation, City University of Hong Kong.Google Scholar
Luo, Shan. 2020. Articulatory tongue shape analysis of Mandarin alveolar–retroflex contrast. The Journal of the Acoustical Society of America 148(4), 19611977.CrossRefGoogle ScholarPubMed
Maniwa, Kazumi, Jongman, Allard & Wade, Travis. 2009. Acoustic characteristics of clearly spoken English fricatives. The Journal of the Acoustical Society of America 125(6), 39623973.CrossRefGoogle ScholarPubMed
Meng, Qinghui. 1962. Anhuisheng fangyan bianzheng [A guide to Anhui dialects]. Hefei: Anhui People Press.Google Scholar
Meng, Qinghui. 1997. Anhuisheng fangyan zhi [A record of Anhui dialects]. Beijing: Fangzhi Press.Google Scholar
Mielke, Jeff. 2015. An ultrasound study of Canadian French rhotic vowels with polar smoothing spline comparisons. The Journal of the Acoustical Society of America 137, 28582869.CrossRefGoogle ScholarPubMed
Mielke, Jeff. 2017. tongue_ssanova.r (r-code package for SSANOVA comparisons of tongue traces in polar coordinates using gss). https://phon.chass.ncsu.edu/manual/tongue_ssanova.r, 1 May 2021.Google Scholar
Perkell, Joseph, Matthies, Melanie L., Svirsky, Mario A. & Jordan, Michael I.. 1993. Trading relations between tongue-body raising and lip rounding in production of the vowel /u/: A pilot “motor equivalence” study. The Journal of the Acoustical Society of America 93, 29482961.CrossRefGoogle Scholar
R Core Team. 2014. R: A language and environment for statistical computing. http://www.r-project.org, 23 June 2018.Google Scholar
Radisic, Milica. 2014. An ultrasound and acoustic study of Turkish rounded/unrounded vowel pairs. Ph.D. dissertation, University of Toronto.Google Scholar
Raphael, Lawrence J., Bell-Berti, Fredericka, Collier, René & Baer, Thomas. 1979. Tongue position in rounded and unrounded sound pairs. Language and Speech 22, 3748.CrossRefGoogle Scholar
Shao, Bowei. 2020. The apical vowel in Jixi-Hui Chinese: Phonology and phonetics. Ph.D. dissertation, Université Sorbonne Nouvelle.Google Scholar
Shao, Bowei & Ridouane, Rachid. On the nature of apical vowel in Jixi-Hui Chinese: Acoustic and articulatory data. Journal of the International Phonetic Association, doi:10.1017/S0025100322000196. Published online by Cambridge University Press, 19 January 2023.Google Scholar
Smith, Bridget, Mielke, Jeff, Magloughlin, Lyra & Wilbanks, Eric. 2019. Sound change and coarticulatory variability involving English /ɹ/. Glossa: A Journal of General Linguistics 4(1), 63.Google Scholar
Stone, Maureen. 2005. A guide to analysing tongue motion from ultrasound images. Clinical Linguistics and Phonetics (19), 455501.CrossRefGoogle ScholarPubMed
Tabain, Marija & Beare, Richard. 2018. An ultrasound study of coronal places of articulation in Central Arrernte: Apicals, laminals and rhotics. Journal of Phonetics 66, 6381.CrossRefGoogle Scholar
Tiede, Mark K., Boyce, Suzanne E., Espy-Wilson, Carol & Gracco, Vincent L.. 2010. Variability of North American English /r/ production in response to palatal perturbation. In Maassen, Ben & van Lieshout, Pascal (eds.), Speech motor control: New developments in basic and applied research, 5368. Oxford Scholarship Online.CrossRefGoogle Scholar
Westerberg, Fabienne. 2016. An auditory, acoustic, articulatory and sociophonetic study of Swedish viby-i. MA thesis, University of Glasgow.Google Scholar
Wickham, Hadley. 2009. ggplot2: Elegant graphics for data analysis. New York: Springer.CrossRefGoogle Scholar
Wood, Sidney. 1986. The acoustical significance of tongue, lip, and larynx maneuvers in rounded palatal vowels. The Journal of the Acoustical Society of America 80(2), 391401.CrossRefGoogle ScholarPubMed
Wu, Wei. 1995. Hefeihua “-i” “-y” yinjie shengyunmu qianhua tantao “-i” “-y” [On the fronting of consonants and vowels in Hefei Mandarin ‘-i’, ‘-y’ syllables]. Yuwen Yanjiu [Language and literature study] 3, 5860+21.Google Scholar
Xu, Shirong 1980. Putonghua yuyin zhishi [Phonetics of Beijing Mandarin]. Beijing: Wenzi Gaige Press.Google Scholar
Zee, Eric & Lee, Wai-Sum. 2004. The apical sounds in Beijing Mandarin. In Lu, Jilun & Wang, Jialing (eds.), Xiandai yuyinxue yu yinxixue yanjiu [Studies on modern phonetics and phonology], 104–110. Tianjin: Tianjin Shehui Kexue Yuan Press.Google Scholar
Zhao, Rixin. 2007. Hanyu fangyan zhong de [i] > [ɿ] [i] > [ɿ] [The sound change of [i] > [ɿ] in Chinese dialects]. Zhongguo Yuwen [Studies of the Chinese language] 1, 4654.Google Scholar
Zhou, Dianfu & Wu, Zongji. 1963. Putonghua fayin tupu [Articulatory diagrams of Standard Chinese]. Beijing: Shangwu yinshuguan.Google Scholar
Zhu, Xiaonong. 2004. Hanyu yuanyin de gaoding chuwei [Sound changes of high vowels in Chinese dialects]. Zhongguo Yuwen [Studies of the Chinese language] 5, 440451.Google Scholar
Figure 0

Table 1 Phonotactics of the apical vowels [ɹ̩/ɻ̩] and [i] in Mandarin Chinese.

Figure 1

Table 2 Phonotactics of apical vowels and [i] in Hefei Mandarin.

Figure 2

Figure 1 Waveforms and spectrograms of the syllables [sɹ̩213] 洗 ‘to wash’, [sɹ̩ʷ213] 许 ‘to allow’, [ʂɻ̩213] 矢 ‘arrow’, [pɹ̩213] 比 ‘to compare’, [tsɹ̩213] 挤 ‘to squeeze’, and [zɹ̩213] 礼 ‘gift’ in Hefei Mandarin produced by a male speaker (M02).

Figure 3

Table 3 Mean F1 and F2 values of the three apical vowels in Hefei Mandarin in comparison with the apical vowels in Mandarin Chinese (Lin & Wang 1992, Hou 2007).

Figure 4

Table 4. The stimuli used in the ultrasound study of Hefei Mandarin.

Figure 5

Figure 2 The ultrasound stabilization helmet and ultrasound probe as fit to a participant.

Figure 6

Figure 3 A sample waveform and spectrogram of [sɹ̩] in Hefei Mandarin, with five frames (Frame 1–5) during the consonant period and seven frames (Frames 6–12) during the vocalic period. Of the 12 frames, #1, #3, #5 are presented for the consonant part and #6, #8, #10, #12 for the vocalic part. The tongue dorsum is on the left, and the tongue front is on the right.

Figure 7

Figure 4 The third of the five frames extracted during the articulation of the apical segment [ɹ̩] after an alveolar fricative [s], when the vocalic articulation reached its maximal constriction.

Figure 8

Figure 5 Smoothing spline estimates of the curves with 95% confidence intervals of the vowels [i] (purple), [ɹ̩] (red), [ɹ̩ʷ] (green), and [ɻ̩] (blue) across the five speakers, each line based on 15 tokens (three syllables × five repetitions).

Figure 9

Figure 6 Lip gestures in producing the three apical vowels by a female speaker (F01) as in the syllables [sɹ̩213], [sɹ̩ʷ213], and [ʂɻ̩213].

Figure 10

Figure 7 Ultrasound images of [sɹ̩], [sɹ̩ʷ] and [ʂɻ̩] by a female speaker (F01). The tongue dorsum is on the left and the tongue blade is on the right. Frame #1 corresponds to the start of a consonant, Frame #7 the onset of a vocalic segment, and Frame # 13 the offset of it.

Figure 11

Figure 8 Smoothing spline estimates of the curves of the onset consonants (blue line) and vowels (red line) in the syllables [sɹ̩], [sɹ̩ʷ] and [ʂɻ̩] for the five speakers, each line based on five tokens, modeled from the midpoints of the consonants and those of the vowels respectively. The grey lines indicate the palate traces. The tongue body is on the left and the tongue blade on the right. The traces for each speaker used in the SSANOVA splines are provided in appendix Figure A2.

Figure 12

Figure 9 SSANOVA splines for the apical [ɹ̩] across five speakers, each line based on 10 tokens. The red solid lines represent the [ɹ̩] after the bilabial onsets /m/ and /p/ (abbreviated as B) and the blue dash lines represent the [ɹ̩] after the alveolar fricatives /s/ and /z/ (abbreviated as F). The tongue body is on the left and the tongue blade is on the right. The traces for each speaker used in the SSANOVA splines are provided in appendix Figure A3.

Figure 13

Figure A1 Sample lip gestures of the three apical vowels of speakers F02, F03, M01, M02.

Figure 14

Figure A2 The traces for each speaker used in the SSANOVA splines in Figure 8: onset fricatives (blue lines) vs. apical vowels (brown lines). The x-axis and the y-axis correspond to the horizontal coordinate and the vertical coordinate, respectively, both in millimeters.

Figure 15

Figure A3 The traces for each speaker used in the SSANOVA splines in Figure 9: the bilabial /m/ and /p/ (brown lines) vs. the alveolar /s/ and /z/ (blue lines). The x-axis and the y-axis correspond to the horizontal coordinate and the vertical coordinate, respectively, both in millimeters.