Hostname: page-component-586b7cd67f-r5fsc Total loading time: 0 Render date: 2024-11-23T19:24:35.110Z Has data issue: false hasContentIssue false

Effect of Crianza Positiva e-messaging program on adult–child language interactions

Published online by Cambridge University Press:  06 September 2021

Ana Balsa*
Affiliation:
Department of Economics, Universidad de Montevideo, Montevideo, Uruguay
Florencia López Boo
Affiliation:
Department of Social Protection, Inter-American Development Bank, Washington, DC, USA
Juanita Bloomfield
Affiliation:
Department of Economics, Universidad de Montevideo, Montevideo, Uruguay
Alejandrina Cristia
Affiliation:
Laboratoire de Sciences Cognitives et Psycholinguistique, Département d’études cognitives, ENS, EHESS, CNRS, PSL University, Paris, France
Alejandro Cid
Affiliation:
Department of Economics, Universidad de Montevideo, Montevideo, Uruguay
María de la Paz Ferro
Affiliation:
Department of Social Protection, Inter-American Development Bank, Washington, DC, USA
Rosario Valdés
Affiliation:
Fundación Fe y Alegría, Montevideo, Uruguay
María del Luján González
Affiliation:
Department of Psychology, Universidad Católica del Uruguay, Montevideo, Uruguay
*
*Correspondence to: E-mail: [email protected]
Rights & Permissions [Opens in a new window]

Abstract

We assess the effects of the Crianza Positiva text and audio e-messaging program on caregiver–child language interaction patterns. The program is a six-month-long intervention for families with children aged 0–2 aimed at strengthening parental competences. Its design exploits behavioral tools such as reminders, suggestions of action, and messages of encouragement to reinforce and sustain positive parenting practices. Families in 24 early childhood centers in Uruguay that completed an eight-week workshop were randomized into receiving or not receiving mobile messages. After the program, we videotaped 10-minute sessions of free play between the caregiver and the child, and decoded language patterns using automated techniques. The intervention was successful at improving the quality of parental vocalizations, as measured by the parent's pitch range. We also found suggestive evidence of increases in the duration of adult vocalizations. The results are consistent with more frequent parental self-reported involvement in reading, telling stories, and describing things to the child. Regarding the child, we find a nonrobust decrease in the duration of vocalizations, which we attribute to a crowding-out effect by the caregiver in the context of a fixed 10-minute suggested activity and a more proactive parental role.

Type
Article
Copyright
Copyright © The Author(s), 2021. Published by Cambridge University Press

Introduction

Wide variability in language development is present from very early in life, as early as eight months (Fenson et al., Reference Fenson, Dale, Reznick, Bates, Thal and Pethick1994). Early experiences are among the most important predictors of language development, along with maternal education. Research has found that parental interventions that target caregiver's responsiveness and communication patterns are effective in changing parental behavior and promoting language development, with some reports of long-term effects on academic achievement and economic outcomes in adulthood (Gertler et al., Reference Gertler, Heckman, Pinto, Zanolini, Vermeerch, Walker, Chang and Grantham-McGregor2014). However, these programs are, in many cases, resource-intensive and difficult to scale up.

In this article, we evaluate the effects of a low-cost e-messaging intervention directed at parents with children below the age of 2 on the caregiver–child dyad communication patterns. The Crianza Positiva e-messaging intervention consists of a series of text and audio messages delivered to the mobile phones of families three times a week for a period of six months. The messages were designed to complement a parenting workshop of eight weekly sessions. Both the workshop and the messages are organized around four parental competences: attachment bond, protection, stimulation, and reflective function. The messages include reminders, suggestions of simple actions, and messages of encouragement to reinforce and sustain positive parenting practices. Behavioral economics tools have demonstrated large benefits in diverse areas of public policy and are increasingly being used in policies targeting families with children (Gennetian et al., Reference Gennetian, Darling and Aber2016; Mayer et al., Reference Mayer, Kalil, Oreopoulos and Gallegos2018; Ajzenman & Lopez Boo, Reference Ajzenman and Lopez Boo2019).

The e-messaging intervention was randomized across 24 early childhood centers in Uruguay that had previously implemented the Crianza Positiva workshop (529 families). All treated and control families participated in the workshop, but only those in the treatment received mobile messages. Bloomfield et al. (Reference Bloomfield, Balsa and Cid2020) analyze the impact of the Crianza Positiva e-messaging program on parental self-reported investment and find that the intervention increased the frequency of parental involvement with the child in social, stimulating, and physical activities by 0.24 SD. It also increased parents’ investment quality as measured by a positive parenting index and by an index of outreach for social support.

In this article, we further contribute to the assessment of the program by relying on objective, rather than self-reported, outcomes to study parent–child language interactions. Language development is an important part of the Crianza Positiva curriculum: about 20% of the messages focus explicitly on promoting parents’ involvement in early literacy practices, while the rest seek to improve parental sensitivity and responsiveness, household environment, and the quality of the caregiver–child interactions, all features indirectly associated with better communication patterns and more successful language development (Aboud et al., Reference Aboud, Singla, Nahil and Borisova2013). Our analysis of language interactions relies on the automated decoding of a 10-minute session of free play between the caregiver and the child, videotaped around three months after the e-messaging intervention ended. The use of externally assessed outcomes mitigates usual problems of self-reports, such as desirability bias.

Our findings show that the intervention was successful at influencing parental language cues: it increased the duration of caregiver vocalizations and improved the quality of vocalizations with an increase of the adult's pitch range. We did not find clear evidence of positive benefits in child outcomes, which may suggest limitations in short free play sessions for assessing language and cognitive development.

Our work contributes to a nascent literature exploiting audio- and text-based communication and behavioral economic insights to boost early literacy development. We believe this is one of the first articles to assess the effectiveness of a large-scale e-messaging intervention using automatic decoding of videotapes to infer language and communication patterns. Our results demonstrate the potential of these very low-cost interventions, based on mobile technology and on the understanding of behavioral biases, for changing parental behavioral patterns and ultimately improving child development, particularly when interventions are conducted as a complement to an initial face-to-face intervention with the families.

The article is structured as follows. The section ‘Background’ provides background literature on interventions that promote language development. The section ‘The Crianza Positiva text and audio messaging intervention’ describes the Crianza Positiva messaging intervention. The section ‘Methodology’ provides an overview of the methodology used. The section ‘Results’ reports our results and in the section ‘Discussion and conclusions’, we discuss our findings.

Background

Language development in early childhood

The importance of language development during early childhood is well known. It predicts the latter results during youth, including literacy skills and school success (Zimmerman et al., Reference Zimmerman, Gilkerson, Richards, Christakis, Xu, Gray and Yapanel2009; Rowe et al., Reference Rowe, Raudenbush and Goldin-Meadow2012). It has also been associated with adulthood outcomes, with significant correlations between language difficulties early in life and lower literacy, poor mental health, and higher unemployment rates (Law et al., Reference Law, Rush, Schoon and Parsons2009). A majority of this research has been carried out in North America, but the little evidence that exists from Latin American samples seems to align with the global trends we discuss next (e.g., Law et al., Reference Law, Rush, Schoon and Parsons2009, in Jamaica; Shneidman & Goldin-Meadow, Reference Shneidman and Goldin-Meadow2012, in Mexico).

Language development gaps appear early in life. One of the most important predictors of language development is early experiences, along with maternal education. Differences in the rate of language development depend on access to language and gestures from caregivers, and on infant's speech processing skills (Weisleder & Fernald, Reference Weisleder and Fernald2013). Language inputs and social interactions support the child's language development through opportunities where the child can interpret language and practice his/her emerging skills.

Studies have found a positive association between the quantity of language input and the child's vocabulary and speed of word recognition (Hurtado et al., Reference Hurtado, Marchman and Fernald2008; Shneidman & Goldin-Meadow, Reference Shneidman and Goldin-Meadow2012; Fernald et al., Reference Fernald, Marchman and Weisleder2013; Weisleder & Fernald, Reference Weisleder and Fernald2013; Mendive et al., Reference Mendive, Mascareño Lara, Aldoney, Pérez and Pezoa2020). Furthermore, when studying how the two above-mentioned factors work together, Weisleder and Fernald (Reference Weisleder and Fernald2013) find that the effect of child-directed speech quantity (language input) on vocabulary is mediated by infants’ language processing efficiency. Regarding language input, the evidence points to the conclusion that what matters most for language learning is infant-directed speech and responses contingent on their communicative signals or turn-taking (Snow & Ferguson, Reference Snow and Ferguson1977; Weisleder & Fernald, Reference Weisleder and Fernald2013; Bornstein et al., Reference Bornstein, Putnick, Cote, Haynes and Suwalsky2015; Romeo et al., Reference Romeo, Leonard, Robinson, West, Mackey, Rowe and Gabrieli2018), rather than speech overheard by the child (Weisleder & Fernald, Reference Weisleder and Fernald2013) or the sheer number of words (Romeo et al., Reference Romeo, Leonard, Robinson, West, Mackey, Rowe and Gabrieli2018). Caregiver infant-directed speech has been associated with an increased depth of processing in brain imaging studies (Zangl & Mills, Reference Zangl and Mills2007) and improved early language skills (Weisleder & Fernald, Reference Weisleder and Fernald2013; McGillion et al., Reference McGillion, Pine, Herbert and Matthews2017). In addition, studies have reported a positive association between the number of conversational turns, child language development (Zimmerman et al., Reference Zimmerman, Gilkerson, Richards, Christakis, Xu, Gray and Yapanel2009), and brain activity (Romeo et al., Reference Romeo, Leonard, Robinson, West, Mackey, Rowe and Gabrieli2018). A recent study by Rubio-Codina and Grantham-McGregor (Reference Rubio-Codina and Grantham-McGregor2020) shows that the Family Care Indicator (FCI), an index measuring the quality of the home environment in the first years of life, which assesses, among several other resources and caregiving activities, reading, looking at picture books, telling stories, and singing to the child, can predict IQ and academic performance when the child is in elementary school.Footnote 1

Language input has been linked to household socioeconomic status (SES). For the USA, Hart and Risley (Reference Hart and Risley1995) found that children in low-income households not only heard fewer words per hour from their caregiver than children in high-income households, but the content also varied greatly between groups. The difference they found meant that by age 4, children in high-income families will have been exposed to 30 million more words than children from low-income families. A more recent study found that associations between vocabulary and language processing efficiency, and SES, appear as early as 18 months, and that this association is pronounced by the time the child turns two (Fernald et al., Reference Fernald, Marchman and Weisleder2013). In Latin America, language disparities widen with age and are also associated with SES (Berlinski & Schady, Reference Berlinski, Schady, Berlinksy and Schady2015). There is also evidence from Uruguay on SES gradients on the home environment and the FCI (Lopez Boo et al., Reference Lopez Boo, Cubides Mateus, Sorio, Garibotto and Berón2018).

Other authors have reported large variance in language input and infants’ language development within households of homogeneous socioeconomic characteristics (Weisleder & Fernald, Reference Weisleder and Fernald2013), as well as associations over and above SES (Zimmerman et al., Reference Zimmerman, Gilkerson, Richards, Christakis, Xu, Gray and Yapanel2009; Romeo et al., Reference Romeo, Leonard, Robinson, West, Mackey, Rowe and Gabrieli2018), suggesting that SES by itself does not determine the infant language trajectory. Sperry et al. (Reference Sperry, Sperry and Miller2019) test Hart and Risley's (Reference Hart and Risley1995) (HR) findings using data from five communities in the USA. Their results do not support HR findings: they do not find a clear pattern between the number of words spoken by the primary caregiver to the child and SES. Even when testing for a broader definition of the child's verbal environment (all speech directed to the child and speech heard by the child), they still do not find a consistent relation between these measures and SES.

Either way, creating language-rich experiences for children is extremely important to prevent language disparities that continue throughout the life cycle. Parental interventions that target caregivers’ responsiveness and communication patterns have been found to be effective. For example, a six-month home-visiting program (3Ts: Tune In, Talk More, and Take Turns with the child) for low SES households in the USA that focuses on the home language environment had immediate positive effects on caregivers’ knowledge, the quantity of linguistic input, and the quality of caregiver interactions (Leung et al., Reference Leung, Hernandez and Suskind2020). Some aspects of the intervention that the researchers point out as being key to effectively changing parents’ behavior at home are video modeling (strategies are presented through true-to-life videos), concrete feedback (based on quantitative measures of linguistic input from LENAFootnote 2 recordings), and goal setting (goals are set in terms of the linguistic input measures collected from the previous week).

McGillion et al. (Reference McGillion, Pine, Herbert and Matthews2017) evaluate another language intervention that focuses on contingent talk and find immediate effects on caregivers. Households are visited twice in a two-week period. In the first visit, the importance of contingent talk is introduced, a short video with examples is presented, and caregivers are asked to set aside 15 minutes per day to practice what was learned. In the second visit, further questions are solved. Immediately after the end of the intervention, the study finds that parents assigned to the intervention engage in more contingent talk. However, a year later, no effects are found. This fade-out suggests the importance of guiding and providing the caregivers with more tools to effectively support the infant's language learning through the different stages of its development.

A recent set of studies evaluates the incorporation of technology in early childhood interventions that promote literacy skills. Doss et al. (Reference Doss, Fahle, Loeb and York2019) evaluate ‘Ready for K!’, a text-based intervention in the USA to help parents support their preschoolers’ development. The program increased parental involvement at home and at the school and improved child literacy skills. Doss et al. (Reference Doss, Fahle, Loeb and York2019) show that personalizing ‘Ready for K!’ messages increases the likelihood that parents read to their children by 50%. Hurwitz et al. (Reference Hurwitz, Lauricella, Hanson, Raden and Wartella2015) and Meuwissen et al. (Reference Meuwissen, Giovanelli, Labella and Susman-Stillman2017) also report that the use of text messages increased parental engagement in literacy and other learning activities with their children. Mayer et al. (Reference Mayer, Kalil, Oreopoulos and Gallegos2018) evaluate an intervention that provided parents with a tablet containing children's books and a goal-setting tool, reminders, and social gratification messages. The authors find that parents in the treatment group used the tablet more and read more to their children.

Measurement of language input and output

In the economics literature, the norm has been to use standardized tests, such as Bayley Scales (Bayley, Reference Bayley1969), the Denver Developmental Screening Test (Frankenburg et al., Reference Frankenburg, Dodds, Archer, Shapiro and Bresnick1992), and MacArthur Communicative Development Inventory (Fenson et al., Reference Fenson, Dale, Reznick, Thal, Bates, Hartung, Pethick and Reilly1993), to measure early language development. We know these measures are predictive of future outcomes (Fernald et al., Reference Fernald, Kariger, Engle and Raikes2009) and that early stimulation strongly impacts language as measured by these tests (NICHD, 2000; Rubio-Codina & Grantham-McGregor, Reference Rubio-Codina and Grantham-McGregor2020). However, these measures are subject to limitations including the fact that they are expensive to implement, depend on evaluators’ training and profile, and measure the child's development in a particular setting. An alternative to standardized development tests is audio recordings that measure the child's language skills in a natural setting in addition to collecting other measures, such as maternal vocabulary and conversational turns. These audio recordings are usually obtained without the mediation of a trained facilitator and provide critical information for understanding the infant's language trajectory and environment.

There are broadly two methods to capture both the quantity and quality of language input and output through recording, namely short audio- or video recordings, or day-long audio recordings (Lavechin et al., Reference Lavechin, de Seyssel, Gautheron, Dupoux and Cristia2021). In the psycholinguistic literature, it is most common for the former to be transcribed by trained annotators, and for the latter to be analyzed automatically, typically using speech technology algorithms that do not produce orthographic transcriptions (not technically possible today), but which instead provide estimates of who spoke when and for how long (e.g., Zimmerman et al., Reference Zimmerman, Gilkerson, Richards, Christakis, Xu, Gray and Yapanel2009). Human transcription of adult speech in such conversations is a time-consuming process, with most annotators requiring 20 times the recording duration, that is, 1 minute of interaction takes 20 minutes to transcribe. Transcription is even more time-consuming for speech by children aged 1–3 years, with estimates for transcription time reaching 40 times the recording duration. It is beyond the scope of this article to fully lay out the advantages and disadvantages of short versus long recordings, but we do point out that day-long recordings remain quite challenging to collect in large-scale projects like ours because of logistical, legal, cost, and ethical reasons, which led us to prioritize short observations. We believe our study is among a few to analyze the impact of a behavioral and parenting intervention at scale and measure language outcomes using short videotaped recordings (a cost-effective method vis-à-vis day-long recordings).Footnote 3

In short video- or audio-recorded observations, the quantity of a caregiver's speech can be measured in terms of the number of words, number of vocalizations, or amount of time spoken. We are not aware of research showing that longer vocalization durations correlate with child development in short observations, but in transcription-based research, children who are more advanced produce more vocalizations (Gilkerson et al., Reference Gilkerson, Richards, Warren, Montgomery, Greenwood, Kimbrough Oller and Paul2017) and utterances with more words (i.e., with a higher ‘mean length of utterance’, e.g., Allen & Dench, Reference Allen and Dench2015), which, all else equal, should be correlated with greater child vocalization duration. We also expect more parental input (a longer duration of parental vocalizations) to have a positive impact on the child (e.g., Ferjan Ramírez et al., Reference Ferjan Ramírez, Lytle, Fish and Kuhl2019, Reference Ferjan Ramírez, Lytle and Kuhl2020).

Communication quality, on the other hand, may be captured through the number of conversational turns, parental responsiveness, and the caregiver's pitch. A child is more likely to be engaged when the conversation takes place in turns rather than when the caregiver talks nonstop. There is research also suggesting that more responsive caregivers, who vocalize after the child vocalized, help yield better language outcomes in day-long recording data (Ferjan Ramírez et al., Reference Ferjan Ramírez, Lytle and Kuhl2020). Another way to engage the child is via producing speech that captures the child's attention. This is often done by using a child-directed speech register (or Parentese speech), characterized by a higher pitch (and in some cultures, a larger pitch range) than adult-directed speech. It is, in part, because of the higher pitch and larger pitch range that infants show a strong preference for infant- over adult-directed speech (Fernald & Kuhl, Reference Fernald and Kuhl1987; The ManyBabies Consortium, 2020), and this heightened attention boosts performance in a wide range of tasks (a systematic review in Cristia, Reference Cristia2013). Caregivers can be trained to use this register, leading to positive child outcomes (Ferjan Ramírez et al., Reference Ferjan Ramírez, Lytle, Fish and Kuhl2019, Reference Ferjan Ramírez, Lytle and Kuhl2020).

The Crianza Positiva text and audio messaging intervention

The Crianza Positiva text and audio program is a six-month-long intervention for families with children aged 0–2 years. Messages were designed to help caregivers change their decision-making strategies in order to adopt what previous research suggests to be positive parenting practices and sustain these practices over time, to ultimately change their children's development. Participant families started receiving messages right after completing a workshop of eight weekly sessions that was organized around four parenting competences: attachment bond, stimulation, protection, and reflective function. Families in the treatment group received messages three times a week both in text (via SMS) and in audio format (via WhatsApp). The contents of the text and audio messages were exactly the same, except for the fact that text messages were personalized with the name and gender of the child. Control families received a single SMS message thanking them for their participation in the workshop.

The message structure is based on behavioral economics insights. Its design recognizes the existence of four key behavioral biases associated with low parental investment. First, parents’ behavior may be influenced by the present bias (Thaler, Reference Thaler2015), which implies attaching low valuation to costly activities that imply rewards only in the future. Another key bias is limited attention and cognitive fatigue as it deviates cognitive resources from important decisions (Shah et al., Reference Shah, Mullainathan and Shafir2012; Mani et al., Reference Mani, Mullainathan, Shafir and Zhao2013; Schilbach et al., Reference Schilbach, Schofield and Mullainathan2016) such as those relating to parenting. Lastly, negative identities can lead to suboptimal parental investment by making parents attach a low value to their existing resources and individual performance. Having a positive identity as a parent is important because self-esteem can increase effort through higher levels of motivation (Bénabou & Tirole, Reference Bénabou and Tirole2002).

As stated above, the literature on language development highlights the importance of exposing the child to language stimulation in the context of a sensitive and caring interaction. Throughout the program, several modules of messages focus on providing the right setting for an adequate development of literacy skills. A first module promotes parental sensitive observation, interpretation, and response to the child's needs and signals, gentle treatment, and the expression of affection (17% of the messages). This module emphasizes observing the child carefully, trying to interpret his/her expressions, gestures, and signals, seeking to understand what the child is feeling and thinking, exploring how to get closer to the child by responding to these signals, demonstrating love, and communicating assertively. A second module explicitly emphasizes the benefits of parental involvement in specific home-literary practices (17% of messages) and suggests concrete activities, such as engaging in conversations with the child, asking open questions, describing objects, places and activities, singing and listening to songs together, and engaging in book-reading and story-telling routines. The rest of the modules cover other parenting practices such as child safety at home and the incorporation of routines (17% of messages), free play and parental involvement in enjoyable activities with the child (17%), and parental self-care and reflective function (32%). To the extent that all messages influence the quality of the environment and of the parent–child interaction, they are directly or indirectly setting the grounds for better communication patterns and improved language development.

Treated families receive messages three times a week at the same time (specifically, Mondays, Tuesdays, and Fridays at 6 pm). A fundamental challenge of home visits and other interventions seeking to improve parenting practices is to be able to help parents overcome potential behavioral barriers they face at home in key moments when parenting decisions are critical. The message structure is designed to address some of these biases and always follows the same pattern. On Mondays, families receive a message with information on the benefits to the child of engaging in a certain parenting practice. This message aims at addressing present bias and inattention. Reminders have been successfully implemented in many contexts (from reminding people about voting to reducing missed appointments in hospitals or increasing medication adherence) and work well especially if the timing is right (Dale & Strauss, Reference Dale and Strauss2009; Gurol-Urganci et al., Reference Gurol-Urganci, de Jongh, Vodopivec-Jamsek, Atun and Car2013).

On Tuesdays, families receive a suggestion to engage in a specific activity with the child in relation to that week's topic and to repeat it during the week. This activity does not take much time and is usually concrete and easy to implement. The purpose of this message is to simplify parental decision-making in the face of the complexity of parenting and to help parents establish routines by engaging repetitively in the mentioned practices. The message addresses inattention and cognitive fatigue. Doss et al. (Reference Doss, Fahle, Loeb and York2019) show that providing activity tips to parents can help reduce parental cognitive load and improve parental ability to process information, particularly in the context of highly stressful environments.

On Fridays, families are invited to reflect on their performance during the week and on their personal feelings regarding the task proposed and usually receive a motivational or encouragement message. This message seeks to help parents overcome negative identities and strengthen self-efficacy.

For example, during the week that focused on vocabulary development and the importance of speaking to the child, parents received the following messages:

MONDAY:

Children's brains are like sponges, they absorb everything: the sounds, the pitches of voices, the language they listen to. The more words your child listens to at this stage, the more [she] will develop [her] language. It is therefore very important that you speak to [child's name], this will impact heavily on [her] ability to learn.

TUESDAY:

Take advantage of every moment to tell [child's name] what you're doing when you're with [her]. For example, say ‘I'm changing your diapers’, ‘Now I'm preparing the food’, or ‘Let's go for a walk’. This will help enrich [her] language.

FRIDAY:

The more you speak to [child's name], the better will [her] language develop and the more [she] will learn. Today and in the following days, remember and repeat this thought: ‘I take advantage of all the moments with [child's name] to speak to [her].’

The program included some additional components that provided parents with further sources of information and material. For example, we coupled messages promoting reading and singing to the child with a message that provided a username and password to access ‘Radio Butiá’, a Uruguayan web server that hosts Latin American songs and stories online. These messages addressed concerns relating to the possibility that families without children's books and music would not perform the suggested activities.

Methodology

Timeline and main features of program implementation and evaluation

Participant families were selected from a pool of candidates who had attended an eight-week parenting workshop at ‘Children and Family Care Centers’ (CAIF) of Uruguay. After workshop completion, in December 2017, we assigned families to a treatment and control group to evaluate the effects of the Crianza Positiva text and audio messages component. Following Baird et al. (Reference Baird, Bohren, McIntosh and Özler2018), we selected a two-stage randomization strategy in order to measure spillovers. First, we stratified the sample by average maternal education and, within each stratum, randomly allocated early childhood centers to receive or not receive messages using a 60–40% rule. As a result of the randomization, we assigned 14 early childhood centers to the treatment group and 10 centers to the control group. Next, within each center in the treatment arm, we randomized families into receiving or not receiving messages. In centers that were treated, 80% of families were selected to receive messages. Out of the initial 529 families, 237 families were assigned to receive messages, 59 were assigned not to receive messages but belonged to a treated center, and 233 families were assigned to a pure control group.

Figure 1 illustrates the timeline of the implementation and evaluation of the Crianza Positiva messaging program. Treated families received messages between January 5, 2018, and June 27, 2018, and control families received a single SMS message on January 26, 2018, thanking them for participating in the parenting workshop.

Figure 1. Crianza Positiva Timeline.

Messages were sent to all contact numbers we had for each family (mothers, fathers, and other caregivers), which in total added to 373 mobile phones. SMS (text) messages were sent through a platform that enabled us to personalize the messages by including the name of the baby and gender-specific pronouns. We also created a WhatsApp broadcast list to deliver text and audio messages.Footnote 4

We collected data on families’ SES and other characteristics at three points in time. Baseline information was collected before the initiation of the workshops (in August 2017) and after the workshop ended (November–December 2017). We conducted a follow-up assessment two months after the messaging intervention ended (between August and November 2018). Baseline evaluations were carried out at the CAIF center, and follow-up evaluations were carried out either at the family's home or at the center depending on the family's preferences.Footnote 5 In this article, we focus on a specific component of the family assessment: a 10-minute video recording of the interaction between the caregiver and the child in a free play activity. Evaluators (most of them psychologists, therapists or social workers, or advanced students of these specialties) provided the dyad with a set of toys that they could use during the activity. The mother received instructions to play freely while being recorded. The evaluator was instructed not to interfere.

Outcomes

The first set of outcomes we consider includes the counts and duration of vocalizations for both the child and the adult. A vocalization is a section of audio that has been classified automatically as being speech-like and attributed to an adult or a child.Footnote 6 As summarized in the section ‘Measurement of language input and output’, previous research suggests that better input is associated with more and/or longer child-directed caregiver vocalizations (e.g., Ferjan Ramírez et al., Reference Ferjan Ramírez, Lytle, Fish and Kuhl2019, Reference Ferjan Ramírez, Lytle and Kuhl2020). As for the child, more advanced children produce more vocalizations (Gilkerson et al., Reference Gilkerson, Richards, Warren, Montgomery, Greenwood, Kimbrough Oller and Paul2017) and potentially longer ones (Allen & Dench, Reference Allen and Dench2015).

Second, we capture communication quality through the number of conversational turns between child and adult. There is a turn whenever there is a threshold of at most 0.3 seconds between a vocalization offset and the next vocalization's onset. We also account for parental responsiveness by considering the proportion of child vocalizations that are followed by an adult vocalization in less than 2 seconds (adult response ratio). If the intervention improves communication quality, this may be reflected in higher turn counts and higher adult response ratios in the treatment compared with the controlled group (see the section ‘Measurement of language input and output’).

Finally, we account for two qualitative measures of adult vocalizations: pitch range and pitch average. Fundamental frequency (f0) range is the difference between the highest and the lowest f0 frequency reached within a given vocalization. The f0 average is the average pitch of that vocalization. Research summarized in the section ‘Measurement of language input and output’ leads to the prediction that caregivers in the treatment group should exhibit higher pitch and larger pitch ranges than caregivers in the control group.

All outcomes were obtained from automatic analyses of the 10-minute videos using the DiViMe open-source package (Le Franc et al., Reference Le Franc, Riebling, Karadayi, Wang, Scaff, Metze and Cristia2018) via the avutils package (Neumann, Reference Neumann2020). For the main outcomes, we first conducted manual annotations of the outcome for a random set of 30 videos and then checked the Pearson correlation between the manual and the automatic annotations. These correlations are reported in Figures A1 to A6 in Appendix 1. This allowed us to select the DiViMe diarizer (and parameters such as interturn duration) leading to the best performance. We used the Yunitate broad class diarizer, a recurrent neural network that has been trained to classify audio into three broad classes: male adult, female adult, and child. With this diarizer, we find correlations of r = 0.78 between manual and automatic child vocalization counts, of r = 0.64 for child vocalization durations, and of r = 0.77 for conversational turns. The correlations between manual and automatic annotations for adult vocalizations are r = 0.74 for vocalization counts, r = 0.66 for vocalization duration, and r = 0.7 for adult response rate within 2 seconds. Correlations are measures of association between two variables; a correlation of 0.7 is very high (given Cohen's 1988 guidelines, where correlations above r = 0.5 are ‘large’), and it indicates that the two variables share 70% of the variance.

Because child language may be difficult to assess at these ages, we correlated child outcomes with age in months. We expected that child measurements should correlate with age, because we would like them to be sensitive to language development and age is a good proxy for development (older children have better language than younger children). Unfortunately, we did not find a statistically significant correlation between count or duration of child vocalizations and child's age. This raises some concerns about our ability to measure children's language outcomes with precision. We did find, however, a negative correlation between the duration of adult vocalizations and children's age (r = −0.11; p < 0.1), suggesting that adults vocalize less as children vocalize more in the context of the proposed fixed time activity, and that the measures of adult vocalizations may be more reliable than those of the child. We found no association between maternal age or education and language outcomes.

Compliance

Issues like having the incorrect mobile number or sending messages to a line that is unavailable or has no credit could lead to SMS failure. In our intervention, we sent messages via SMS and WhatsApp to reduce concerns that this could happen. One month after program initiation, we recontacted all treated families to verify whether they were receiving the SMS messages. We repeated this for a random sample of treated families three months after program initiation. We learnt from these follow-ups that out of the 237 families in the treatment group, 197 were receiving SMSs correctly, while 40 were not. We asked the CAIF centers to provide us with new contact numbers of these families. We could update nine contact numbers, which reduced our SMS failure rate to 13%. WhatsApp messages had a high reception rate: in 87% of families at least one family member received the messages. The percentage of families that read the messages was also high: families that received the messages read on average 69% of them. Overall, we found that 27 out of the 237 families assigned to treatment did not get SMS nor WhatsApp messages. The correlation between assignment to the messaging treatment and receiving the messages was r = 0.9.

Attrition and balance

We managed to collect videos on 50% of the 529 families at baseline (either in August or in November–December 2017) and on 60% of the families at follow-up (317 families overall, 122 assigned to receive messages and 195 in the control group). During baseline, control families were less likely than treated ones to consent to being videotaped, while the opposite was true at follow-up. We ended up with 178 families (34%) with a recorded video both at baseline and at follow-up, 80 of which were in the messaging treatment group and 98 in the control group.Footnote 7 There was no differential attrition in these families across treatment and control arm.

Based on WhatsApp message reception markers, 80% of treated families with a video at follow-up read or listened to the WhatsApp messages at least once. Among these, families opened, on average, 79% of the messages. Regarding families with videos at baseline and follow-up, 85% read or opened at least one WhatsApp message, and on average, they opened 83% of the messages.

Table 1 reports descriptive statistics of family characteristics at baseline as well as the balance of these characteristics across treated and control subjects (ITT = 1 and ITT = 0, respectivelyFootnote 8), both for the sample of families with videos at baseline and follow-up and for the sample only with a video recorded at follow-up. Regarding the first sample (see columns (1) through (3)), more than 90% of caregivers correspond to the child's mother. The average age of the mother is around 30 years old, nearly half of the children are male, around 11% were born premature, and 20% are still being breastfed. Three out of four children live in a family with both their biological parents. On average children live with another sibling in the house, and 15% live with other adults. We constructed a wealth index that was a weighted average of household assets, with the weight of each asset in inverse proportion to its availability across households. The index is skewed to the left: within a range of 0–1, the average household has an index value of 0.25. Two out of three households receive governmental assistance, and 85% report having participated in at least six of the eight Crianza Positiva workshop sessions. Twenty-nine percent of mothers completed middle school, but not high school, and another 28% completed high school or more. This implies that more than 40% of mothers have less than middle school. Forty percent of mothers work full time, around 20% work part time, and 80% of fathers have a full-time job. Half of the families suffered a negative shock in the 12 months prior to the baseline survey, such as a divorce, a death, a job dismissal, or a disease within the family, among other issues.

Table 1. Descriptive statistics and balance of family characteristics at baseline

Notes: Columns (1)–(3) refer to the sample with videos at baseline and follow-up and Columns (4)–(6) refer to the sample only with videos in follow-up. Columns (1) and (4) show mean values for the variables in each row across subjects with ITT = 0. Columns (2) and (5) show mean values for the characteristics in each row for subjects with ITT = 1. Standard deviations are reported in parentheses for nondichotomous variables. Columns (3) and (6) show the differences in covariates at baseline by ITT status that are obtained by regressing the covariate on a dummy variable that takes the value of 1 if ITT = 1 and of 0 if ITT = 0. Due to missing values, the number of observations for each covariate may differ slightly from the N in the last row. The difference is statistically significant at: ***1%, **5%, *10%.

Regarding behavioral biases, we use the parent's discount rate to capture present bias, the number of negative shocks faced by the family to measure cognitive fatigue, and a parental efficacy scale to capture negative identities. Specifically, we use Kirby et al.'s (Reference Kirby, Petry and Bickel1999) Monetary Choice Questionnaire (MCQ) to measure the discount rate. Because the number of observations for the MCQ at baseline was quite small (N = 115), and we did not expect treatment to affect the parameter, we work with the MCQ as measured at follow-up (N = 148).Footnote 9 The negative shocks variable reflects the total number of the following events experienced by the family in the 12 months prior to the follow-up survey: divorce, separation, moving, income falls, debt, drug problems in the family, the death of a close friend or a family member, unemployment, job problems, or problems with the law. We use the measure at follow-up because we do not expect treatment to affect these events, and we seek to capture family stress or emotional fatigue during the time the intervention took place. Finally, we use the Johnston and Mash's (Reference Johnston and Mash1989) version of the Parental Efficacy Subscale in the Parental Sense of Competence Scale (PSOC).Footnote 10 In this case, we choose to work with the baseline measure because we do expect treatment to affect parental efficacy.

Column (3) in Table 1 checks for balance in family baseline characteristics across treated and control groups. Out of 22 characteristics, we only find a statistically significant difference at 5% or less in the mother's age: treated mothers are 2.6 years older than control mothers. At a 10% level of significance, we also find that treated families have, on average, a larger number of children. The fact that we only find a difference at a 5% level in 1 out of 22 tested features suggests that our sample is balanced.

The sample with videos at follow-up (but not necessarily at baseline) is quite similar in terms of family characteristics to the other sample.Footnote 11 But unlike the former, it shows differential attrition: treated families are 15 percentage points less likely to be videotaped at follow-up than control families.Footnote 12 This results in a slightly larger imbalance of baseline characteristics. At a 5% significance or less, mothers in the treatment group are older than those in the control, and the child is less likely to be an only child. There are more differences at the 10% level, with children in the treatment group being younger and living with a larger number of other children and adults in the household. While only 2 out of 22 covariates are unbalanced at normal levels of significance, because 5 covariates are unbalanced at a p-level of 0.10, in the next section, we device some strategies to address the differential attrition.

Descriptive statistics of language outcomes

Table 2 describes the main language outcomes we assess in the analysis. We present outcomes for control group families at follow-up, both for the sample with videos at baseline and follow-up and for the sample with videos at follow-up but not necessarily at baseline. On average, there are 112 child vocalizations in a 10-minute free play interaction between the child and the caregiver. This average is above the number of vocalizations found by Soderstrom and Wittebolle (Reference Soderstrom and Wittebolle2013) in a similar setting (they report 55 vocalizations on average). The total duration of vocalizations is 200 s in the first sample and 195 in the second one. Adults show around 110 vocalizations, with a total duration of 146 sconds in the first sample and 158 in the second one. These counts are similar to those found by Hurtado et al. (Reference Hurtado, Marchman and Fernald2008) in a similar context of 10 minutes of organized playtime. There are around 90 conversational turns defined by a threshold of 0.3 seconds between vocalization off- and onset. Between 55% and 60% of the time, adults respond to children's vocalizations in less than 2 seconds. The average frequency of the adult pitch is around 5.5 Equivalent Rectangular Bandwidths (ERB), and its range (the maximum frequency minus the minimum one) is around 2.4 ERB.

Table 2. Description of language outcomes for control families at follow-up (N = 98)

We also assess correlations across language outcomes (see Table A1). Children's count and duration of vocalizations are strongly associated with each other (r = 0.64; p < 0.01) as with conversational turns (r = 0.52; p < 0.01 and r = 0.46; p < 0.01, respectively). Adult count and duration of vocalizations show also a strong positive bivariate correlation, and both are associated with conversational turns and response ratio. Interestingly, while the count of adult vocalizations is positively, although not too strongly, associated with the count of child vocalizations (r = 0.18; p < 0.01), the duration of adult vocalizations is negatively associated with the duration of child vocalizations (r = −0.19; p < 0.05), suggesting a possible crowding-out effect given the proposed fixed time activity. Finally, adult pitch range is positively correlated with count (r = 0.24; p < 0.01) and duration (r = 0.42; p < 0.01) of adult vocalizations, but it is negatively correlated with the count of child vocalizations (r = −0.18; p < 0.05). Adult pitch average is negatively correlated with the duration of child vocalizations (r = −0.37; p < 0.01) and uncorrelated with adult vocalization count and duration.

Estimation approach

We run ordinary least squares regressions of each language outcome on an indicator of having been randomly assigned to receive messages. The coefficient of the latter indicator provides us with the Intention-to-Treat (ITT) effect, or the effect of having been randomly assigned to receive messages, on language outcomes. In our core analysis, we estimate ITT effects using the sample with videos both at baseline and at follow-up. We then run robustness analyses on the sample with videos at follow-up (but not necessarily at baseline). While the first sample is smaller, it is more balanced across treatment and control subjects and allows us to estimate standard errors with more precision by accounting for the outcome at baseline. We also explore spillover effects by including in our estimations an indicator if the family was assigned to the control group but belonged to a center that was randomized into treatment. Note, however, that due to the small number of videos observed in this subsample, we do not expect estimates on spillovers to be precise and do not emphasize this feature in the analysis.

To improve precision and address potential sample selection due to attrition, in our core specification, we adjust for a set of covariates including (1) the value of the outcome at baseline;Footnote 13 (2) the variable used to stratify the randomization at the CAIF center level (a dummy equal to one if the average level of education of mothers participating in Crianza Positiva at the center was at least middle school); (3) covariates that were unbalanced after attrition at a significant level of 10% or less (the mother's age and the number of other children in the household); and (4) other covariates that could help improve the precision of the estimation, such as the child's age and gender, mother's education, whether the child lived with both biological parents, father's working status, and the number of months that elapsed between the date when the intervention began and the final assessment. For statistically significant outcomes, we report an adjusted p-value of the ITT coefficient after accounting for the clustering of observations around centers using wild bootstrapping (Cameron et al., Reference Cameron, Gelbach and Miller2008). We also adjust p-values for multiple hypotheses testing using the Romano Wolf algorithm (Clarke et al., Reference Clarke, Romano and Wolf2020).

We check for robustness by adjusting for different combinations of covariates in the main sample. In addition, we address the differential probability of being assigned to treatment and control arms by using inverse probability weighting: we estimate the probability that the family belongs to the treatment, conditional on the observed covariates at baseline, and then weight treated observations by the inverse of such probability, and control observations by the inverse of 1 minus such probability. We also run the analysis in the full sample with videos at follow-up, without adjusting for baseline outcomes.

Results

Main specification

Our core specification estimates ITT effects for the sample with videos both at baseline and at follow-up. Table 3 reports the effects of the intervention on the different language outcomes: measures of count and duration of vocalizations (in columns 1–5) and measures of conversational quality (in columns 6–9). At a 10% level of statistical significance, we find evidence that the intervention decreased the total duration of child vocalizations and increased the total duration of adult vocalizations. The estimates on the ITT indicator are −45 seconds per vocalization for the child, compared with an average duration for control children of 200 seconds (−23%), and 29 s for adults, compared with an average of 146 seconds for control subjects (20%). The finding that adults speak more is in line with our hypotheses, but not the finding that children vocalize less. We find no statistically significant effects over the summation of child and adult vocalizations (total vocalization duration), suggesting that parental vocalizations are crowding out child vocalizations. There is no evidence of effects on the count of child or adult vocalizations. Once we account for clustering of observations around centers, the effect on the duration of adult vocalizations loses statistical significance at traditional levels. Also, none of the effects are robust to adjustment for multiple hypotheses testing.

Table 3. ITT effects of CP messages on language outcomes

Notes: *p < 0.1, ***p < 0.01. The table shows the effects of randomization into treatment (ITT) or into the spillover sample on quantitative language outcomes after adjusting for the following set of covariates: randomization strata, outcome at baseline, mother's education, mother's age, child's gender, child's age in months when the intervention began, the number of months that elapsed between the final assessment and the date when the intervention began, the number of people in the household, single child, whether the father worked full time at baseline, and indicators for the timing when the family was videotaped at baseline (in August 2017, in November–December 2017, or both). When there are two baseline measures, we use the average. Robust standard errors are reported in parentheses. For significant outcomes, we report in squared brackets the p-values after adjusting standard errors for clustering of observations around centers using wild bootstrapping. We also report p-values after adjusting for multiple hypotheses testing (mht) using the Romano Wolf algorithm and considering two families of outcomes: count and duration of vocalizations, and conversational quality.

Regarding outcomes that assess communication quality, the first outcomes analyzed, conversational turns, and parental responsiveness show no differences across treated and control subjects. Coefficients appear to be imprecisely estimated. On the other hand, we find a strong and positive effect on a measure of speech quality: the range of adult pitch. This estimate is 0.4 higher for parents in the intervention group, 18% above the frequency range observed for control parents (2.4 ERB). The effect remains statistically significant at p < 0.10 after clustering standard errors around centers using wild bootstrapping techniques and at p < 0.05 after adjusting p-values for multiple hypotheses testing. We also find that the intervention had a positive and statistically significant effect at the 10% level on the range of adult pitch in the spillover sample, robust to wild bootstrapping but not to multiple hypotheses testing. These effects support our hypothesis that the intervention contributed to improve the quality of language input.

Robustness

Table 4 shows ITT effects conditional on different sets of covariates. We include outcomes for which the effect was statistically significant in Table 3. Specification 1 replicates results in Table 3 (same coefficients as in columns (2), (4), and (9)). Specification 2 shows ITT effects after adjusting for sociodemographic status and the outcome variable at the last baseline survey, if available, or at the first baseline survey otherwise. The third specification shows ITT results adjusting for SES and the outcome at the first baseline survey, or at the second baseline survey otherwise. Specification 4 shows ITT effects when adjusting for the average of the outcome at the two baseline surveys only (no adjustment for SES). The final specification weights treatment and control observations for the inverse of the estimated propensity score of being in each group. We report wild bootstrapping p-values in square brackets. The results are very robust when assessing the effect of treatment on adult pitch range. Coefficients on the duration of child and adult vocalizations, on the other hand, are more sensitive to alternative specifications; and as in Table 3, first panel, most of the estimates lose statistical significance after adjusting standard errors for clustering.

Table 4. Robustness of ITT effects to different sets of covariates

Notes: *p < 0.1; ***p < 0.01. The table shows the effects of randomization into treatment (ITT) on language outcomes after adjusting for different sets of covariates. The columns indicate different outcomes and rows specifications with different sets of covariates in the regressions. For each outcome/specification combination, we show the ITT coefficient from the regression, its robust standard error in parentheses, and in square brackets the p-values after clustering standard errors around centers using wild bootstrapping (WB p). Specification 1 shows ITT estimates after adjusting for sociodemographic characteristics (SES) and the average of the outcome measured at the two baseline surveys. Specification 2 shows ITT effects after adjusting for SES and the outcome variable at the last baseline survey if available or at the first baseline survey otherwise. Specification 3 shows ITT results adjusting for SES and the outcome at the first baseline survey or at the second one otherwise. Specification 4 shows ITT effects adjusting for the average of the outcome at the two baseline surveys, but without adjustment for SES. Specification 5 is similar to 1, but weights observation for the inverse probability of having been assigned to treatment.

In Table A2 we show the results of estimating the Treatment on the Treated (TOT), rather than ITT, effects. We use two-stage least-square estimation techniques and instrument the probability of having received at least one message with the indicator of random assignment to treatment. The results are very similar to those in Table 3. Exposure to the messages decreases the duration of child vocalizations by 47 seconds (wild bootstrapping p-value = 0.09) and increases the duration of adult vocalizations by 40 seconds, although this effect becomes marginally insignificant once we cluster standard errors around centers. Receiving parenting messages continues to show a strong positive effect on adult pitch range (0.50 ERB), which remains statistically significant at 5% after clustering standard errors.

Next, we estimate ITT results analyzing the video's last 300 seconds rather than the full video (which theoretically should have an extension of 600 seconds). We do so for two reasons: (1) to measure outcomes in a similar timeframe for all observations and (2) because we expect mother and child to be more relaxed and less influenced by socially desirable norms in the second half of the video. The results continue to show a positive effect of the intervention on the adult pitch range – of 0.43 ERB – and some evidence of an increase in the duration of adult vocalizations, which marginally loses statistical significance after adjusting standard errors for clustering (see Table A3).

In Table A4, we show results when running the ITT analysis for the full sample of families with a video recording at follow-up (regardless of having or not a video at baseline). Again, we identify a positive ITT effect on the adult pitch range and a negative effect on the duration of child vocalizations, but the magnitudes are smaller than in the core specification and lose statistical significance after clustering standard errors around centers.

Mechanisms

We explore mechanisms in two ways. First, we investigate whether exposure to the message content led parents to increase their involvement in activities associated with language development. As mentioned before, 17% of messages encouraged parents to engage in language-enhancing activities such as holding conversations with the child, asking open questions, describing objects, places and activities, reading to the child, or singing and listening to songs together. Furthermore, other messages were oriented toward strengthening parental–child interactions through the development of sensitivity and responsiveness, key features for good communication and language development.

In a post-treatment survey (administered between one and three months after the messaging intervention finished), we asked parents for their frequency of involvement in several activities with the child, including singing songs, reading, telling stories, and naming, telling, or drawing things. Each measure ranged from 1 to 7, where 1 was never in the week and 7 was every day or nearly every day. We standardized each outcome dividing it by the standard deviation of the outcome in the control group. In addition, we constructed an aggregate index of parental involvement in literacy activities using the first component of a principal component analysis on the six parental involvement outcomes. This component explained 53% of the overall variability in the outcomes and was also standardized. We regressed all standardized outcomes on the ITT indicator, adjusting for the same set of covariates as in our core specification.Footnote 14 The results are reported in Table 5. Being assigned to treatment increased the frequency of reading books, telling stories, and naming, telling, or drawing things to the child by 0.30, 0.27, and 0.44 standard deviations, respectively. The overall effect on the aggregate index is 0.37 standard deviations. All effects are robust to wild bootstrapping.

Table 5. Effects of CP messages on parental frequency of involvement in literacy activities

Notes: *p < 0.1, **p < 0.05, ***p < 0.01. Each outcome is standardized (expressed in terms of the standard deviation of the outcome in the control sample). Robust standard errors in parentheses and p-value after clustering standard errors with wild bootstrapping in squared brackets (only for significant outcomes). Regressions adjust for strata, an indicator of nontreated families within treated centers, outcome at baseline, mother's education, mother's age, child's gender, child's age in months when the intervention began, the number of months that elapsed between the final assessment and the date when the intervention began, the number of people in the household, single child, and whether the father worked full time at baseline. The sample includes observations with videos at baseline and follow-up.

We also investigate whether these parental involvement outcomes correlate with the pitch range and the duration of adult vocalization at follow-up, the two parental language outcomes for which we find a positive effect of the intervention. We find a correlation of r = 0.13 (p < 0.1) between telling stories to the child and the adult pitch range. We do not find a statistically significant association between the measures of parental engagement and the adult vocalization duration, but do find a significant correlation between the aggregate index of parental engagement and the total duration of vocalizations (r = 0.14; p < 0.1).

Next, we explore whether ITT effects are heterogeneous across parents with different degrees of behavioral biases. Recall that the messages aimed at addressing behavioral barriers such as present bias, cognitive fatigue, and negative identities. The first message of the week presented information and reminded parents on the importance of a specific parenting practice (making future benefits more salient); the second one addressed cognitive fatigue and inattention by proposing simple and concrete activities to do during the week; and the third one targeted parental identities, via messages of encouragement and reflection. If the messages work by circumventing these behavioral biases, we would expect parents with stronger biases to react more to the treatment. As shown in Table 1, we measured these dimensions by using parents’ discount rate (present bias), negative shocks faced by the family (cognitive fatigue), and parents’ sense of competence (parental identity). We expect the treatment to have stronger impacts among parents with higher discount rates, households with more negative shocks, and parents with a lower sense of competence. For each language outcome that was affected by the intervention, we run a regression that adds to the core specification a main effect for a behavioral barrier and an interaction between this barrier and the ITT variable (see Table 6). Rather than using the measures for discount rates, negative shocks, and parental efficacy displayed in Table 1, we work with dichotomous indicators of these variables being above (discount rate, negative shocks) or below (parental efficacy) the mean. We find that parents with high discount rates or low efficacy are more likely to show increases in the pitch range after treatment at p < 0.05 and p < 0.1, respectively. Both effects are robust to wild-bootstrapped standard errors and have the expected sign. Impatient parents are more likely to react to messages that remind them about the benefits of speaking and reading to the child, and of responding sensibly to the child's cues. Parents with a low sense of efficacy are more likely to react to messages of encouragement and support. The coefficient on the other interaction (ITT times parental cognitive fatigue) is also positive but statistically insignificant.

Table 6. Heterogeneity of ITT effects by behavioral barrier

Notes: *p < 0.1; **p < 0.05. Robust standard errors in parentheses. When the interaction of interest is statistically significant, we report in squared brackets p-values after adjusting for the clustering of observations around centers using wild bootstrapping. The estimation sample includes observations that have a video in baseline and follow-up. Each pair of rows denotes a specific model in which the ITT is interacted with a behavioral barrier measured at baseline. The first two rows correspond to a high present bias, proxied with a discount rate above the mean. The next couple of rows correspond to cognitive fatigue. The variable that measures the latter behavioral barrier is an indicator that takes the value of 1 if the family was exposed to 2 or more negative shocks in the previous 12 months to the survey. The last two rows correspond to negative identities. The variable that measures the latter behavioral barrier is an indicator that takes the value of 1 if the parental efficacy at baseline is less or equal than the mean. Each column denotes a different outcome. For each outcome, we show the coefficient and standard error for the ITT main effect, and the coefficient and standard error for the interaction between ITT and the behavioral barrier is analyzed. The estimations control for the main effect of the behavioral barrier analyzed, an indicator of whether the subject was randomized to control within treated CAIF centers (the ‘spillover’ sample), for randomization strata, SES, and outcomes at baseline.

Discussion and conclusions

In this article, we assess the effects of the Crianza Positiva text and audio e-messaging program on caregiver–child language interaction patterns. The Crianza Positiva e-messaging program is a three-times-a-week, six-month-long intervention for families with children aged 0–2 years aimed at improving parenting competences. Its design recognizes that parental biases, such as present bias, inattention and cognitive fatigue, and negative identities, may lead to suboptimal parenting practices and exploits behavioral tools such as reminders, suggestions of actions, and messages of encouragement to reinforce and sustain positive parenting practices.

Crianza Positiva messages address the parental–child interaction by reinforcing sensitive observation, response, caring, and stimulation. These features are key components of a high-quality communication between the child and the caregiver, and set the grounds for language development. In addition, several modules of the Crianza Positiva e-messaging program (almost one out of five of the messages) are aimed specifically at language stimulation, encouraging parents to engage in conversations with the child, ask open questions, describe objects, places, and activities, read books, tell stories, and sing to and with the child.

We evaluate the program using a randomized controlled trial in 24 early childhood centers in Uruguay. Families that had completed an eight-week workshop at the center were randomized into receiving or not receiving mobile messages. After the program, we videotaped 10-minute sessions of free play between the caregiver and the child and decoded language patterns using automated techniques. We find that the intervention was successful at influencing parental behavior: it increased both the duration and the quality of parental vocalizations, as captured by the pitch range. While the effect on the duration of parental vocalization was sensitive to alternative specifications, the effect on pitch range was very robust. Higher pitch has been associated with better infant–adult connection both in individual variation (e.g., a comparison of depressed and nondepressed mothers; Porritt et al., Reference Porritt, Zinser, Bachorowski and Kaplan2014) and in experimental settings (i.e., when the mother is shown a supposedly live video of her infant, but the infant can versus cannot actually hear the mother; Smith & Trainor, Reference Smith and Trainor2008). Furthermore, studies show that speech conveying pitch variations (larger ranges of speaker intonations) also results in higher levels of attention, more word recall, and more recognition of information by the listener than those with homogenous pitch ranges across sentences (Rodero et al., Reference Rodero, Potter and Prieto2017). Parentese speech, or speech characterized by higher pitch, slower tempo, and exaggerated intonation ranges, has been positively associated with early spoken word recognition and better acquisition of language by infants and has been shown to be preferred by children relative to standard speech (Liu et al., Reference Liu, Tsao and Kuhl2009; Singh et al., Reference Singh, Nestor, Parikh and Yull2009; Ramírez-Esparza et al., Reference Ramírez-Esparza, García-Sierra and Kuhl2017). In turn, better language outcomes are related to better long-run educational achievement and earnings (Gertler et al, Reference Gertler, Heckman, Pinto, Zanolini, Vermeerch, Walker, Chang and Grantham-McGregor2014). These results are promising because pitch range is not a variable that parents can easily and consciously manipulate, and, thus, it could be less sensitive to desirability biases.

Consistent with parental language results, we find that the intervention also increased parental involvement in reading, telling stories, and describing things to the child, as self-reported by parents in a follow-up survey. Furthermore, we find a positive correlation between telling stories and parental pitch range, as well as between an aggregate index of parental engagement in literacy activities and the total duration of vocalizations. The findings above suggest that the effects on Parentese speech and adult vocalization duration may have been mediated by an impact of the intervention on parental engagement in language interactions with the child and language-stimulating activities. We also explore the heterogeneous effects of the intervention by parental characteristics that may be capturing behavioral biases. We find evidence that the messages were most effective on parents with high discount rates, supporting our hypothesis that reminders help present-oriented parents to make more salient the benefits of positive parental engagement with the child.

It is important to bear in mind that our study may be among the first to analyze short video recordings automatically in the context of large-scale studies (see Background and sections ‘Language development in early childhood’ to ‘Measurement of language input and output’). We, therefore, first checked our proposed metrics in terms of their reliability against manual coding, as well as external validity (i.e., correlations with age, which is our best proxy for language development in this sample; see the section ‘Outcomes’). Our results suggested low external validity of child measures, as only one of our measures correlated significantly with child age (a proxy for language development available in the same data set). The one measure that had some external validity was child vocalization duration, but this measure exhibited a decrease among the treated children as compared with those in the control group. Since we observe that there is basically a trade-off between adult speech duration and child speech duration, we attribute this negative result to a crowding-out effect by the caregiver in the context of a fixed 10-minute suggested activity and a more proactive parental role. We believe that the most reasonable interpretation for this effect is not a negative effect of the intervention on the child, but rather an unfortunate masking of potential effects on the child due to greater caregiver engagement. This suggests that our analyses, while sufficient to reveal changes in parental behavior, may not be ideal to measure potential positive effects on the child. Instead, perhaps qualitative analyses of children's vocalization content (such as an estimation of syntactic complexity and lexical diversity) may be more sensitive. Unfortunately, given the state of automatic speech recognition, it is not feasible to automatically generate transcriptions of these videos. Transcriptions for the 59 hours of dataFootnote 15 in the smaller sample would have required at least 1770 hours of human work,Footnote 16 which was also not feasible.

Our work contributes to a nascent literature exploiting text-based communication and behavioral economic insights to boost early childhood development. It is worth emphasizing that in our intervention, messages were sent after completing a workshop with the families, so compliance and message efficacy may have been larger due to this prior contact. Moreover, we also add value to the literature by rigorously measuring language outcomes, which are seldom measured in the field due to the methodological difficulties with caregiver reporting and child observation (Weber et al., Reference Weber, Fernald and Diop2017). Here, via the use of videos and automatic transcriptions, we are able to identify some interesting outcomes previously unexplored in the literature, such as parental intonation or pitch range. Our results demonstrate the large potential of these very low-cost interventions, based on mobile technology and on the understanding of behavioral biases, in changing and sustaining parental behavioral patterns that ultimately lead to improved child development.

Acknowledgments

We acknowledge financial support from Reaching U Foundation, the Inter-American Development Bank, and Agencia Nacional de Investigación e Innovación (ANII, Uruguay). A.C. acknowledges Agence Nationale de la Recherche (ANR-17-CE28-0007 LangAge, ANR-16-DATA-0004 ACLEW, ANR-14-CE30-0003 MechELex, ANR-17-EURE-0017) and the J. S. McDonnell Foundation. The RCT design for this project was registered in the AEA RCT registry with RCT ID number AEARCTR-0003585. The project was approved by the Institutional Ethics Review Board at Universidad Católica del Uruguay, resolution A 22-08-17. A.B. discloses her position as a board member of Reaching U Foundation, one of the funding agencies. F.L.B. and M.P.F. work for the IADB, another funding agency in the project.

Conflict of interest

There are no other conflicts of interest to disclose.

Appendix 1: Correlations

Figure A. Pearson Correlations between Manual and Automatic Annotations. Main Outcomes.

Figure A1. Count of Child Vocalizations.

Figure A2. Duration of Child Vocalizations.

Figure A3. Count of Adult Vocalizations.

Figure A4. Duration of Adult Vocalizations.

Figure A5. Adult–Child Turns.

Figure A6. Adult Response Rate 2 s.

Appendix 2: Robustness estimations

Table A1. Pairwise correlations across outcomes

Table A2. TOT effects of CP messages on language outcomes

Table A3. ITT effects of CP messages on language outcomes in the last 300 s of video

Table A4. ITT effects on a larger sample with videos at follow-up

Footnotes

1 Most of the referenced studies analyze populations from developed countries (the USA or the UK) of varied SES. Some exceptions are Snow and Ferguson (Reference Snow and Ferguson1977) that analyze populations from mostly developed and a few less-developed countries; Mendive et al. (Reference Mendive, Mascareño Lara, Aldoney, Pérez and Pezoa2020), that study a varied SES population in Chile; and Rubio-Codina and Grantham-McGregor (Reference Rubio-Codina and Grantham-McGregor2020), who work with a low- and middle-income population in Colombia.

2 The Language ENvironment Analysis (LENA) system is a relatively new recording technology that can be used to investigate typical child language acquisition. More details can be found here: https://www.lena.org/.

3 Some previous evaluations of parenting programs such as Rodriguez et al. (Reference Rodriguez, Dumont, Mitchell-Herzfeld, Walden and Greene2010) for Health Families New York, Love et al. (Reference Love, Kisker, Ross, Raikes, Constantine, Boller and Fuligni2005) for Early Head Start, and Olds et al. (Reference Olds, Kitzman, Cole, Robinson, Sidora, Luckey and Holmberg2004) for Nurse Family Partnership have used measurements based on video recordings to analyze mother–child interactions, which could, thus, provide a view both on input and on child output. More recently, Conti et al. (Reference Conti, Poupakis, Sandner and Kliem2020) also used 3-minute videotaped play sessions to evaluate a home-visiting program in Germany and focused on reciprocity and the initiation of interactions.

4 Due to the specificities of WhatsApp, only mobile phones that saved the mobile number of Crianza Positiva received the messages. In our welcome message, we asked families to add our cellphone to their contact list.

5 This was a family choice and was unrelated to treatment status.

6 The speech detection software is unable to classify overlapping speech.

7 Eighteen of these were control families were within treated centers.

8 ITT = 1 whenever the family is assigned to receive messages. It is 0 when the family participates in a center not assigned to be a part of the treatment, or participates in a center selected for treatment, but is not assigned to receive messages.

9 The geometric mean of the MCQ discount rate k in our sample (ln(0.08) = −2.53) is larger than that found in Phillips et al. (Reference Phillips, Hurley, Angadi, Todd, Berardi, Hovell and Adams2020) for a sample of overweight adults aged 18–60, and in Kirby et al. (Reference Kirby, Petry and Bickel1999) for a sample of opioid-dependent patients (3.96). Our measure is within the upper quartile of the k distribution in the cited articles.

10 The average of the PSOC efficacy subscale in our sample is 4.13, similar to the 4.16 found among Spanish families with children aged 0–5 (Alvarez et al., Reference Alvarez, Byrne and Rodrigo2021), but below the average scale in other English-speaking samples, such as in Karp et al. (Reference Karp, Lutenbacher and Wallston2015) (4.85).

11 Some minor differences include being less likely to have premature babies, being more likely to breastfeed, having more children and more adults in the household, and being more likely to have a negative shock in the past 12 months.

12 We offered families the option of being videorecorded at the early childhood center or at home. Some centers were more generous than others in offering us physical space and operation hours to record the videos. Because randomization was conducted at the center level, any barriers to video recording imposed by a single center could substantially affect the balance. The observed imbalance is, in fact, largely explained by one of the centers assigned to treatment, which accounted for more than 10% of the treated families, and in which we were only able to video-record 8% of the dyads. In all the other centers, participation was at least 40%.

13 To maximize sample size, we consider observations that had a recording in at least one of the two preintervention assessments (August 2017 or November–December 2017). We control for dummy variables that account for the timing of this assessment.

14 This includes strata indicators, an indicator of being a part of the spillover sample, outcomes at baseline, and a set of covariates capturing family SES.

15 178 dyads × 2 waves × 10 minutes = 3560 minutes or 59 hours.

16 59 × 30 = 1770. The transcription of 1 minute of adult vocalization content requires approximately 20 minutes, while the transcription of 1 minute child's vocalization content involves approximately 40 minutes.

Notes: *p < 0.1, **p < 0.05, ***p < 0.01.

Notes: *p < 0.1, **p < 0.05, ***p < 0.01.

Notes: *p < 0.1, **p < 0.05. The table shows ITT effects on language outcomes measured in the last 300 s of video after adjusting for the following covariates: randomization strata, outcome at baseline, mother's education, mother's age, child's gender, child's age in months when the intervention began, the number of months that elapsed between the final assessment and the date when the intervention began, the number of people in the household, single child, whether the father worked full time at baseline, and indicators for the timing when the family was videotaped at baseline (in August 2017, in November–December 2017, or both). When there are two baseline measures, we use the average. Robust standard errors are reported in parentheses. For significant outcomes, we report in squared brackets the p-value on the ITT coefficient after adjusting standard errors for clustering around centers using wild bootstrapping.

Notes: *p < 0.1, **p < 0.05, ***p < 0.01. The table shows the effects of randomization into treatment (ITT) on language outcomes using a larger sample with videos at follow-up but not necessarily at baseline. Regressions adjust for the following set of covariates: randomization strata, mother's education, mother's age, child's gender, child's age in months when the intervention began, the number of months that elapsed between the final assessment and the date when the intervention began, the number of people in the household, single child, and whether the father worked full time at baseline. Robust standard errors are reported in parentheses. For significant outcomes, we report in squared brackets the p-values after adjusting standard errors for clustering of observations around centers using wild bootstrapping (WB p).

References

Aboud, F. E., Singla, D. R., Nahil, M. I. and Borisova, I. (2013), ‘Effectiveness of a parenting program in Bangladesh to address early childhood health, growth and development’, Social Science and Medicine, 97 (Special Issue: Critical Periods in Early Human Development, Guest Editors: Pia Rebello Britto, UNICEF, USA; Rafael Pérez-Escamilla, Yale University, USA): 250258.CrossRefGoogle ScholarPubMed
Ajzenman, N. and Lopez Boo, F. (2019), Lessons from behavioral economics to improve treatment adherence in parenting programs: An application to SMS. IZA Discussion Paper No. 12808. Retrieved from: https://www.iza.org/publications/dp/12808/lessons-from-behavioral-economics-to-improve-treatment-adherence-in-parenting-programs-an-application-to-sms.10.18235/0001981CrossRefGoogle Scholar
Allen, S. E. and Dench, C. (2015), ‘Calculating mean length of utterance for Eastern Canadian Inuktitut’, First Language, 35(4-5): 377406.CrossRefGoogle Scholar
Alvarez, M., Byrne, S. and Rodrigo, M. J. (2021), ‘Social support dimensions predict parental outcomes in a Spanish early intervention program for positive parenting’, Children and Youth Services Review, 121: 105823.CrossRefGoogle Scholar
Baird, S., Bohren, J. A., McIntosh, C. and Özler, B. (2018), ‘Optimal design of experiments in the presence of interference’, Review of Economics and Statistics, 100(5): 844860.10.1162/rest_a_00716CrossRefGoogle Scholar
Bayley, N. (1969), Bayley Scales of Infant Development [Measurement instrument]. San Antonio, TX: The Psychological Corporation.Google Scholar
Bénabou, R. and Tirole, J. (2002), ‘Self-confidence and personal motivation’, The Quarterly Journal of Economics, 117(3): 871915.CrossRefGoogle Scholar
Berlinski, S. and Schady, N. (2015), ‘More bang for the buck: Investing in early childhood development’, in Berlinksy, S., and Schady, N. (eds), The early years, New York: Palgrave Macmillan, 149178.CrossRefGoogle Scholar
Bloomfield, J., Balsa, A. and Cid, A. (2020), Using behavioral insights in early childhood interventions: The effects of Crianza Positiva e-messaging program on parental investment. Universidad de Montevideo. Working Paper.Google Scholar
Bornstein, M. H., Putnick, D. L., Cote, L. R., Haynes, O. M. and Suwalsky, J. T. D. (2015), ‘Mother-infant contingent vocalizations in 11 countries’, Psychological Science, 26(8): 12721284.10.1177/0956797615586796CrossRefGoogle ScholarPubMed
Cameron, A. C., Gelbach, J. B. and Miller, D. L. (2008), ‘Bootstrap-based improvements for inference with clustered errors’, The Review of Economics and Statistics, 90(3): 414427.CrossRefGoogle Scholar
Clarke, D., Romano, J. P. and Wolf, M. (2020), ‘The Romano–Wolf multiple-hypothesis correction in Stata’, The Stata Journal, 20(4): 812843.CrossRefGoogle Scholar
Conti, G., Poupakis, S., Sandner, M. and Kliem, S. (2020), The effects of home visiting on mother-child interactions: evidence from a randomized trial using dynamic micro-level data. IZA DP No. 12937.CrossRefGoogle Scholar
Cristia, A. (2013), ‘Input to language: The phonetics and perception of infant-directed speech’, Language and Linguistics Compass, 7(3): 157170.CrossRefGoogle Scholar
Dale, A. and Strauss, A. (2009), ‘Don't forget to vote: Text message reminders as a mobilization tool’, American Journal of Political Science, 53(4): 787804.CrossRefGoogle Scholar
Doss, C., Fahle, E. M., Loeb, S. and York, B. N. (2019), ‘More than just a nudge supporting kindergarten parents with differentiated and personalized text messages’, Journal of Human Resources, 54(3): 567603.CrossRefGoogle Scholar
Fenson, L., Dale, P. S., Reznick, J. S., Thal, D., Bates, E., Hartung, J. P., Pethick, S. and Reilly, J. S. (1993), The MacArthur communicative development inventories: User's guide and technical manual. San Diego: Singular Publishing Group.Google Scholar
Fenson, L., Dale, P. S., Reznick, J. S., Bates, E., Thal, D. J. and Pethick, S. J. (1994), ‘Variability in early communicative development’, Monographs of the Society for Research in Child Development, 59(5): 1173.CrossRefGoogle ScholarPubMed
Ferjan Ramírez, N., Lytle, S. R., Fish, M. and Kuhl, P. K. (2019), ‘Parent coaching at 6 and 10 months improves language outcomes at 14 months: A randomized controlled trial’, Developmental Science, 22(3): e12762.CrossRefGoogle ScholarPubMed
Ferjan Ramírez, N., Lytle, S. R. and Kuhl, P. K. (2020), ‘Parent coaching increases conversational turns and advances infant language development’, Proceedings of the National Academy of Sciences, 117(7): 34843491.CrossRefGoogle ScholarPubMed
Fernald, A. and Kuhl, P. (1987), ‘Acoustic determinants of infant preference for motherese speech’, Infant Behavior and Development, 10(3): 279293.CrossRefGoogle Scholar
Fernald, L., Kariger, P., Engle, P. and Raikes, A. (2009), Examining child development in low-income countries: a toolkit for the assessment of children in the first five years of life. Washington, DC: The World Bank.CrossRefGoogle Scholar
Fernald, A., Marchman, V. A. and Weisleder, A. (2013), ‘SES differences in language processing skill and vocabulary are evident at 18 months’, Developmental Science, 16(2): 234248.CrossRefGoogle ScholarPubMed
Frankenburg, W. K., Dodds, J., Archer, P., Shapiro, H. and Bresnick, B. (1992), ‘The Denver II: A Major Revision and Restandardization of the Denver Developmental Screening Test’, Pediatrics, 89(1): 9197.CrossRefGoogle Scholar
Gennetian, L., Darling, M. and Aber, J. L. (2016), ‘Behavioral economics and developmental science: A new framework to support early childhood interventions’, Journal of Applied Research on Children, 7(2): 2.Google Scholar
Gertler, P., Heckman, J., Pinto, R., Zanolini, A., Vermeerch, C., Walker, S., Chang, S. M. and Grantham-McGregor, S. (2014), ‘Labor market returns to an early childhood stimulation intervention in Jamaica’, Science, 344(6187): 9981001.CrossRefGoogle Scholar
Gilkerson, J., Richards, J. A., Warren, S. F., Montgomery, J. K., Greenwood, C. R., Kimbrough Oller, D. and Paul, T. D. (2017), ‘Mapping the early language environment using all-day recordings and automated analysis’, American Journal of Speech-language Pathology, 26(2): 248265.CrossRefGoogle ScholarPubMed
Gurol-Urganci, I., de Jongh, T., Vodopivec-Jamsek, V., Atun, R. and Car, J. (2013), ‘Mobile phone messaging reminders for attendance at healthcare appointments’, Cochrane Database of Systematic Reviews, 2012 Jul 11(7): CD007458.Google Scholar
Hart, B. and Risley, T. R. (1995), Meaningful differences in the everyday experience of young American children. Baltimore, MD: Paul H. Brookers Publishing.Google Scholar
Hurtado, N., Marchman, V. A. and Fernald, A. (2008), ‘Does input influence uptake? Links between maternal talk, processing speed and vocabulary size in Spanish-learning children’, Developmental Science, 11(6): 3139.CrossRefGoogle ScholarPubMed
Hurwitz, L. B., Lauricella, A. R., Hanson, A., Raden, A. and Wartella, E. (2015), ‘Supporting Head Start parents: impact of a text message intervention on parent–child activity engagement’, Early Child Development and Care, 185(9): 13731389.CrossRefGoogle Scholar
Johnston, C. and Mash, E. J. (1989), ‘A measure of parenting satisfaction and efficacy’, Journal of Clinical Child Psychology, 18(2): 167175.CrossRefGoogle Scholar
Karp, S. M., Lutenbacher, M. and Wallston, K. A. (2015), ‘Evaluation of the parenting sense of competence scale in mothers of infants’, Journal of Child and Family Studies, 24(11): 34743481.CrossRefGoogle Scholar
Kirby, K. N., Petry, N. M. and Bickel, W. K. (1999), ‘Heroin addicts have higher discount rates for delayed rewards than non-drug-using controls’, Journal of Experimental Psychology: General, 128(1): 7887.CrossRefGoogle ScholarPubMed
Lavechin, M., de Seyssel, M., Gautheron, L., Dupoux, E. and Cristia, A. (2021), Reverse-engineering language acquisition with child-centered long-form recordings. https://doi.org/10.31234/osf.io/pt9xq.CrossRefGoogle Scholar
Law, J., Rush, R., Schoon, I. and Parsons, S. (2009), ‘Modeling developmental language difficulties from school entry into adulthood: literacy, mental health, and employment outcomes’, The Journal of Speech, Language, and Hearing Research, 52(6): 14011416.CrossRefGoogle ScholarPubMed
Le Franc, A., Riebling, E., Karadayi, J., Wang, Y., Scaff, C., Metze, F. and Cristia, A. (2018), The ACLEW DiViMe: An Easy-to-use Diarization Tool. In INTERSPEECH Conference Proceedings (pp. 1383–1387).CrossRefGoogle Scholar
Leung, C. Y. Y., Hernandez, M. W. and Suskind, D. L. (2020), ‘Enriching home language environment among families from low-SES backgrounds: A randomized controlled trial of a home visiting curriculum’, Early Childhood Research Quaterly, 50: 2435.CrossRefGoogle Scholar
Liu, H. M., Tsao, F. M. and Kuhl, P. K. (2009), ‘Age related changes in acoustic modifications of Mandarin maternal speech to preverbal infants and five-year-old children: A longitudinal study’, Journal of Child Language, 36: 909922.CrossRefGoogle ScholarPubMed
Lopez Boo, F., Cubides Mateus, M., Sorio, R., Garibotto, G. and Berón, C. (2018), Measuring the quality of the home environment of young children in Uruguay: socioeconomic gradients in the home inventory. Inter-American Development Bank Technical Notes. IDB-TN-1550.Google Scholar
Love, J. M., Kisker, E. E., Ross, C., Raikes, H., Constantine, J., Boller, K. and Fuligni, A. S. (2005), ‘The effectiveness of early head start for 3-year-old children and their parents: Lessons for policy and programs’, Developmental Psychology, 41(6): 885.CrossRefGoogle ScholarPubMed
Mani, A., Mullainathan, S., Shafir, E. and Zhao, J. (2013), ‘Poverty impedes cognitive function’, Science, 341(6149): 976980.CrossRefGoogle ScholarPubMed
Mayer, S. E., Kalil, A., Oreopoulos, P. and Gallegos, S. (2018), ‘Using behavioral insights to increase parental engagement: The parents and children together intervention’, Journal of Human Resources, 54(4): 900925.CrossRefGoogle Scholar
McGillion, M., Pine, J. M., Herbert, J. S. and Matthews, D. (2017), ‘A randomized controlled trial to test the effect of promoting caregiver contingent talk on language development in infants from diverse socioeconomic status backgrounds’, Journal of Child Psychology and Psychiatry, 58(10): 11221131.CrossRefGoogle ScholarPubMed
Mendive, S., Mascareño Lara, M., Aldoney, D., Pérez, J. C. and Pezoa, J. P. (2020), ‘Home language and literacy environments and early literacy trajectories of low-socioeconomic status Chilean children’, Child Development, 91(6): 20422062.CrossRefGoogle ScholarPubMed
Meuwissen, A., Giovanelli, A., Labella, M. and Susman-Stillman, A. (2017), Text2Learn: An early literacy texting intervention by community organizations.Google Scholar
Neumann, C. (2020), Avutils. Retrieved from: https://github.com/gobbios/avutils.Google Scholar
NICHD (2000), ‘The relation of child care to cognitive and language development’, Child Development, 71(4): 960980.CrossRefGoogle Scholar
Olds, D. L., Kitzman, H., Cole, R., Robinson, J., Sidora, K., Luckey, D. W. and Holmberg, J. (2004), ‘Effects of nurse home-visiting on maternal life course and child development: age 6 follow-up results of a randomized trial’, Pediatrics, 114(6): 15501559.CrossRefGoogle ScholarPubMed
Phillips, C. B., Hurley, J. C., Angadi, S. S., Todd, M., Berardi, V., Hovell, M. F. and Adams, M. A. (2020), ‘Delay discount rate moderates a physical activity intervention testing immediate rewards’, Behavioral Medicine, 46(2): 142152.CrossRefGoogle ScholarPubMed
Porritt, L. L., Zinser, M. C., Bachorowski, J. A. and Kaplan, P. S. (2014), ‘Depression diagnoses and fundamental frequency-based acoustic cues in maternal infant-directed speech’, Language Learning and Development, 10(1): 5167.CrossRefGoogle ScholarPubMed
Ramírez-Esparza, N., García-Sierra, A. and Kuhl, P. K. (2017), ‘Look who's talking NOW! Parentese speech, social context, and language development across time’, Frontiers in Psychology, 8: 1008.CrossRefGoogle ScholarPubMed
Rodero, E., Potter, R. F. and Prieto, P. (2017), ‘Pitch range variations improve cognitive processing of audio messages’, Human Communication Research, 43(3): 397413.CrossRefGoogle Scholar
Rodriguez, M. L., Dumont, K., Mitchell-Herzfeld, S. D., Walden, N. J. and Greene, R. (2010), ‘Effects of healthy families New York on the promotion of maternal parenting competencies and the prevention of harsh parenting’, Child Abuse & Neglect, 34(10): 711723.CrossRefGoogle ScholarPubMed
Romeo, R. R., Leonard, J. A., Robinson, S. T., West, M. R., Mackey, A. P., Rowe, M. L. and Gabrieli, J. D. E. (2018), ‘Beyond the 30-million-word gap: children's conversational exposure is associated with language-related brain function’, Psychological Science, 29(5): 700710.CrossRefGoogle ScholarPubMed
Rowe, M. L., Raudenbush, S. W. and Goldin-Meadow, S. (2012), ‘The pace of vocabulary growth helps predict later vocabulary skill’, Child Development, 83(2): 508–25.CrossRefGoogle ScholarPubMed
Rubio-Codina, M. and Grantham-McGregor, S. (2020), Validez predictiva de pruebas cortas comúnmente usadas para medir el desarrollo infantil en estudios a gran escala. Inter-American Development Bank Working Paper. DB-WP-1174.CrossRefGoogle Scholar
Schilbach, F., Schofield, H. and Mullainathan, S. (2016), ‘The psychological lives of the poor’, American Economic Review, 106(5): 435–40.CrossRefGoogle ScholarPubMed
Shah, A. K., Mullainathan, S. and Shafir, E. (2012), ‘Some consequences of having too little’, Science, 338(6107): 682685.CrossRefGoogle ScholarPubMed
Shneidman, L. A. and Goldin-Meadow, S. (2012), ‘Language input and acquisition in a Mayan village: How important is directed speech?’, Developmental Science, 15(5): 659673.CrossRefGoogle Scholar
Singh, L., Nestor, S., Parikh, C. and Yull, A. (2009), ‘Influences of infant-directed speech in early word recognition’, Infancy, 14: 654666.CrossRefGoogle ScholarPubMed
Smith, N. A. and Trainor, L. J. (2008), ‘Infant-directed speech is modulated by infant feedback’, Infancy, 13(4): 410420.CrossRefGoogle Scholar
Snow, C. E. and Ferguson, C. A. (1977), Talking to children. Cambridge, UK: Cambridge University Press.Google Scholar
Soderstrom, M. and Wittebolle, K. (2013), ‘When do caregivers talk? The influences of activity and time of day on caregiver speech and child vocalizations in two childcare environments’, PLoS One, 8(11): e80646.CrossRefGoogle ScholarPubMed
Sperry, D. E., Sperry, L. L. and Miller, P. J. (2019), ‘Reexamining the verbal environments of children from different socioeconomic backgrounds’, Child Development, 90(4): 13031318.CrossRefGoogle ScholarPubMed
Thaler, R. H. (2015), Misbehaving: The making of behavioral economics. New York: WW Norton.Google Scholar
The ManyBabies Consortium (2020). ‘Quantifying sources of variability in infancy research using the infant-directed-speech preference’, Advances in Methods and Practices in Psychological Science.Google Scholar
Weber, A., Fernald, A. and Diop, Y. (2017), ‘When cultural norms discourage talking to babies: Effectiveness of a parenting program in rural Senegal’, Child Development, 88(5): 15131526.CrossRefGoogle ScholarPubMed
Weisleder, A. and Fernald, A. (2013), ‘Talking to children matters: early language experience strengthens processing and builds vocabulary’, Psychological Science, 24(11): 21432152.CrossRefGoogle ScholarPubMed
Zangl, R. and Mills, D. L. (2007), ‘Increased brain activity to infant-directed in 6-and 13-month-old infants’, Infancy, 11(1): 3162.CrossRefGoogle Scholar
Zimmerman, F. J., Gilkerson, J., Richards, J. A., Christakis, D. A., Xu, D., Gray, S. and Yapanel, U. (2009), ‘Teaching by listening: The importance of adult-child conversations to language development’, Pediatrics, 124(1): 342349.CrossRefGoogle ScholarPubMed
Figure 0

Figure 1. Crianza Positiva Timeline.

Figure 1

Table 1. Descriptive statistics and balance of family characteristics at baseline

Figure 2

Table 2. Description of language outcomes for control families at follow-up (N = 98)

Figure 3

Table 3. ITT effects of CP messages on language outcomes

Figure 4

Table 4. Robustness of ITT effects to different sets of covariates

Figure 5

Table 5. Effects of CP messages on parental frequency of involvement in literacy activities

Figure 6

Table 6. Heterogeneity of ITT effects by behavioral barrier

Figure 7

Figure A1. Count of Child Vocalizations.

Figure 8

Figure A2. Duration of Child Vocalizations.

Figure 9

Figure A3. Count of Adult Vocalizations.

Figure 10

Figure A4. Duration of Adult Vocalizations.

Figure 11

Figure A5. Adult–Child Turns.

Figure 12

Figure A6. Adult Response Rate 2 s.

Figure 13

Table A1. Pairwise correlations across outcomes

Figure 14

Table A2. TOT effects of CP messages on language outcomes

Figure 15

Table A3. ITT effects of CP messages on language outcomes in the last 300 s of video

Figure 16

Table A4. ITT effects on a larger sample with videos at follow-up