Hostname: page-component-cd9895bd7-fscjk Total loading time: 0 Render date: 2024-12-26T13:21:32.622Z Has data issue: false hasContentIssue false

Estimating Parties’ Policy Positions in Uruguay: Comparing Scaling Methods Based on Legislative Speeches and Roll-Call Votes

Published online by Cambridge University Press:  05 June 2023

Diego Luján
Affiliation:
Diego Luján is a professor. [email protected].
Nicolás Schmidt
Affiliation:
Nicolás Schmidt is a assistant professor. [email protected].
Juan A. Moraes
Affiliation:
Juan A. Moraes is a professor. [email protected].
Rights & Permissions [Opens in a new window]

Abstract

This research note takes advantage of a novel dataset to analyze legislators’ behavior in Uruguay’s Parliament. Comparing the positions of legislators based on floor speeches and roll-call voting, it discusses the relationship between discourse and voting among individual legislators and parties. The dataset contains more than 57,000 speeches from more than 1,000 Uruguayan legislators between 1985 and 2015 and its related R package. The study estimates the parties’ policy positions on the basis of two data sources, roll-call votes and floor speeches, and then compares both results. Contrary to expectations, no clear association appears between the two scaling methods, demonstrating that vote and legislative speech may reflect the behavior of individual legislators with potentially conflicting goals. Strategic calculations or party discipline may be plausible explanations for the divergent results obtained from text and roll-call scaling methods.

Type
Research Notes
Copyright
© The Author(s), 2023. Published by Cambridge University Press on behalf of the University of Miami

Political scientists frequently attribute a relevant role to actors’ policy preferences in the explanation of a wide range of political phenomena and have sought to estimate them as accurately as possible. In recent years, text analysis has appeared as a low-cost and reliable alternative to more restrictive data-gathering strategies based on roll-call voting in legislatures, and as a matter of fact, it has been suggested as a reasonable substitute where roll-call data are simply not available (Slapin and Proksch Reference Slapin and Proksch2014). The underlying presumption is that the use of different scaling techniques using archival, documental, or speech data is convergent with ideal point estimation using roll-call data.

This research note asks if this is the case. It uses a novel dataset gathering more than 57,000 speeches from about 1,000 Uruguayan legislators between 1985 and 2015. As a way of illustrating the potential uses of the dataset, we address a theoretically relevant question on legislative behavior applied to the Uruguayan Parliament: How is legislative speech related to voting? The study takes advantage of the functionalities included in the R package related to the database to collect all roll-call votes during the period, and then it compares voting patterns with text analysis among the same legislators, in line with some previous studies for advanced democracies (Schwarz et al. Reference Schwarz, Traber and Benoit2017).

We find an overall low level of convergence between the speech and the voting patterns of individual legislators, contrary to the underlying presumption but in line with the findings from previous studies like Proksch and Slapin Reference Proksch and Slapin2012, Schwarz et al. Reference Schwarz, Traber and Benoit2017, and Izumi and Medeiros Reference Izumi and Medeiros2021. While both techniques allow for intraparty differentiation, the analysis of roll calls exhibits greater potential to capture differences between parties, showing that individual legislators may use speech and vote to pursue different goals.

Uruguay is an interesting case to address this issue because of its political stability, as well as the longevity and institutionalization of its main political parties (Mainwaring Reference Mainwaring2018; Piñeiro and Rosenblatt Reference Piñeiro Rodríguez and Rosenblatt2020). Additionally, it is a case of high reelection rates among both deputies and senators (Chasquetti Reference Chasquetti2014). This allows us to systematically compare individual legislators and parties across a long period. Uruguayan political parties also diverge in their policy positions and programmatic profiles, allowing us to analyze the relationship between voting and speech in Parliament.

This research provides a twofold contribution to the study of text analysis in social science and legislative studies. The original dataset overcomes a series of limitations in text analysis, making available flexible and ready-to-use data. This dataset is the result of applying different computational techniques implemented on the statistical software R (R Core Team 2017) to solve the trade-offs between extension and depth in dealing with large volumes of written content (Welbers et al. Reference Welbers, Van Atteveldt and Benoit2017). Likewise, this work enables the treatment of a particular type of content, related to collective actors (like parliaments), which comprise a large number of individual speakers who belong to other collective actors, such as parties, factions, or legislative committees.

The second contribution is to discuss the relationship between vote and speech in the Uruguayan Parliament. For this purpose, the data were used to collect every roll-call vote in a legislature that marginally uses this kind of voting rule.Footnote 1 That process allowed for the estimation of W-nominate scores for each available vote during the period and further comparison of voting behavior with Wordscores as a standard scaling method for text analysis.

The article proceeds as follows. First it introduces the basic traits of Uruguayan politics and the set of techniques involving the dataset and briefly describes the data. Then it outlines two scaling methods for estimating the positions of individual legislators and parties, based on two types of data: roll-call votes and floor speeches. This application illustrates the potential utility of the dataset and its related R package for students interested in the ideological location of parties in the policy space. To conclude, it reviews the main findings of the research and suggests a research agenda for the study of a large array of political phenomena based on legislative speeches.

Gathering Legislative Speeches in the Uruguayan Parliament

Uruguay is one of the oldest and stablest democratic regimes in Latin America, leading, along with Costa Rica, most rankings measuring the quality of democracy in the region (Coppedge et al. Reference Coppedge, Gerring and Lindberg2017; Polity I 2012; Freedom House 2014). Its highly institutionalized party system (Mainwaring Reference Mainwaring2018), established by Nationalists and Colorados (PN and PC) during the nineteenth century and lately the Frente Amplio (FA, born in 1971), is considered the backbone of Uruguayan democracy.

To a large extent, such persistence of parties in Uruguayan politics is rooted in the ability of partisan elites to preserve clear programmatic and ideological stances in party competition. For this reason, Uruguay is an interesting case to reconstruct the preferences of parties and legislators over time. Given this opportunity, the main concern of this research note is whether the positions of Uruguayan parties are identifiable and differentiable across legislatures using text analysis. And more important: are different scaling methods used to estimate the preferences of parties convergent or divergent over time? Because these questions have received little attention in the research, particularly among Latin American countries (for an exception see Kim et al. Reference Kim, Londregan and Ratkovic2018; and Izumi and Medeiros Reference Izumi and Medeiros2021 for the Brazilian case), the goal here is to contribute to current and future research on these topics, comparing two scaling methods that make use of a different type of data to address our research questions.

Traditionally, studies of political behavior have used legislative surveys, expert surveys, and party manifestos to reconstruct the preferences of political actors (Benoit and Laver Reference Benoit and Laver2006, Reference Benoit and Laver2007; Knutsen Reference Knutsen1998; Laver and Hunt Reference Laver and Ben Hunt1992). However, despite their widespread use and undeniable virtues, these sources have their own shortcomings (Slapin and Proksch Reference Slapin and Proksch2014). On the one hand, there are temporal limitations to rebuilding the preferences of actors, due to generations of deceased voters, extinct political parties, or disappeared economic elites, among many other limitations, especially in third wave democracies. On the other hand, spatial variation is also challenging, as a great number of countries and other subnational institutions have made only limited advances in data availability.

More recently, scholars have developed new techniques to complement attitudinal studies, with data coming from roll-call votes in legislatures (Poole and Rosenthal Reference Poole and Rosenthal2000; Carroll and Poole Reference Carroll and Poole2014), social networks (Bond and Messing Reference Bond and Messing2015), archival data, and public speeches (Kim et al. Reference Kim, Londregan and Ratkovic2018; Laver and Garry Reference Laver and Garry2000; Huber and Inglehart Reference Huber and Inglehart1995; Dinas and Gemenis Reference Dinas and Gemenis2010; Laver et al. Reference Laver, Benoit and Garry2003; Welbers et al. Reference Welbers, Van Atteveldt and Benoit2017; Ramey et al. Reference Ramey, Klingler and Hollibaugh2019; Slapin and Proksch Reference Slapin and Proksch2014). In all cases, the goal has been to collect and systematize data to infer the preferences of different individual and collective actors over time and under different political and institutional contexts. Among these alternatives, W-nominate scores (Poole et al. Reference Poole, Lewis, Lo and Carroll2011) and other ideal point estimation techniques (Jackman Reference Jackman2001; Martin and Quinn Reference Martin and Quinn2002; Kellermann Reference Kellermann2012) based on roll calls have become one of the most reliable pieces of evidence to capture the preferences of political actors. Yet roll-call data availability is very limited beyond most advanced democracies.

In this context, text analysis has emerged as a reasonable option to bypass the limitations of roll-call records in most democracies around the world. As a matter of fact, text analysis has been considered a viable alternative to solve the limitations of time and space usually attributed to roll-call data (Curini et al. Reference Curini, Hino and Osaka2018; Lauderdale and Herzog Reference Lauderdale and Herzog2016). However, the content analysis techniques (of examining speeches, documents, or written records of events) also presents some weaknesses that have captured the attention of experts (Gabel and Huber Reference Gabel and Huber2000; Klemmensen et al. Reference Klemmensen, Hobolt and Hansen2007). On the one hand, the method requires a considerable amount of resources, particularly when it is applied to important volumes of documents or speeches. The need for reliable and systematic coding leads to significant demand for manual work. On the other hand, the ability to extend this technique to analyze content across time and space has important technical limitations. Coding and systematizing large volumes of text from manual work is onerous for one or even several encoders. Due to these limitations, extant applications have mainly focused on industrialized democracies. Our new dataset aims to expand those limits.

We built on extant techniques to develop an original dataset implemented on R (R Core Team 2017). The sequence starts with web scraping to obtain the Actas de Sesión; that is, the formal registry of all the activities involved in an individual legislative session. Some of these available documents on the Uruguayan legislative website are in HTML and some are in PDF format. The remaining documents that were not available on the web were manually scanned and transformed into PDF format.

The data generation process is flexible, giving researchers different options to compile the resulting dataset according to their interests. Depending on the research goal, scholars could be interested in different portions or contents of legislative speeches. The package includes a function allowing users to compile and recompile the dataset with different structures. For example, if someone is interested in scaling the position of parties, the dataset can be compiled using this unit of analysis. Also, the dataset can be recompiled with individual legislators as the unit of analysis, which enables adding new variables like factions, electoral districts, and district size, among many others that might interest students of legislative bodies and other research fields. Additionally, it is possible to track specific debates within and among legislatures (e.g., abortion, human rights, budgeting) using functions included in packages like quanteda (Benoit et al. Reference Benoit, Watanabe and Wang2018).

Figure 1 displays the building process of our dataset. First, the resulting structure of the text-mining process considers that some or several legislators speak at different times during the same session. Notice that this is a dataset of speeches performed by legislators who invariably belong to a certain political party. This rule excludes some individuals who participate in legislative sessions, like cabinet ministers, and staff members who are frequently invited to participate in committee and general sessions by request. The main function of the package has a search algorithm based on regular expressions that eliminate these cases.

Figure 1. Sequence of the Data Generation Process

Formally, figure 2 shows the resulting structure of the dataset. In order to work with individual speeches, we extract and cluster all the interventions for each legislator who speaks in a single session. Then we check whether the legislators’ names are correctly written and correct the multiple typing errors in the original document to cluster the interventions for each legislator. Compiling the dataset by individual legislators, the first variable is the legislator’s name, followed by other variables like the legislature number, the date and content of the speech, the legislator’s party, and the document type.Footnote 2 Depending on the research interest, the dataset can be recompiled using the legislator´s party as the unit of analysis or any other added variable of interest, such as gender, district magnitude, or others.

Figure 2. Resulting Structure of the DGP

The resulting dataset contains more than 57,000 observations, covering more than 1,000 unique legislators from 7 political parties across 6 legislative periods for 3 different chambers: the Chamber of Representatives, the Senate, and the Asamblea General (General Assembly) (as the fusion of the upper and lower chambers). One of the main features of the dataset and the speech package is the easy integration with available packages and tools for computational text analysis, such as quanteda (Benoit et al. Reference Benoit, Watanabe and Wang2018) and tm (Feinerer and Hornik Reference Feinerer and Hornik2018; Feinerer et al. Reference Feinerer, Hornik and Meyer2008). The structure of the dataset was conceived for that purpose, allowing researchers to compute scores, obtain summary statistics and frequency tables, and generate word clouds and other descriptive measures of interest. Also, it is possible to estimate scores for an individual legislator across different legislative periods, or for a specific set of legislators, parties, or any other combination in which the researcher could be interested. Table 1 shows the variables included in the dataset. In addition to the content of legislative speeches, several variables allow working with auxiliary information, like the legislator’s party, gender, or electoral district.

Table 1. Variables Included in the Dataset

Table 2 and figure 3 present descriptive statistics related to the speeches and the number of legislators in the dataset by legislative period. For example, in the first legislature after the transition to democracy (1985–90), a total of 236 legislators (between incumbents and alternates) spoke on the floor, delivering 7,645 speeches in 424 sessions. The average length of those speeches was 1,156 words. Figure 3 shows the distribution of legislative speeches among parties and across legislative periods. There is some systematic variation, suggesting that government parties tend to deliver shorter speeches on the floor while opposition parties tend to do the opposite.

Table 2. Descriptive Statistics of Legislative Speeches in the Uruguayan Parliament, 1985–2015

Figure 3. Number of Words in Individual Legislators’ Speeches in Uruguayan Parliament, 1985–2015, by party

During the first legislative period (1985–90), the legislators of the governing party (PC) delivered speeches with a lower median compared to the opposition parties (PN and FA). During the following three legislative periods, the PC and the PN governed together as a coalition, and both parties’ legislators made shorter speeches compared to the main opposition party, the FA. This was reversed in 2005, when the FA became the governing party. During the following two legislative periods (2005–15), the FA’s legislators delivered shorter legislative speeches than legislators of both opposition parties, the PN and the PC. This is a reasonable finding; time is important for legislative work, so governing parties are mainly concerned with approving legislation, while opposition parties may be more interested in delivering speeches for position-taking purposes.

Estimating Policy Positions of Uruguayan Parties Using Roll-Call Votes and Legislative Speeches

Given the importance of its political parties, Uruguayan democracy has been characterized as a partidocracia (Caetano et al. Reference Caetano, Rilla and Pérez1987). The continuity of the main party labels, as well as the salience of their ideological profiles, is a propitious context to address the main goal of this research note. That is, to estimate their policy positions through different scaling methods based on individual legislative speeches and roll-call voting, and compare them to find whether systematic differences emerge.

Although estimating ideological or policy positions from text and roll calls raises some concern because of the limitations that individual legislators might face when voting, both sources of data continue to be the central piece for estimating parties’ and individuals’ preferences (Carroll and Poole Reference Carroll and Poole2014, Slapin and Proksch Reference Slapin and Proksch2014). For example, previous studies focused on the Uruguayan Senate and the Brazilian Congress have shown that ideal point estimates may be reflecting latent dimensions different from ideology (Izumi Reference Izumi2016; Zucco Reference Zucco2009, Reference Zucco2013; Izumi and Medeiros Reference Izumi and Medeiros2021). Also, according to some studies, legislative speeches can be restricted for strategic reasons in a way similar to voting (Proksch and Slapin Reference Proksch and Slapin2012). However, other studies argue that legislative speeches are less constrained than voting, as long as legislators vote according to the party line (Schwarz et al. Reference Schwarz, Traber and Benoit2017).

Therefore, we ask whether Uruguayan legislators and parties are more or less constrained by these factors. We are aware that under some circumstances the government-opposition cleavage or party discipline can influence legislators’ behavior, and that is one additional reason to perform the comparison between speech and vote. If there were no restrictions on the behavior of individual legislators, one would expect to find congruence between what they say on the floor and how they vote. But precisely because it is expected that those restrictions are operative, we seek to systematically compare parties’ positions estimated from legislative speeches and roll calls.

To address our main concern, we first estimated parties’ positions from their legislators’ floor speeches. One of the most widespread methods to infer the parties’ policy positions from political texts is Wordscores (Laver et al. Reference Laver, Benoit and Garry2003; Lowe Reference Lowe2008; Lowe et al. Reference Lowe, Benoit, Mikhaylov and Laver2011; Klemmensen et al. Reference Klemmensen, Hobolt and Hansen2007). Based on all available floor speeches during the period 1985–2005, figure 4 shows the Wordscores for the three main political parties in the General Assembly.Footnote 3 Since Wordscores requires setting reference texts with known scores in the relevant dimension (Slapin and Proksch Reference Slapin and Proksch2014), we used the speeches of individual legislators known for their ideological positions.Footnote 4 As the figure shows, the comparison among parties shows little ideological differentiation at the party level. Nevertheless, figure 4 also reveals an important intraparty variation in policy preferences. This variation reflects the well-known factional nature of Uruguayan parties, showing the differences that may be associated with the policy positions of factions (Buquet et al. Reference Buquet, Chasquetti and Moraes1998; Moraes Reference Moraes, Siavelis and Morgenstern2008; Magar and Moraes Reference Magar and Moraes2012). Thus, based on text scaling, Uruguayan parties display a very similar ideological outlook, with an internal differentiation that we attribute to their traditionally factionalized structure, in line with the literature on Uruguayan parties.

Figure 4. Wordscores in Uruguayan General Assembly, 1985–2005

Second, we proceeded to estimate parties’ positions from roll-call data. Nominal votes are exceptional in Latin American politics, and Uruguay is not the exception (Carey Reference Carey2009; Zucco Reference Zucco2013; Ainsley et al. Reference Ainsley, Carrubba and Crisp2020). Because the dataset allows for a specific search of words and text patterns, we searched for all the available roll calls during the period. In line with Ainsley et al. (Reference Ainsley, Carrubba and Crisp2020), we identified roll-call votes from an automatized search for regular expressions.Footnote 5 However, unlike Ainsley et al. (Reference Ainsley, Carrubba and Crisp2020) or Zucco (Reference Zucco2013), we decided to keep only the roll calls associated with the treatment of executive vetoes that take place in the Asamblea General, since the vast majority of nominal votes in the lower or upper chambers refer to specific issues on administrative decisions of the chamber (e.g., the election of authorities and other administrative personnel) that cannot be coerced into yes or no, which is crucial to running the W-nominates algorithm.

Additionally, the treatment of executive vetoes is the only legislative instance in which nominal voting is mandatory, and thus it constitutes the most reliable and comparable source of roll calls in the Uruguayan Parliament. Therefore, limiting the analysis of roll calls to the executive vetoes allows us to deal with a more homogeneous corpus of nominal votes, restricted to substantive and divisive issues that emerge from potentially conflicting interests between legislative and executive powers.Footnote 6 Thus, we collected 77 roll-call votes during the 4 legislative periods under consideration, which were used to calculate another standard measure of ideal point estimation: W-nominate scores.Footnote 7

Figure 5 displays the W-nominate scores for the three main Uruguayan parties for the period 1985–2005, using all available roll calls.Footnote 8 Unlike Wordscores, W-nominate scores display a substantive level of party differentiation. Such differentiation at the party level is largely consistent with the programmatic or ideological differences observed by domestic and comparative students of parties who rely on elite survey data or public opinion polls. Furthermore, our W-nominate data also show a great deal of within-party differentiation, which resembles our estimations in Wordscores. Therefore, parties differentiate not only from each other but also among themselves, as students have observed using different sources of empirical evidence (Buquet et al. Reference Buquet, Chasquetti and Moraes1998; Moraes Reference Moraes, Siavelis and Morgenstern2008; Magar and Moraes Reference Magar and Moraes2012; Chasquetti Reference Chasquetti2014). Overall, W-nominate scores are closer to the findings observed in the specialized literature.

Figure 5. W-nominate Scores in Uruguayan General Assembly, 1985–2005

We went on to compare the results of W-nominate scores with Wordscores. Interestingly, a large body of literature in recent years has pointed out that political discourse is an appropriate way to capture the policy preferences of different political agents (Benoit and Herzog Reference Benoit, Herzog, Bachner, Wagner Hill and Ginsberg2017). This means that if there is an essential difference among parties (and alternatively among factions), it should be captured by analyzing political speeches, as well as other political expressions using documents, newspapers, and other sources. Yet our results using Wordscores do not reveal such differentiation at the party level. Instead, the only observed differentiation occurs within parties, as can be seen in figure 5.

Regarding the systematic differences among scaling methods, figure 6 shows that there is no clear association. The overall correlation is low (r = 0.22), and it is statistically significant at 99 percent confidence. When we look at partial correlations across legislative periods, some differences emerge. It can be seen that for the periods 1985–90 and 2000–2005, there is no clear association between speech and roll-call votes. But for the periods 1990–95 and 1995–2000. there is a moderate to high correlation (0.64 and 0.76, respectively). Not surprisingly, these two legislative periods were characterized by the functioning of governing coalitions between the PC and the PN that reconfigured Uruguayan politics and laid the foundations for the electoral reform of 1997, motivated by the electoral rise of the FA.

Figure 6. Comparing Parties’ Policy Positions Through Wordscores and W-nominate in Uruguayan General Assembly, 1985–2005

According to our estimations using speech, legislators seem to agree on a similar set of terms, using a highly homogeneous vocabulary that does not allow us to make a clear differentiation among parties with legislative representation. Such lack of differentiation contrasts with the voting patterns of parties in the legislature, where they are distinct from each other. Arguably, legislators may speak with a very similar jargon or set of words and expressions, from which we cannot identify different policy orientations. However, differentiation appears when parties and factions step up to vote.

These results are somewhat surprising, given scholars’ observations on the nature of programmatic parties in Uruguay, which would lead us to expect more congruence between speech and voting patterns. The divergence we found between the two scaling methods, based on a different type of data, does not let us confirm this expectation. While estimating the parties’ positions through roll-call data leads to differentiation among parties (as well as inside parties), the estimation using floor speeches does not confirm that differentiating pattern. These results are similar to those of Schwarz et al. (Reference Schwarz, Traber and Benoit2017) for the Swiss legislature, who also find that text scaling from legislative speeches reveals intraparty differentiation more clearly than roll calls.

The divergence among scaling methods may be due to the methods themselves, or to the fact that both data sources are the product of different goals on the part of individual legislators. In other words, it is possible that when an individual legislator speaks on the floor, he or she intends to maximize the expressive function of legislative work (Andeweg Reference Andeweg2014). But when the same legislator votes, he or she may be maximizing party goals, especially among disciplined parties like those in Uruguay. In this case, we expect to find some divergence among the policy positions based on legislative speeches compared with those coming from roll-call data, in line with what Izumi and Medeiros (Reference Izumi and Medeiros2021) found for the Brazilian Senate.

To check the robustness of our results, we replicated the analysis using two alternative, unsupervised scaling methods based on both speech and roll calls. We estimated the parties’ policy positions through Wordfish (Slapin and Proksch Reference Slapin and Proksch2008) to overcome the issues that may arise by selecting the reference texts. Wordfish is an unsupervised scaling algorithm designed to estimate policy positions based on word frequencies that, unlike Wordscores, does not require the identification of reference texts (Slapin and Proksch Reference Slapin and Proksch2008). Our results using Wordfish show a very similar picture in terms of differentiation within and among parties. We also estimated parties’ positions from roll-call votes through IDEAL (Jackman Reference Jackman2001; Clinton et al. Reference Clinton, Jackman and Rivers2004), a Bayesian scaling method that allows us to overcome the problems that may arise as a consequence of the low number of roll calls, and perhaps more important, to avoid identifying the “polar” legislators, as in W-nominates. As with Wordfish, the estimations from IDEAL show very similar results to those obtained using W-nominates.Footnote 9

Both robustness tests allow us to be confident that our results are not heavily dependent on the decisions we made by selecting the reference texts and the “polar” legislators when scaling parties’ positions through Wordscores and W-nominates. In turn, and most important, they also suggest that the divergence we found between speech and vote is not dependent on the scaling method but on the fact that they are reflecting different behavior by individual legislators, factions, and parties. It seems that floor speeches in the General Assembly do not allow for capturing the variation in parties’ policy positions. Unlike roll calls, legislative discourse does not seem to be enough to differentiate among parties, but it does allow for intraparty differentiation, which is a key factor for the study of Uruguayan parties, given their high levels of fractionalization.

In sum, legislative speeches are not necessarily a good proxy to substitute for legislative voting using roll-call data, if we are to identify the ideological basis of political parties (Kim et al. Reference Kim, Londregan and Ratkovic2018). Instead, the fact that speech and votes are not highly correlated indicates that partisan structures are strong enough to impose certain decisions beyond the preferences or political discourse of individual legislators. If this is the case, the role of leadership within parties and factions must be subject to further research.

Concluding Remarks

This research note introduces a novel dataset and its related R package, resulting from a data generation process that involved web scraping, cleaning, and organizing more than 57,000 floor speeches in Uruguayan Parliament between 1985 and 2015. Taken as a whole, the dataset constitutes an effort to address the difficulties involved in analyzing a large amount of data coming from text and discourse, especially when researchers have to deal with multiple speakers in deliberative bodies. The first contribution this research aims to make for researchers interested in these issues is the accessibility of the data, as well as the replicability of the entire data generation process. A set of new and preexisting techniques were used to build our dataset, which we expect will create incentives to explore new applications for a wide variety of research fields, based on specific needs.

Regarding the main concern of this research note, our results indicate that quantitative text analysis reveals diverging outcomes compared to scaling methods using roll-call data. More specifically, voting patterns in the Uruguayan case seem to be a more effective method to capture the ideological preferences of political parties, compared to legislative speeches that do not reveal differentiation among parties. The divergence between scaling parties’ positions from legislative speech and vote could be due to strategic reasons, as suggested by previous works. As Zucco (Reference Zucco2013) and Izumi (Reference Izumi2016) have demonstrated, the government-opposition dimension can alter the way legislators vote regardless of their ideological or policy preferences.

Additionally, as some studies have shown for advanced and developing democracies, legislative speeches may be a more reliable source for determining the sincere policy positions of individual legislators and parties, since they are less constrained by party leaders, compared to voting behavior (Proksch and Slapin Reference Proksch and Slapin2012; Schwarz et al. Reference Schwarz, Traber and Benoit2017; Izumi and Medeiros Reference Izumi and Medeiros2021). This research note has shown that for the Uruguayan case, legislative speech shows intraparty differentiation but does not allow for differentiation between parties. Roll-call voting, in turn, shows greater potential to capture differences in party positions while also revealing important intraparty differences. Whether these differences are due to strategic reasons or to the government-opposition dimension is a relevant question that could be addressed in detail in future research.

From a methodological standpoint, our results show that scaling techniques based on text, like Wordscores or Wordfish, may not substitute for those based on roll calls, like W-nominate or other ideal point estimation techniques, and vice versa; instead, they complement each other in finding relevant differences between what legislators say and how they vote. This can be seen as our second contribution to the research on these topics, to the extent that the gap may be explained by the different goals that individual legislators face during the legislative work. These partially conflicting goals may lead legislators to balance speech and vote in a way that maximizes the benefits of party unity (e.g., belonging to a united party that constitutes a useful label for electoral purposes) while they signal differential positions to specific constituencies that are important for the legislator individually (e.g., they can help to define a personal profile for legislators seeking to send a signal to specific social groups that form the legislator’s basis of electoral support). Indeed, the divergence among scaling methods has to be seen as a puzzle open to future research, as it may indicate the role played by partisan structures, as well as leadership, in determining party discipline.

Acknowledgment

The authors acknowledge the support of the Comisión Sectorial de Investigación Científica (Universidad de la República) through the Proyecto de I+D N° 293 “Partidos programáticos y comportamiento electoral en América Latina.”

Supporting Information

For replication data, see the author’s file on the Harvard Dataverse website: https://dataverse.harvard.edu/dataverse/laps

Footnotes

All in the Departamento de Ciencia Política, Universidad de la República, Montevideo, Uruguay.

Competing interests: The authors declare none.

1 In the Uruguayan Parliament, roll-call votes are used only in the Asamblea General (the union of the Senate and the Chamber of Representatives) for the treatment of executive vetoes. As an exception, individual legislators can request a roll-call vote for some specific voting, but it has to be approved by a majority of the chamber.

2 Although the current version of the dataset covers the period from 1985 to 2015, the R package allows us to update it to the present.

3 To be able to compare both scaling methods, we had to use data from the Asamblea General, as roll-call votes are available only for executive vetoes, which are treated by this chamber. Except for certain bills for which there is a majority in each chamber to vote on a nominal basis, all bills are passed by an ordinal voting rule, thus limiting us to comparing scaling methods based on the Asamblea General.

4 As Slapin and Proksch (Reference Slapin and Proksch2014) argue, the choice of reference texts is an important step for estimating and analyzing Wordscores. Therefore, to select the reference texts, we had to take care of several considerations. First, for comparability reasons, we selected individual legislators with activity in more than one legislative period. Second, having restricted our analysis to the sessions in the Asamblea General, we constrained the selection to those legislators whose speeches were long enough for the Wordscores algorithm to differentiate between them. Third, we selected several legislators for each legislative period and assigned them the reference scores from left to right on the ideological dimension. The list of individual legislators is as follows: 1985–1990: José Araujo, Reinaldo Gargano (FA), Yamandú Fau (FA), Luis Heber (PN), Luis Lacalle (PN), Pablo Millor (PC); 1990–1995: Guillermo Chifflet (FA), Reinaldo Gargano (FA), Yamandú Fau (NE), Luis Heber (PN), Daniel García Pintos (PC); 1995–2000: José Mujica (FA), Guillermo Chifflet (FA), Yamandú Fau (PC), Luis Heber (PN), Daniel García Pintos (PC); 2000–2005: Guillermo Chifflet (FA), José Mujica (FA), Yamandú Fau (PC), Luis Heber (PN), Daniel García Pintos (PC).

5 The R package related to our database contains a function that allows for detecting and extracting nominal votes from the Actas de Sesión (parliamentary session minutes).

6 Because of the small number of available roll calls, we used a Monte Carlo simulation to reach a number of observations that enable the scaling method to perform the estimation of W-nominate scores for each legislator (Martin and Quinn Reference Martin and Quinn2002). The number of roll-call votes in the Uruguayan Congress is low because, as noted, this voting procedure is exceptional and is mandatory only for the legislative treatment of executive vetoes. This poses some difficulties for the scaling process necessary to estimate the legislators’ ideal points. Facing this problem, which is not well developed in the literature, we implemented an algorithm that generates roll-call votes with Gibbs sampling from MCMC. This implementation has the advantage that it generates votes with the same distribution by party for each session in which there was nominal voting. This is important because it allows us to solve a strictly technical problem without introducing perturbations in the distribution of the data that can make the scaling results weak. For more details on this procedure, see our replication file, which is available online.

7 For our empirical estimations, we dropped the legislative periods 2005–10 and 2010–15 because they did not have enough executive vetoes to run the W-nominates. During its two first administrations, the Frente Amplio held the majority in both the Senate and the Chamber of Representatives, so President Tabaré Vázquez (2005–10) used the veto power only twice, while President José Mujica (2010–15) did not use it during his administration.

8 Estimating the W-nominates scores requires identifying a “polar” legislator with a positive ideal point on the dimension of interest. In our estimations of W-nominates scores, we selected individual legislators who are known for being located at the left of the ideological dimension: Nelson Lorenzo Rovira (FA) for 1985–90; Hugo Cores (FA) for 1990–95; Helios Sarthou (FA) for 1995–2000; and José Mujica (FA) for 2000–2005.

9 The results of the robustness tests using Wordfish and IDEAL can be seen on the supplementary material available online together with the database and the replication files.

References

Ainsley, Caitlin, Carrubba, Clifford, Crisp, Brian, et al. 2020. Roll-call Vote Selection: Implications for the Study of Legislative Politics. American Political Science Review 114, 3: 691706.CrossRefGoogle Scholar
Andeweg, Rudy B. 2014. Roles in Legislatures. In Martin et al. 2014. 267–86.Google Scholar
Benoit, Kenneth, and Herzog, Alexander. 2017. Text Analysis: Estimating Policy Preferences from Written and Spoken Words. In Analytics, Policy and Governance, ed. Bachner, Jennifer, Wagner Hill, Kathryn, and Ginsberg, Benjamin. New Haven: Yale University Press. 137–59.Google Scholar
Benoit, Kenneth, and Laver, Michael. 2006. Party Policy in Modern Democracies. London: Routledge.CrossRefGoogle Scholar
Benoit, Kenneth, and Laver, Michael. 2007. Estimating Party Policy Positions: Comparing Expert Surveys and Hand-Coded Content Analysis. Electoral Studies 26, 1: 90107.CrossRefGoogle Scholar
Benoit, Kenneth, Watanabe, Kohei, Wang, Haiyan,et al. 2018. quanteda: An R Package for the Quantitative Analysis of Textual Data. Journal of Open Source Software 3, 30: 774. DOI 10.21105/joss.00774. https://quanteda.io CrossRefGoogle Scholar
Bond, Robert, and Messing, Solomon. 2015. Quantifying Social Media’s Political Space: Estimating Ideology from Publicly Revealed Preferences on Facebook. American Political Science Review 109, 1: 6278.CrossRefGoogle Scholar
Buquet, Daniel, Chasquetti, Daniel, and Moraes, Juan A.. 1998. ¿Un enfermo imaginario? Fragmentación política y gobierno en Uruguay. Montevideo: FCS-ICP.Google Scholar
Caetano, Gerardo, Rilla, José, and Pérez, Romeo. 1987. La partidocracia uruguaya. Historia y teoría de la centralidad de los partidos políticos. Cuadernos del CLAEH 44, 4: 3762.Google Scholar
Carey, John M.k 2009. Legislative Voting and Accountability in Latin America. Cambridge: Cambridge University Press.Google Scholar
Carroll, Royce, and Poole, Keith. 2014. Roll Call Analysis and the Study of Legislatures. In Martin et al. 2014. 103–25.Google Scholar
Chasquetti, Daniel. 2014. Parlamento y carreras legislativas en Uruguay: un estudio sobre reglas, partidos y legisladores en las cámaras. Montevideo: Universidad de la República, Facultad de Ciencias Sociales, Instituto de Ciencia Política.Google Scholar
Clinton, Joshua, Jackman, Simon, and Rivers, Douglas. 2004. The Statistical Analysis of Roll Call Data. American Political Science Review 98, 2: 355–70.CrossRefGoogle Scholar
Coppedge, Michael, Gerring, John, Lindberg, Staffan, et al. 2017. V-dem dataset v7. https://www.v-dem.net/en CrossRefGoogle Scholar
Curini, Luigi, Hino, Airo, and Osaka, Atsushi. 2018. The Intensity of Government–Opposition Divide as Measured Through Legislative Speeches and What We Can Learn from It: Analyses of Japanese Parliamentary Debates, 1953–2013. Government and Opposition 55, 2: 184201.CrossRefGoogle Scholar
Dinas, Elias, and Gemenis, Kostas. 2010. Measuring Parties’ Ideological Positions with Manifesto Data: A Critical Evaluation of the Competing Methods. Party Politics 16, 4: 427–50.10.1177/1354068809343107CrossRefGoogle Scholar
Feinerer, Ingo, and Hornik, Kurt. 2018. tm: Text Mining Package. R package version 0.7-6. https://CRAN.R-project.org/package=tm Google Scholar
Feinerer, Ingo, Hornik, Kurt, and Meyer, David. 2008. Text Mining Infrastructure in R. Journal of Statistical Software 25, 5: 154. http://www.jstatsoft.org/v25/i05 CrossRefGoogle Scholar
Freedom House. 2014. Freedom in the World 2014: The Annual Survey of Political Rights and Civil Liberties. Lanham: Rowman and Littlefield.Google Scholar
Gabel, Matthew, and Huber, John D.. 2000. Putting Parties in Their Place: Inferring Party Left-Right Ideological Positions from Party Manifestos Data. American Journal of Political Science 44, 1: 94103.CrossRefGoogle Scholar
Huber, John, and Inglehart, Ronald. 1995. Expert Interpretations of Party Space and Party Locations in 42 Societies. Party Politics 1, 1: 73111.CrossRefGoogle Scholar
Izumi, Mauricio. 2016. Governo e oposição no senado brasileiro (1989–2010). Dados 59, 1: 91138.CrossRefGoogle Scholar
Izumi, Mauricio, and Medeiros, Danilo. 2021. Government and Opposition in Legislative Speechmaking: Using Text-as-Data to Estimate Brazilian Political Parties’ Policy Positions. Latin American Politics and Society 63, 1: 145–64.10.1017/lap.2020.36CrossRefGoogle Scholar
Jackman, Simon. 2001. Multidimensional Analysis of Roll Call Data via Bayesian Simulation: Identification, Estimation, Inference, and Model Checking. Political Analysis 9: 227–41.CrossRefGoogle Scholar
Kellermann, Michael. 2012. Estimating Ideal Points in the British House of Commons Using Early Day Motions. American Journal of Political Science 56, 3: 757–71.CrossRefGoogle Scholar
Kim, In Song, Londregan, John, and Ratkovic, Marc. 2018. Estimating Spatial Preferences from Votes and Text. Political Analysis 26, 2: 210–29.CrossRefGoogle Scholar
Klemmensen, Robert, Hobolt, Sara B., and Hansen, Martin E.. 2007. Estimating Policy Positions Using Political Texts: An Evaluation of the Wordscores Approach. Electoral Studies 26, 4: 746–55.10.1016/j.electstud.2007.07.006CrossRefGoogle Scholar
Knutsen, Oddbjorn. 1998. Expert Judgements of the Left-Right Location of Political Parties: A Comparative Longitudinal Study. West European Politics 21, 2: 6394.CrossRefGoogle Scholar
Lauderdale, Benjamin E., and Herzog, Alexander. 2016. Measuring Political Positions from Legislative Speech. Political Analysis 24, 3: 374–94.CrossRefGoogle Scholar
Laver, Michael, and Garry, John. 2000 Estimating Policy Positions from Political Texts. American Journal of Political Science 44, 3: 619–34.10.2307/2669268CrossRefGoogle Scholar
Laver, Michael, and Ben Hunt, W.. 1992. Policy and Party Competition. New York: Routledge.Google Scholar
Laver, Michael, Benoit, Kenneth, and Garry, John. 2003. Extracting Policy Positions from Political Texts Using Words as Data. American Political Science Review 97, 2: 311–31.CrossRefGoogle Scholar
Lowe, Will. 2008. Understanding Wordscores. Political Analysis 16, 4: 356–71.CrossRefGoogle Scholar
Lowe, Will, Benoit, Kenneth, Mikhaylov, Slava, and Laver, Michael. 2011. Scaling Policy Preferences from Coded Political Texts. Legislative Studies Quarterly 36, 1: 123–55.CrossRefGoogle Scholar
Magar, Eric, and Moraes, Juan A.. 2012. Factions with Clout: Presidential Cabinet Coalition and Policy in the Uruguayan Parliament. Party Politics 18, 3: 427–51.CrossRefGoogle Scholar
Mainwaring, Scott. 2018. Party Systems in Latin America: Institutionalization, Decay, and Collapse. Cambridge: Cambridge University Press.CrossRefGoogle Scholar
Martin, Andrew D. and Quinn, Kevin M.. 2002. Dynamic Ideal Point Estimation via Markov Chain Monte Carlo for the U.S. Supreme Court, 1953–1999. Political Analysis 10, 2: 134–53. http://mqscores.lsa.umich.edu/media/pa02.pdf CrossRefGoogle Scholar
Martin, Shane, Saalfeld, Thomas, and Strøm, Kaare, eds. 2014. The Oxford Handbook of Legislative Studies. Oxford: Oxford University Press.CrossRefGoogle Scholar
Moraes, Juan A. 2008. Why Factions? Candidate Selection and Legislative Politics in Uruguay. In Pathways to Power: Political Recruitment and Candidate Selection in Latin America, ed. Siavelis, Peter and Morgenstern, Scott. University Park: Penn State University Press. 164–85.Google Scholar
Piñeiro Rodríguez, Rafael, and Rosenblatt, Fernando. 2020. Stability and Incorporation: Toward a New Concept of Party System Institutionalization. Party Politics 26, 2: 249–60.CrossRefGoogle Scholar
Polity, I. 2012. Polity IV Project: Political Regime Characteristics and Transitions, 1800–2012. http://www.systemicpeace.org/polity/polity4.htm Google Scholar
Poole, Keith T., and Rosenthal, Howard. 2000. Congress: A Political-Economic History of Roll Call Voting. Oxford: Oxford University Press.Google Scholar
Poole, Keith, Lewis, Jeffrey, Lo, James, and Carroll, Royce. 2011. Scaling Roll Call Votes with W-nominate in R. Journal of Statistical Software 42, 14: 121.CrossRefGoogle Scholar
Proksch, Sven-Oliver, and Slapin, Jonathan B.. 2012. Institutional Foundations of Legislative Speech. American Journal of Political Science 56, 3: 520–37.CrossRefGoogle Scholar
R Core Team. 2017. R: A Language and Environment for Statistical Computing. Vienna: R. Foundation for Statistical Computing. https://www.R-project.org Google Scholar
Ramey, Adam J., Klingler, Jonathan D., and Hollibaugh, Gary E.. 2019. Measuring Elite Personality Using Speech. Political Science Research and Methods 7, 1: 163–84.CrossRefGoogle Scholar
Schwarz, Daniel, Traber, Denise, and Benoit, Kenneth. 2017. Estimating Intra-party Preferences: Comparing Speeches to Votes. Political Science Research and Methods 5, 2: 379–96.CrossRefGoogle Scholar
Slapin, Jonathan B., and Proksch, Sven-Oliver. 2008. A Scaling Model for Estimating Time-Series Party Positions from Texts. American Journal of Political Science 52, 3: 705–22.CrossRefGoogle Scholar
Slapin, Jonathan B., and Proksch, Sven-Oliver. 2014. Words as Data: Content Analysis in Legislative Studies. In Martin et al. 2014. 126–44.Google Scholar
Welbers, Kasper, Van Atteveldt, Wouter and Benoit, Kenneth. 2017. Text Analysis in R. Communication Methods and Measures 11, 4: 245–65.CrossRefGoogle Scholar
Zucco, Cesar Jr. 2009. Ideology or What? Legislative Behavior in Multiparty Presidential Settings. Journal of Politics 71, 3: 1076–92.CrossRefGoogle Scholar
Zucco, Cesar Jr. 2013. Legislative Coalitions in Presidential Systems: The Case of Uruguay. Latin American Politics and Society 55, 1: 96118.CrossRefGoogle Scholar
Figure 0

Figure 1. Sequence of the Data Generation Process

Figure 1

Figure 2. Resulting Structure of the DGP

Figure 2

Table 1. Variables Included in the Dataset

Figure 3

Table 2. Descriptive Statistics of Legislative Speeches in the Uruguayan Parliament, 1985–2015

Figure 4

Figure 3. Number of Words in Individual Legislators’ Speeches in Uruguayan Parliament, 1985–2015, by party

Figure 5

Figure 4. Wordscores in Uruguayan General Assembly, 1985–2005

Figure 6

Figure 5. W-nominate Scores in Uruguayan General Assembly, 1985–2005

Figure 7

Figure 6. Comparing Parties’ Policy Positions Through Wordscores and W-nominate in Uruguayan General Assembly, 1985–2005