Mitigating gender inequality in women's voices: the role of normative gender-egalitarian messages

Ryo Takahashi

doi:10.1017/bpp.2024.41

Mitigating gender inequality in women's voices: the role of normative gender-egalitarian messages

Published online by Cambridge University Press: 25 September 2024

Ryo Takahashi

Show author details

Ryo Takahashi*: Affiliation:
Graduate School of Economics, Waseda University, Tokyo, Japan
*: Email: [email protected]

Article contents

Abstract
Introduction
Hypotheses
Experimental design and data collection
Methodology
Results
Discussion
Conclusion
Conflict of interest
Ethical approval
Footnotes
References

Rights & Permissions

Abstract

This study empirically examines gender inequality in tolerance for women's opinions and identifies how the provision of normative gender-egalitarian message can mitigate this inequality by conducting online randomized experiments in Japan. In this experiment, I asked the participants to evaluate the agreement score for 10 anonymous statements and implemented two types of random interventions: disclosing the gender of the statement poster and providing normative statement for gender equality. The results of both cross-sectional and panel data analyses showed that people significantly reduced the agreement score for women's opinions compared with men's and non-gender disclosure opinions. Meanwhile, the negative impact of female gender disclosure was neutralized when participants were provided with a normative message.

Keywords

social norms gender bias online randomized experiment Japan

Type: Article
Information: Behavioural Public Policy , First View , pp. 1 - 18

DOI: https://doi.org/10.1017/bpp.2024.41 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: Copyright © The Author(s), 2024. Published by Cambridge University Press

Introduction

Women often experience gender discrimination and bias in various situations, such as the hiring process (Coffman et al., Reference Coffman, Exley and Niederle2021), wage levels (Mulligan and Rubinstein, Reference Mulligan and Rubinstein2008; Flabbi, Reference Flabbi2010; Card et al., Reference Card, Cardoso and Kline2016; Biasi and Sarsons, Reference Biasi and Sarsons2022), promotion (Babcock et al., Reference Babcock, Recalde, Vesterlund and Weingart2017; Régner et al., Reference Régner, Thinus-Blanc, Netter, Schmader and Huguet2019), work-environment (Antecol et al., Reference Antecol, Barcus and Cobb-Clark2009), educational attainment (Carlana, Reference Carlana2019; Brenøe and Zölitz, Reference Brenøe and Zölitz2020) and bargaining outcomes (Ayres and Siegelman, Reference Ayres and Siegelman1995; Dittrich et al., Reference Dittrich, Knabe and Leipold2014; Ge et al., Reference Ge, Knittel, MacKenzie and Zoepf2016; Hernandez-Arenaz and Iriberri, Reference Hernandez-Arenaz and Iriberri2018). A large body of literature has shown that gender inequality also exists in evaluating women's abilities and statements; women are underestimated in their abilities, even when they have the same abilities as men (Boring, Reference Boring2017; Huang et al., Reference Huang, Gates, Sinatra and Barabási2020; Ayalew et al., Reference Ayalew, Manian and Sheth2021). These studies suggest that women's abilities and opinions may be devalued by gender, not by their content.

The undervaluation of women's abilities and opinions can be attributed, in part, to prevailing social norms that are unfavorable to women. Social norms are customary or ideal forms of behavior that individuals in a group try to conform to, influencing human behavior through the willingness to punish those who breach them (Elster, Reference Elster1989; Fehr and Gächter, Reference Fehr and Gächter2000; Benabou and Tirole, Reference Benabou and Tirole2011; Burke and Young, Reference Burke, Young, Benhabib, Bisin and Jackson2011; Krupka and Weber, Reference Krupka and Weber2013; Buckholtz, Reference Buckholtz2015; Adriani and Sonderegger, Reference Adriani and Sonderegger2018). Specifically, in patriarchal cultures, such as many in Asia, Africa and the Middle East, male-preferential norms may act to discourage women from asserting themselves publicly.

This study had two main objectives. First, I empirically investigated whether tolerance for opinions decreases when expressed by women through online randomized experiments in Japan, a country with a strong patriarchal culture. In this experiment, I presented 10 anonymous statements to the participants and asked them to evaluate the agreement score for each statement. At that time, I disclosed the gender of the statement poster to randomly selected participants. Since the disclosure of gender and the type of gender were determined randomly, in the absence of gender bias, the agreement score was expected to be similar regardless of the poster's gender.

The second objective of this study was to examine how a normative gender-egalitarian message (hereafter, ‘normative message’) can mitigate gender inequality in tolerance for women's opinions. This normative message aims to raise awareness that male-preferential norms are often misperceived and to encourage individuals to judge women's opinions based on their content rather than the gender of the speaker. This study tested the effectiveness of the normative message in reducing gender inequality by analyzing changes in agreement scores for women's opinions.

This study contributes to the literature on gender bias in women's abilities. Previous empirical literature reported that women's abilities are underestimated because of their gender (Hoisl and Mariani, Reference Hoisl and Mariani2017). For example, Azmat and Ferrer (Reference Azmat and Ferrer2017) reported that female lawyers earn less than half as much as male lawyers from new clients, even after controlling for individual characteristics. Similar gender bias has been observed in academia (Hechtman et al., Reference Hechtman, Moore, Schulkey, Miklos, Calcagno and Aragon2018; Bosquet et al., Reference Bosquet, Combes and García-Peñalosa2019; Huang et al., Reference Huang, Gates, Sinatra and Barabási2020; Ersoy and Pate, Reference Ersoy and Pate2023). For example, Knobloch-Westerwick et al. (Reference Knobloch-Westerwick, Glynn and Huge2013) found that the scientific quality of female scientists was underestimated, especially in male-dominated fields. Ginther and Kahn (Reference Ginther and Kahn2021) reported that female economists were 15% less likely to be promoted to associate professor, even after controlling for academic achievements, such as cumulative publications, citations and grants. However, it remains unclear whether, in general, people change their tolerance for women's opinions.

In addition, this study contributes to the literature on gender inequality and role of normative message. Previous literature discussed how the provision of normative message mitigates gender inequality (Boring and Philippe, Reference Boring and Philippe2021). For example, Okuyama (Reference Okuyama2021) found that normative messages delivered through a radio program increased women's political participation during the Allied Occupation of Japan. In addition, Bursztyn et al. (Reference Bursztyn, González and Yanagizawa-Drott2020) demonstrated that correcting misperceptions of social norms stimulated female labor participation in Saudi Arabia. However, to the best of my knowledge, none of the studies have investigated whether normative message efficiently reduce gender inequality in tolerance for women's opinions, particularly in settings with strong patriarchal norms that are unfavorable to women.

In this regard, Japan provides an ideal setting to examine the effectiveness of correcting misperceptions about male-preferential norms through normative messages. In general, Japan is a male-dominated country with a low awareness of gender equality (Lee, Reference Lee2019; Ogasawara and Komura, Reference Ogasawara and Komura2021). In fact, Japan ranks 125th out of 146 countries in the Global Gender Gap Report 2021, which is the second lowest among OECD countries (World Economic Forum, 2023). Despite this, there is a strong demand to mitigate gender inequality in Japan. For example, a 2019 public opinion survey revealed that over 90% of respondents believed that the government should implement policies to promote gender equality (Cabinet Office, 2019). By providing a normative message emphasizing the societal demand for gender equality, this study aimed to address and correct misperception about male-preferential norms, thereby assessing their impact on the tolerance of women's opinions.

Hypotheses

In this section, I developed hypotheses regarding tolerance for women's opinions and the expected impact of providing the normative message. As discussed, women are often undervalued because of their gender, even when they have abilities similar to men (Boring, Reference Boring2017; Hechtman et al., Reference Hechtman, Moore, Schulkey, Miklos, Calcagno and Aragon2018; Bosquet et al., Reference Bosquet, Combes and García-Peñalosa2019; Mengel et al., Reference Mengel, Sauermann and Zölitz2019; Huang et al., Reference Huang, Gates, Sinatra and Barabási2020; Ayalew et al., Reference Ayalew, Manian and Sheth2021; Ersoy and Pate, Reference Ersoy and Pate2023). This undervaluation is likely influenced by social norms that are unfavorable to women, which may lead to the devaluation of their abilities and opinions.

Although many studies have indicated that norm enforcement helps to sustain cooperation in society (Fehr and Gächter, Reference Fehr and Gächter2000; Gürerk et al., Reference Gürerk, Irlenbusch and Rockenbach2006), social norms can also exacerbate gender disparities (Gneezy et al., Reference Gneezy, Leonard and List2009; Field et al., Reference Field, Jayachandran and Pande2010; Alesina et al., Reference Alesina, Giuliano and Nunn2013; Bertrand et al., Reference Bertrand, Kamenica and Pan2015; Jayachandran, Reference Jayachandran2015, Reference Jayachandran2021). Specifically, when social norms dictate behaviors and ideologies that disadvantage women, such as prescribing how they should act, look, think and feel, they perpetuate gender inequalities (Cislaghi and Heise, Reference Cislaghi and Heise2020). Consequently, even individuals who personally hold egalitarian views may conform to unequal norms under societal pressures.

This dynamic is particularly pronounced in patriarchal cultures like Japan, where norms favoring men over women are deeply entrenched (Hamada, Reference Hamada2024). In such contexts, behaviors perceived as assertive or challenging to male opinions, especially when exhibited by women, are socially stigmatized (Seguino, Reference Seguino2000; Jayachandran, Reference Jayachandran2015; Lecoutere et al., Reference Lecoutere, d'Exelle and Van Campenhout2015). As a result, women's opinions and ideas may receive less acceptance, regardless of their quality. This argument leads to the first hypothesis:

Hypothesis 1. Tolerance for women's opinions is lower compared with the same opinions of men.

In contrast, previous literature suggests that providing general normative messages can lead to various prosocial behaviors (Dimant et al., Reference Dimant, Van Kleef and Shalvi2020; Takahashi, Reference Takahashi2021a, Reference Takahashi2021b; Takahashi and Tanaka, Reference Takahashi and Tanaka2021; Bhattacharya and Dugar, Reference Bhattacharya and Dugar2022). Moreover, some studies demonstrate that normative messages specifically addressing gender equality can stimulate gender-egalitarian behavior (Boring and Philippe, Reference Boring and Philippe2021; Okuyama, Reference Okuyama2021).

One reason the provision of normative messages promotes gender-egalitarian behavior is by correcting misperceptions through these messages. In societies with unequal gender norms, individuals may support gender equality but misperceive societal norms due to prevailing stereotypes or biases. In such cases, transmitting normative messages endorsing gender-egalitarian behavior can correct these misperceptions and promote attitudes aligned with gender equality (Cislaghi and Heise, Reference Cislaghi and Heise2020). If gender inequality exists in the tolerance for opinions due to such misperceived norms, providing the normative message could mitigate this inequality. Accordingly, I propose the following hypothesis:

Hypothesis 2. The provision of a normative gender-egalitarian message mitigates gender inequality in the tolerance for women's opinions.

Experimental design and data collection

To test the above hypotheses, I conducted two online randomized experiments in Japan. The first experiment was conducted on August 3 and 4, 2021, targeting 1,600 individuals through the online survey platform ‘iResearch’.Footnote ¹ I conducted a second survey a month later (between September 3 and 7) to construct panel data. Although I invited 1,000 participants from the first survey, only 774 participated in the second survey (attrition rate was 22.6%). For each survey, participants received a participation allowance of 35 yen (approximately US$0.35), which is the standard price fixed by the survey company. The participants took an average of 6 min to complete the two tasks: (1) a demographic questionnaire survey and (2) evaluation of an anonymous statement.Footnote ²

Evaluation of anonymous statements

The main objective of this study was to identify whether people changed their attitudes toward statements depending on the gender of the statement poster. For this purpose, I asked the participants to evaluate their preferences for anonymous statements at the end of the survey.

Specifically, the participants were informed that 10 statements would be presented on the screen one at a time, and all statements were made by anonymous individuals. Table 1 shows the 10 statements used in the first and second surveys. Statements 1 through 4 in Table 1 are related to gender equity (hereafter, ‘gender-sensitive statements’). The first three statements focus on gender equity issues, which are of significant social concern in Japan. If male-preferential social norms or conservative beliefs are entrenched, people may strongly resist accepting these statements when made by women. In contrast, the fourth statement, which posits that women should stay home to raise their children, was included as a male-preferential opinion.

Table 1. Ten statements presented during the first and second surveys

Note: The same statements were used consistently in the first and second surveys. The order of the statements presented to the participants was randomized.

Statements 5 through 8 pertain to environmental issues. Given the pervasive gender assumption that women are innately compassionate toward protecting the environment (Lau et al., Reference Lau, Kleiber, Lawless and Cohen2021), it is possible that pro-environmental statements made by women may be more readily accepted without strong opposition. The last two statements were included as neutral statements that are not considered to be particularly affected by gender.

To avoid ordering effects, the order of the statements presented to the participants was randomized. Then, the participants were asked to rate how much they agreed or disagreed with each statement based on a 7-point scale (hereafter, ‘agreement score’), ranging from ‘Strongly disagree’ to ‘Strongly agree’. For the analysis, I set a response of neither agree nor disagree as zero, while ‘Strongly disagree’ and ‘Strongly agree’ answers were scored −3 and 3, respectively.Footnote ³

Random interventions

To empirically test the hypotheses, I implemented two types of interventions. The first intervention involved disclosing the gender of the statement posters (gender disclosure treatment). I indicated whether the posters were anonymous women or anonymous men when presenting each statement, while participants without the gender disclosure treatment saw ‘anonymous person’. The gender disclosed to participants was randomly selected; in the gender disclosure treatment, 50.3% of statements were presented as women's opinions.

In the second intervention, I provided a normative message related to gender equality in Japan to randomly selected participants before the evaluation of statement agreement (normative message treatment). Specifically, participants were presented with the following message:

The following is a summary of the results of a public opinion survey conducted by the Cabinet Office in 2019.

According to the survey, more than 70% of the respondents feel that men are more privileged in society and that gender inequality persists.

Furthermore, more than 90% of respondents require the government to implement policies to promote gender equality.

By providing the normative message, I expected to correct the misperception of male-preferential norms. It is important to note that the message did not explicitly indicate the potential influence of gender inequality on the tolerance for women's opinions. Instead, the normative message suggested that it is socially desirable to mitigate gender-unequal behavior.

It is important to note that the gender disclosure treatment is considered deceptive because the gender of the poster is randomly assigned. This deception could be avoided by using actual written statements in the experiment. However, using actual statements might blur the distinction between whether differences in tolerance result from the speaker's gender or from differences in the statement's wording, as the wording may not align perfectly. Hence, to effectively test the hypotheses of this study, I opted for deception in the second intervention. This approach allows me to eliminate the influence of wording discrepancies and clearly discern gender inequality in tolerance. To mitigate the potential negative effects of deception, participants received debriefing information at the end of the survey (refer to Supplementary Appendix A). Furthermore, the experiment was conducted with approval from the Institutional Review Board.

Overview of the experimental design

Figure 1 shows an overview of the experimental design of this study. In the first survey, 1,600 participants were randomly assigned to one of the four groups. A total of 600 out of 1,600 participants received one or two interventions (groups 1–3 in Figure 1). It is important to note that for groups 1 and 3, which received the normative message treatment, the normative message was presented before participants were asked to evaluate their agreement with the 10 statements. The remaining 1,000 participants did not receive any intervention and served as the control group.

Figure 1. Experimental design overview. Notes: The two interventions (i.e., the disclosure of poster's gender and the provision of normative message) are illustrated in dash box. Numbers in parentheses indicate the number of observations.

Since the control group participants in the first survey were not exposed to the interventions, they were exclusively invited to participate in the second survey to assess how the treatments influenced changes in agreement scores. A total of 774 individuals participated in the second survey, resulting in an attrition rate of 22.6%. This attrition rate is moderate compared with the average attrition rate of approximately 15% reported for field experiments (Ghanem et al., Reference Ghanem, Hirshleifer and Ortiz-Beccera2023). For the second survey, participants were again randomly assigned to one of four groups and were asked to rate their agreement with the same set of 10 statements presented in the initial survey.

The participants' demographic characteristics and the balance between the groups are reported in Supplementary Appendix B. Scheffe's multiple comparison test confirmed that there were no statistical differences in the average demographic characteristics between the four groups. Table 2 reports the average agreement scores of the groups for the first and second surveys. The total averages of agreement score were 0.85 and 0.86 in the first and second surveys, respectively. In both surveys, the average agreement score was relatively smaller for the groups with the gender disclosure treatment (i.e., both treatments and gender disclosure groups). While Table 2 presents the mean scores for the 10 statements at the individual level, the estimations use the agreement score for each statement as the dependent variable.

Table 2. Average agreement scores at the individual level by the groups

Note. Standard deviations are in parentheses. The gender of the poster is disclosed only for the participants in the both treatments group and gender disclosure group.

Methodology

To identify how the agreement level was affected by the gender of the poster, this study employed both cross-sectional and panel data analyses. First, I started with a prefecture-level fixed effects regression model using observations from the first survey (cross-sectional analysis), as follows:

(1)$${\rm Scor}{\rm e}_{ij} = \alpha + \beta _1{\rm Femal}{\rm e}_{ij} + \beta _2{\rm Mal}{\rm e}_{ij} + \beta _3{\rm Nor}{\rm m}_i + \beta _4( {{\rm Femal}{\rm e}_{ij} \times {\rm Nor}{\rm m}_i} ) \\ + \beta _5( {{\rm Mal}{\rm e}_{ij} \times {\rm Nor}{\rm m}_i} ) + \gamma {\rm Stat}{\rm e}_j + \delta X_i + \rho _i + \varepsilon _{ij}, \;$$

where Score_ij is the agreement scale ranging from −3 to 3 for statement j for individual i. Female_ij and Male_ij are the dummy variables representing the gender disclosure treatment of individual i for statement j (hereafter, ‘female disclosure dummy’ and ‘male disclosure dummy,’ respectively). More precisely, Female_ij takes a value of 1 if the gender of the poster of statement j is disclosed as female, while Male_ij takes a value of 1 if the gender of statement j is disclosed as male to individual i. Norm_i is a dummy variable that takes a value of 1 if individual i receives the normative message. In Equation (1), I include two interaction terms between each gender disclosure dummy and the normative message dummy, shown as Female_ij × Norm_i and Male_ij × Norm_i. State_j denotes a set of dummy variables for each statement. X_i indicates a set of observable demographic characteristics of individual i (see Supplementary Appendix Table B1). ρ_i is the prefecture-specific fixed effect for individual i, which reduces the unobserved time-invariant differences between prefectures. Standard errors are clustered at the treatment level to account for autocorrelations in the error term ɛ_ij.

Next, I conducted an individual fixed effects model using panel data:

(2)$${\rm Agreemen}{\rm t}_{ijt} = \alpha + \beta _1{\rm Femal}{\rm e}_{ijt} + \beta _2{\rm Mal}{\rm e}_{ijt} + \beta _3{\rm Nor}{\rm m}_{it} + \beta _4( {{\rm Femal}{\rm e}_{ijt} \times {\rm Nor}{\rm m}_{it}} ) \\ + \beta _5( {{\rm Mal}{\rm e}_{ijt} \times {\rm Nor}{\rm m}_{it}} ) + \upsilon _i + \tau _t + u_{ijt}, \;$$

where t is the time of the survey round. υ_i and τ_t represent individual-specific fixed effects and time dummy. Note that Equation (2) excludes time-invariant variables.

In addition to Equations (1) and (2), I performed the regression by excluding the gender-sensitive statements (statements 1 through 4 in Table 1). The potential concern with including gender-sensitive statements is that the results, particularly for the gender disclosure treatment, could be affected by social desirability bias, male backlash or other confounding effects when participants encounter gender-sensitive statements. In addition, I excluded each statement from the observation and performed the estimation, which results are provided in Supplementary Appendix C.

Hypothesis 1 is tested by examining whether the agreement score is decreased when the gender of the poster was disclosed as female compared with when the poster's gender is disclosed as male. In Equations (1) and (2), the non-disclosure of the statement poster's gender is set as the baseline category. Therefore, β ₁ in Equations (1) and (2) represents the difference in agreement scores when the poster's gender is disclosed as female compared with when it is not disclosed. Since the exact same statements were presented to all participants in both surveys, if the agreement scores were determined solely by the content of the statements, the scores should be similar regardless of the disclosed gender (i.e., β ₁ = β ₂). However, if unfavorable social norms reduce tolerance for women's opinions, the coefficient for the female disclosure dummy is expected to be lower than that for the male disclosure dummy (β ₁ < β ₂).

Meanwhile, as proposed in Hypothesis 2, providing the normative message may mitigate the influence of social norms and improve tolerance for women's opinions. The general impact of normative message is captured in β ₃, while the study focuses on its specific impact on the agreement score for women's statements, indicated by the interaction term between the female disclosure dummy and the normative message dummy (β ₄). A positive β ₄ would suggest that the normative message helps correct the misperception of norms.

If the normative message treatment successfully offsets the underestimation of women's opinions, there should be no significant gender difference in the agreement scores. This is expressed as β ₁ + β ₄ = β ₂ + β ₅, meaning that the negative effect of disclosing a female poster's gender (β ₁) combined with the positive effect of the normative message (β ₄) should equal the effect of disclosing a male poster's gender (β ₂) combined with the normative message (β ₅). In other words, the normative message should neutralize the gender bias, resulting in no difference in agreement scores between genders.

Results

Results of the benchmark estimations

The estimation results are presented in Table 3.Footnote ⁴ First, the results of the cross-sectional analysis with the gender-sensitive statements indicated that the female gender disclosure dummy negatively affected the agreement score (column 1). The coefficient indicates that participants decreased the agreement score for women's opinions by 0.23 compared with when the gender of the statement poster was not disclosed, even though the exact same statements were presented. Given that the average agreement score in the first survey was 0.89, this coefficient represents approximately a 26% reduction in the agreement score due to female disclosure.

Table 3. Effect of the gender disclosure and normative message on agreement score

Note: The female disclosure variable represents the gender disclosure dummy for women. Female disclosure × normative message is the interaction term between female disclosure and the normative message dummy. The male disclosure variable is a dummy variable indicating the disclosure of the statement poster's gender as male. The normative message variable denotes whether an individual receives the normative message treatment. Standard errors are clustered at the group level in parentheses; ***, ** and * indicate statistical significance at the 1, 5 and 10% levels, respectively.

Similarly, the male disclosure dummy also had a negative and significant impact on the agreement score, reducing it by approximately 19%. These results indicate that, on average, agreement scores decrease when the poster's gender is disclosed, regardless of gender. However, testing the coefficients of the two variables revealed that the coefficient for the female disclosure dummy was significantly lower than that for the male disclosure dummy (p < 0.01).

Additionally, even after controlling for the general effect of the normative message treatment, I observed a significantly positive effect of the interaction term between the female gender disclosure dummy and the normative message dummy. In contrast, the coefficient of the interaction term with the male disclosure dummy was positive but not significant. To test whether the normative message fully offsets the underestimation of women's opinions, I compared the combined effects of the gender disclosure dummies and their interaction terms with the normative message. The test did not reveal a significant difference (p = 0.57), suggesting that the normative message treatment neutralizes the negative effect of female gender disclosure.

Furthermore, I performed the regression excluding the gender-sensitive statements, and the results are presented in column 2 of Table 3. While there were concerns about potential estimation bias from including gender-sensitive statements, the findings showed that the coefficients were slightly smaller but did not change significantly. Similar to the results in column 1, there were significant differences between the female and male gender disclosure dummies (p < 0.01). However, when the effects of the interaction terms were combined, no significant difference was found (p = 0.75). These findings suggest that including gender-sensitive statements does not introduce major biases.

Columns 3 and 4 of Table 3 present the results of the panel data analysis with and without gender-sensitive statements, respectively. Consistent with the cross-sectional results, the coefficient for female gender disclosure indicates that participants reduced the agreement score by 0.089 (approximately a 16% reduction) when they were aware that the poster was female. A similar finding was observed in the results excluding the gender-sensitive statements, with a reduction of 0.118 (equivalent to 12%).

In contrast, unlike the cross-sectional results, the coefficients for the male disclosure dummy were smaller and became insignificant for both the estimates with and without gender-sensitive statements. However, when testing for differences between female and male disclosure, combined with their interaction terms, there was consistently no significant difference (p = 0.35).

Attrition

While the attrition rate of 22.6% in the second survey is not excessively high, I cannot rule out the possibility that it may have influenced the results of the panel data analysis. Specifically, if participants with certain characteristics were more likely to participate in the second survey, systematic differences between the first and second control groups could arise, potentially leading to over- or underestimation of the panel data analysis results. To address this concern, I conducted a t-test to examine differences in the average agreement score and demographic characteristics between the control group in the first survey (1,000 participants) and the control group in the second survey (184 participants).

The results of the t-test are reported in Supplementary Appendix Table C2. The average agreement score for the control group in the first survey was 0.89, compared with 0.81 for the control group in the second survey. This difference was not statistically significant, indicating that individuals who scored particularly high or low in the first survey did not disproportionately continue to the second survey. Similarly, there were no significant differences between the groups in the eight demographic characteristics used in the estimation. These findings suggest that attrition is unlikely to have caused estimation bias into the panel data analysis results.

Heterogeneity

This section presents the results of two types of heterogeneity analyses. The first analysis examines heterogeneity based on the gender gap at the prefectural level. Regions with larger gender gaps may exhibit stronger male-dominated social norms, potentially leading to a greater undervaluation of women's opinions. In this study, I used the gender composition of prefectural assembly parliament posts as a proxy for the gender gap in each prefecture. In Japan, there are significant regional differences in the gender composition of prefectural assembly posts, ranging from 31% in areas with the highest gender composition to as low as 5% in areas with the lowest composition. For the estimation, observations were divided into two groups based on the median gender composition (16%). Subsequently, panel data analysis was conducted separately for prefectures with large and small gender gaps.

The results are presented in Table 4, with columns 1 and 2 showing the results for prefectures with large and small gender gaps, respectively. In prefectures with large gender gaps, the coefficient for the female disclosure variable is negative and significant, while the interaction term with the normative message is positive and significant. In contrast, no significant effects were found for the male disclosure variable. These findings are largely consistent with the benchmark results.

Table 4. Panel data analysis for prefectures with large and small gender gaps

Note: The term ‘gender gap’ refers to the gender gap in prefectural assembly parliament posts; ***, ** and * indicate statistical significance at the 1, 5 and 10% levels, respectively.

In prefectures with smaller gender gaps, as shown in column 2, the signs of the female disclosure variable and its interaction term were the same, but the coefficients were small and insignificant. This insignificance might be attributed to the reduced prevalence of social norms unfavorable to women in regions with smaller gender gaps. Additionally, I found that the normative message had a significantly positive effect, unlike the benchmark results. These findings imply that the provision of the normative message effectively improves tolerance for women's opinions only in regions with large gender gaps, whereas in areas with small disparities, such intervention increases tolerance in general, regardless of gender.

The second heterogeneity analysis examines whether the effects of the normative message treatment vary by participant demographic characteristics. Specifically, I investigate how concern for gender issues influences the effect of normative message provision. Participants who have pre-existing concerns about gender issues may respond more strongly to the normative message, which could primarily explain the positively significant results observed in the benchmark analysis.

To test for this heterogeneity, I used two demographic variables: the high gender concern dummy and the female dummy, both of which are included in X in Equation (1). The high gender concern dummy takes a value of 1 if the participant strongly agrees that gender inequality needs to be addressed, while the female dummy takes a value of 1 if the participant is female. In the estimation, I included interaction terms between each of these two demographic variables and the interaction terms in Equation (1).

The results including the interaction terms with each of the two demographic variables are presented in column 1 of Table 5, while columns 2 and 3 indicate the results with the interaction terms with either demographic variable.Footnote ⁵ Although the coefficients of the interaction terms with each demographic variable were positive (0.068 and 0.031), they were not statistically significant. In contrast, the interaction term between female gender disclosure and normative message remained significant and positive. These findings suggest that the main results of this study are not driven by participants' pre-existing concerns about gender issues.

Table 5. Results by participant demographic characteristics and normative message treatment

Note: This table shows the results of the interaction terms between the female gender dummy and the other variables. The variable ‘×Normative message’ is the interaction term between the female gender dummy and normative message dummy, while the other two variables (‘×Normative message × High gender concern’ and ‘×Normative message × Female dummy’) are the interaction terms of three variables, including each of the demographic variables. High gender concern is a dummy variable representing a high level of interest in gender issues before the experiment. Female dummy is a dummy variable indicating the gender of the respondent. Standard errors are clustered at the group level in parentheses; ** indicates statistical significance at the 1% level.

Discussion

The results of both cross-sectional and panel data analyses showed that tolerance significantly decreased when the poster's gender was disclosed as female. This finding is consistent with previous studies indicating that women's abilities are often underestimated (Boring, Reference Boring2017; Hechtman et al., Reference Hechtman, Moore, Schulkey, Miklos, Calcagno and Aragon2018; Bosquet et al., Reference Bosquet, Combes and García-Peñalosa2019; Mengel et al., Reference Mengel, Sauermann and Zölitz2019; Huang et al., Reference Huang, Gates, Sinatra and Barabási2020; Ayalew et al., Reference Ayalew, Manian and Sheth2021; Ersoy and Pate, Reference Ersoy and Pate2023). These findings clearly demonstrate that people undervalue women's opinions based on gender rather than content. Hence, based on these findings, Hypothesis 1 is supported.

Furthermore, the heterogeneity analysis revealed that the underestimation of women's opinions was most pronounced in regions with larger gender gaps. Although this study cannot definitively identify the precise mechanisms driving this phenomenon, the findings strongly suggest that social norms unfavorable to women could be a contributing factor to this undervaluation. These results underscore the significant role that unfavorable social norms play in perpetuating gender inequality.

In contrast, the provision of the normative message significantly increased the agreement score for women's opinions. The test results showed that providing the normative message offset the negative effect of disclosing the female gender, resulting in agreement scores that were no longer significantly different from those for men's opinions. Therefore, the findings also confirm Hypothesis 2.

In addition, a notable observation from the study is that gender disclosure (whether female or male) leads to a more substantial reduction in agreement scores compared with non-disclosure. This phenomenon could imply that a situation where people are not conscious of gender might lead to more unbiased evaluations. This notion aligns with the findings from blind audition studies, such as those by Goldin and Rouse (Reference Goldin and Rouse2000), which demonstrated reduced gender bias when anonymity was maintained. In the context of this study, it appears that disclosing gender – regardless of whether it is male or female – introduces a bias that lowers the perceived value of the statement.

This observation could be understood through the lens of implicit bias, which refers to unconscious attitudes or stereotypes that affect our understanding, actions and decisions (Greenwald and Banaji, Reference Greenwald and Banaji1995). The reduction in agreement scores upon gender disclosure could be a manifestation of such implicit biases, where awareness of gender activates subconscious stereotypes and prejudices held by individual participants, leading to biased evaluations. This interpretation aligns with the differences observed between the cross-sectional and panel data results. The inclusion of individual fixed effects in the panel data analysis, which yielded a significant negative effect only for female disclosure, indicates that unobserved individual factors, such as personal biases and experiences, significantly influence sensitivity to gender disclosure.

Overall, these findings suggest that implicit biases could be more pronounced when gender is disclosed, whether female or male, influencing the evaluation process. This is consistent with research indicating that merely increasing the number of women in leadership positions does not necessarily promote gender equality (Bagues and Esteve-Volart, Reference Bagues and Esteve-Volart2010; Bagues et al., Reference Bagues, Sylos-Labini and Zinovyeva2017). Therefore, in addition to providing normative messages to mitigate gender inequality, these findings also underscore the importance of maintaining anonymity in assessment and decision-making processes to mitigate such biases.

Conclusion

By conducting randomized online experiments with 1,600 individuals in Japan, this study reported empirical evidence on gender inequality in tolerance for women's opinions. In our experiment, although the exact same statements were presented to all participants, the results of both cross-sectional and panel data analyses indicated that people reduced the agreement score when the gender of the statement poster was disclosed as female. These results suggest that people are likely to be less tolerant of women's opinions. However, the negative impact of female gender disclosure was neutralized when participants were provided with the normative gender-egalitarian message.

These findings have policy implications for mitigating gender inequality. First, it is important to recognize that there is a risk of underestimating women's opinions, even unconsciously. I believe that the participants did not intendedly reduce the agreement score for women's opinions in order to oppress them. In fact, approximately 60% of participants reported that they have a strong or relatively strong concern on gender inequality issue. However, this study found a statistical difference in the score between female and male disclosure, suggesting that people may unintendedly decline women's opinions based on gender, not by its quality. This point is practically important because in a society where women's opinions are disregarded, their views will not be reflected in policy, which may reproduce a male-dominated society (Chattopadhyay and Duflo, Reference Chattopadhyay and Duflo2004).

Second, it is essential to disseminate the messages of normative gender egalitarianism. As the results of this study suggested, if undervaluation of women's opinions is generated by social norms unfavorable to women, correcting such misperceptions of norms is of paramount importance. In fact, efforts to disseminate information on gender equality have been undertaken for a long time (Beach and Hanlon, Reference Beach and Hanlon2019; Lau et al., Reference Lau, Kleiber, Lawless and Cohen2021; Okuyama, Reference Okuyama2021). Likewise, this study suggests that people may refrain from making gender inequality when they correct their perception of gender norms through normative messages, even in a patriarchal, male-dominated country like Japan.

Third, to prevent bias in the evaluation process, the non-disclosure of gender is crucial. The findings of this study reveal that the mere act of disclosing gender – whether male or female – introduces a bias that reduces the perceived value of the statement. While it may not be feasible to implement gender non-disclosure in all areas, this study suggests that maintaining gender anonymity in evaluation processes can contribute to more equitable evaluations for both men and women. This recommendation aligns with the broader literature on implicit bias, which emphasizes the benefits of anonymity in reducing gender-based biases in various evaluative contexts (Goldin and Rouse, Reference Goldin and Rouse2000). Therefore, policy measures that promote or enable gender non-disclosure in assessments could be a significant step toward achieving gender equality.

Finally, I discuss the limitations of this study. The primary limitation is that it is not clear how individuals perceived the normative message treatment, so the actual mechanisms by which message provision improved the agreement scores were not clearly identified. Furthermore, the generalizability of the findings is limited. The study was conducted within the specific cultural context of Japan, which may have unique social norms and gender biases. While the findings may have some applicability to other patriarchal cultural countries, the extent to which they apply to other cultural or societal contexts remains uncertain. Future research should focus on how social norms and the provision of normative messages influence behavior toward women in diverse cultural settings.

Supplementary material

To view supplementary material for this article, please visit https://doi.org/10.1017/bpp.2024.41.

Acknowledgments

This paper was supported by JSPS KAKENHI Grant-in-Aid for Scientific Research (B) Number 19H01492. The author would like to thank Kahori Ishibashi for her help with data collection.

Conflict of interest

The author declares that there is no competing financial interests or personal relationships that could have influenced the work reported in this paper.

Ethical approval

All participants provided informed consent, and the study design was approved by the institutional review board of Waseda University (Application Number: 2021-111).

Footnotes

¹ The individuals registered in the online survey platform ‘iResearch’ were recruited for the survey. After finalizing the instructions and experiment design, the survey company ‘Neo Marketing’ constructed the electronic questionnaire.

² The translated version of the questions and instructions used in this study are presented in Supplementary Appendix A.

³ The scale values were not presented to the participants.

⁴ The full results, including the estimates for control variables, are presented in Supplementary Appendix Table C1.

⁵ Although it was not reported in Table 4, the interaction terms with the male gender disclosure dummy were included in the estimation model.

References

Adriani, F. and Sonderegger, S. (2018), ‘The signaling value of punishing norm-breakers and rewarding norm-followers’, Games, 9: 102.Google Scholar

Alesina, A., Giuliano, P. and Nunn, N. (2013), ‘On the origins of gender roles: women and the plough’, Quarterly Journal of Economics, 128: 469–530.Google Scholar

Antecol, H., Barcus, V. E. and Cobb-Clark, D. (2009), ‘Gender-biased behavior at work: exploring the relationship between sexual harassment and sex discrimination’, Journal of Economic Psychology, 30: 782–792.Google Scholar

Ayalew, S., Manian, S. and Sheth, K. (2021), ‘Discrimination from below: experimental evidence from Ethiopia’, Journal of Development Economics, 151: 102653.Google Scholar

Ayres, I. and Siegelman, P. (1995), ‘Race and gender discrimination in bargaining for a new car’, American Economic Review, 85: 304–321.Google Scholar

Azmat, G. and Ferrer, R. (2017), ‘Gender gaps in performance: evidence from young lawyers’, Journal of Political Economy, 125: 1306–1355.Google Scholar

Babcock, L., Recalde, M. P., Vesterlund, L. and Weingart, L. (2017), ‘Gender differences in accepting and receiving requests for tasks with low promotability’, American Economic Review, 107: 714–747.Google Scholar

Bagues, M. and Esteve-Volart, B. (2010), ‘Can gender parity break the glass ceiling? Evidence from a repeated randomized experiment’, The Review of Economic Studies, 77: 1301–1328.Google Scholar

Bagues, M., Sylos-Labini, M. and Zinovyeva, N. (2017), ‘Does the gender composition of scientific committees matter?’ American Economic Review, 107: 1207–1238.Google Scholar

Beach, B. and Hanlon, W. W. (2019), Censorship, family planning, and the historical fertility transition. https://www.nber.org/papers/w25752.Google Scholar

Benabou, R. and Tirole, J. (2011), Laws and norms. https://www.nber.org/papers/w17579.Google Scholar

Bertrand, M., Kamenica, E. and Pan, J. (2015), ‘Gender identity and relative income within households’, Quarterly Journal of Economics, 130: 571–614.Google Scholar

Bhattacharya, H. and Dugar, S. (2022), ‘Business norm versus norm-nudge as a contract-enforcing mechanism: evidence from a real marketplace’, European Economic Review, 144: 104078.Google Scholar

Biasi, B. and Sarsons, H. (2022), ‘Flexible wages, bargaining, and the gender gap’, Quarterly Journal of Economics, 137: 215–266.Google Scholar

Boring, A. (2017), ‘Gender biases in student evaluations of teaching’, Journal of Public Economics, 145: 27–41.Google Scholar

Boring, A. and Philippe, A. (2021), ‘Reducing discrimination in the field: evidence from an awareness raising intervention targeting gender biases in student evaluations of teaching’, Journal of Public Economics, 193: 104323.Google Scholar

Bosquet, C., Combes, P. P. and García-Peñalosa, C. (2019), ‘Gender and promotions: evidence from academic economists in France’, Scandinavian Journal of Economics, 121: 1020–1053.Google Scholar

Brenøe, A. A. and Zölitz, U. (2020), ‘Exposure to more female peers widens the gender gap in stem participation’, Journal of Labor Economics, 38: 1009–1054.Google Scholar

Buckholtz, J. W. (2015), ‘Social norms, self-control, and the value of antisocial behavior’, Current Opinion in Behavioral Sciences, 3: 122–129.Google Scholar

Burke, M. A. and Young, H. P. (2011), ‘Social Norms’, in Benhabib, J., Bisin, A., and Jackson, M. (eds), Handbook of Social Economics, Amsterdam: Elsevier, 311–338.Google Scholar

Bursztyn, L., González, A. L. and Yanagizawa-Drott, D. (2020), ‘Misperceived social norms: women working outside the home in Saudi Arabia’, American Economic Review, 110: 2997–3029.Google Scholar

Cabinet Office (2019), Public opinion poll on gender-equal society (in Japanese). https://survey.gov-online.go.jp/r01/r01-danjo/gairyaku.pdf.Google Scholar

Card, D., Cardoso, A. R. and Kline, P. (2016), ‘Bargaining, sorting, and the gender wage gap: quantifying the impact of firms on the relative pay of women’, Quarterly Journal of Economics, 131: 633–686.Google Scholar

Carlana, M. (2019), ‘Implicit stereotypes: evidence from teachers’ gender bias’, Quarterly Journal of Economics, 134: 1163–1224.Google Scholar

Chattopadhyay, R. and Duflo, E. (2004), ‘Women as policy makers: evidence from a randomized policy experiment in India’, Econometrica, 72: 1409–1443.Google Scholar

Cislaghi, B. and Heise, L. (2020), ‘Gender norms and social norms: differences, similarities and why they matter in prevention science’, Sociology of Health & Illness, 42: 407–422.Google Scholar

Coffman, K. B., Exley, C. L. and Niederle, M. (2021), ‘The role of beliefs in driving gender discrimination’, Management Science, 67: 3551–3569.Google Scholar

Dimant, E., Van Kleef, G. A. and Shalvi, S. (2020), ‘Requiem for a nudge: framing effects in nudging honesty’, Journal of Economic Behavior & Organization, 172: 247–266.Google Scholar

Dittrich, M., Knabe, A. and Leipold, K. (2014), ‘Gender differences in experimental wage negotiations’, Economic Inquiry, 52: 862–873.Google Scholar

Elster, J. (1989), ‘Social norms and economic theory’, Journal of Economic Perspectives, 3: 99–117.Google Scholar

Ersoy, F. and Pate, J. (2023), ‘Invisible hurdles: Gender and institutional differences in the evaluation of economics papers’, Economic Inquiry, 61(4): 769–1136.Google Scholar

Fehr, E. and Gächter, S. (2000), ‘Cooperation and punishment in public goods experiments’, American Economic Review, 90: 980–994.Google Scholar

Field, E., Jayachandran, S. and Pande, R. (2010), ‘Do traditional institutions constrain female entrepreneurship? A field experiment on business training in India’, American Economic Review, 100: 125–129.Google Scholar

Flabbi, L. (2010), ‘Gender discrimination estimation in a search model with matching and bargaining’, International Economic Review, 51: 745–783.Google Scholar

Ge, Y., Knittel, C. R., MacKenzie, D. and Zoepf, S. (2016), ‘Racial and gender discrimination in transportation network companies’. https://www.nber.org/papers/w22776.Google Scholar

Ghanem, D., Hirshleifer, S. and Ortiz-Beccera, K. (2023), ‘Testing attrition bias in field experiments’, Journal of Human Resources, 0920-11190R2. https://doi.org/10.3368/jhr.0920-11190R2.Google Scholar

Ginther, D. K. and Kahn, S. (2021), ‘Women in academic economics: have we made progress?’ AEA Papers and Proceedings, 111: 138–142.Google Scholar

Gneezy, U., Leonard, K. L. and List, J. A. (2009), ‘Gender differences in competition: evidence from a matrilineal and a patriarchal society’, Econometrica, 77: 1637–1664.Google Scholar

Goldin, C. and Rouse, C. (2000), ‘Orchestrating impartiality: the impact of ‘blind’ auditions on female musicians’, American Economic Review, 90: 715–741.Google Scholar

Greenwald, A. G. and Banaji, M. R. (1995), ‘Implicit social cognition: attitudes, self-esteem, and stereotypes’, Psychological Review, 102: 4.Google Scholar

Gürerk, Ö, Irlenbusch, B. and Rockenbach, B. (2006), ‘The competitive advantage of sanctioning institutions’, Science, 312: 108–111.Google Scholar

Hamada, I. (2024), ‘Double Truth: Employment Insecurity and Gender Inequality in Japan's Neoliberal Promotion of Side Jobs’, Japan Forum, 36: 329–351.Google Scholar

Hechtman, L. A., Moore, N. P., Schulkey, C. E., Miklos, A. C., Calcagno, A. M., Aragon, R., et al. (2018), ‘NIH funding longevity by gender’, Proceedings of the National Academy of Sciences, 115: 7943–7948.Google Scholar

Hernandez-Arenaz, I. and Iriberri, N. (2018), ‘Women ask for less (only from men): evidence from bargaining in the field’, Journal of Economic Behavior & Organization, 152: 192–214.Google Scholar

Hoisl, K. and Mariani, M. (2017), ‘It's a man's job: income and the gender gap in industrial research’, Management Science, 63: 766–790.Google Scholar

Huang, J., Gates, A. J., Sinatra, R. and Barabási, A.-L. (2020), ‘Historical comparison of gender inequality in scientific careers across countries and disciplines’, Proceedings of the National Academy of Sciences, 117: 4609–4616.Google Scholar

Jayachandran, S. (2015), ‘The roots of gender inequality in developing countries’, Annual Review of Economics, 7: 63–88.Google Scholar

Jayachandran, S. (2021), ‘Social norms as a barrier to women's employment in developing countries’, IMF Economic Review, 69: 576–595.Google Scholar

Knobloch-Westerwick, S., Glynn, C. J. and Huge, M. (2013), ‘The Matilda effect in science communication: an experiment on gender bias in publication quality perceptions and collaboration interest’, Science Communication, 35: 603–625.Google Scholar

Krupka, E. L. and Weber, R. A. (2013), ‘Identifying social norms using coordination games: Why does dictator game sharing vary?’ Journal of the European Economic Association, 11: 495–524.Google Scholar

Lau, J. D., Kleiber, D., Lawless, S. and Cohen, P. J. (2021), ‘Gender equality in climate policy and practice hindered by assumptions’, Nature Climate Change, 11: 186–192.Google Scholar

Lecoutere, E., d'Exelle, B. and Van Campenhout, B. (2015), ‘Sharing common resources in patriarchal and status-based societies: evidence from Tanzania’, Feminist Economics, 21: 142–167.Google Scholar

Lee, J. F. (2019), ‘In the pursuit of a gender-equal society: do Japanese EFL textbooks play a role?’ Journal of Gender Studies, 28: 204–217.Google Scholar

Mengel, F., Sauermann, J. and Zölitz, U. (2019), ‘Gender bias in teaching evaluations’, Journal of the European Economic Association, 17: 535–566.Google Scholar

Mulligan, C. B. and Rubinstein, Y. (2008), ‘Selection, investment, and women's relative wages over time’, Quarterly Journal of Economics, 123: 1061–1110.Google Scholar

Ogasawara, K. and Komura, M. (2021), ‘Consequences of war: Japan's demographic transition and the marriage market’, Journal of Population Economics, 35: 1037–1069.Google Scholar

Okuyama, Y. (2021), Empowering women through radio: evidence from occupied Japan. https://papers.okuyamayoko.com/Okuyama_Womens_radio_in_Occupied_Japan.pdf.Google Scholar

Régner, I., Thinus-Blanc, C., Netter, A., Schmader, T. and Huguet, P. (2019), ‘Committees with implicit biases promote fewer women when they do not believe gender bias exists’, Nature Human Behaviour, 3: 1171–1179.Google Scholar

Seguino, S. (2000), ‘Accounting for gender in Asian economic growth’, Feminist Economics, 6: 27–58.Google Scholar

Takahashi, R. (2021a), ‘How to stimulate environmentally friendly consumption: evidence from a nationwide social experiment in Japan to promote eco-friendly coffee’, Ecological Economics, 186: 107082.Google Scholar

Takahashi, R. (2021b), ‘Who is attracted to purchase green products through information provision: a nationwide social experiment to promote eco-friendly coffee’, Environmental Science & Policy, 124: 593–603.Google Scholar

Takahashi, R. and Tanaka, K. (2021), ‘Social punishment for breaching restrictions during the COVID-19 pandemic’, Economic Inquiry, 59: 1467–1482.Google Scholar

World Economic Forum (2023), Global Gender Gap Report 2023. https://www..weforum.org/docs/WEF_GGGR_2021.pdf.Google Scholar

Table 1. Ten statements presented during the first and second surveys

Figure 1. Experimental design overview. Notes: The two interventions (i.e., the disclosure of poster's gender and the provision of normative message) are illustrated in dash box. Numbers in parentheses indicate the number of observations.

Table 2. Average agreement scores at the individual level by the groups

Table 3. Effect of the gender disclosure and normative message on agreement score

Table 4. Panel data analysis for prefectures with large and small gender gaps

Table 5. Results by participant demographic characteristics and normative message treatment

Takahashi supplementary material

File 219.8 KB

Article contents

Mitigating gender inequality in women's voices: the role of normative gender-egalitarian messages

Abstract

Keywords

Introduction

Hypotheses

Experimental design and data collection

Evaluation of anonymous statements

Random interventions

Overview of the experimental design

Methodology

Results

Results of the benchmark estimations

Attrition

Heterogeneity

Discussion

Conclusion

Supplementary material

Acknowledgments

Conflict of interest

Ethical approval

Footnotes

References

Takahashi supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests