The digit ratio (2D:4D) and economic preferences: no robust associations in a sample of 330 women

Elle Parslow; Eva Ranehill; Niklas Zethraeus; Liselott Blomberg; Bo von Schoultz; Angelica Lindén Hirschberg; Magnus Johannesson; Anna Dreber

doi:10.1007/s40881-019-00076-y

The digit ratio (2D:4D) and economic preferences: no robust associations in a sample of 330 women

Published online by Cambridge University Press: 01 January 2025

Angelica Lindén Hirschberg ,

Magnus Johannesson and

Anna Dreber

Show author details

Elle Parslow: Affiliation:
Department of Economics, Stockholm School of Economics, P.O Box 6501, 11383 Stockholm, Sweden
Eva Ranehill: Affiliation:
Department of Economics, University of Gothenburg, Gothenburg, Sweden
Niklas Zethraeus: Affiliation:
Department of Learning, Informatics, Management and Ethics, Karolinska Institutet, Solna, Sweden
Liselott Blomberg: Affiliation:
Karolinska University Hospital, Karolinska Institutet, Solna, Sweden
Bo von Schoultz: Affiliation:
Karolinska University Hospital, Karolinska Institutet, Solna, Sweden
Angelica Lindén Hirschberg: Affiliation:
Karolinska University Hospital, Karolinska Institutet, Solna, Sweden
Magnus Johannesson: Affiliation:
Department of Economics, Stockholm School of Economics, P.O Box 6501, 11383 Stockholm, Sweden
Anna Dreber*: Affiliation:
Department of Economics, Stockholm School of Economics, P.O Box 6501, 11383 Stockholm, Sweden Department of Economics, University of Innsbruck, Innsbruck, Austria
*: e-mail: [email protected]

Article contents

Abstract
Introduction
Method
Results
Discussion
Footnotes
References

Rights & Permissions

Abstract

Many studies report on the association between 2D:4D, a putative marker for prenatal testosterone exposure, and economic preferences. However, most of these studies have limited sample sizes and test multiple hypotheses (without preregistration). In this study we mainly replicate the common specifications found in the literature for the association between the 2D:4D ratio and risk taking, the willingness to compete, and dictator game giving separately. In a sample of 330 women we find no robust associations between any of these economic preferences and 2D:4D. We find no evidence of a statistically significant relation for 16 of the 18 total regressions we run. The two regression specifications which are statistically significant have not previously been reported and the associations are not in the expected direction, and therefore they are unlikely to represent a real effect.

Keywords

2D:4D Economic preferences Experiment Testosterone

JEL classification

C91: Laboratory, Individual Behavior

Type: Replication Paper
Information: Journal of the Economic Science Association , Volume 5 , Issue 2 , December 2019 , pp. 149 - 169

DOI: https://doi.org/10.1007/s40881-019-00076-y [Opens in a new window]
Creative Commons: This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
Copyright: Copyright © The Author(s) 2019

1 Introduction

Testosterone has been hypothesised to be associated with a wide range of economic decision making. One aspect of this hypothesis is the theory that prenatal testosterone exposure impacts brain development and therefore can explain some of the heterogeneity in behaviour between individuals. A putative proxy for the level of prenatal testosterone exposure is the ratio of the length of the second digit to the length of the fourth digit (2D:4D) on each hand, as suggested by Manning et al. (Reference Manning, Scutt, Wilson and Iwan Lewis-Jones1998). Subsequently, many studies have reported associations between 2D:4D and a variety of traits, such as sexual orientation, spatial ability and personality traits, although the results are often conflicting [and with some possibility of publication bias, see, e.g., Puts et al. (Reference Puts, McDaniel, Jordan and Marc Breedlove2008), Voracek and Loibl (Reference Voracek and Loibl2009), Grimbos et al. (Reference Grimbos, Dawood, Burriss, Zucker and Puts2010), Voracek et al. (Reference Voracek, Pietschnig, Nader and Stieger2011), but see Hönekopp and Schuster (Reference Hönekopp and Schuster2010) and Hönekopp and Watson (Reference Hönekopp and Watson2011), who do not find evidence for publication bias]. Furthermore, a sizeable literature uses 2D:4D to explore the effect of prenatal testosterone exposure on economic decisions, also with mixed results.

This paper aims to test hypotheses in previous papers in relation to the association between 2D:4D and risk taking, dictator game giving, and the willingness to compete. These preferences are relevant for explaining variation in many economic outcomes. We use a sample of 330 women—which is large given most sample sizes that have previously been reported—in an experiment to measure 2D:4D and economic preferences.

Whilst the 2D:4D measure has been used in many studies, the link between prenatal testosterone and 2D:4D is not strongly established (McIntyre Reference McIntyre2006). The oft-cited study by Lutchmaya et al. (Reference Lutchmaya, Baron-Cohen, Raggatt, Knickmeyer and Manning2004), which indirectly investigates the link between 2D:4D and prenatal testosterone exposure, finds a statistically significant negative correlation in a sample of 29 children between the testosterone-to-estradiol ratio in amniotic fluid and right hand 2D:4D only, even after controlling for gender (the left hand is reported insignificant). An additional method of investigation is to compare same sex and opposite sex twins, based on the theory of sex-hormone transfer in utero (Miller Reference Miller1994). van Anders et al. (Reference van Anders, Vernon and Wilbur2006) find that females with a male rather than female co-twin have lower left hand 2D:4D, which the authors argue is due to hormone transfer from male to female foetuses, however, they find no statistically significant results for the right hand. Whilst Voracek and Dressler (Reference Voracek and Dressler2007) in a similar study report a statistically significant result for mean 2D:4D, among studies with much larger sample sizes there is a failure to find statistically significant differences (Hiraishi et al. Reference Hiraishi, Sasaki, Shikishima and Ando2012; Cohen-Bendahan Reference Cohen-Bendahan2005; Medland et al. Reference Medland, Loehlin and Martin2008).Footnote ¹ In a study looking at umbilical cord androgen and estrogen concentrations and 2D:4D measured as young adults, Hollier et al. (Reference Hollier, Keelan, Jamnadass, Maybery, Hickey and Whitehouse2015) find no statistically significant association for either hand, using a mixed gender sample of 341 participants. Lastly, other methods of establishing a link between 2D:4D and androgen exposure both post- and peri-natally include using congenital adrenal hyperplasia (CAH) and the CAG repeat polymorphism (McIntyre Reference McIntyre2006; Brown et al. Reference Brown, Hines, Fane and Marc Breedlove2002), and here also there is a mix of positive and null results.

Even though the link between 2D:4D and prenatal testosterone is not well established, there are many papers investigating the association of 2D:4D with economic decision making. Whilst 2D:4D is an easy-to-measure way to proxy for prenatal testosterone exposure, many of these papers use multiple tests and have relatively small sample sizes. As far as we are aware, none of the previous studies pre-register their analyses. There are often multiple hypotheses involving different ways of measuring the explanatory variable (left hand, right hand, average of both hands or even squared 2D:4D), as well as which controls to include (such as gender, age or sexual orientation) and which subsamples to analyse (such as ethnicity and gender), giving rise to many ‘forking paths’ (Gelman and Loken Reference Gelman and Loken2013) and researcher degrees of freedom (Simmons et al. Reference Simmons, Nelson and Simonsohn2011). As discussed in Simmons et al. (Reference Simmons, Nelson and Simonsohn2011), researchers have many options available in choosing among outcome variables, controls and subsample selection, creating ambiguity in the research process and potentially generating higher rates of false positives than 5%, even if researchers do not intend to do so. In our review of the literature in the following subsections, we consider statistically significant results to be cases where the p value is less than 0.05 and report anything above that threshold as insignificant, as is typically used. We present tables to summarise the results of studies that use comparable measures of economic preferences to our experiments.Footnote ² However, in our own results in this paper, we instead consider a p value less than 0.05 to indicate suggestive evidence, whilst statistical significance requires a p value less than 0.005, following Benjamin et al. (Reference Benjamin, Berger, Johannesson, Nosek, Wagenmakers, Berk, Bollen, Brembs, Brown and Camerer2018).

Benjamin et al. (Reference Benjamin, Berger, Johannesson, Nosek, Wagenmakers, Berk, Bollen, Brembs, Brown and Camerer2018) suggest a change in the p value defining statistically significant new discoveries from 0.05 to 0.005, to improve the reproducibility of scientific studies (in terms of reducing rates of false positives). The authors propose that where p values are below 0.05 but above 0.005, this should be interpreted as suggestive evidence. Whilst our study aims to be a replication of past studies, the results of past studies are mixed and therefore we think it is appropriate to use the more conservative 0.005 threshold for statistical significance. An additional motivation for a more conservative threshold than 0.05 is that we, following the existing literature, run several tests for each outcome measure.

1.1 Dictator game giving

Several papers have looked at the relationship between 2D:4D and giving in the dictator game.Footnote ³ The dictator game removes any repercussions of failure to reciprocate (unlike the ultimatum game), and in all the below studies the participants were told that the recipient in the game is another participant whose identity is unknown.Footnote ⁴ The hypothesised relationship between 2D:4D and dictator game giving is positive, with higher exposure to testosterone (low 2D:4D) being associated with lower levels of dictator game giving. The results from studies using the dictator game are summarised in Table 1, showing that insignificant findings are common. When statistically significant, regressions using squared 2D:4D measures find an inverse U-shaped relationship between 2D:4D and dictator game giving (low dictator game giving is associated with both low and high testosterone). From the five previous papers summarised in Table 1, 2 out of the 43 total tests find statistically significant positive results, 1 out of 43 finds statistically significant negative results, 8 out of 43 find an inverse U-shaped relationship and 32 out of 43 find no statistically significant results (where here significance is $p < 0.05$ ).

Table 1 Dictator game giving studies

Study	Men						Women						Both sexes
	L	L	R	R	M	M	L	L	R	R	M	M	L	L	R	R	M	M
	L	$L^{2}$	R	$R^{2}$	M	$M^{2}$	L	$L^{2}$	R	$R^{2}$	M	$M^{2}$	L	$L^{2}$	R	$R^{2}$	M	$M^{2}$
Millet and Dewitte (Reference Millet and Dewitte2009), Study 1 neutral prime															S− (119 $\hat{)}$
Millet and Dewitte (Reference Millet and Dewitte2009), Study 1 aggressive prime															NS (119 $\hat{)}$
Millet and Dewitte (Reference Millet and Dewitte2009), Study 2 neutral prime															NS (90 $\hat{)}$
Millet and Dewitte (Reference Millet and Dewitte2009), Study 2 aggressive prime															S+ (90 $\hat{)}$
Buser (Reference Buser2012)													NS (221)		NS** (221)		NS** (221)
Brañas-Garza et al. (Reference Brañas-Garza, Kovářík and Neyse2013), 2010 study		S+, S− (87)		S+, S− (88)				NS, NS (61)		S+, S− (61)			NS* (170)	S+, S− (170)	NS* (171)	S+, S− (171)
Brañas-Garza et al. (Reference Brañas-Garza, Kovářík and Neyse2013), 2011 study		NS, NS (68)		NS, NS (69)				NS, NS (53)		S+, S− (53)			NS* (126)	NS, NS (126)	S+* (127)	S+, S− (127)
Galizzi and Nieboer (Reference Galizzi and Nieboer2015), all													NS (602)	NS, NS (602)	NS (602)	NS, NS (602)
Galizzi and Nieboer (Reference Galizzi and Nieboer2015), Caucasian													NS (201)	NS, NS (201)	NS (201)	S+, S− (201)
Galizzi and Nieboer (Reference Galizzi and Nieboer2015), Chinese													NS (221)	NS, NS (221)	NS (221)	NS, NS (221)
Galizzi and Nieboer (Reference Galizzi and Nieboer2015), South Asian													NS (81)	NS, NS (81)	NS (81)	NS, NS (81)
Brañas-Garza et al. (Reference Brañas-Garza, Espín, Garcia-Muñoz and Kovářík2019)													NS (560)	NS, NS (560)	NS (560)	NS, NS (560)

^Sample sizes in parentheses

L left hand, R right hand, M mean of left and right hands, NS not statistically significant, S+ statistically significant positive relationship, S− statistically significant negative relationship

*Does not control for gender, **Statistically significant positive result for binary variable where 1 indicates 4D longer and 0 indicates all other scenarios (i.e., same as 2D

or shorter), authors note results do not change for genders separately, they also control for age, nationality and experience in previous games

$^{\hat{}}$ Sample size is the total sample for that study, the authors do not state the sample split for neutral or aggressive prime groups

1.2 Risk taking

While several review papers find that women are on average more risk averse than men, [see, e.g., Eckel and Grossman (Reference Eckel and Grossman2008), Croson and Gneezy (Reference Croson and Gneezy2009), Charness and Gneezy (Reference Charness and Gneezy2012)], there is also evidence from a meta-analysis by Nelson (Reference Nelson2015) suggesting that the difference (in terms of effect size) is not very large. Nevertheless, there is a substantial literature looking into a biological explanation for this gender difference through prenatal testosterone exposure and the 2D:4D ratio. As far as we are aware, only one study finds an association between 2D:4D and risk tasking in men and not in women (Stenstrom et al. Reference Stenstrom, Saad, Nepomuceno and Mendenhall2011). The hypothesis is that risk taking is negatively related to 2D:4D—higher testosterone exposure is associated with higher risk taking (and lower risk aversion). The results from studies using risk taking tasks are summarised in Table 2. We limit our analysis of the previous literature to the areas of financial or general risk taking. There are numerous ways to measure risk-taking in experimental tasks, as well as the digit ratio (such as by scanner, or calliper etc.), which can add measurement error. From the 18 previous papers summarised in Table 2, 1 out of the 109 total tests finds positive statistically significant results, 15 out of 109 find negative statistically significant results, and 93 out of 109 find no statistically significant results (significance here is $p < 0.05$ ).

Table 2 Risk-taking studies

Study	Men					Women					Both sexes
	L	L	R	R	M	L	L	R	R	M	L	L	R	R	M
	L	$L^{2}$	R	$R^{2}$	M	L	$L^{2}$	R	$R^{2}$	M	L	$L^{2}$	R	$R^{2}$	M
Dreber and Hoffman (Reference Dreber and Hoffman2007), study 1											S− (120)		NS (116)
Dreber and Hoffman (Reference Dreber and Hoffman2007), study 2											NS (116)		NS (115)
Apicella et al. (Reference Apicella, Dreber, Campbell, Gray, Hoffman and Little2008)	NS (85)		NS (88)
Sapienza et al. (Reference Sapienza, Zingales and Maestripieri2009)	NS (116)		NS (116)		NS (116)	NS (65)		NS (65)		NS (65)	NS (181)		NS (181)		NS (181)
Coates and Page (Reference Coates and Page2009)			S− (47)
Brañas-Garza and Rustichini (Reference Brañas-Garza and Rustichini2011), risk aversion			NS (72)					S+ (116)					NS (188)
Brañas-Garza and Rustichini (Reference Brañas-Garza and Rustichini2011), combined risk aversion			S− (72)					NS (116)					NS (188)
Garbarino et al. (Reference Garbarino, Slonim and Sydnor2011)*															S− (151)
Stenstrom et al. (Reference Stenstrom, Saad, Nepomuceno and Mendenhall2011), financial risk $^{\hat{}}$			S− (219)					NS (194)
Sytsma (Reference Sytsma2014), gain domain*	NS (105)		NS (98)		NS (92)	S− (29)		S− (24)		NS (23)	S− (134)		NS (122)		NS (115)
Sytsma (Reference Sytsma2014), loss domain*	S− (105)		NS (98)		NS (92)	NS (29)		NS (24)		NS (23)	NS (134)		NS (122)		NS (115)
Sytsma (Reference Sytsma2014), average*	S− (105)		NS (98)		NS (92)	NS (29)		NS (24)		NS (23)	S− (134)		NS (122)		NS (115)
Aycinena et al. (Reference Aycinena, Baltaduonis and Rentschler2014)	NS (106)	NS, NS (106)	NS (106)	NS, NS (106)		NS (78)	NS, NS, (78)	NS (78)	NS, NS, (78)		NS (184)	NS, NS (184)	NS (184)	NS, NS (184)
Drichoutis and Nayga (Reference Drichoutis and Nayga2015)														NS, NS (138)
Schipper (Reference Schipper2014), gains*			NS (103)					NS (71)
Schipper (Reference Schipper2014), losses*			NS (111)					NS (80)
Schipper (Reference Schipper2014), gains white*			NS (47)					NS (25)
Schipper (Reference Schipper2014), losses white*			NS (50)					NS (27)
Schipper (Reference Schipper2014), gains asian*			NS (52)					NS (41)
Schipper (Reference Schipper2014), losses asian*			NS (56)					NS (48)
Bönte et al. (Reference Bönte, Procher and Urbig2016), investment risk $^{\hat{}}$ *													NS (432)
Bönte et al. (Reference Bönte, Procher and Urbig2016), general risk $^{\hat{}}$ *													S− (432)
Barel (Reference Barel2017), general risk $^{\hat{}}$											NS (211)		S− (211)
Barel (Reference Barel2017), financial risk $^{\hat{}}$											NS (211)		NS (211)
Chicaiza-Becerra and Garcia-Molina (Reference Chicaiza-Becerra and Garcia-Molina2017), full											NS (123)		NS (123)
Chicaiza-Becerra and Garcia-Molina (Reference Chicaiza-Becerra and Garcia-Molina2017), midland											NS (115)		NS (115)
Brañas-Garza et al. (Reference Brañas-Garza, Galizzi and Nieboer2018), risk preference											S− (664)		S− (664)
Brañas-Garza et al. (Reference Brañas-Garza, Galizzi and Nieboer2018), general risk attitude $^{\hat{}}$											NS (704)		NS (704)
Lima de Miranda et al. (Reference de Miranda, Neyse and Schmidt2018)											NS (144)	NS, NS (144)	NS (145)	NS, NS (145)
Alonso et al. (Reference Alonso, Di Paolo, Ponti and Sartarelli2018)											NS (390)		NS (390)
Neyse et al. (Reference Neyse, Vieider, Ring, Probst, Kaernbach, van Eimeren and Schmidt2019), gains Germany* $^{+}$											NS (181)		NS (183)
Neyse et al. (Reference Neyse, Vieider, Ring, Probst, Kaernbach, van Eimeren and Schmidt2019), losses Germany* $^{+}$											NS (185)		NS (187)
Neyse et al. (Reference Neyse, Vieider, Ring, Probst, Kaernbach, van Eimeren and Schmidt2019), mixed Germany* $^{+}$											NS (188)		NS (188)
Neyse et al. (Reference Neyse, Vieider, Ring, Probst, Kaernbach, van Eimeren and Schmidt2019), gains Vietnam* $^{+}$											NS (162)		NS (162)
Neyse et al. (Reference Neyse, Vieider, Ring, Probst, Kaernbach, van Eimeren and Schmidt2019), losses Vietnam* $^{+}$											NS (162)		NS (161)
Neyse et al. (Reference Neyse, Vieider, Ring, Probst, Kaernbach, van Eimeren and Schmidt2019), mixed Vietnam* $^{+}$											NS (162)		NS (162)

Sample sizes in parentheses

L left hand, R right hand, M mean of left and right hands, NS not statistically significant, S+ statistically significant positive relationship, S− statistically significant negative relationship

*Multiple other controls, $^{\hat{}}$ Questionnaire elicitation of risk, $^{+}$ Sample size refers to number of subjects

1.3 Competitiveness

Whilst there is evidence for gender differences in self-selection into competition (Niederle and Vesterlund Reference Niederle and Vesterlund2007; Dariel et al. Reference Dariel, Kephart, Nikiforakis and Zenker2017),Footnote ⁵ there exists substantially less literature looking at the relation between prenatal testosterone exposure and willingness to compete, relative to the other economic preferences discussed. Given the gender differences observed in this scenario, the hypothesis tested in the existing literature is that higher testosterone is associated with higher competitiveness, leading to a negative relationship between 2D:4D and the willingness to compete. Table 3 summarises the results from previous studies. Out of the 10 total tests reported across previous studies, 2 find statistically significant negative results and 8 find no statistically significant results (here significance is $p < 0.05$ ).

Table 3 Competitiveness studies

Study	Men						Women						Both sexes
	L	L	R	R	M	M	L	L	R	R	M	M	L	L	R	R	M	M
	L	$L^{2}$	R	$R^{2}$	M	$M^{2}$	L	$L^{2}$	R	$R^{2}$	M	$M^{2}$	L	$L^{2}$	R	$R^{2}$	M	$M^{2}$
Apicella et al. (Reference Apicella, Dreber, Gray, Hoffman, Little and Campbell2011)*	NS (83)		NS (86)
Bönte et al. (Reference Bönte, Procher, Urbig and Voracek2017), study 1 behavioural measure													NS (461)		NS (461)
Bönte et al. (Reference Bönte, Procher, Urbig and Voracek2017), study 1 self-reported measure													NS (461)		S− (461)
Bönte et al. (Reference Bönte, Procher, Urbig and Voracek2017), study 2 behavioural measure													NS (150)		NS (150)
Bönte et al. (Reference Bönte, Procher, Urbig and Voracek2017), study 2 self-reported measure													NS (618)		S− (618)

Sample sizes in parentheses

L left hand, R right hand, M mean of left and right hands, NS not statistically significant, S+ statistically significant positive relationship, S− statistically significant negative relationship

*Study includes a control for sexual orientation

2 Method

2.1 Experimental procedures and design

The data on 2D:4D were collected in conjunction with a study on the influence of the oral contraceptive pill (Ranehill et al. Reference Ranehill, Zethraeus, Blomberg, von Schoultz, Hirschberg, Johannesson and Dreber2017). The pre-analysis plan specifying the analysis prior to completion of data collection for this study was posted on the Open Science Framework website on the 21st of August 2015 (available at http://osf.io/he8nb/). However, the 2D:4D measure was not part of the main planned analyses in this double-blind randomised study. The exact analyses for the 2D:4D measure were therefore not specified in the pre-analysis plan. Instead it was stated in the pre-analysis plan that the 2D:4D data would be used to carry out tests of previous 2D:4D results reported as statistically significant in the literature (i.e., the data were collected to be able to replicate previous findings). The previously reported results in the literature are therefore the starting point for our analyses, but ideally our tests should have been exactly specified in the pre-analysis plan.

The participants in the study were 340 healthy women aged 18–35 years recruited following the criteria used in the oral contraceptive study.Footnote ⁶ Participants in this study thus had agreed to participate in a randomized controlled trial on the effects of the contraceptive pill. Participants participated in two sessions for the overall study: once at baseline, and once during the follow-up (the end of the study medication treatment period). Both sessions took place at the Karolinska University Hospital. The economic experiment was performed during the second session. During both sessions, we first collected blood samples for the participants before they filled out surveys on sexual function, general well-being and depressive symptoms. Participants then filled out a survey on facial preferences. In the second session, participants participated in the economic experiment after the survey of facial preferences. The economic experiment was computerized.Footnote ⁷ The economic part took about 30 minutes, while the other parts took about 20 minutes. Participants were not informed about their earnings for any task during the experiment but were paid at a later date (within 2 months after having participated in the experiment).

For details on how participants were recruited, the criteria for inclusion and exclusion, and further sample characteristics see Ranehill et al. (Reference Ranehill, Zethraeus, Blomberg, von Schoultz, Hirschberg, Johannesson and Dreber2017). Approximately 60% of participants reported an education level of university studies (ongoing) or a university degree. Unfortunately, we do not have ethnicity data for our sample of participants. While the majority of the participants were Caucasian, we cannot rule out that controlling for ethnicity would affect our results. The statistical analysis is based on 330 participants as 10 participants did not complete the data collection (7 discontinued treatment and thus did not complete the data collection, and 3 had missing hand measurements).

The economic experiments on decision making were also reported and analysed in Ranehill et al. (Reference Ranehill, Zethraeus, Blomberg, von Schoultz, Hirschberg, Johannesson and Dreber2017). The tests measured dictator game giving, financial risk taking, and willingness to compete. The order of the experimental tasks was kept constant across all participants, starting with the dictator game, the risk task, and thereafter the three stages of the competitiveness task.Footnote ⁸ Participants were not informed about their earnings for any task during the experiment but were paid at a later date (within 2 months after having participated in the experiment). The economic experiment was computerized and took about 30 minutes.

The dictator game giving measure was elicited in a modified dictator game where the participant was asked to allocate SEK 100Footnote ⁹ between herself and a charitable organization, repeated five times with a different charity organisation in each repetition. The average donation across the five decisions is used as our measure of dictator game giving. We include five dictator game decisions to reduce measurement error.

We measure risk taking with repeated lottery choices, involving 18 decisions between a certain payoff, and a 50:50 gamble to win either a larger amount of money than the safe option or SEK 0. The certain payoff amounts varied from SEK 40 to 280, and the gamble amounts were either SEK 200, 300 or 400. The percentage of choices of the gamble (i.e., the number of times the gamble was chosen over the certain payoff) is used as our measure of risk taking.

Measuring willingness to compete consisted of asking participants to solve simple tasks of adding numbers for 3 minutes, first under a non-competitive piece-rate payment scheme of SEK 5 for each correct answer, and then under a competitive tournament payment scheme of SEK 10 for each correct answer only if more tasks were solved than a random competitor (a participant selected from a previous session), otherwise the pay was zero (with SEK 5 for each person in the case of a tie). Then, in the last part, the participant could select to be paid either under the non-competitive piece rate scheme or the competitive tournament scheme. For our willingness to compete measure, we used the choice of competitive tournament scheme in this part (dummy variable where 1 is choice of competitive tournament scheme).

2D:4D results in the literature are sometimes presented for the left hand, sometimes for the right hand, and sometimes for the average of both hands. Following the existing literature, we therefore present results for all these three 2D:4D measures. In the literature results are sometimes presented for a linear model and sometimes a squared term is added to allow for a non-linear relationship. Following the existing literature, we therefore present results both without (the linear model) and with a quadratic term. In total we therefore estimate 18 regression models; 6 models for each outcome measure. In the models with a squared term we evaluate the significance of 2D:4D as the significance of the regression coefficient for the squared 2D:4D, but we also report the significance of an F-test for the joint significance of 2D:4D and the squared 2D:4D.

2.2 Power calculations

We first estimate our power to detect previous statistically significant results, based on all statistically significant findings in the literature (for models without the squared term and where the necessary information was available) and we have the following ranges of power calculations. For dictator game giving, the range of power is 0.896 to 0.999 with a mean of 0.941 at the 5% level and 0.656 to 0.994 with a mean of 0.791 at the 0.5% level. For risk taking, the range of power is 0.423 to 0.999 with a mean of 0.748 at the 5% level and 0.148 to 0.999 with a mean of 0.535 at the 0.5% level. For the willingness to compete, the range is 0.441 to 0.468 with a mean of 0.454 at the 5% level and 0.159 to 0.176 with a mean of 0.167 at the 0.5% level. However, we note that there are drawbacks to doing such power calculations, since it is very likely that original results are biased in terms of being exaggerated even if they are true positives [see, e.g., Gelman and Carlin (Reference Gelman and Carlin2014)]. Lastly, with our sample size of 330, we have 90% power to find a small effect size of $r = 0.17$ with $α = 0.05$ , and $r = 0.22$ with $α = 0.005$ .

2.3 Measuring 2D:4D

Digit measurement expressed in millimetres (mm) was performed for digit two (2D) and digit four (4D), using a Vernier digital calliper 0–150 mm (USA, Cocraft) with a precision of 0.01mm. Digit length was directly measured by two raters from the mid-point of the proximal crease of the proximal phalanx to the distal tip of the distal phalanx for 2D and 4D on both left and right hand. The reliability of direct measurement of digits was tested, demonstrating a high repeatability and differences between subjects greater than measurement errors (Savic et al. Reference Savic, Frisen, Manzouri, Nordenstrom and Hirschberg2017). The mean value of two measurements of the 2D and 4D length was calculated and then divided to create the 2D:4D ratio, which was used for further statistical analysis.

3 Results

Overall we report results for 18 regression variations, with 6 different specifications for the explanatory variables run separately using OLS for the 3 dependent variables, representing our outcome measures of dictator game giving, risk taking, and the willingness to compete. We note that the Pearson correlation between left and right hand 2D:4D in our sample is 0.63.Footnote ¹⁰ Table 4 shows the means and standard deviations for the 2D:4D measures and the outcome variables.

Table 4 Summary statistics

	Mean	SD
Giving	40.748	30.356
Risk	0.550	0.186
Comp.	0.424	0.495
2D:4D LH	0.967	0.033
2D:4D RH	0.980	0.031
2D:4D Avg	0.973	0.029
2D:4D LH sqr	0.935	0.063
2D:4D RH sqr	0.961	0.062
2D:4D Avg sqr	0.948	0.056
Observations	330

We report the regression results in the following three tables, grouped by outcome measure. Table 5 shows the results for the dictator game giving measure, whilst Table 6 shows risk taking and Table 7 shows the willingness to compete as the dependent variable.

Table 5 Dictator game giving results

	(1)	(2)	(3)	(4)	(5)	(6)
	Giving	Giving	Giving	Giving	Giving	Giving
2D:4D LH	23.2			− 930.5
2D:4D LH	(50.87)			(1655.11)
2D:4D RH		6.50			3729.8
2D:4D RH		(55.70)			(1993.72)
2D:4D Avg			18.7			2648.6
2D:4D Avg			(57.81)			(2537.98)
2D:4D LH sqr				491.8
2D:4D LH sqr				(854.80)
2D:4D RH sqr					− 1892.6
2D:4D RH sqr					(1012.69)
2D:4D Avg sqr						− 1350.6
2D:4D Avg sqr						(1306.45)
Constant	18.3	34.4	22.5	480.1	− 1795.0	− 1256.6
Constant	(49.20)	(54.62)	(56.29)	(800.98)	(981.03)	(1232.14)
N	330	330	330	330	330	330
F	0.21	0.014	0.11	0.25	1.75	0.63
p	0.65	0.91	0.75	0.78	0.18	0.53

This table reports OLS regressions for six specifications where the dependent variable is a measure for dictator game giving (the average donation across the five decisions). LH, RH and Avg correspond to left hand, right hand and average of both hands 2D:4D, respectively. LH sqr, RH sqr and Avg sqr correspond to the square of the left, right and average of both hands 2D:4D measures, respectively. The lower panel shows the F statistic and the p value from a test of the significance of each regression model, and the sample size N. Robust standard errors in parentheses.

* $p < 0.05$ , ** $p < 0.005$

Table 6 Risk-taking results

	(1)	(2)	(3)	(4)	(5)	(6)
	Risk	Risk	Risk	Risk	Risk	Risk
2D:4D LH	0.42			− 6.21
2D:4D LH	(0.31)			(11.15)
2D:4D RH		0.29			27.3
2D:4D RH		(0.34)			(13.89)
2D:4D Avg			0.44			16.0
2D:4D Avg			(0.36)			(16.01)
2D:4D LH sqr				3.42
2D:4D LH sqr				(5.75)
2D:4D RH sqr					− 13.7
2D:4D RH sqr					(7.04)
2D:4D Avg sqr						− 7.99
2D:4D Avg sqr						(8.22)
Constant	0.14	0.27	0.12	3.35	− 13.0	− 7.45
Constant	(0.30)	(0.34)	(0.35)	(5.41)	(6.85)	(7.79)
N	330	330	330	330	330	330
F	1.83	0.69	1.48	1.13	2.13	1.23
p	0.18	0.41	0.23	0.32	0.12	0.29

This table reports OLS regressions for six specifications where the dependent variable is a measure for risk taking (the percentage of choices of the gamble). LH, RH and Avg correspond to left hand, right hand and average of both hands 2D:4D, respectively. LH sqr, RH sqr and Avg sqr correspond to the square of the left, right and average of both hands 2D:4D measures, respectively. The lower panel shows the F statistic and the p-value from a test of the significance of each regression model, and the sample size N. Robust standard errors in parentheses.

* $p < 0.05$ , ** $p < 0.005$

Table 7 Willingness to compete results

	(1)	(2)	(3)	(4)	(5)	(6)
	Comp.	Comp.	Comp.	Comp.	Comp.	Comp.
2D:4D LH	1.19			73.4**
2D:4D LH	(0.80)			(22.83)
2D:4D RH		0.76			43.2
2D:4D RH		(0.86)			(36.33)
2D:4D Avg			1.21			97.3**
2D:4D Avg			(0.92)			(33.93)
2D:4D LH sqr				− 37.2**
2D:4D LH sqr				(11.84)
2D:4D RH sqr					− 21.6
2D:4D RH sqr					(18.46)
2D:4D Avg sqr						− 49.3**
2D:4D Avg sqr						(17.44)
Constant	− 0.73	− 0.32	− 0.76	− 35.7**	− 21.2	− 47.5**
Constant	(0.78)	(0.85)	(0.89)	(11.00)	(17.87)	(16.49)
N	330	330	330	330	330	330
F	2.21	0.77	1.74	7.46	1.07	5.22
p	0.14	0.38	0.19	0.00068	0.34	0.0059

Notes: This table reports OLS regressions for six specifications where the dependent variable is a binary measure for the willingness to compete (the value 1 represents choosing the competitive tournament scheme). LH, RH and Avg correspond to left hand, right hand and average of both hands 2D:4D, respectively. LH sqr, RH sqr and Avg sqr correspond to the square of the left, right and average of both hands 2D:4D measures, respectively. The lower panel shows the F statistic and the p-value from a test of the significance of each regression model, and the sample size N. Robust standard errors in parentheses.

* $p < 0.05$ , ** $p < 0.005$

We find no evidence of a statistically significant relation between 2D:4D and either dictator game giving or risk taking ( $p > 0.05$ ). For competitiveness we find no evidence in the linear models either ( $p > 0.05$ ). When we add a squared term we find statistically significant evidence ( $p < 0.005$ for the squared 2D:4D coefficient) in both the regression for left hand 2D:4D and competitiveness, and the regression for the average 2D:4D of the two hands and competitiveness.Footnote ¹¹ However, these regression specifications are not among those that have previously been reported in the literature for the willingness to compete. We plot the predicted relationships from these statistically significant specifications to illustrate the interpretation of the predicted relationships, using the range of 2D:4D that we see in our data (Fig. 1).Footnote ¹²

Fig. 1 Plot of the predicted relationships between 2D:4D and willingness to compete, for the regression with left hand 2D:4D and left hand 2D:4D squared, and also for the regression with average 2D:4D and average 2D:4D squared

The willingness to compete outcomes predicted by our regression equations show an inverse U-shaped relationship where, across a range of 2D:4D values from 0.85 to 1.1, low 2D:4D (synonymous with high prenatal testosterone exposure) predicts low competitiveness, which does not fit with the pre-existing hypothesis that high testosterone correlates with high competitiveness.Footnote ¹³ The highest willingness to compete is instead associated with mid-range 2D:4D for this predicted relationship. If the hypothesis tested in the existing literature was to hold here, we would see a decreasing relationship. As most 2D:4D measurements are below 1, we see that most of the distribution of observations would lie to the left of the peak, in the region of an increasing relationship, which is the opposite to the hypothesised relationship. The estimated inverse U-shaped relationship is thus unlikely to represent a real effect.

4 Discussion

In this study we find little evidence of 2D:4D correlating with economic preferences in a sample of 330 women. The only two statistically significant regression specifications ( $p < 0.005$ ) are not in the hypothesised direction and are not consistent with any previous findings, and are thus likely to be a false positive. The study by Ranehill et al. (Reference Ranehill, Zethraeus, Blomberg, von Schoultz, Hirschberg, Johannesson and Dreber2017) that was run in conjunction, but looking at the effect of the oral contraceptive pill, also did not find any impact of the pill on economic preferences.

Our null results could be due to several reasons. First, 2D:4D may be a reliable proxy of prenatal testosterone exposure but prenatal testosterone exposure may not correlate with economic preferences and previous results are false positive results. Second, 2D:4D may be a reliable proxy of prenatal testosterone but the relation between prenatal testosterone exposure and economic preferences is so weak that with 330 women we do not have sufficient statistical power to detect true positive results. Third, 2D:4D may be a weak or noisy proxy of prenatal testosterone but the relation between prenatal testosterone exposure and economic preferences is actually strong; but again we could then be underpowered to detect true positive results. Fourth, 2D:4D may be a weak or noisy proxy of prenatal testosterone and there is also a weak relation between prenatal testosterone exposure and economic preferences; again we could then be underpowered to detect true positive results. Fifth, 2D:4D may not correlate with economic preferences among women, thus our study would be set up to not find anything since we have only women in our sample. Given previous literature it is not clear to us why this should make a difference but additional high-powered studies, with pre-analysis plans, on men or mixed gender would be useful.

Sixth, perhaps there is something special about our sample that makes us not find a true correlation between 2D:4D and economic preferences that exist in more general samples. The editor pointed out that the selection of women who are non-smokers and who are willing to use oral contraceptives might generate a sample that is more risk-averse than the general population, or have a higher 2D:4D ratio. With respect to risk taking, the closest comparison of our sample to the general population is Boschini et al. (Reference Boschini, Dreber, von Essen, Muren and Ranehill2018) who explore risk preferences in a random sample of 487 Swedish women in a similar risk preference elicitation task of choices over lotteries versus safe options. In these samples, the average switching point is very similar—just below the risk neutral point. With respect to 2D:4D, our sample is within a similar range to previous studies.Footnote ¹⁴ In sum, more work is needed to disentangle these six possible explanations for our null results.

In a related vein, the evidence linking sex hormone administration to economic preferences is also inconclusive with most studies failing to reject the null hypothesis of no effect. The few statistically significant findings (as well as the null results) need, however, to be interpreted with caution because of low statistical power and the many researcher degrees of freedom [see the recent review by Dreber and Johannesson (Reference Dreber, Johannesson, Schultheiss and Mehta2018) for more information].

In sum, more work is needed with larger sample sizes and pre-registered hypotheses to have enough statistical power to find small effects of 2D:4D on economic preferences. Additionally, studies using improved indicators of prenatal testosterone exposure may be warranted.

Acknowledgements

Open access funding provided by Stockholm School of Economics. We thank David Bilén for research assistance, and Levent Neyse, Pablo Brañas-Garza and Thomas Buser for helpful comments. This work was supported by research grants from the Jan Wallander and Tom Hedelius Foundation (Grants P2010-0133:1, P2012-0002:1, P2013-0156:1, P2017-0143:1, and H2015-0408:1), the Knut and Alice Wallenberg Foundation (Wallenberg Academy Fellows Grant to A. Dreber), the Swedish Council for Working Life and Social Research (Grant 2006-1623), the Swiss National Science Foundation (Grant 100010-149451), the Austrian Science Fund (FWF, SFB F63), the Swedish Research Council (Grant 20324), Karolinska Institutet, and the regional agreement on medical training and clinical research (ALF) between Stockholm County Council and Karolinska Institutet (Grant 20130313).

Footnotes

Electronic supplementary material The online version of this article (https://doi.org/10.1007/s40881-019-00076-y) contains supplementary material, which is available to authorized users.

¹ Studies finding statistically significant results have sample sizes of 24 and 28 (van Anders et al. Reference van Anders, Vernon and Wilbur2006 and Voracek and Dressler Reference Voracek and Dressler2007 respectively) whereas studies finding no statistically significant differences report sample sizes of 55, 55 and 449 (Cohen-Bendahan Reference Cohen-Bendahan2005; Hiraishi et al. Reference Hiraishi, Sasaki, Shikishima and Ando2012 and Medland et al. Reference Medland, Loehlin and Martin2008, respectively).

² The results we include in the table for mixed gender samples are specifications which include a gender control only, unless noted otherwise.

³ Some studies have looked at other games, such as public goods games, and interpret behaviour as altruistic or pro-social, such as Millet and Dewitte (Reference Millet and Dewitte2006).

⁴ However, Millet and Dewitte (Reference Millet and Dewitte2009) use a hypothetical dictator game.

⁵ Although there is some evidence that this effect is context dependent (Gneezy et al. Reference Gneezy, Leonard and List2009)

⁶ Such as having a body mass index between 19–30, willing to start using oral contraceptives, being fluent in the Swedish language, being a non-smoker, not being pregnant and so on.

⁷ See online appendix for the economic experiment instructions as well as the instructions for the non-economic parts.

⁸ The order is constant for all participants for logistical reasons—the experiment was performed at the university hospital by the hospital staff over a period of several years, and randomizing the order of tasks was not something the staff wanted.

⁹ SEK 100 corresponds to roughly USD 11.

¹⁰ Whilst similar correlations have been found previously [see, e.g., Brañas-Garza et al. (Reference Brañas-Garza, Espín, Garcia-Muñoz and Kovářík2019)], our correlation appears to be on the low side (e.g., Neyse et al. (Reference Neyse, Bosworth, Ring and Schmidt2016) find correlations of 0.727 for men and 0.765 for women), however, other studies also find even lower correlations (e.g., Bönte et al. (Reference Bönte, Procher, Urbig and Voracek2017)).

¹¹ We also perform two additional analyses for the regression with willingness to compete and 2D:4D, which were not pre-specified in our pre-analysis plan. We include these in the online appendix Tables 8 and 9. The first analysis includes risk taking as a control variable. We find that adding this control variable leads to no change in the qualitative conclusions, except for the regression of average 2D:4D with the squared term, which now shows only suggestive evidence of a relation. The second analysis adds the piece-rate task performance as a control variable. In this case the regressions with left hand and left hand squared, and average squared now indicate suggestive evidence of a relation, as the p values for the coefficients are p < 0.05 . The results for the other regressions do not change.

¹² Our range of 2D:4D is within that commonly seen in the literature of around 0.8–1.2.

¹³ We note that some of the predicted values for the willingness to compete are negative, which is an implication of using OLS for estimation.

¹⁴ Neyse et al. (Reference Neyse, Bosworth, Ring and Schmidt2016) find that women in their sample on average have a left hand 2D:4D of 0.970 (with standard deviation of 0.0341) and for right hand 2D:4D the average is 0.967 (with standard deviation of 0.0362). Brañas-Garza et al. (Reference Brañas-Garza, Galizzi and Nieboer2018) find that in their female subsample the average left hand 2D:4D 0.9733 (with standard deviation of 0.0321) while average right hand 2D:4D is 0.9770 (with standard deviation of 0.0325). Our results are average left hand 2D:4D of 0.9667 (with standard deviation of 0.0327) and average right hand 2D:4D of 0.9801 (with standard deviation of 0.0313).

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Alonso, J., Di Paolo, R., Ponti, G., Sartarelli, M. (2018). Facts and misconceptions about 2D:4D, social and risk preferences. Frontiers in Behavioral Neuroscience, 12, 22.CrossRef Google Scholar PubMed

Apicella, C. L., Dreber, A., Campbell, B., Gray, P. B., Hoffman, M., Little, A. C. (2008). Testosterone and financial risk preferences. Evolution and Human Behavior, 29(6), 384–390.CrossRef Google Scholar

Apicella, C. L., Dreber, A., Gray, P. B., Hoffman, M., Little, A. C., Campbell, B. C. (2011). Androgens and competitiveness in men. Journal of Neuroscience, Psychology, and Economics, 4(1),54.CrossRef Google Scholar

Aycinena, D., Baltaduonis, R., Rentschler, L. (2014). Risk preferences and prenatal exposure to sex hormones for ladinos. PloS One, 9(8),e103332.CrossRef Google Scholar PubMed

Barel, Efrat. (2017). 2D:4D, Optimism, and Risk Taking. Current Psychology, 38(1), 204–212.CrossRef Google Scholar

Benjamin, D. J., Berger, J. O., Johannesson, M., Nosek, B. A., Wagenmakers, E.-J., Berk, R., Bollen, K. A., Brembs, B., Brown, L., Camerer, C. et al., (2018). Redefine statistical significance. Nature Human Behaviour, 2(1),6.CrossRef Google Scholar PubMed

Bönte, W., Procher, V. D., Urbig, D. (2016). Biology and selection into entrepreneurship—The relevance of prenatal testosterone exposure. Entrepreneurship Theory and Practice, 40(5), 1121–1148.CrossRef Google Scholar

Bönte, W., Procher, V. D., Urbig, D., Voracek, M. (2017). Digit ratio (2D:4D) predicts self-reported measures of general competitiveness, but not behavior in economic experiments. Frontiers in Behavioral Neuroscience, 11, 238.CrossRef Google Scholar

Boschini, A., Dreber, A., von Essen, E., Muren, A., & Ranehill, E. (2018). Gender, risk preferences and willingness to compete in a random sample of the Swedish population. Available at SSRN 3241415.Google Scholar

Brañas-Garza, P., Galizzi, M. M., Nieboer, J. (2018). Experimental and self-reported measures of risk taking and digit ratio (2D:4D): Evidence from a large, systematic study. International Economic Review, 59(3), 1131–1157.CrossRef Google Scholar

Brañas-Garza, P., Espín, A. M., Garcia-Muñoz, T., Kovářík, J. (2019). Digit ratio (2D:4D) and pro-social behavior in economic games: No direct correlation with generosity, bargaining or trust-related behaviors. Biology Letters, 10.1098/rsbl.2019.0185CrossRef Google Scholar

Brañas-Garza, P., Kovářík, J., Neyse, L. (2013). Second-to-fourth digit ratio has a non-monotonic impact on altruism. PloS One, 8(4),e60419.CrossRef Google Scholar

Brañas-Garza, P., Rustichini, A. (2011). Organizing effects of testosterone and economic behavior: Not just risk taking. PloS One, 6(12),e29842.CrossRef Google Scholar PubMed

Brown, W. M., Hines, M., Fane, B. A., Marc Breedlove, S. (2002). Masculinized finger length patterns in human males and females with congenital adrenal hyperplasia. Hormones and Behavior, 42(4), 380–386.CrossRef Google Scholar PubMed

Buser, T. (2012). Digit ratios, the menstrual cycle and social preferences. Games and Economic Behavior, 76(2), 457–470.CrossRef Google Scholar

Charness, G., Gneezy, U. (2012). Strong evidence for gender differences in risk taking. Journal of Economic Behavior and Organization, 83(1), 50–58.CrossRef Google Scholar

Chicaiza-Becerra, L. A., Garcia-Molina, M. (2017). Prenatal testosterone predicts financial risk taking: Evidence from Latin America. Personality and Individual Differences, 116, 32–37.CrossRef Google Scholar

Coates, J. M., Page, L. (2009). A note on trader Sharpe Ratios. PloS One, 4(11),e8036.CrossRef Google Scholar PubMed

Cohen-Bendahan, C. (2005). Biological roots of sex differences: A longitudinal twin study, Nijmegen: C. Cohen-Bendahan.Google Scholar

Croson, R., Gneezy, U. (2009). Gender differences in preferences. Journal of Economic Literature, 47(2), 448–474.CrossRef Google Scholar

Dariel, A., Kephart, C., Nikiforakis, N., Zenker, C. (2017). Emirati women do not shy away from competition: Evidence from a patriarchal society in transition. Journal of the Economic Science Association, 3(2), 121–136. 10.1007/s40881-017-0045-yCrossRef Google Scholar

de Miranda, K. L., Neyse, L., Schmidt, U. (2018). Risk preferences and predictions about others: No association with 2D:4D ratio. Frontiers in Behavioral Neuroscience, 12, 9.CrossRef Google Scholar

Dreber, A., Hoffman, M. (2007). Portfolio selection in utero, Stockholm: Stockholm School of Economics.Google Scholar

Dreber, A., Johannesson, M., & Schultheiss, O., Mehta, P. (2018). Sex hormones and economic decision making in the lab: A review of the causal evidence Routledge international handbook of social neuroendocrinology, Milton Park: Taylor & Francis Group.Google Scholar

Drichoutis, A. C., Nayga, R. M. (2015). Do risk and time preferences have biological roots? Southern Economic Journal, 82(1), 235–256.CrossRef Google Scholar

Eckel, C. C., Grossman, P. J. (2008). Men, women and risk aversion: Experimental evidence. Handbook of Experimental Economics Results, 1, 1061–1073.CrossRef Google Scholar

Galizzi, M. M., Nieboer, J. (2015). Digit ratio (2D:4D) and altruism: evidence from a large, multi-ethnic sample. Frontiers in Behavioral Neuroscience, 9, 41.CrossRef Google Scholar PubMed

Garbarino, E., Slonim, R., Sydnor, J. (2011). Digit ratios (2D:4D) as predictors of risky decision making for both sexes. Journal of Risk and Uncertainty, 42(1), 1–26.CrossRef Google Scholar

Gelman, A., & Loken, E. (2013). The garden of forking paths: Why multiple comparisons can be a problem, even when there is no “fishing expedition” or “p-hacking” and the research hypothesis was posited ahead of time. Working Paper.Google Scholar

Gelman, A., Carlin, J. (2014). Beyond power calculations: Assessing type S (sign) and type M (magnitude) errors. Perspectives on Psychological Science, 9(6), 641–651.CrossRef Google Scholar PubMed

Gneezy, U., Leonard, K. L., List, J. A. (2009). Gender differences in competition: Evidence from a matrilineal and a patriarchal society. Econometrica, 77(5), 1637–1664.Google Scholar

Grimbos, T., Dawood, K., Burriss, R. P., Zucker, K. J., Puts, D. A. (2010). Sexual orientation and the second to fourth finger length ratio: A meta-analysis in men and women. Behavioral Neuroscience, 124(2),278.CrossRef Google Scholar PubMed

Hiraishi, K., Sasaki, S., Shikishima, C., Ando, J. (2012). The second to fourth digit ratio (2D:4D) in a Japanese twin sample: Heritability, prenatal hormone transfer, and association with sexual orientation. Archives of Sexual Behavior, 41(3), 711–724.CrossRef Google Scholar

Hollier, L. P., Keelan, J. A., Jamnadass, E. S. L., Maybery, M. T., Hickey, M., Whitehouse, A. J. O. (2015). Adult digit ratio (2D: 4D) is not related to umbilical cord androgen or estrogen concentrations, their ratios or net bioactivity. Early Human Development, 91(2), 111–117.CrossRef Google Scholar PubMed

Hönekopp, J., Schuster, M. (2010). A meta-analysis on 2D:4D and athletic prowess: Substantial relationships but neither hand out-predicts the other. Personality and Individual Differences, 48(1), 4–10.CrossRef Google Scholar

Hönekopp, J., Watson, S. (2011). Meta-analysis of the relationship between digit-ratio 2D:4D and aggression. Personality and Individual Differences, 51(4), 381–386.CrossRef Google Scholar

Lutchmaya, S., Baron-Cohen, S., Raggatt, P., Knickmeyer, R., Manning, J. T. (2004). 2nd to 4th digit ratios, fetal testosterone and estradiol. Early Human Development, 77(1), 23–28.CrossRef Google Scholar

Manning, J. T., Scutt, D., Wilson, J., Iwan Lewis-Jones, D. (1998). The ratio of 2nd to 4th digit length: a predictor of sperm numbers and concentrations of testosterone, luteinizing hormone and oestrogen. Human Reproduction (Oxford, England), 13(11), 3000–3004.CrossRef Google Scholar PubMed

McIntyre, M. H. (2006). The use of digit ratios as markers for perinatal androgen action. Reproductive Biology and Endocrinology, 4(1),10.CrossRef Google Scholar PubMed

Medland, S. E., Loehlin, J. C., Martin, N. G. (2008). No effects of prenatal hormone transfer on digit ratio in a large sample of same-and opposite-sex dizygotic twins. Personality and Individual Differences, 44(5), 1225–1234.CrossRef Google Scholar

Miller, E. M. (1994). Prenatal sex hormone transfer: A reason to study opposite-sex twins. Personality and Individual Differences, 17(4), 511–529.CrossRef Google Scholar

Millet, K., Dewitte, S. (2006). Second to fourth digit ratio and cooperative behavior. Biological Psychology, 71(1), 111–115.CrossRef Google Scholar PubMed

Millet, K., Dewitte, S. (2009). The presence of aggression cues inverts the relation between digit ratio (2D:4D) and prosocial behaviour in a dictator game. British Journal of Psychology, 100(1), 151–162.CrossRef Google Scholar

Nelson, J. A. (2015). Are women really more risk-averse than men? A re-analysis of the literature using expanded methods. Journal of Economic Surveys, 29(3), 566–585.CrossRef Google Scholar

Neyse, L., Vieider, F. M., Ring, P., Probst, C., Kaernbach, C., van Eimeren, T., & Schmidt, U. (2019). Risk attitudes and digit ratio (2D:4D): Evidence from prospect theory. Available at SSRN 3409084.CrossRef Google Scholar

Neyse, L., Bosworth, S., Ring, P., Schmidt, U. (2016). Overconfidence, incentives and digit ratio. Scientific Reports, 6, 23294.CrossRef Google Scholar PubMed

Niederle, M., Vesterlund, L. (2007). Do women shy away from competition? Do men compete too much? The Quarterly Journal of Economics, 122(3), 1067–1101.CrossRef Google Scholar

Puts, D. A., McDaniel, M. A., Jordan, C. L., Marc Breedlove, S. (2008). Spatial ability and prenatal androgens: Meta-analyses of congenital adrenal hyperplasia and digit ratio (2D:4D) studies. Archives of Sexual Behavior, 37(1),100.CrossRef Google Scholar PubMed

Ranehill, E., Zethraeus, N., Blomberg, L., von Schoultz, B., Hirschberg, A. L., Johannesson, M., Dreber, A. (2017). Hormonal contraceptives do not impact economic preferences: Evidence from a randomized trial. Management Science, 64(10), 4515–4532.CrossRef Google Scholar

Sapienza, P., Zingales, L., Maestripieri, D. (2009). Gender differences in financial risk aversion and career choices are affected by testosterone. Proceedings of the National Academy of Sciences, 106(36), 15268–15273.CrossRef Google Scholar PubMed

Savic, I., Frisen, L., Manzouri, A., Nordenstrom, A., Hirschberg, A. L. (2017). Role of testosterone and Y chromosome genes for the masculinization of the human brain. Human Brain Mapping, 38(4), 1801–1814.CrossRef Google Scholar PubMed

Schipper, B. C. (2014). Sex hormones and choice under risk. Working Paper.Google Scholar

Simmons, J. P., Nelson, L. D., Simonsohn, U. (2011). False-positive psychology: Undisclosed flexibility in data collection and analysis allows presenting anything as significant. Psychological Science, 22(11), 1359–1366.CrossRef Google Scholar PubMed

Stenstrom, E., Saad, G., Nepomuceno, M. V., Mendenhall, Z. (2011). Testosterone and domain-specific risk: Digit ratios (2D:4D and rel2) as predictors of recreational, financial, and social risk-taking behaviors. Personality and Individual Differences, 51(4), 412–416.CrossRef Google Scholar

Sytsma, T. (2014). Handling risk: Testosterone and risk preference, evidence from Dhaka, Bangladesh. Master’s thesis, The University of San Francisco.Google Scholar

van Anders, S. M., Vernon, P. A., Wilbur, C. J. (2006). Finger-length ratios show evidence of prenatal hormone-transfer between opposite-sex twins. Hormones and Behavior, 49(3), 315–319.CrossRef Google Scholar PubMed

Voracek, M., Dressler, S. G. (2007). Digit ratio (2D:4D) in twins: Heritability estimates and evidence for a masculinized trait expression in women from opposite-sex pairs. Psychological Reports, 100(1), 115–126.CrossRef Google Scholar PubMed

Voracek, M., Loibl, L. M. (2009). Scientometric analysis and bibliography of digit ratio (2D:4D) research, 1998–2008. Psychological Reports, 104(3), 922–956.CrossRef Google Scholar PubMed

Voracek, M., Pietschnig, J., Nader, I. W., Stieger, S. (2011). Digit ratio (2D:4D) and sex-role orientation: Further evidence and meta-analysis. Personality and Individual Differences, 51(4), 417–422.CrossRef Google Scholar