1. Introduction
Governments regularly conduct benefit–cost analyses to help policymakers understand the merits of major policy proposals that affect mortality and morbidity risk. While there is a rich literature on willingness to pay (WTP) to reduce mortality risk, it is widely recognized that there are relatively few studies on willingness to pay to reduce the risk of non-fatal outcomes (Cameron, Reference Cameron2014). To address this gap, the OECD has been implementing a coordinated set of stated preference surveys, the Surveys on Willingness-to-pay to Avoid negative Chemicals-related Health Effects (SWACHE) project.Footnote 1
We present the results of a stated preference study, implemented in 10 countries, estimating WTP for reduced risk of serious kidney impairment, defined as stage 3 and 4 chronic kidney disease, the serious chronic phase of kidney disease that is followed by permanent kidney failure (end-stage renal disease (ESRD)) in about 35 % of cases. Serious kidney impairment has been linked to exposure to several chemicals, including heavy metals, certain organic solvents, polycyclic aromatic hydrocarbons (PAHs), and biotoxins (Kataria et al., Reference Kataria, Trasande and Trachtman2015). Kidney effects have also been associated with exposures to “GenX” chemicals (a trade name for a technology used to make high-performance fluoropolymers) and perfluorobutane sulfonic acid (PFBS) (U.S. EPA, 2018). In addition, some foodborne or waterborne biological organisms, e.g. Shiga toxin-producing E. coli, can produce biotoxins that can cause serious kidney damage (Obrig, Reference Obrig2010).
Few studies have estimated WTP to avoid or reduce the risk of chronic kidney disease. Herold (Reference Herold2010) conducted a stated preference survey of U.S. patients with permanent kidney failure (or ESRD) to explore factors influencing their WTP to secure a kidney for transplant, reporting non-zero WTP ranging from less than $2,000 to over $50,000. Kjaer et al. (Reference Kjær, Bech, Kronborg and Morkbak2013) used a discrete choice experiment of 206 respondents to estimate WTP to establish nephrology treatment facilities in Greenland for patients with permanent kidney failure.
The European Chemicals Agency (ECHA) (2014) estimated WTP to avoid acute kidney injury and WTP to reduce the risk of permanent kidney failure (ESRD) in the Czech Republic, the United Kingdom, the Netherlands, and Italy. Using a chaining approach (Carthy et al., Reference Carthy, Chilton, Covey, Hopkins, Jones-Lee, Loomes, Pidgeon and Spencer1998), WTP to avoid ESRD was estimated at less than $5,000. The authors suggest estimates be “treated with caution” in part because responses do not seem to be responsive to illness length or severity.
Rigby et al. (Reference Rigby, Burton, Payne, Wright, O’Brien, Hardstaff, Ozdemiroglu, Gianferrera and Mistry2017) estimated WTP to avoid permanent kidney failure in adults and children and found WTP per statistical case of about $80,000 and nearly $250,000, respectively. As with the ECHA study, the survey does not consider the longer and more common phase of chronic kidney disease that usually precedes permanent kidney failure.
These studies estimate WTP for an aspect of kidney disease (obtain a transplant or avoid kidney failure), but there appear to be no studies that provide WTP estimates for serious chronic kidney disease generally and that produce results applicable to valuing the reduction in risk of serious chronic kidney disease and can be used for benefit–cost analysis. The results here are the first WTP values for kidney disease that are broadly applicable for benefit–cost analysis, available for several countries and internationally comparable.
2. Survey design
We estimate values for symptomatic kidney disease, including specific impacts from disease progression, a more broadly applicable endpoint for valuing the impacts of chemical and biotoxin exposure that can cause kidney disease. Specifically, we value the risk of stages 3 and 4 chronic kidney disease, labeled as “serious kidney disease,” and the probability that it progresses to kidney failure (stage 5).
The survey presents an accurate but succinct description of chronic kidney disease, its impacts, and its causes. The survey explains the symptoms associated with serious kidney disease (e.g. lower back pain, nausea, difficulty concentrating) and how kidney disease can lead to other major health complications (e.g. high blood pressure, cardiovascular disease, fluid in lungs, weakened ability to fight infections). The survey description also includes the 35 % likelihood that serious kidney disease can lead to permanent kidney failure and provides a simplified, but detailed, description of what is involved in being on dialysis, the chances of finding a donor kidney, and lifestyle changes and medical care needed following a kidney transplant. The description of kidney disease concludes with information on the impact of serious kidney disease on life expectancy.
Early pretesting made it clear that a scenario based on treatment was not viable because it raised questions about why the treatment was not covered by existing public health programs in many countries, and how the elicited payments related to those programs. Consequently, the survey relies on a scenario where the mechanism for reducing risk is not specified, but the costs are clearly out-of-pocket expenses. This approach has been used successfully in prior stated preference surveys (Krupnick et al., Reference Krupnick, Alberini, Cropper, Simon, O’Brien, George and Heintzelman2002; Hoffmann et al., Reference Hoffmann, Krupnick and Qin2017). Respondents were informed that only the risk of kidney disease would be affected and that no other effects would occur.
We designed the survey instrument through a multi-step process followed by all surveys in the SWACHE project. This started with a description of the health endpoint and an evaluation of how to characterize the endpoint in the survey, followed by the development of the risk reduction mechanism, the payment vehicle, and the elicitation method. A steering group of experts, including academics, BCA practitioners, regulators, and health professionals, provided regular feedback throughout the process.
The draft survey was initially tested in 51 one-on-one interviews across Chile, Denmark, Italy, and the United States. The near-final version of the survey instrument was translated into the necessary languages for all countries, with the translations verified by native speakers, and subjected to 50-person pilot tests in each country prior to it being fielded.
The survey uses a double-bounded dichotomous choice (DBDC) question with an open-ended follow-up question. Risks are characterized as chances per thousand over 5 years, displayed numerically and graphically, together with a short tutorial on risk, risk changes, and how these are presented in the survey. Aggregating risk over 5 years was necessary for risk changes to be plausible and effectively communicated to respondents given baseline risk levels for kidney diseases, particularly for the youngest age group.
Respondents for each country were divided into three age groups (18–40, 40–60, and over 60) to establish the baseline risk of serious kidney disease (15, 25, and 60 in 1,000).Footnote 2 Members of each age group were randomly assigned into one of two hypothetical risk reductions (2 in 1,000 and 5 in 1,000) and one of four possible starting bids. The follow-up bid for the second dichotomous choice question was the first bid multiplied by either two or one-half, following conventional approaches as described in Carson and Hanneman (Reference Carson, Hanneman, Mäler and Vincent2006).Footnote 3
The bid structure is shown in Table 1.
The payment is stated as both an annual figure and as a total over the 5 years. An example of the valuation question is shown in Figure 1.
After the risk tutorial, 91.5 % of respondents correctly answered a simple test employing the risk communication device in which they were asked to identify the scenario with the highest probability of occurring.
All surveys in the SWACHE project share a common set of debriefing questions to identify common issues of concern. Key questions include whether respondents understood that the risk reductions were to occur only over the 5 years described in the survey and were not permanent, and whether they considered benefits beyond those described in the scenario.
3. Data
The survey was administered between June 2021 and June 2022 to a sample drawn from a large panel of individuals maintained by Ipsos, a global marketing and survey research firm. Representative samples were drawn for each country based on quotas matching key country-specific demographic characteristics: gender, age group, level of education, and geographic region. Respondents were screened to exclude those with chronic kidney disease. A total of 14,641 individuals started the survey after passing the screening questions, and 12,614 finished the survey. Ipsos removed 614 respondents due to a low-quality response score based on survey speeding (completing the survey in less than 1/3rd of the median survey time), straight-lining, and the proportion of “don’t know” answers, leaving a sample of 12,000.
The data were then screened based on core principles for empirical analysis agreed upon by the SWACHE researchers.Footnote 4 Screening included removing individuals who failed the survey’s probability test (8.47 %) and “speeders” who completed the survey or valuation questions in less than 48 % of the country-specific median time of the sample (12.4 %).Footnote 5 This screening is in addition to the screening done by Ipsos, and is based on the recommendations of Survey Sampling International (2013) and Mitchell (Reference Mitchell2014), and reduced the sample to 9,709 observations. Additional factors that might affect WTP were also identified, including gender, monthly household income, level of education, health expenditure paid for out of pocket, higher or lower average perceived health, having a friend or relative who experienced kidney failure, being diagnosed with some other chronic disease, and being diagnosed with or having a friend or relative diagnosed with COVID-19. These additional factors were used as explanatory variables in the econometric analysis.
Respondents were asked to indicate their household’s monthly income after income taxes had been paid and were presented with 10 income ranges corresponding to income deciles in their respective countries. Income deciles correspond to equivalized income. Unequivalized income deciles are derived by multiplying equivalized income deciles in 2019 from OECD Income (IDD) database by the number of “equivalent adults” using data on family composition from OECD Family database.Footnote 6 Respondents who did not indicate their range for the income deciles were presented with bigger ranges corresponding to income quintiles in their respective countries. Most respondents (91 %) provided information about the total income of their household. Income ranges were then converted into a single amount to facilitate the use of income data in the empirical analysis. For the smallest income range between 0 and the first decile, the income is set equal to 0.5 times the first decile. For the largest income range above the last decile, the income is computed as equal to 1.5 times the top decile. For all the other income ranges, the computed income is the simple average between the two deciles. All income values were then converted into purchasing power parity adjusted U.S. dollars (USD PPP) using PPP for actual individual consumption data for 2019 from the PPPs and exchange rates OECD database.Footnote 7
We imputed missing income responses by performing country-level Ordinary Least Square (OLS) regression analyses of logged income as a function of indicators for age, couples, females, high education, number of people in the household, employment, part-time work, and whether the respondent was retired.
Table 2 presents the sampling target and the achieved sample for target characteristics of the screened sample. The table shows an over-representation of females in all countries except Denmark, Norway, and Türkiye. This over-representation was present in the full sample received by Ipsos but was larger after our screening. The achieved sample is close to the sample target for age groups. China and Türkiye show an under-representation and Demark an over-representation of individuals over 60. Low and medium levels of education were collapsed into a single group because of an under-representation of lower-educated respondents in all countries. Higher educated respondents are close to the target except for Norway and Türkiye.
A respondent’s monthly income is the average of the income range selected, converted to USD PPP. The median monthly income for Canada, Denmark, Italy, Norway, and the United States closely matches the population median. The achieved sample for Chile, Germany, and the United Kingdom show a slightly higher deviation, with an under-representation of low-income individuals. The two outliers are Türkiye and China, which show much higher incomes in the achieved samples than the population medians. The sample median income is 162 % higher than the target for Türkiye and 227 % higher for China.
Given that the Chinese sample has a median income of more than 225 % of the Chinese population median and that 28 % of the Chinese respondents completed the survey and valuation question exceptionally quickly, China was removed from the baseline analysis, but is included as a sensitivity analysis in the main results table and in the country-level parametric estimations of WTP.
To test for sensitivity to scope and scale, Tables 3 shows the response to the first dichotomous choice question is broken down by the risk reduction offered and the starting bid. For presentation purposes, the starting bid is presented as the 5-year cost for U.S. respondents, as described in Table 1. In practice, the starting bid is a continuous variable because the U.S. dollar values were rounded after being converted to the local currency. For both risk reduction values, the per cent of respondents who answered “Yes” to the first question declines with the starting bid, indicating that respondents are less likely to be willing to pay for a risk reduction as the bid increases. This is the expected sensitivity to scale. Additionally, for the three starting bids that are common to both risk reduction values, the per cent of respondents who answered “Yes” is higher for the larger risk reduction. This suggests that respondents are willing to pay more for a larger risk reduction, which is the expected sensitivity to scope.
Tables 4 and 5 display the summary statistics for the baseline sample of 8,905 respondents who remained after screening those who failed the probability test or were identified as speeders and after removing China. Table 4 provides the summary statistics for the important continuous variables used in the analysis. The first block of data (5 rows) shows the time to complete the survey and the time respondents spent on the two dichotomous choice questions. Respondents answered the second risk question twice as fast as the first one, probably due to increased understanding after answering the first question. The second block of data shows the bid values for the first and second dichotomous choice questions, with the second bid being either one-half or twice the first bid value. Twenty-five per cent of respondents chose the lower risk (responded “yes” to pay the cost) for both risk questions, and 40 % of respondents chose the current risk (responded “no” to pay the bid price) for both questions.
The last rows in Table 4 provide statistics for baseline risk and income, continuous variables considered to be important factors affecting willingness to pay. The baseline risk is 15, 25, or 60, depending on the age group. The PPP-adjusted reported median income ranges from $263 (in Chile) to $17,552 (in the U.S.). Almost 10 % of the baseline sample declined to report their income, but the income with the imputed values used for the missing responses produces a very similar income distribution and provides 8,905 observations for the estimation.
Table 5 provides the summary statistics for the indicator (0/1) variables used in the analysis. The table reports the number of observations for which the variable is equal to one, and the per cent of the total 8,905 observations. The first block on the left-hand side reports the observations for each country after screening for speeders and those who failed the probability test. China is not included in this list because it was removed from the baseline sample. Denmark, Türkiye, and the U.S. have a slightly lower per cent of the observations because of a larger number of speeders and respondents who failed the probability test. The second two blocks on the left-hand side of Table 5 show the age–gender distribution of the baseline sample. As noted previously, there is an over-representation of females in this sample.
The first block on the right-hand side of Table 5 shows the distribution of responses to both risk questions. Twenty-five per cent of respondents chose the lower risk (responded “yes” to pay the cost) for both risk questions, and 40 % of respondents chose the current risk (responded “no” to pay the bid price) for both questions. Approximately 15 % and 20 % of the respondents answered “yes-no” and “no-yes” to the first and second dichotomous choice questions, respectively.
The second block on the right-hand side provides statistics for the indicator variables considered important factors affecting willingness to pay. A large per cent of respondents (38.7 %) believe that their health is better than average, while 15.9 % self-report lower than average health. The majority of respondents (65.8 %) have never been diagnosed with a chronic disease, but over a third (34.8 %) had a relative or friend with kidney disease. Only 8.5 % of respondents report having been diagnosed with COVID-19, but almost 40 % have a close friend or relative who had been diagnosed with COVID-19.
The final block on the right-hand side of Table 5 contains the statistics for controls used to test the robustness of the screening analysis. 5.8 % of the respondents reported that they thought the risk reduction was permanent. 9.2 % considered changes in other health issues not described in the survey (co-benefits) when they made their choices. Eleven per cent of the respondents strongly agreed (on a 5-point Likert scale) with the statement that they would pay almost any amount to reduce risks to their health. Combining those who strongly agreed to pay almost anything with the 25 % of respondents who answered Yes-Yes to the two dichotomous choice questions produces 6.2 % of respondents classified as yea-sayers. 3.4 % of respondents reported that they did not answer as they would have in real life or said that the survey did not provide them enough information to make informed choices. These respondents were classified as protesters.
4. Estimation model
To derive mean and median WTP estimates for a reduction in the risk of serious kidney disease, we employ a maximum likelihood estimator using the binary outcomes of the double-bounded dichotomous choice.
We assume a Weibull distribution for the utility error as our baseline because it generally has a shorter right tail than the log-normal and, in its “spike” configuration, usually performs well (Kriström, Reference Kriström1997; Carson & Hanneman, Reference Carson, Hanneman, Mäler and Vincent2006).
We employ a spike model to account for the possibility that a small share of the population might still choose the status quo even if it costs them nothing. This spike could be significant in the case of serious kidney disease because the baseline risk is relatively small for younger respondents. Carson and Hanemann argue that failing to include a spike parameter can, in some cases, lead to overestimating WTP.
We measure the spike by using responses to the dichotomous question and responses to the open-ended question that followed the double-bounded dichotomous choice: “What would be the most you would be willing to pay, if anything, to reduce your chance of getting serious kidney disease within 5 years?”. The spike dummy variable equals one when respondent $ i $ chooses no to both valuation questions and responds with a value of zero to the open-ended questions.
A Weibull distribution $ \theta =\left\{k,\lambda \right\} $ is characterized by a shape parameter $ k $ and a scale parameter $ \lambda $ . All estimations assume a shape parameter equal to 1. The baseline specification of the scale parameter when $ b>0 $ is
where $ {\Delta R}_i $ is the risk reduction proposed to respondent $ i $ , $ \ln {b}_i $ is the logged cost or bid proposed to respondent $ i $ , $ {d}_{ic} $ is a country dummy equal to 1 when respondent $ i $ lives in country c, and $ {\omega}_i $ is the post-stratification weight of respondent $ i $ . Including post-stratification weights, $ {\omega}_i $ , as a control allows us to capture the fact that some categories of people were slightly under- or over-represented in the sample compared to the actual population.Footnote 8 The more respondent $ i $ is underrepresented in the sample, the higher his weight $ {\omega}_i $ . It is necessary to interact country dummies with the weights because the weights are defined at the country level.
The spike parameter when $ b=0 $ is
The model is also estimated when the scale parameter includes additional explanatory variables as follows:
$ {\mathrm{Female}}_i $ is a dummy variable equal to 1 when respondent $ i $ identifies as a female, $ \ln {y}_i $ is the logged monthly income for the household of respondent $ i $ , $ {\mathrm{HighEduc}}_i $ is a dummy variable equal to 1 when respondent $ i $ achieved high education outcome, and $ {\mathrm{Baseline}}_i $ is the baseline risk presented to respondent $ i $ .
The model is also estimated when the scale parameter includes information on other important factors. These factors include whether respondents must pay for health expenditures out-of-pocket; whether they perceive their health as below or above the average of people of their gender and age; whether they know a relative who had kidney failure; whether they are diagnosed with any other chronic disease; and whether they or a relative was ever diagnosed with COVID-19.
The mean WTP for an average 3.5 in 1,000 risk reduction in serious kidney disease is computed as a simple average of the individual mean WTP follows:
The individual mean WTP is computed by integrating the probability of responding yes to the valuation question over the interval from 0 to maximum bid with adjustment:
where $ f $ is the density function of the Weibull distribution and k denotes the shape parameter. Truncation at maximum bid level $ {b}_{max} $ is necessary since the right tail is not null when the cost goes to infinity. The adjustment of the denominator compensates for the fact that the support of $ f\left({\lambda}_{ic}(b),k\right) $ does not stop at $ {b}_{max} $ . The median WTP is computed as a simple average of individual median WTP as follows:
$ {\alpha}_2 $ is the parameter for the logged bid value as indicated in equation (1).
Mean and median WTP are also presented as the value of a statistical case of kidney disease avoided. VSC is calculated using the standard method of dividing annual WTP by the average annual risk reduction. Because this survey asks about risk changes over a 5-year period and payments over this same period, VSC can equivalently be calculated as 5 x annual WTP divided by the average 5-year risk reduction of 3.5 per 1,000.
5. Results
The parametric estimation results of the dichotomous choice model are presented in Table 6. Column 1 shows the baseline estimation results. The size of the risk reduction has a positive and statistically significant effect on the joint probabilities of choosing the reduced risk options, indicating scope sensitivity. Consistent with expectations, the additional cost of choosing the reduced-risk option has a negative and statistically significant effect on the likelihood that it is chosen. The spike variable equals 0.035 and is statistically different from zero. In other words, the average probability that people are indifferent to the valued item is 3.5 % of the estimation sample. This spike at zero is small but high enough to justify using a spike model. For an average reduction of 3.5 in 1,000 in the risk of serious kidney disease over 5 years, mean WTP is $2,609 and median WTP is $764. The mean value of a statistical case (VSC) of serious kidney disease is $805,000 and the median VSC is $224,000.
Note: The baseline estimation corresponds to a maximum likelihood estimation of the joint probabilities, assuming a Weibull distribution with a spike configuration. The baseline sample excludes survey responses from China. All columns exclude survey and valuation speeders as well as respondents who failed the risk tutorial test. Standard errors are given in parentheses. Signif. codes: 0 “***” 0.001 “**” 0.01 “*” 0.05 “+” 0.1. ° integral truncated at maximum bid level with adjustments.
These results are robust to alternative methodological choices. Column 2 shows the estimation results when the post-stratification weights and its interactions with country dummies are not included as regressors. Column 3 excludes the possibility of a spike at zero. Column 4 assumes a log-logistic distribution rather than a Weibull distribution, while column 5 assumes a log-normal distribution. Column 6 includes survey responses from China, and column 7 excludes survey responses from Türkiye. China was excluded from the baseline model for the reasons described above. Türkiye was excluded in column 7 because the median income of respondents from Türkiye is twice as high as the median income of the population of Türkiye. All columns show statistically significant scope sensitivity and a negative impact of cost on the joint probabilities to choose the reduced risk option that is statistically different from zero. The mean VSC varies from $698 (without a spike) to $1,093 (using a log-normal distribution), and the median VSC varies from $176 (from the log-normal model) to $373 (when including China). The largest deviation of the mean WTP from the baseline estimate is when a log-logistic and log-normal distribution is used, and when survey responses from China are included in the estimation sample. The Weibull distribution was chosen because of a lower AIC score, and as explained previously, the Chinese respondents in this sample are richer than the actual Chinese population.
Table 7 shows the baseline model with likely determinants of WTP added as additional cofactors. The statistically significant determinants are the size of the risk reduction, income, and not having another serious disease. Below average health is marginally significant at the 5 % level. The cost has a negative effect on the probability of choosing the reduced risk option, and income and the size of the risk reduction have positive impacts. Not being diagnosed with another serious disease has a negative effect on the probability of choosing the reduced risk option. Surprisingly, people who perceive their own health as below the average of people their age and gender are less likely to choose the reduced-risk option. This could capture respondents who have lower preferences for a healthy lifestyle. Gender, education, health insurance, and one’s own or a relative’s COVID-19 diagnosis have no statistically significant impact on WTP.
Note: The baseline estimation corresponds to a maximum likelihood estimation of the joint probabilities, assuming a Weibull distribution with a spike configuration. The baseline sample excludes survey responses from China. All columns exclude survey and valuation speeders as well as respondents who failed the risk tutorial test. Standard errors are given in parentheses. Signif. codes: 0 “***” 0.001 “**” 0.01 “*” 0.05 “+” 0.1. ° integral truncated at maximum bid level with adjustments. °° Marginal effects as a result of a 1/1,000 increase in risk reduction or baseline risk reduction for each observation. °°° Marginal effects as a result of a USD 500 per month increase in in baseline income for each observation. K means thousand.
To illustrate the impact and relative magnitude of these determinants, marginal effects on the mean WTP are reported in the last column of Table 7. The marginal impact for a dummy variable was calculated by running the model twice, once with the indicator variable set to zero for all observations and a second time with the indicator variable set to 1, and then recording the change in the mean WTP. The WTP of people who have not been diagnosed with a chronic disease is 15 %, or $387, lower than the WTP of people who have a chronic disease. Respondents who believe their health is below average would be willing to pay 11 % or USD $279 less. The marginal effect for the continuous variables – risk reduction, logged income, and baseline risk – was calculated by running the model twice, once using the baseline model and a second time increasing the variable by the same amount for all observations. An increase in the risk reduction of 1 in 1,000 for each observation raises the mean WTP by 20 %, or USD $518. Increasing the baseline risk for each observation by 1 in 1,000 reduces the mean by a negligible $1. An increase in income of $500 per month for each observation raises the WTP by 3.6 %, or $94. Lastly, increasing the income of every observation by 1 % increases the mean WTP by $5.32, which is 0.2 % of the baseline mean WTP of $2,609, implying an income elasticity of 0.2.
The baseline estimation results are robust to different screening choices, as shown in Table 8. When excluding respondents who thought that the change in the risk was, the mean VSC equals $806,000, as in column 9. This is almost exactly the same as the baseline estimate of $805,000. Removing co-benefiters, respondents who considered changes in other health issues not described in the survey, slightly reduces the VSC to $796,000. When people who responded yes to both valuation questions and who indicated that they strongly agree that they would pay almost anything to reduce their risk (the yea-sayers) are removed from the estimation sample, the mean VSC equals $806,000. Excluding all people who indicated that they strongly agree that they would pay almost anything to reduce their risk, which is more restrictive than removing the yea-sayers, barely shifts the mean VSC to $805,000. Finally, removing protest responses, defined as those who strongly disagree that the survey gave sufficient information to decide or who did not answer as they would in real life, reduced the mean VSC to $804,000. Overall, the WTP and VSC estimates from these different screenings is effectively the same as the baseline estimate.
Note: The baseline estimation corresponds to a maximum likelihood estimation of the joint probabilities, assuming a Weibull distribution with a spike configuration. The baseline sample excludes survey responses from China. All columns exclude survey and valuation speeders as well as respondents who failed the risk tutorial test. Standard errors are given in parentheses. Signif. codes: 0 “***” 0.001 “**” 0.01 “*” 0.05 “+” 0.1. ° integral truncated at maximum bid level with adjustments.
Country-level estimates
The parametric estimation results of modeling each country separately are presented in Table 9. For all countries, the coefficients have signs and magnitudes that are consistent with the results from the baseline model. Scope sensitivity (i.e. the coefficient on risk reduction) is statistically significant for all countries, lowest in Germany and Türkiye, and highest in the United Kingdom and Chile. In all countries, the cost for the reduced risk option (i.e. scale sensitivity) has a negative effect on the probability to choose the reduced risk option and is statistically different from zero.
Note: All models correspond to a maximum likelihood estimation of the joint probabilities, assuming a Weibull distribution with a spike configuration. All columns exclude survey and valuation speeders as well as respondents who failed the risk tutorial test. Standard errors are given in parentheses. Signif. codes: 0 “***” 0.001 “**” 0.01 “*” 0.05 “+” 0.1. ° integral truncated at maximum bid level with adjustments. K means thousand.
The highest country-specific mean WTP is $8,198 for China, which is a clear outlier. There are various factors that can explain such a high value. First, the median income of the sample of Chinese respondents is three times higher than the median income of the Chinese population, so the mean WTP estimated for China is biased upward, given the positive effect of income on WTP. Second, the health system in China likely incentivizes Chinese respondents to choose the reduced risk option, other things equal. Despite progress in recent years, China’s social security system provides limited protection, and medical bankruptcies pose a serious threat to many households. Insurance for sick leave and for healthcare costs is also uncommon; many Chinese respondents likely have limited or no sick pay and sick leave. In sum, having a chronic disease is more costly for individuals in China than for those in the other countries sampled where the health systems are more protective. When excluding China, the mean (median) WTP varies from $1,920 ($419) in the United Kingdom to $4,046 ($1,390) for Türkiye. The small median WTP for Canada is consistent with the high share of respondents who are indifferent to the valued item, shown by the large and statistically significant spike at zero.
Table 10 presents the estimation of country-level values using the pooled baseline model. This model includes post-stratification weights times country dummies, which not only corrects for the under- or over-representation of individuals in the sample compared to the actual population, but also captures “cultural” differences between the countries. It also increases the statistical power of the country-level estimate.
Country-level estimates of the mean WTP from the pooled baseline model are generated by recovering the individual-level WTP estimates for each observation. As described above, the individual mean WTP is computed by integrating the probability of an individual responding yes to the valuation question over the interval from 0 to the maximum bid, with an adjustment. The country-level mean WTP is the average of the individual mean WTP estimate for all observations from that country. The same approach is used for the median WTP and the VSC estimates.
The country-level results in Table 10 are generally consistent with the results in Table 9. A country-level estimate is not provided for China because it was not included in the baseline model. The largest impact of using the pooled model is on the estimate for Chile. The mean WTP and VSC increased by over 28 % to $3,163 and $994, respectively, and the median values decreased by over 20 % to $994 and $292. Most of the other changes in country-level estimates are less than 10 %, except for the VSC estimates for the U.S. The mean (median) WTP varies from USD $1,832 ($455) in the United Kingdom to $3,870 ($1,378) for Türkiye. The mean (median) VSC varies from $556,000 ($133,000) in the United Kingdom to $1,203,000 ($401,000) for Türkiye.
A bivariate regression indicates that GDP is a statistically significant determinant of cross-country differences in the mean VSC (p-value = 0.018, Adj. R 2 = 0.51). However, preferences for reduced risk of serious kidney disease differ across countries and cannot be predicted by differences in GDP per capita alone. Other factors, such as differences in health systems, prevalence of kidney disease, and demographic factors, such as age distribution or cultural differences, might be more relevant, although these were not explicitly tested in this report. Overall, the significant variation in the mean VSC across countries illustrates why eliciting WTP values in various countries is important.
5. Conclusion
This study provides the first WTP estimates for pre-ESRD chronic kidney disease available for policy analysis, both directly in the surveyed countries, and through benefit transfer for others. Because preferences for health risk reductions can be expected to vary by country in ways that cannot fully be controlled for in the pooled values, country-specific estimates are more applicable for policy analysis. The country-specific values, with the exception of China, reflect the preferences of fairly representative sets of respondents in those countries. Consistency with the principles of benefit–cost analysis requires that benefits be valued as those who are affected would value them, and the country-specific estimates are the best reflection of preferences in each country.
The values developed here provide the basis for a more complete and rigorous characterization of the benefits of reduced risk of kidney disease. Importantly, these estimates are not limited to the “worst case” of kidney failure but are for symptomatic chronic kidney disease with the chance that it may result in kidney failure and premature mortality. As such, they are more representative of chronic kidney disease broadly, and more likely to be applicable for benefit–cost analysis of risk-reducing policies in many contexts, including chemicals regulation and food safety.
Looking forward, this study, in conjunction with the other surveys conducted under the SWACHE project, is likely to be informative for designing and implementing future cross-country surveys. For example, the survey shows that substantive survey revisions, in addition to translation verification, are likely to be necessary for a successful survey implementation in some countries. It may be the case that the results for China in this survey are due to language issues contributing to completion time anomalies, and more work may be needed for testing and refining the instrument after translating to mitigate such issues. In addition, as WTP estimates are systematically developed for more health endpoints in multiple countries, benefit-transfer both across countries and across health endpoints can be more fully evaluated.
Acknowledgements
The findings and conclusions in this paper are those of the authors and should not be construed to represent any official determination of policy of their affiliated institutions, the OECD, or any funding countries. We would like to thank all of those involved in the SWACHE project and associated OECD working parties who offered valuable comments and insights as this research developed. We thank the National Kidney Foundation staff for reviewing the description of kidney disease, treatment, and health outcomes presented in the survey. This research was funded by contributions from the European Union, and the governments of Canada, France, the Netherlands, Norway, Poland, Sweden, Switzerland, the United Kingdom, and the United States.