1 Introduction
In this paper we present research that shows how different performance-based incentive structures affect performance on a variant of the secretary problem. In particular, we examined the effect of a constant (flat) fee for participation, a commission-based incentive (earnings are proportional to chosen prices), and a best only incentive that was only obtained if the participant made the very best (optimal) choice possible in that round. The key dependent variables were the length of the search and the number of optimal choices made.
Little attention has been paid to the structure of the performance-based incentive used in laboratory experiments. Yet using inappropriate incentive structures in experiments runs the risk of impairing performance (e.g., Reference Ariely, Gneezy, Loewenstein and MazarAriely, Gneezy, Loewenstein & Mazar, 2009, Reference Ariely, Bracha and MeierAriely, Bracha & Meier, 2009) or producing bad decisions (e.g., Reference Cole, Kanz and KlapperCole, Kanz & Klapper, 2015). The problem of not knowing the effect of incentive structure on the intended performance measures can be particularly acute in addressing research questions. Participants who do not perform well because of an inappropriate incentive structure may be mistakenly perceived as not performing well as a result of the conditions. Ultimately, understanding the effectiveness of different incentive structures is an important design issue.
This study also investigates the interactive effect of context and incentive structures. Framing the task with a meaningful context is commonly used in the laboratory to facilitate understanding of the instructions. Does context influence the effectiveness of different incentive structures?
1.1 The effectiveness of monetary incentive on performance
The presence of a monetary incentive could convey that the task is unattractive or undesirable (Reference Frey and Oberholzer-GeeFrey & Oberholzer-Gee, 1997) and change how people perceive the task (Reference Cole, Kanz and KlapperCole et al., 2015). On occasion, extrinsic motivation, for example, a monetary incentive, may crowd out the intrinsic motivation, for example, the sense of accomplishment, resulting in an incentive having a reduced or no effect on performance (e.g., Frey, 1997; Reference Gneezy, Meier and Rey-BielGneezy, Meier & Rey-Biel, 2011; Reference Gneezy and RustichiniGneezy & Rustichini, 2000a; Reference Gneezy and RustichiniGneezy & Rustichini, 2000b).
Monetary incentives are generally found to have a beneficial effect on performance with mundane tasks that are effort dominant, and which people lack intrinsic motivation to perform; for example, clerical tasks (Reference Riedel, Nebeker and CooperRiedel, Nebeker & Cooper, 1988), and item recognition and recall tasks (e.g., Reference Kahneman and PeavlerKahneman & Peavler, 1969; Reference Libby and LipeLibby & Lipe, 1992). As the participants are likely to derive no satisfaction or enjoyment from doing the task, the monetary incentive provides extrinsic motivation to perform. However, in this case, a monetary incentive is not always effective in motivating performance, because an increase in effort does not necessarily lead to improved performance (e.g., Fryer 2011).
Even in tasks that require basic cognitive skills, e.g., attention and memory, or tasks requiring some creative or motor skills, people sometimes produce worse performance with a higher payment than a lower one (Reference Ariely, Gneezy, Loewenstein and MazarAriely et al., 2009). Monetary incentives may elevate the level of arousal and lead people to choke under pressure. According to the Yerkes-Dodson law (Reference Yerkes and DodsonYerkes & Dodson, 1908), performance often improves with increased arousal, but excessive arousal can lead to a decrement in performance. People in the two performance-based incentives in the experiment described below might experience a higher level of arousal than the people with a flat fee incentive. This higher level of arousal could lead either to better or worse performance, although our expectation — given that the incentives were not huge — was that the effect was more likely to be positive than negative.
1.2 The secretary problem
Many real-life decision-making situations are sequential, and such decisions often need to be made immediately and cannot be revisited (for example, finding a partner, buying or selling houses). This type of sequential decision-making situation often displays the features of the secretary problem (see Ferguson, 1989, and Freeman, 1983, for historical reviews). The basic form of the secretary problem, often referred to as the classical secretary problem, has been specified in the following way. A known number of n candidates is presented randomly in a sequence. The decision-maker must either accept or reject the presented candidate immediately and the decision cannot be recalled. (That is, the decision maker cannot withdraw or revise any previously made decision.) A positive payoff is earned only if the decision-maker chooses the best overall candidate.
The optimal strategy of this classical version of the secretary problem (henceforth the classical strategy) allows the decision-maker to maximize the probability of finding the best candidate. The classical strategy states that the decision-maker should reject the first n/e (equal to approximately 0.37 as n approaches infinity) of the candidates and then accept the first candidate who is better than all of the previously rejected candidates, where n is the total number of candidates available in the candidate pool and e is Euler’s number, approximately equal to 2.718 (Reference Gilbert and MostellerGilbert & Mosteller, 1966, p. 40). This strategy yields an approximately 37% chance of finding the best candidate (depending on the size of n; see Lindley, 1961; Reference Gilbert and MostellerGilbert & Mosteller, 1966 for detailed proofs. Note that there is also a risk of losing the best candidate because the best occurs after the choice or the best occurs in the first 37% of the candidate pool.) People who are not familiar with the classical strategy face at least two major difficulties in solving the problem. First, the distribution of the candidate quality is unknown, and the decision maker does not know which candidate may potentially be the best or better candidate to accept. Second, an immediate decision must be made without recall. If the decision maker realizes the best candidate has been rejected, she cannot go back and accept the previously rejected candidate.
Researchers have explored many different features and variations of the secretary problem. A number of assumptions have been relaxed and their implications investigated both theoretically (e.g., Reference Gilbert and MostellerGilbert & Mosteller, 1966; Reference LindleyLindley, 1961; Reference MorigutiMoriguti, 1993; Reference TamakiTamaki, 1979; Reference YeoYeo, 1998) and experimentally (e.g., Reference Bearden, Murphy and RapoportBearden, Murphy & Rapoport, 2005; Reference Bearden, Rapoport and MurphyBearden, Rapoport & Murphy, 2006; Reference Seale and RapoportSeale & Rapoport, 1997, 2000; Reference Zwick, Rapoport, Lo and MuthukrishnanZwick, Rapoport, Lo & Muthukrishnan, 2003).
A common empirical result for the secretary problem is that people tend to stop their search too soon (e.g., Reference Seale and RapoportSeale & Rapoport, 1997, 2000). Experimental researchers have also found that people do not generally perform better when a monetary reward is offered (e.g., Hey, 1987; Reference Campbell and LeeCampbell & Lee, 2006).Footnote 1 Evidence from a computer simulation indicated that people’s experimental search behavior coincided better with the classical strategy when a 1% search cost was assumed (Reference Seale and RapoportSeale & Rapoport, 1997). Hsiao (2018, Chapter Two) examined the effect of time search on search behavior in the secretary problem and indeed found that people shortened their search with a higher time cost. Other processing costs such as cognitive load may also influence the search behavior and lead to a shorter search. The incentive structure can potentially help offset these processing costs and motivate performance.
Different secretary problem situations in life often have different incentive structures. Sports tournaments often feature a winner takes all structure. Real estate agents or car salesman often received payoffs proportional to the sale price; these are commission-based. An interesting question is whether performance changes when the same incentive structure is applied to different contexts. Contextual instructions are known to have an effect in changing behavior in experimental research that helps understanding of the task and reduces confusion among participants, especially in tasks that required reasoning and cognitive abilities to perform (see Reference Alekseev, Charness and GneezyAlekseev, Charness & Gneezy, 2017 for a review of when and why contextual instructions matter). Dual-processing theory that posits system one (intuitive/heuristic) and system two (analytic/executive) processes has been proposed to explain why different decisions result from the context (e.g., Kahneman, 2003; Reference EvansEvans, 2008). Context effects can be caused by different decision-making schemas belonging to system one. Hence we varied the context of the experiment. Half the participants performed with no context – they were simply presented numbers – and the other half were asked to consider they were selling a house.
1.3 Decision strategies
People unfamiliar with the secretary problem might employ at least three different strategies in the experiment, besides a variant of the classical strategy. They might choose randomly. In our experimental setting, this would yield only a 5% chance of finding the optimal (there are 20 offers in each round), and 95% of the time the participant would not receive any payoff when in the best only incentive. This would not be an effective strategy for people for either commission base or best only incentives, where the payoffs are based on performance. Secondly, they might choose any price above their own reservation price. This strategy is most likely to be effective when the distribution of the price offers is available to the participants, unlike in our study. The chance of finding the best candidate is enhanced drastically to approximately 58% as n approaches infinity (Reference Gilbert and MostellerGilbert & Mosteller, 1966), when information about the distribution of the quality of candidate becomes available. Thirdly, they might attempt to refine a strategy by trial and error.
We conducted a computer simulation of the effect of stopping at different points in a 20-round sequence. Details of the simulation are shown in Appendix A. However, the key result of the simulation was to show that the chance of finding the optimal offer is actually quite sensitive to how long the search continues and when it is terminated, but the average price obtained from accepting an offer is only slightly affected by stopping position. Thus the simulation indicated that the best only incentive was likely to have more effect on stopping position compared to a commission-based incentive than any measure based on the amount the participants receive. Hence the results that follow focus mostly on the stopping position and number of optimal choices made rather than on average earnings obtained.
1.4 Research questions
Based on the preceding sections, we formed the following research questions:
Research question 1:
Will people search longer and find more optimal offers when they receive a monetary reward only when the optimal offer is found (best only) than when they receive a monetary reward with every offer (commission base)?
Research question 2:
Will people search longer and find more optimal offers when their payoff is proportional to the price obtained (commission base) than when they receive a flat fee payment?
Economic experiments demonstrate that context affects incentivized behavior. Alekseev et al. (2017) surveyed the literature and concluded that context often, but not invariably, improves performance. Improvement is more likely if the task requires sophisticated reasoning. They did not, however, include secretary problem studies in their survey, but the problem seems to qualify as one requiring sophisticated reasoning. The effect of context when the earnings are proportional to the chosen prices (commission base) is explored in Hsiao, Kemp and Servátka (2020). This experiment found that framing the secretary problem as selling houses leads to better decisions than a context-free frame.
Research question 3:
Will people search longer and find more optimal offers with a house selling context than without?
2 Method
2.1 Participants
A total of 178 undergraduate students from the University of Canterbury participated individually in the experiment. All participants received two 100-level course credits for participating in the experiment (a kind of show-up payment). In addition, they could receive different cash incentives. The average actual pay for each condition is summarized in Appendix B. There was an age range from 18 to 40 years old, with the median in the range 18–21 years old. The actual time spent on the sequential search task is reported in Appendix C.
2.2 Experimental design and procedures
The experiment was a 2 × 3 between-subjects design, with two contexts, house selling and no context, and three monetary incentive structures, commission base, best only, and flat fee. Participants performed their entire experiment in one condition combination only. The number of participants that participated in each condition is presented in Appendix B. In the house selling context conditions, the task was framed as selling 10 houses; one house per each round. A description of a house, consisting of the floor area, the number of bedrooms, suburb and the year the house was built in, was presented at the beginning of the round, prior to any price offer.
In each round, a participant could review up to 20 price offers for the given house. The price offers were presented one at a time. Once a price offer was presented, the participant decided whether to reject the offer or whether to accept it. Once the decision was made, there was no recall. If the participant had not accepted an offer prior to the final (20th) offer, the participant was forced to accept the final offer regardless of its value. Similar procedures were employed in the no context conditions, except that the participants were asked to accept “numbers” and there was no mention of houses, only “rounds” instead.
The actual price offer sequences used in the experiment are presented in Appendix D. Appendix E shows the expected stopping positions if the classical strategy was used. To allow comparisons between participants, the same 10 random sequences generated prior to the experiment were employed for each participant in each session. (For a similar use of cardinal values in the secretary problem, see Reference Teodorescu, Sang and ToddTeodorescu, Sang & Todd, 2018.) All conditions had two practice rounds prior to the 10 paid rounds.
The participants in the commission base incentive with house selling context received the following instructions about their payoffs.
The payoffs will be denoted in experimental currency units (ECUs). 735 ECUs = 1 NZD.
Your ECUs will be converted into NZD at this rate, and you will be paid in NZD when you leave the lab. The more ECUs you earn, the more NZD you earn.
Your payoffs are determined as follows:
Total ECUs you earn = Accepted price offer for House 1 + Accepted price offer for House 2 + ….+ Accepted price offer for House 10.
The participants in the best only incentive and house selling context received the following instructions about their payoffs.
Your will earn NZD 4.60 if you have choose the highest price offer for each house.
Your payoffs are determined as follows:
Total NZD you earn = number of houses that you have selected the highest price offer * NZD 4.60
The participants in the flat fee and house selling context simply received $9.50 for their participation. They were instructed to find the highest price they could in the flat fee condition.
Similar instructions for all three incentive structures were used in the no context conditions, except that the instructions used “number” and “round” instead of “price offer” and “house”.
The conversion rate in the commission base incentive and the cash payoff for selecting an optimal offer in the best only incentive were based on the results of previous findings (Hsiao, 2018, Chapter Two). So all conditions were intended to yield roughly the same payoff regardless of the incentive structures, which was set to be NZD 10. In Hsiao (2018, Chapter Two), participants in the condition with no search cost had an average chosen price of 7339 ECUs and they found 2.17 optimal offers. Using these results as a start point, suggested cash payoffs of NZD 4.60 for finding an optimal offer ($10 ÷ 2.17) in the best only condition and a conversion rate of 735 ECU to 1 NZD (7339 ECUs ÷ NZD 10) in the commission base condition. These were intended to yield average payoff of NZD 10.
Upon arriving at the lab, the participants were randomly assigned a cubicle to read the instructions at their own pace. Any questions were answered in private. The participants were also requested to complete a short research exercise, where they were asked what they thought the purpose of the study was and how they thought the results might be applied. They also completed a regret questionnaire (Reference Schwartz, Ward, Monterosso, Lyubomirsky, White and LehmanSchwartz et al., 2002). The results from these exercises turned out not to be very informative or relevant to the rest of the study and are not reported below. The participants were paid individually in private when all the tasks were completed.
3 Results
Three dependent variables are examined in this section. First, and most important, the position in the sequence where the participant accepted the offers (henceforth stopping position) was evaluated for each round. Second, we calculated the number of rounds in which the optimal offer was selected (henceforth optimal offer count). Third, we conducted a limited amount of analysis with the total sum of the 10 chosen prices in ECUs (henceforth total chosen price). (As remarked earlier, the simulation indicated that this is not a sensitive measure of performance.)
First, we test the research questions for the performance measures of overall participant decision-making obtained from the experiment. Second, we examine to what extent the effect of incentive structure on stopping position generalizes over rounds. Third, we report a path analysis that examined the relationship between the incentive structures and performance measures.
3.1 Performance
3.1.1 Stopping position
As shown in Table 1, Panel A the participants in the best only incentive searched the longest with both the house selling context and no context. Analysis of variance showed a significant effect of incentive structure (F(2, 172) = 4.55, MSerror = 8.08, p = 0.01, partial η2 = 0.50), and Tukey HSD post hoc tests showed a significant (p < 0.05) difference between the commission base and best only (p = 0.04, Research question 1), as well as between the best only and flat fee (p = 0.02). But there was no significant difference between the commission base and flat fee (p = 0.98, Research question 2). Participants chose to stop at a significantly later position in the sequence (M = 9.3) under the no context than the house selling (M = 8.4) context (F(1, 172) = 4.08, p = 0.05, partial η2 = 0.02). There was no statistically significant (p = 0.52) interactive effect of context and incentive structure.
* Sequence optimal refers to the actual optimal offer from the sequences used in the experiment.
** This is the result predicted by the classical strategy and prescribes an information set of 7 offers, see Appendix E for more details on the prediction. This classical optimal is useful as a benchmark for the best only incentive.
3.1.2 Optimal offer count
Participants assigned the best only incentive on average selected the highest number of optimal offers (M = 2.7) compared to the commission base (M = 2.0) and flat fee incentives (M = 1.6); (F(2, 172) = 12.45, MSerror = 1.51, p < 0.001, partial η2 = 0.13). Tukey HSD Post Hoc tests confirmed significant differences between the best only and commission base (p = 0.002, Research question 1), and the best only and flat fee (p < 0.001). There was no significant difference between the commission base and flat fee (p = 0.31, Research question 2). There were no other statistically significant (p < 0.05) main or interactive effects. See Table 1, Panel B for more descriptive results in each condition.
Taken over both contexts, 30.6% of the participants in the best only incentive performed better than or as well as the classical theorem in finding the optimal offer (by finding 4 or more optimal offers), and 10.2% of them outperformed the classical theorem (by finding 5 or more optimal offers); 5.2% of the commission-based structure participants performed as predicted by the classical theorem and 1.9% performed better. Only 3.2% of the flat fee participants performed as well as the classical theorem and 0% outperformed.
3.1.3 Total chosen price
The house-selling context returned a higher total chosen price (M = 7160.1 ECUs) than no context (M = 6986.1 ECUs; F(1, 172) = 4.23, MSerror = 3.18 × 105, p = 0.01, partial η2 = 4.23, Research question 3). However, there was no significant (p < 0.05) effect of the incentive structure; nor was there a significant interactive effect.
3.2 Differences between rounds
To examine whether the longer search found in the best only incentive was general across rounds, Kendall’s tau correlation analysis was used to examine the relationship between the stopping position and the incentives in each round. Table 2, Panel A summarizes the results comparing the best only with the other two incentives. Table 2, Panel B compares the commission base and flat fee incentives, and excludes the best only.
Note, first, that the commission base and flat fee comparison shows little difference across the rounds. Overall the commission base incentive led to longer search in four rounds (two of them statistically significant); the flat fee incentive longer search in six (one statistically significant).
However, the best only incentive structure led to longer search in nine rounds (not round 5), and the effect was statistically significant in seven of them. A simple binomial test assesses the chances of obtaining nine or more rounds in the same direction as p = 0.011. Thus the significant effect of the best only incentive structure on stopping position found in the previous section does not appear to have arisen from unusual sequences in a few rounds.
3.3 Path analysis
Stopping position could be seen as a mediating variable, as well as a dependent variable. To investigate the role of stopping position in determining the chosen price and optimal offer count while also considering the effect of incentive structure and context, we performed a path analysis.
The path analysis results are summarized in Figure 1. Choosing later stopping positions led to both higher chosen prices and a higher optimal offer count. There is an indirect (that is, mediated by stopping position) effect of the best only incentive on both chosen price and optimal offer count. The mediation is partial for the effect of the best only incentive on optimal offer count. Context has opposing direct and indirect effects. The direct effect shows that, if you disregard stopping position, the house selling context produces higher chosen prices and optimal offer counts. However, no context produces a later stopping position and an opposing mediating effect.
4 Discussion
This study investigated the effect of different incentive structures on a common sequential search task which requires some cognitive ability to perform well. The main results of the experiment were that the best only incentive lengthened the number of searches and produced an increased number of optimal results. The commission base incentive did not differ from a flat fee in its effect on these variables. As could be predicted from a simulation, the incentive structure had no marked effect on the total chosen price over the different rounds. However, total chosen prices were on average a little higher when a house selling context was introduced.
Overall, the participants in this study do not appear to adopt a variant of the classical strategy or a variant of the reservation price strategy consistently. The main behavioral difference found between the best only and commission base/flat fee incentives was the willingness to conduct longer searches. Perhaps the effect of the incentive structures might be understood in terms of satisficing theory: Simon (1956) suggested a bounded rationality approach to sequential search decisions. Instead of optimizing the expected outcome as suggested by traditional economic theory, people set an aspirational level and are satisfied once the aspirational level is achieved. Such an approach allows decision-makers to effectively achieve a variety of needs in situations where the optimal strategy is unknown to the decision-maker; that is, there is no need for a utility function to be postulated. For example, when a rat forages for food, it learns to choose time-conserving paths that lead to sufficient food for survival rather than paths which might obtain the maximum amount of food but potentially risk survival in the process. In this study, participants in the flat fee and commission base conditions may be satisficing. They were trying to get a “good enough” result (which does not take too much time and effort) instead of investing time and cognitive effort in figuring out the optimal, which they may or may not obtain. The participants in the best only condition, however, were encouraged to optimize, by being more careful with their searches and conducting (relatively) longer searches, even at the risk of over search and missing the optimal offers.
This study found no interactive effects of the incentive structures with the context. People with a house selling context conducted less search and had similar number of optimal offers as the people in the no context. However, path analysis found that the house selling context had a significant direct and positive effect on the chosen price. This potentially confirms a schema hypothesis (e.g., Hsiao et al., 2020), according to which we might form a fuller mental model of how to make sequential search decisions from experiencing various sequential decisions in daily life, such as finding a parking space or a partner. Such a schema could be activated when a similar situation arises, for example, selling houses, and result in applying an effective strategy to making decisions even without having any previous experience in buying or selling a house. Hsiao et al.(2020) provides evidence that schema may be activated solely with a house selling frame and that additional descriptive information (e.g., the suburb the house is located, the year it was built, etc.) is not necessary. Incentive structures may then be excess information that does not take part in activating the decision-making schema; similar to the house descriptive information, hence no interactive effects of the incentive structures with the context.
There are limitations in this experiment. First, the flat fee incentive received NZD 9.50 instead of NZD 10. The original design aimed to avoid participants in the flat fee incentive receiving a higher average payoff than the other two incentives, and the average payoff for the other two incentives was set to be approximately NZD10. However, for some reason, the predictions were slightly astray. Second, the payoffs of the best only incentive were presented in NZD directly, while in the commission base, they were presented in ECUs first then converted to NZD. However, Drichoutis, Lusk and Nayga (2015) reported that the use of experimental currency unit conversion affected behavior only when a 1 to 1 conversion to a monetary term is imposed. In practice, it is not easy to see how this difficulty could have been avoided. Either the house selling context would have employed unrealistically tiny commission sums or the numbers would have had to be different between the two contexts. Finally, the underlying mechanisms of how the best only condition produces more optimal offers, or why similar performance was found in the commission base and flat fee conditions remain unclear, thus diluting the conclusions we can draw from this study. We view these as promising future areas of research.
To conclude, the experiment showed, firstly, that having a commission-based proportional incentive did not produce a better performance than simply having a flat payment for any of the different dependent variables considered. However, paying only for the best did lead to longer searches and to more frequently obtaining the very best price. It is plausible that the effect of the best only incentive arose because the structure itself led the participants to optimize rather than satisfice, and thus led them to search more even when it is risky, and hence allow them to obtain the best price more often. A more general implication is that not all performance-based incentive structures motivate performance.
Appendix A: Simulation
To derive a prediction for the impact of incentive structures on search behavior and performance outcomes, we conducted a simulation that allows for evaluating the performance of different variants of the classical strategy. Each simulation compares the performance measures resulting from 20 different decision strategies (as there was a maximum of 20 offers; each decision strategy prescribes how many offers to reject, followed by accepting the next highest offer), which contain all possible stopping positions (i.e., a decision strategy stops the search by accepting the nth offer in an iteration; where 1 ≤ n ≤ 20). Each simulation iteration generated a set of 20 random offers using the mean and standard deviation for each round from the values used in the experiment. The simulation ran separately for each round (10 rounds in total) and with 1.2 million iterations for each round.
To compare the performance of decision strategies across incentive structures, we compared the performance of all 20 possible decision strategies using both the chosen price (in Experimental Currency Units - ECUs) they yield in total, and the frequency of which the decision strategy found the optimal (highest) offer (in %). The chosen price statistic indicates which decision strategy yields the highest payoff. The optimal offer count statistic shows which decision strategy finds the greatest number of optimal offers.
The results are shown in Figures A1 and A2 (next page). Perhaps the key feature to emerge from the simulation is that the optimal offer count is much more sensitive to the actual stopping position than is the chosen price, and hence the value of any commission received on the basis of total price.
Appendix C
Table C1 shows the actual time taken to complete the search task (in minutes). The result shows that participants in the best only incentive averaged across both house selling context and no context, spent a longer time in making decisions (M = 6.2) than the commission base incentive (M = 5.5) and flat fee incentive (M = 5.2).
A 2 x 3 factorial (ANOVA) found a significant main effect of the incentive structures on the average time taken (F(2, 172) = 3.70, MSerror = 4.20, p =0.03, partial η2 = 0.04). Tukey HSD post hoc tests showed a significant difference between the best only and flat fee incentives (p = 0.02), but no significant difference between the commission base and best only (p = 0.13), or the commission base and flat fee (p = 0.73). Analysis of variance results also showed that participants spent significantly less time (M = 4.7) under the no context than the house selling (M = 6.5) context (F(1, 172) = 35.05, p < 0.001, partial η2 = 0.17). There was no statistically significant (p = 0.17) interactive effect of context and incentive structure.