1. Introduction
Incomplete information is a key ingredient in many hiring processes, where full knowledge about the true capacity of a potential employee is rarely available to the employer at the hiring time. Instead, if the candidate is hired, such information is gradually revealed to the employer over time. On the other hand, the potential employee would typically possess more accurate information, and would use this additional information when providing their salary claim. Naturally, a high salary is costly for the employer, and thus increases the risk for the employee of being fired. Therefore, there is a trade-off in the choice between a high salary claim to increase personal income and a small salary claim to decrease the risk of being fired.
To model one possible instance of the strategic interaction between an employer and a potential employee, we set up and study a stochastic game with asymmetric information between two players. The game is informally described as follows.
-
(i) The capacity $\mu$ of the employee (Player 1) is a random variable with a known two-point distribution.
-
(ii) At time $t=0$ , Player 1 learns about the realization of the random variable $\mu$ , and presents to the employer (Player 2) a non-negotiable salary claim C; the salary can only take two values.
-
(iii) At time 0, Player 2 observes the salary claim, and subsequently also noisy observations of $\mu$ , based upon which a choice is made of a stopping time $\tau$ to terminate the employment; here, $\tau=0$ corresponds to a case in which the salary claim is not accepted (no hiring), $0<\tau<\infty$ corresponds to an accepted salary claim, but with firing in finite time, and $\tau=\infty$ to an accepted salary claim, with no firing taking place.
-
(iv) Up to the termination time $\tau$ , Player 1 receives compensation at the chosen rate C per unit of time. Player 2, on the other hand, earns a net payment stream consisting of increments of a stochastic process $\mu t+ \sigma W_t-Ct$ , where W is a Brownian motion which accounts for random fluctuations about the mean rate $\mu-C$ .
This is a signaling game, with two possible types of Player 1, corresponding to the two possible values of $\mu$ , and where Player 1 sends a signal by choosing the salary level C. As such, there is an incomplete and asymmetric information structure since the players have different knowledge about $\mu$ .
Variants of such signaling games with asymmetric information have a long history in the literature on hiring of staff and salary formation. A classical reference is [Reference Spence23], where an example with a job seeker that can have two different types is studied. The job seeker knows their own type and chooses an education level, where the cost of education depends on the type, thereby conveying information to the employer. In the setup of [Reference Spence23], all information is conveyed at the initial time, and the signal that consists of the chosen education level does not influence the actual ability of the employee. Extensions are studied in [Reference Alós-Ferrer and Prat1, Reference Daley and Green6] with a type-dependent continuation value for the job seeker, thus allowing for a future change in the salary level. In particular, the setup includes cases of employer learning based on an additional report (such as an interview phase) or based on the employee’s on-the-job performance. For related studies allowing for uncertainty about an individual’s own type, see [Reference Weiss24], and for more possible types of job-seeker and competition between employers, see [Reference Dato, Grunewald, Kräkel and Müller7]. Also, a signaling game outside the job market is explored in [Reference Daley and Green5], where an owner of a company and several potential buyers are considered. The seller holds private information of the company type, and buyers learn gradually from noisy observations of the unknown type and from the actions (lack of actions) of the seller. The hiring problem that we consider is also related to so-called principal-agent problems (see, e.g., [Reference Cvitanić, Possamaï and Touzi4, Reference Holmström and Milgrom15, Reference Sannikov21]), in which a principal seeks to set up a compensation scheme to control the effort of the agent.
The methods we use to derive an equilibrium in the signaling game rely on a combination of stochastic filtering and stochastic control theory. For stochastic filtering of the drift of a diffusion process we refer to [Reference Liptser and Shiryaev19], and for a classical application to sequential hypothesis testing of the drift of Brownian motion, where filtering and optimal stopping theory is combined, see [22, Chapter 4]. For other studies of combined filtering and control, see, e.g., [Reference De Angelis8, Reference Ekström and Vaicenavicius10, Reference Lakner17] for single-agent problems, and [Reference Cardaliaguet and Rainer3, Reference Ekström, Lindensjö and Olofsson9, Reference Grün13] for strategic setups.
In contrast to the setup of [Reference Spence23] and many subsequent papers, the current study does not use education level as a signal. Instead, we equip the employee with the right to provide a salary claim, which is then paid out continuously until a possible firing time. This allows us to study the trade-off between a high personal income and the risk of becoming a burden for the company. In line with the literature on signaling games as above (see also [Reference Fudenberg and Tirole12, Reference Harsanyi14, Reference Osborne and Rubinstein20]) we use the concept of perfect Bayesian equilibrium (PBE) as a solution concept. We show the existence of a semi-separating PBE, in which the strong type always chooses the high salary, whereas the weak type randomizes between the low and the high salary.
In Section 2, a precise formulation of our hiring-and-firing game is presented, and Section 3 recalls some standard results from filtering theory. In Section 4 we argue heuristically to derive a candidate equilibrium of strategies; the candidate equilibrium is then verified to in fact be a PBE. While our Bayesian game setup is rather simplistic, with only two possible types of employee and two possible salary claims, it may serve as a benchmark for more involved problems. This is illustrated in Section 5, where a few such extensions are briefly discussed. For these extensions, the solution of the benchmark case presented in Sections 2–4 is utilized.
2. Setup
To describe the game in further detail, let W be a standard Brownian motion, and let $\mu$ be a modified Bernoulli-distributed random variable independent of W with $\mathbb{P}(\mu=\mu_1)=p=1-\mathbb{P}(\mu=\mu_0)$ , where $\mu_0$ , $\mu_1$ , and p are known constants with $\mu_0<\mu_1$ and $p\in(0,1)$ . We assume that the employee (Player 1) generates a payment stream to the employer (Player 2) modeled as the increments of a process $X_t = \mu t + \sigma W_t$ , where $\sigma$ is a positive constant. The random variable $\mu$ will be referred to as the capacity of the employee.
Player 1 knows their own type, i.e. the realization of their capacity $\mu$ , and gives at the initial time $t=0$ a salary claim C in the set $\{c_0,c_1\}$ , where $0<c_0<c_1$ . More precisely, allowing for randomized strategies, a strategy of Player 1 consists of a pair $a=(a_0,a_1)\in\mathcal P$ , where $\mathcal P=[0,1]^2$ is the unit square. Here, $a_i$ represents the conditional probability of choosing $C=c_1$ given that Player 1 is of type i, $i=0,1$ .
To describe the possible strategies of Player 2, denote by $\mathbb F^{X}=(\mathcal F^{X}_t)_{t\geq 0}$ the augmentation of the filtration generated by the process X, and by $\mathbb F=(\mathcal F_t^{X,C})_{t\geq 0}$ the augmentation of the filtration generated by the process X and the random variable C. Also, let $\mathcal T^X$ be the collection of $\mathbb F^{X}$ -stopping times, and $\mathcal T$ be the collection of $\mathbb F$ -stopping times. Clearly, since C only takes two possible values, any stopping time $\tau\in\mathcal T$ can be decomposed as
where $(\tau_0,\tau_1)\in\mathcal T^X\times\mathcal T^X$ . Conversely, defining $\tau$ in this way for a given pair $(\tau_0,\tau_1)\in\mathcal T^X\times\mathcal T^X$ yields that $\tau\in\mathcal T$ . Thus we may identify $\mathcal T$ with $\mathcal T^X\times\mathcal T^X$ , and we therefore write $\tau=(\tau_0,\tau_1)$ .
In addition to a pair $(a,\tau)\in\mathcal P\times \mathcal T$ of strategies, the definition of a perfect Bayesian equilibrium (given below) also requires the specification of a belief system $\Pi_0=\big(\Pi_0^0,\Pi_0^1\big)\in\mathcal P$ . Here, $\Pi_0^i$ represents the probability that Player 2 assigns to the event $\{\mu=\mu_1\}$ conditional on the signal $C=c_i$ , $i=0,1$ .
The payoff structure of the game is now described as follows. Up to the stopping time $\tau$ , Player 1 receives compensation for their work at rate C per unit of time. Player 2, on the other hand, receives increments of the net payment stream
per unit of time. Both players seek to maximize their expected discounted future payoff. More precisely, for a given triple $(a,\tau,\Pi_0)\in \mathcal P\times\mathcal T\times \mathcal P$ with $a=(a_0,a_1)$ , $\tau=(\tau_0,\tau_1)$ , and $\Pi_0=\big(\Pi_0^0,\Pi_0^1\big)$ , and for a given discount rate $r>0$ , define
Then $J_1^i$ represents the expected payoff for Player 1 given the capacity $\mu=\mu_i$ , $i=0,1$ . Similarly, define
so that $J_2^i$ is the expected payoff for Player 2 given that $C=c_i$ , $i=0,1$ . Here, the subindex in the expected value indicates that the expected value is calculated using the belief system $\Pi_0$ as the initial probability of the type $\mu=\mu_1$ .
Remark 1. Note that Player 1 is equipped with randomized strategies, whereas Player 2 is not. The intuitive reason for this is as follows. Player 1 acts at time 0 (revealing the realization of C), and once this is done, the game collapses to a single-player optimization problem of choosing a non-anticipative stopping strategy for Player 2. In such a Markovian optimal stopping problem (cf. Section 4.1), however, the optimal value is attained for a hitting time (a pure strategy), and there are typically no other strategies (e.g. mixed strategies) that are optimal.
Remark 2. While a key ingredient in our setup is asymmetric information about the capacity $\mu$ , we point out that the setup itself, including the numerical values of all parameters p, $\mu_0$ , $\mu_1$ , $\sigma$ , r, $c_0$ , and $c_1$ , is common knowledge to both players.
We now introduce the solution concept that we will use.
Definition 1. (Perfect Bayesian equilibrium.) We call a triplet $(a^*,\tau^*,\Pi_0)\in\mathcal P\times\mathcal T \times \mathcal P$ a perfect Bayesian equilibrium (PBE) if the following conditions are satisfied.
-
(A) Rationality: $J^i_1(a^*,\tau^*)\geq J^i_1(a,\tau^*)$ for $i=0,1$ , and $J^i_2(\tau^*,\Pi_0)\geq J^i_2(\tau,\Pi_0)$ , $i=0,1$ , for all pairs $(a,\tau)\in\mathcal P\times\mathcal T$ .
-
(B) Bayesian updating: If $\min \big\{a^*_0,a^*_1\big\}<1$ , then
\begin{align*}\Pi_0^0=\frac{p(1-a^*_1)}{p(1-a^*_1) + (1-p)(1-a^*_0)},\end{align*}and if $\max\big\{a^*_0,a^*_1\big\}>0$ , then\begin{align*}\Pi_0^1=\frac{pa^*_1}{pa^*_1+ (1-p)a^*_0}.\end{align*}
Remark 3. In Definition 1, condition (A) requires the strategy triplet to form an equilibrium in the usual sense that neither of the players wants to unilaterally deviate from it. Condition (B) requires Player 2 to update their belief system using Bayes’ rule on events with positive probability. If $a_0^*=a_1^*=1$ then Player 1 always chooses $C=c_1$ , and if $a_0^*=a_1^*=0$ , then Player 1 always chooses $C=c_0$ , and in these cases the belief system can be chosen with no restriction.
Remark 4. If $a_0^*=a_1^*\in\{0,1\}$ , then the PBE is of pooling type; if $a_0^*=1-a_1^*\in\{0,1\}$ , then the PBE is separating; finally, if $a^*_i\in\{0,1\}$ and $a_{1-i}^*\notin\{0,1\}$ , then the PBE is semi-separating.
3. Filtering
From the perspective of Player 2, the problem is a two-source learning problem: at time $t=0$ , the salary claim C is observed and the prior distribution of $\mu$ is updated in accordance with the specified belief system; at subsequent times $t>0$ , the posterior distribution is updated using observations of X.
Given $\pi\in[0,1]$ , define the process $\tilde\Pi\,:\!=\,\tilde\Pi^{\pi}$ by $\tilde \Pi_t\,:\!=\,\mathbb{P}_\pi(\mu=\mu_1\mid \mathcal{F}_t^{X})$ , where the index $\pi$ indicates that the conditional probability is calculated using an initial estimate $\pi$ for the event $\{\mu=\mu_1\}$ . Thus, $\tilde\Pi$ is the probability that $\mu=\mu_1$ conditioned merely on observations of X, and calculated with an initial belief $\pi$ . It is well known from filtering theory (see [Reference Liptser and Shiryaev19]) that the conditional probability $\tilde\Pi$ satisfies ${\mathrm{d}}\tilde\Pi_t=\omega\tilde\Pi_t(1-\tilde\Pi_t)\,{\mathrm{d}}\hat W_t$ , where $\omega\,:\!=\,(\mu_1-\mu_0)/\sigma$ is the signal-to-noise ratio and the innovations process
is an $\mathcal{F}^X$ -Brownian motion. In particular, the process $\tilde \Pi$ is strong Markov.
Now, given a belief system $\Pi_0=\big(\Pi_0^0,\Pi_0^1\big)\in\mathcal P$ , we define the conditional probability process
If the Bayesian updating property (B) holds, then $\Pi_t$ coincides with $\mathbb{P}(\mu=\mu_1\mid \mathcal{F}^{X,C}_t)$ on the event $\{C=c_i\}$ provided that $\mathbb{P}(C=c_i)>0$ .
Lemma 1. Let $(\tau,\Pi_0)\in\mathcal T\times\mathcal P$ . Then, for $i=0,1$ ,
Proof. By conditioning,
where $\Pi_t\,:\!=\,\mathbb{P}_{\Pi_0^i}(\mu=\mu_1\mid\mathcal{F}^X_t)$ . Moreover, by an application of Itô’s formula and optional sampling,
Consequently,
4. A semi-separating PBE
Note that if $c_0\geq \mu_1$ , then the net drift $\mu-C$ in (1) is non-positive, and Player 2 should choose immediate firing ( $\tau=0$ ). Similarly, if $\mu_0\geq c_1$ , then $\tau=\infty$ would always be optimal. Thus, to rule out degenerate cases, a minimal assumption is that $\mu_0< c_1<\mu_1$ . Moreover, we will make the additional assumption that $c_0\leq\mu_0$ so that the net drift $\mu-C$ in (1) is non-negative on the event $\{C=c_0\}$ . We thus impose the parameter ordering
It is straightforward to see that there is no PBE of separating type. Indeed, in a separating equilibrium with $a^*=(0,1)$ , the strong type would never be fired, and therefore the weak type would have an incentive to deviate and choose $c_1$ ; similarly, if $a^*=(1,0)$ , then the weak type would be fired immediately, and thus again have an incentive to deviate. The aim of the current section is to derive a perfect Bayesian equilibrium of semi-separating type under the assumption (3). In Sections 4.1 and 4.2 we use intuitive arguments to derive a candidate equilibrium, which is then verified in Section 4.3.
4.1. The employer’s perspective
Under the assumption (3), the lower salary level $c_0$ is smaller than the capacity $\mu$ with probability 1. Thus, if (3) holds, it is clear that if Player 1 chooses $C=c_0$ , then an optimal response for Player 2 should be to choose $\tau_0=\infty$ .
On the other hand, on the event $\{C=c_1\}$ , Player 2 would stop if there is sufficient evidence that $\mu=\mu_0$ . More precisely, we expect a boundary level b such that
is an optimal response for Player 2. To determine b, standard optimal stopping theory based on the dynamic programming principle (see, e.g., [Reference Shiryaev22]) suggests that the pair (V, b), where
solves the free-boundary problem
where
Here, the two boundary conditions at b constitute the so-called condition of smooth fit, and the boundary condition at $\pi=1$ corresponds to receiving (discounted) payments at rate $\mu_1-c_1$ until time $\tau=\infty$ . Note that the ordinary differential equation (ODE) in (5) is of second order, so there are two degrees of freedom; additionally, the boundary b is unknown. On the other hand, there are three boundary conditions, so we would expect that (5) is well-posed.
To solve the free-boundary problem (5), we readily verify that the general solution of the ODE is given by
where $\gamma_1<0$ and $\gamma_2>1$ are the solutions of the quadratic equation
and $A_1$ and $A_2$ are arbitrary constants. Imposing the boundary condition at $\pi=1$ , we must have $A_2=0$ , and so the two remaining boundary conditions yield
Eliminating $A_1$ , we find that
Thus, the candidate optimal response for Player 2 when Player 1 chooses the higher salary $C=c_1$ is to stop when the conditional probability process $\Pi$ falls below the constant boundary b in (7). Moreover, the candidate value for the employer is then
For a graphical illustration of the function V and the threshold b, see Figure 1.
4.2. The employee’s perspective
We now take the perspective of Player 1. We will construct an equilibrium in which Player 1 always chooses $C=c_1$ on the event $\{\mu=\mu_1\}$ , and on the event $\{\mu=\mu_0\}$ uses a strategy such that $\mathbb{P}(C=c_1\mid\mu=\mu_0)=a_0=1-\mathbb{P}(C=c_0\mid\mu=\mu_0)$ for some $a_0\in[0,1]$ to be determined. Thus, in the notation of Section 2, we consider the strategy $a=(a_0,1)\in\mathcal P$ .
As noted above, on the event $\{C=c_0\}$ , Player 2 would use $\tau_0=\infty$ . By the indifference principle in game theory (see, e.g., [Reference Ferguson11] or [Reference Lindahl18]), to have an equilibrium with a strategy pair $(a^*,\tau^*)$ in which Player 1 uses a mixed strategy $a^*=(a_0,1)$ with $a_0\in(0,1)$ and Player 2 uses $\tau^*=(\infty,\tau_1)$ with $\tau_1$ as in (4), we need the expected payoffs $J^0_1((0,1),\tau^*)$ and $J^0_1((1,1),\tau^*)$ to coincide. Clearly, choosing $C=c_0$ gives the expected payoff $J^0_1((0,1),(\infty,\tau_1))=c_0/r$ for Player 1.
To determine the expected payoff $J^0_1((1,1),\tau^*)$ for Player 1 on the event $\{C=c_1\}$ , note that on the event $\{C=c_1\}$ , Player 2 would first re-evaluate the probability that Player 1 has the larger capacity $\mu=\mu_1$ according to the specified belief system $\Pi_0$ . Moreover, by the Bayesian updating requirement of the belief system, we have
Thus, $\Pi_t$ makes an initial jump from $\Pi_{0-}=p$ up to
and then it diffuses with dynamics ${\mathrm{d}}\Pi_t=\omega\Pi_t(1-\Pi_t)\,{\mathrm{d}}\hat W_t$ . From the perspective of Player 1, however, $\hat W$ is not a Brownian motion since Player 1 knows the true value of $\mu$ . Instead,
Consequently, if Player 1 chooses $C=c_1$ , then their value is $U(\Pi^1_0)$ , where
solves
This ODE has the general solution
where $\gamma_1<0$ and $\gamma_2>1$ are the solutions of (6) as before, and $B_1$ and $B_2$ are arbitrary constants. As before, the boundary condition at $\pi=1$ yields $B_2=0$ , and then the boundary condition at $\pi=b$ gives
so
Now recall that by the indifference principle we are looking for $a_0\in(0,1)$ such that
This is possible only if $U(p)<c_0/r$ , i.e. if
Equivalently, we need to have
where $\hat p$ is the indifference point such that $U(\hat p)=c_0/r$ . Moreover, in that case, $a_0$ should be chosen so that
i.e.
That is, the candidate optimal strategy for Player 1 is as follows. If $\mu=\mu_1$ , then the high salary is chosen with probability 1 ( $a_1=1$ ). On the other hand, if Player 1 is of the weak type ( $\mu=\mu_0$ ), then the high salary $c_1$ is chosen with probability $a_0$ , where
For a graphical illustration of the value function U and the indifference point $\hat p$ , see Figure 2.
4.3. Verification of equilibrium
We now summarize the strategies described above in Theorem 1. We then verify that these strategies together constitute a perfect Bayesian equilibrium.
Let b be defined as in (7), and define the strategy $a^*=(a_0^*,a^*_1)$ of Player 1 by
and $a_1^*=1$ , where $\hat p$ is as in (9). Moreover, let $\Pi_0\,:\!=\,\big(\Pi_0^0,\Pi_0^1\big)=(0,\hat p\vee p)$ , and let $\tau^*=(\tau^*_0,\tau^*_1)$ be defined by $\tau^*_0\,:\!=\,\infty$ and $\tau^*_1\,:\!=\,\inf\big\{t\geq 0\colon\tilde\Pi_t^{\Pi_0^1}\leq b\big\}$ .
Theorem 1. Assume that (3) holds. Then the triplet $(a^*,\tau^*, \Pi_0)$ specified above is a perfect Bayesian equilibrium. Moreover, if $p<\hat p$ , the equilibrium is semi-separating; if $p\geq\hat p$ , then the equilibrium is of pooling type.
Proof. We first note that, by construction, the belief system $\Pi_0$ satisfies the Bayesian updating property. The proof of rationality is divided into two parts.
Optimality of $\tau^*$
First note that $\Pi_0^0=0$ yields that
for any $\tau\in\mathcal T$ , so $\tau_0^*=\infty$ is a rational response to $C=c_0$ .
Next, if the employer observes the event $\{C=c_1\}$ , then the stopping time $\tau^*_1\,:\!=\,\inf\{t\geq 0\colon Z_t\leq b\}$ is used, where $Z_t\,:\!=\,\tilde{\Pi}_t^{\Pi^1_0}=\mathbb{P}_{\Pi_0^1}(\mu=\mu_1\mid\mathcal{F}_t^{X})$ , cf. (2). By Section 3, ${\mathrm{d}} Z_t = \omega Z_t(1-Z_t)\,{\mathrm{d}}\hat W_t$ , so an application of Itô’s formula together with (5) shows that
is a bounded supermartingale. For any stopping time $\tau^{\prime}=(\tau^{\prime}_0,\tau^{\prime}_1)\in\mathcal T$ , optional sampling therefore gives that
as $T\to\infty$ by bounded convergence. Since
by Lemma 1, we find that
for all $\tau^{\prime}\in\mathcal T$ .
Furthermore, for $\tau^*$ , the stopped process $Y_{t\wedge\tau^*_1}$ is a martingale, so optional sampling and bounded convergence give
as $T\to\infty$ , which together with (10) implies that $\tau^*_1$ is an optimal response to $C=c_1$ .
Optimality of $a^*$
We have that
where the inequality holds since if $p>\hat p$ then we have $U(\hat p\vee p)=U(p)\geq c_0/r$ and $a_0^*=1\geq a_0$ , and if $p\leq \hat p$ then we have $U(\hat p\vee p)=U(\hat p)=c_0/r$ .
Similarly,
where the inequality follows from the inequalities
Remark 5. As is often the case for signaling games, there is no uniqueness of PBEs. Indeed, consider the strategy pair $(a,\tau)$ , where $a=(0,0)$ and $\tau=(\infty,0)$ ; in words, Player 1 always chooses $C=c_0$ (regardless of their type) and Player 2 never stops if $C=c_0$ and stops immediately if $C=c_1$ . Then $(a,\tau,\Pi_0)$ with $\Pi_0=(p,\Pi_0^1)$ is also a perfect Bayesian equilibrium (of pooling type) provided the belief $\Pi_0^1$ is chosen small enough (e.g. $\Pi_0^1\leq b$ ).
There have been substantial efforts in the literature to refine the notion of PBE (cf. [Reference Banks and Sobel2, Reference Kreps and Wilson16]) in order to rule out some non-intuitive equilibria. Rather than taking that path, however, we simply note that in the pooling equilibrium both types have the same equilibrium value $c_0/r$ , which is dominated by the corresponding equilibrium values in Theorem 1, so the semi-separating PBE is preferred by the first-mover of our game.
Remark 6. We have analyzed the game under the assumption (3) that $c_0\leq\mu_0<c_1<\mu_1$ . In the alternative ordering $\mu_0<c_0<c_1<\mu_1$ , the smaller salary $c_0$ provides a negative running reward for the employer in the case of a weak-type employee, and a semi-separating (or separating) equilibrium is not feasible. As in Remark 5, we can construct a pooling PBE with $a=(0,0)$ , but we also obtain a pooling equilibrium with $a=(1,1)$ , supported by a sufficiently small belief $\Pi_0^0$ . While semi-separating and separating PBEs are not feasible, it remains an open question whether mixing between the two pooling equilibria is possible in this parameter regime.
5. Extensions
In this section we briefly discuss a few extensions of the basic setup we have presented. All of these extensions can be easily solved using the methods of the current article, thus demonstrating the robustness of the benchmark case studied. For the sake of brevity, we merely outline the solutions and leave out the full arguments.
5.1. Firing cost
In this section we consider the specification (i)–(iv) in the introduction, but with the addition that
-
(v) At the firing time $\tau$ , Player 2 pays a firing cost $\epsilon$ , where $\epsilon\in(0,({c_1-\mu_0})/{r})$ .
Note that the assumption $\epsilon<({c_1-\mu_0})/{r}$ implies that the firing cost is smaller than the maximal possible loss for Player 2. Adding assumption (v), the expected payoff for Player 2 is now
by Lemma 1. Note that the choice $\tau_i=0$ gives rise to no firing cost, with the interpretation that no hiring takes place.
Replacing the first boundary condition in the free-boundary problem (5) with $V(b)=-\epsilon$ , wer obtain a stopping boundary
where $\gamma_1$ is the negative solution to the quadratic equation (6). Due to the parameter ordering in (3), we can verify that indeed $b^\epsilon\in(0,1)$ when $\epsilon\in(0,({c_1-\mu_0})/{r})$ . Arguing as in Section 4.2, we find that the indifference point $\hat p$ at which $U(\hat p)=c_0/r$ is given by
which leads to the candidate strategy $a^*=(a^*_0,1)$ with
Thus we specify a triplet $(a^{*},\tau^{*}, \Pi_0)\in\mathcal P\times\mathcal T\times\mathcal P$ by $a^{*}=(a_0^{*},1)$ , where $\Pi_0=(0,\hat p\vee p)$ and $\tau^{*}=(\infty, \tau_1)$ , with $\tau^{*}_1=\inf\big\{t\geq 0\colon\tilde\Pi_t^{\Pi_0^1}\leq b^\epsilon\big\}$ . It is then straightforward to verify that if $V(\hat p\vee p)> 0$ , then $(a^{*},\tau^{*}, \Pi_0)$ is a PBE. (On the other hand, if $V(\hat p\vee p)\leq 0$ , then ((0, 0), (0, 0), (p, p)), corresponding to always choosing $C=c_0$ and no hiring, is an equilibrium.)
The addition of a firing cost $\epsilon\in(0,({c_1-\mu_0})/{r})$ , together with $V(\hat p\vee p)> 0$ , then constitutes a PBE with a lower stopping boundary $b^\epsilon$ for Player 2 and a higher randomizing probability $a_0^*$ for a low-type Player 1 compared to the corresponding values in the benchmark case of Section 4.
5.2. Uncertainty about type
In this section we consider the same setup as specified in (i)–(iv), but where (ii) is replaced by
-
(iiʹ) At time $t=0$ , Player 1 first receives a noisy observation of their capacity $\mu$ , and then presents to Player 2 a salary claim $C\in\{c_0,c_1\}$ .
In this way, Player 1 also has incomplete information about their own capacity (cf. [Reference Weiss24]). More precisely, assume that the noisy signal is either of the two events ‘strong belief’ and ‘weak belief’, where $p_1\,:\!=\,\mathbb{P}(\mbox{strong belief})$ , $q\,:\!=\,\mathbb{P}(\mu=\mu_1\mid\mbox{strong belief})$ , and
With this notation, the probability (denoted p in the previous sections) that Player 1 has the large capacity is $p=p_1q$ . Note that when $q=1$ , this extension collapses to our original benchmark model. Also note that a consequence of (11) is that Player 1 has a tendency to overestimate their capacity: if Player 1 has weak belief, then they are always of the weak type, whereas if they have strong belief, then they may still be of the weak type. We adapt the strategy $a=(a_0,a_1)$ so that $a_0$ and $a_1$ now denote the conditional probabilities that Player 1 chooses $C=c_1$ given ‘weak belief’ and ‘strong belief’, respectively.
For simplicity, assume that $q>\hat p$ , where $\hat p$ is defined in (9). Now specify a triplet $(a^*,\tau^*, \Pi_0)\in\mathcal P\times\mathcal T\times\mathcal P$ by $a^*=(a_0^*,1)$ , $\Pi_0=(0,\hat p\vee (p_1q))$ , and $\tau^*=(\infty,\tau_1^*)$ , where
with b as in (7). It is then straightforward to check that $(a^*,\tau^*, \Pi_0)$ constitutes a PBE.
5.3. Adding an interview phase
As a last extension of the setup in (i)–(iv), consider a situation where (iii) is replaced with
-
(iiiʹ) At time 0, Player 2 observes the salary claim together with the result ‘weak result’ or ‘strong result’ of a test, and subsequently also noisy observations of $\mu$ , based upon which a choice is made of a stopping time $\tau$ to terminate the employment.
Thus, further to the salary claim C, Player 2 also receives information from an additional test result (cf. [Reference Alós-Ferrer and Prat1, Reference Daley and Green6]) with two possible outcomes. For definiteness, we assume that $\mathbb{P}(\mbox{strong result}\mid \mu=\mu_0)=q\in({c_0}/{c_1},1)$ and
which corresponds to a situation in which strong types always perform well in the interview, and weak types sometimes do. Note that the outcome of the interview test is not available for Player 1, which is similar to the situation studied in [Reference Alós-Ferrer and Prat1, Reference Daley and Green6].
A belief system and strategy for Player 2 can now be described as the quadruples $\Pi_0=\big(\Pi_0^{0,\mathrm{weak}},\Pi_0^{0,\mathrm{strong}},\Pi_0^{1,\mathrm{weak}},\Pi_0^{1,\mathrm{strong}}\big)$ and $\tau=\big(\tau_0^{\mathrm{weak}},\tau_0^{\mathrm{strong}},\tau_1^{\mathrm{weak}},\tau_1^{\mathrm{strong}}\big)$ , where for $i=0,1$ , $\Pi_0^{i,\mathrm{weak}}$ , $\Pi_0^{i,\mathrm{weak}}$ , $\tau_i^{\mathrm{strong}}$ , and $\tau_i^{\mathrm{strong}}$ are the beliefs and stopping times used provided $C=c_i$ and the test result ‘weak result’ or ‘strong result’ are observed, respectively.
By the assumption in (12), if Player 2 observes ‘weak result’ in the test, then Player 1 is automatically of the weak type ( $\Pi_0^{i,\mathrm{weak}}=0$ ), and immediate firing ( $\tau_1^{\mathrm{weak}}=0$ ) would then be optimal in the case $C=c_1$ . Therefore, choosing $C=c_1$ is risky if Player 1 is of the weak type, and the obtained value from choosing $C=c_1$ (for the weak-type player) is $qU\big(\Pi_0^{1,\mathrm{strong}}\big)$ , with U as in (8). We thus define the indifference point $\hat P$ via the indifference principle (cf. Section 4.2) so that $qU(\hat P)={c_0}/{r}$ (note that $\hat P$ is well-defined since $q>c_0/c_1$ by assumption), and let
with
It is then straightforward to check that $(a^*,\tau^*,\Pi_0)$ is a PBE.
Acknowledgements
We wish to thank Kristoffer Glover for enlightening discussions, and for sharing with us preliminary notes on a problem in which Bayesian updating of a fund manager’s skill plays a key role. We also thank Ola Andersson for stimulating discussions on the game-theoretical aspects of the current work.
Funding information
Funding from the Swedish Research Council is gratefully acknowledged.
Competing interests
There were no competing interests to declare which arose during the preparation or publication process of this article.