February 19, 1996. March 6, 2000. 
 This file was scanned in from the published version, and has lost the equations as a result. 

    RAND Journal of Economics     

         Vol. 18, No. 3. Autumn 1987   

           Moral hazard in risk-averse teams 

     Eric Rasmusen'    

 University of Califomia.  Los Angeles.   I would like to thank Dilip Mookhcijec.  Sheridan Titman, Brett Trueman, and an anon,/mous refffm for   helpful comments.   

    Holmstr&m (1982) has shown that a non-biidget-balancing contract induces a team of risk-   neittral agents to choose ihefirst-best effort levels.  This is not generally true when agents are   risk averse.  Fiirthermore, a "massacre" contract, vt,hich punishes all bitt one agent when the   outcome is losi,, can attain the first best over a wider range of parameters than any other   btidget-balancing contract.


    "No, they have no railroad accidents to speak of in France.  But why?  Because when one occurs, someboo has to   hang for it!  Not hang, maybe. but be punished at least with such vigor of emphasis as to make negligence a thing   to be shuddered at by railroad officials for many a day thereafter.  'No blame attached to the officers'-that lying   and disaster-breeding verdict so common to our soft-hearted juries, is seldom rendered in France.'    -Mark Twain.  The Innocents Abroad. chap. 12   

 1. Introduction  


   Holmstrom's 1982 article, "Moral Hazard in Teams,"
begins with a theorem that shows   that no contract that is
"budget-balancing," allocating all of the team's
output to the members   of the team, can induce the members
to choose the efficient effort levels.  Instead, the
efficient   output is obtained by a contract giving each
member of the team a payoff of zero if output   is lower than
if all members had chosen the efficient effort levels. 
Such a contract is not   entirely satisfactory from a
modelling point of view because it requires a commitment
to   discard an output that is even slightly below the
efficient level.  Once a low output is observed,   all of the
team members would like to repudiate their contract and
share the output among   themselves.  To commit to losing
it, they must introduce an outsider to whom they agree   to
surrender all of the output whenever it is insufficient.  
Holmstrom's theorem, however, depends on the agents'
utility functions' being linear   in money.  If agents are
risk averse, they can use their risk aversion to write the
efficient   budget-balancing contract that I shall
describe below.  Holmstrom is not in error, but readers   of
his article, accustomed to models in which risk aversion
prevents, rather than permits,   the first best to be
obtained, might be misled.  Fortunately, the validity of
the results in   Holmstrom's later sections is
unaffected, since by relaxing the requirement that all
contracts   be budget-balancing, he does not exclude such
contracts from his later propositions. I shall also show
that of the many efficient budget-balancing contracts,
the "massacre"   contract, in which one randomly selected
agent benefits and all the others are punished   when an
out-of-equilibrium outcome is observed, is feasible
more often than any other      contract.  In particular, the
massacre contract is feasible for a larger set of
parameters than     the "scapegoat" contract, in which one
agent is punished and the others benefit.  Section 2
describes the model and Holmstrom's original result. 
Section 3 constructs a     budget-balancing contract of the
scapegoat type and shows when it can induce the
first-best     effort level.  This is followed by general
comments on budget-balancing.  Section 4 compares    
different kinds of budget-balancing contracts and
shows that the massacre budget-balancing     contract is
most often feasible.  Concluding remarks appear in
Section 5.  


    2. The Holmstrom model    

  m I shall use Holmstrom's notation.  Each of n agents indexed by i takes an unobservable     action or effort level ai E=- [0, oo).  Write  a-i = (a,,. .  , ai-1, ai+,,. . . , a,,)   and       a = (ai, a-,).      The value of output is a function x(a), which is strictly increasing, concave, and continuously     differentiable.  Output depends only on the effort levels; there is no random error.  The compensation rule specifies s,(x) as agent i's compensation if the output is x. Effort     is not observable, so that compensation cannot be a direct function of aj.  Moreover, agent     i has limited liability, and his compensation cannot be less than the liability bound of     si = -(A),, where we assume that wi 2: 0. Agent i's utility function is separable in money and     effort and can be written in the form    Ui (si, ai) = mi (si) - vi (ai).   The disutility of effort vi is continuously differentiable, strictly convex, and increasing.     In Holmstrom's article utility is linear in money, so that mi = si.  Here I assume that mi is     given by the function    mi(si)      (2)      which has constant absolute risk aversion equal to Oi > 0 for agent i. The particular form     of function (2) is not important except for its convenient parameterization and its differ-     entiability.  We seek sharing rules s,(x) -wi such that we have budget balancing,       s, (x) = x,      for allx,     (3)      and the noncooperative game with payoffs     mi (si (x(a))) - vi (ai),    i= I,. . . n,       (4)      has a Nash equilibrium a* that satisfies the following condition for Pareto optimality.' 

    Condition 1. There do not exist efforts d and sharing rules   f such that: (a) for all i,     Eui(.?i, ii) @ Eui(si, a,*'); and (b) for some agent j, Euj(.?j, I,) > Euj(sj, a,*).

   It may seem unnecessary to write Condition I in terms of expected utility, since there     is no production uncertainty in the model, but doing so allows for the possibility of ran-     domized sharing rules.  Having assumed linearity of the utility functions, Holmstrom can simplify the payoffs     to       Si(x(a)) - vi(ai),       i= I,. . . n.  (4a)      which validates the following proposition.     Hoimstrdm uses the linearity of mi(s,) in his model to state the Pareto-optimalitv condition more simply     than Condition 1.      

     Proposition I (Holmstrom).  There do not exist sharing rules Isi(x)) that satisfy (3) and yield    a* as a Nash equilibrium in the noncooperative game with payoffs (4a). I shall not repeat Holmstrom's proof here, but the result is intuitive.2 If one agent    shirks, he receives the full benefit of his diminished effort.  The cost, on the other hand,    which is lower output, is shared by all the agents. We shall see that when the payoffs are not (4a), but (4), Proposition I is no longer true.    When agents are risk averse, an efficient budget-balancing contract exists if the agents are    either sufficiently wealthy or sufficiently risk averse. 

    3. An efficient budget-balancing contract     0 

UnderthefoUowingcontract,,a*isaNashequifibriumforsomevaluesoftheparameters     w and 0.

 If output is x(al), let each agent i receive a share bi such that the budget is balanced    and Condition I is satisfied.  If output is greater than x(al), split the surplus evenly among    the agents after giving each agent i the amount b,.  If output is less than x(a*), choose one    agent j and let him receive -coj.  Let each of the remaining (n - 1) agents i receive    bi + (bj + wj - x(a*) + x)l(n - 1).

  Depending on the unlucky agent j's wealth and whether    he is paid more than his marginal product in equilibrium, (bj + wj - x(a*) + x) is greater    or less than zero, and the lucky agents are paid more or less than they would have been    had no one shirked. The sum of the rewards when output is below the Pareto-optimal level and agent j is    punished is    pi      b,+w,-x(a*)+x   E Sk + (Z bi) + (n - 1)   XI    (5)   k-I     iij     (n-     so that the contract is budget-balancing.   To a single agent i, A-ho expects all of the other agents to choose the efficient effort    level, the contract appears as contract (6):      bi + (x - x(a*))In   if     x > x(a*),    Si(x) = bi + zi    with probability (n - I )In       if     x < x (a*),       (6)      -coi     with probability I In     if     x < x (a*),     where zi is a random variable taking the value (bj + wj - x(a*) + x)l(n - 1) with probability    I l(n - 1) for j = 1, . . . , n, j 0 i. The agent chooses either the Pareto-optimal effort level    and the reward bi, or some lower effort level and a gamble in which with probability I In    he receives -wi and with probability (n - I)ln he receives not only his own bi but also an    amount depending on the unlucky agent's wealth and equilibrium share. Choose the bi's in (6) so that  bi = x(a*) and Condition I is satisfied when bi re-    places si. Agent i does not want to exert an effort greater than a*, given that the other agents are    exerting the efficient level.  Under the contract just descri@d, increasing his effort beyond    a,* raises every other agent's utility, and if it raises i's also, then a* is not the Pareto-optimal    effort.  We shall implicitly carry this result through the article, and in demonstrating that    an efficient Nash equilibrium is attained, we shall be concerned only about lo%%, effort levels. With each agent i is associated a "cheating effort" di E [0, a*] that represents the    deviation from equilibrium most tempting to him.  Agent i's cheating effort maximizes his    utility, given that the other agents choose a*-i and the contract is replaced by the "deviation    lottery" characterized in (6).  If agent i selects the lowest effort possible when he decides to    cheat, then di = 0, but he might choose a higher cheating effort because with probability  2 Proposition I in this article is Holmstrdm's Tbeorem 1 (1982, p. 339).              (n - I)ln his monetary compensation rises with effort and output.  The cheating effort for      contract (6) solves the problem,    max [ (n- 1) Emi(bi + zi) + Imi(-wi) - Vi(ai)       (7)   ai      n   n  I      The objective function in (7) is strictly concave in effort, because we have assumed that      m     0, v' > 0, and x' < 0. Given the concavity of problem (7), classical optimization tells      us that the solution di exists and is unique.  To induce the agents to select the efficient effort levels, the utility difference Y, between      the efforts a,* and d,, under the Nash assumption that the other agents choose a*-i, must be      positive for each agent i:     I   1) Emi(bi + zi) + I mi(-wi) - vi(di) > 0.  (8)       Yi - (mi(bi) - vi(a*)] - (n       I n   n      Proposition 2. If agents are risk averse, then provided that (a) punishments can be great      enough (wi is large enough for every i) or (b) risk aversion is great enough (Oi is large enough      for every i), an efficient budget-balancing contract exists.   

   Proof 
      We need to show that inequality (8) is true under either of the two conditions.  We      start by holding the levels of risk aversion fixed and showing that if the w's are large enough,      then a* is a Nash equilibrium under contract (6).       The first derivative of the utility difference Yi with respect to the liability bound wi is      (by invoking the envelope theorem to dis@ the change in di)  dYi I    -=    -M'.>O,       (9)    dwi n      and the second derivative is   d2y    2= --Mi >O.       (10)   7w7     n  Both expressions (9) and (10) are positive, because m' > 0 and mT < 0 for a concave      utility function.  Hence, Yi is increasing in wi and increasing at an increasing rate, so that it      does not converge to an asymptotic value, and if wi is chosen large enough, inequality (8)      is true.  This applies to any agent i. For every Oi there is some level of wi that allows ai* to      be the equilibrium effort level, which proves the first part of Proposition 2.  Even if the agents' liabilities are limited, if they are sufficiently risk averse, an efficient      budget-balancing contract can exist.  Rewriting expression (8) by using the full form of the      function mi from equation (2) and substituting for z, from contract (6), we obtain     Yi = _.r,,,b, - V,(,a,Ip) + 1 (2: eol(bl+ll(n-lXbj+.j-x(a')+x(ji,a:.,)])) +(e'l") + vi (di).  (I    n ,,    n  As Oi increases, the first and third terms on the right-hand side of (I 1) approach zero,      the second is unaffected, and the fifth term may change as di changes, but it is bounded by      Vi(O) and vi(ail).  The fourth term of (I 1) increases exponentially with Oi, which implies that      Y, can be made arbitrarily large.  In particular, if Oi is large enough, then Yi is greater than      zero, and (6) is an efficient budget-balancing contract.  Q.E.D.     

     Contract (6), like Holmstrom's non-budget-balancing contract, does not necessarily      yield a* as the unique Nash equilibrium.  An agent will not choose a,* if he expects another      agent to choose an inefficiently low effort.  Agent i's response might be either also to choose      a low effort or to choose a high effort to compensate for the shirker and to avoid the random      punishment, and there might exist other Nash equilibria in which some efforts are either      insufficient or excessive.  This does not mean that a* is not a strong Nash equilibrium: a      player's solitary deviation to any other efrort level would lower his utility.  Note also that       contract (6) is just one of the many possible contracts that rely on risk aversion to obtain  the efficient equilibrium a*, and efficient budget-balancing contracts can differ in their  equilibrium outcomes as well as in the out-of-equilibrium punishments.

 C3 General comments. 

 Under the contract in the preceding subsection, if the outcome  shows that the wrong effort level was chosen, the agents are subjected to risk.  Ex post,  shirking makes only one of the agents worse off, and all the others better off, unless the  shirking decreased output by more than the bi + wi lost by the unlucky agent.  We may find  random punishments morally distressing, but they are similar to the punishments in tour-  nament contracts under uncertainty, a context in which most people feel that punishments  conditioned on random events are fair.     If the efforts of the agents are observable, even if only with error, the observed efforts  can be used as the criteria for punishment.  When the observation error has a high variance,  the wrong agent would often be punished if output were low, but that does not detract from  the contract's efficiency.  Low observed output is merely an excuse to punish, although the  shirker would have at least a slightly greater probability than the other agents of being  punished.  Depending on the variance of the observation error, the punishment is more or  less random.  This article describes a model in which the'variance of the observation error  is infinite, and hence the punishment seems completely capricious.     If efforts are observable with error, but we continue to assume that there is no production  uncertainty, then no punishment occurs in equilibrium.  Although the observed individual  efforts might be used to allocate punishment, the only punishment trigger needed to attain  efficiency is the team's output In equilibrium output is x(a*), and if at the same time the  observed efforts are low, it is clear they must have been observed with error.     If, however. individual efforts are obser  vable with error, the first best may sometimes  be achieved with budget balancing, even if the agents are risk neutral.  The exact form of  the contract depends on the specification of the observation error, but one possibility is to  give the entire output to the agent whose observed effort is closest to his efficient effort.  Such a plan resembles more a tournament contract than a team contract. and it relies  heavily on risk neutrality since the equilibrium compensation (not just the out-of-equilibrium  compensation) varies widely.     I mentioned earlier that one reason for dissatisfaction with non-budget-balancing con-  tracts is the perfectness problem of having to discard output if it is insufficient.  One might  wonder whether the random punishment contract addresses the perfectness problem any  better.  Both contracts rely on lowering the utility of all the agents after certain outcomes  are observed.  The random punishment contract differs in that some agents may be better  off ex post, after the punishment actually occurs, and they would vote against any recon-  tracting.  Between observing the output and choosing the agent to be punished, however, all  agents could raise their utility by abandoning the punishment scheme and reverting to a  nonstochastic sharing rule.  The contract must somehow prevent this.     One way to prevent recontracting is to allocate the punishment immediately and au-  tomaticafly after low output is observed and before recontracting can occur.' Once a victim  has been chosen, the other agents block any recontracting.  In some situations there is a  time lag between the date of choosing effort and the date of observing output.  Committing  to punishment is then simple: the lottery for allocating possible future punishments is held  between the two dates.  Any agent who objects to the lottery must have shirked, since in  equilibrium the lottery is harmless as its punishment is never imposed.     Holmstrdm has suggested that if the team adds a manager, whose sole function is to  serve as the residual claimant, the perfectness problem for the non-budget-balancing contract      3 One way to allocatc shares, for example, would be to adopt a plan for allocating scaled-bid orders suggested,  but not adopted. during the "Electric Conspiracy." This plan was to use stock market results published in the Wall  Street Joumal to allocate bids randomly, an interesting application of the theory of cfficient markets (Sultan, 1974,  p. 47).            can be solved. output below the efficient level triggers a large payment to the manager, who    therefore has the incentive to oppose any recontracting.  This idea is particularly interesting         4    because it depicts the manager not as a principal, but as an agent of the agents.  He is truly    a "public servant." But this introduces the possibility of new agency problems: the manager    has an interest in seeing that the output is below the efficient level so that the punishment    is triggered.  Randomization schemes are free from this danger, despite the drawback of    their greater complexity. 

    4. Scapegoats versus massacres 

    0 Under the randomizing contract described above, one agent is chosen to be the "scape-    goat" when output is low, and the others benefit at his expense.  Another extreme in the    class of randomizing contracts is the "massacre" contract in which all of the agents are     punished when output is low except for one, randomly chosen, who receives the entire    output.  We shall see that the massacre is a better contract in the sense. that it is feasible for    a strictly larger set of liability bounds and risk-aversion parameters.  Since it should be clear    now that heterogeneity of the agents is unimportant except for the amount of notation, we    shall simplify the model of Sections 2 and 3 by assuming that the agents have identical    utility functions and liability bounds.  In the model with identical agents the scapegoat contract appears to the individual    agent as   xln      if     x @ x(a*),       si(x)= (x+w)l(n-1)      with probability (n - I)ln if     x < x(a*),      (12)   -W    with probability I In      if     x < x(a*).     The effort-level vector a* is a Nash equilibrium under the scapegoat contract if and only if     the utility of choosing a,* is greater than that of choosing a lower effort level; that is, if  x(a*)      @a*)      r(n-i) ix,+w\       +  I   v(d,)    0,    (13)     1 ( n )   ' I-F n lln-@l ) n     I     where d, is the effort level chosen under the scapegoat contract by the single cheating agent    and x, - x(d,, a*-i).  The massacre contract appears to the individual agent as  xln      4     if    x @ x(a*),  si(x) = -W     with probability (n - I)ln  if    x < x(a*),  (14)  x+(n- I)w    with probability lln      if    x < x(a*).     The vector a* is a Nash equilibrium under the massacre contract if and only if the utility    of choosing ai* is greater than that of choosing a lower effort level; that is, if [m(x,,)       - I)m(  v(a*   , )]-r(n       +  ' m(x,, + [n - I lw) - V(,i.)] > 0,  (15)  n )      [ n   n     where dn is the effort level chosen under the scapegoat contract by the single cheating agent    and x, - x(d,,, a*-i).  The existence and uniqueness of d., follow from the concavity of the    deviation lottery (14) by the same argument made for the existence of d, in Section 3.  We shall consider the class of contracts that treat identical agents identically as do the    scapegoat and massacre contracts.  Otherwise there is some agent whose change in expected    utility from deviating is highest, and he is the only relevant agent for determining whether   ' Anyone interested in the problem of one agent @g several principals should see Bernhcim and Whin-     ston (1986).      434 /     the first best can be supported.  That agent's compensation in equilibrium should be increased  to make his loss from deviation greater, or his punishment after deviation should be increased  while lightening that of the other agents.  Carried to the extreme, this results in a symmetric  contract, which gives each agent the same compensation in equilibrium and the same ex-  pected disutility from deviation.       We shall now prove that the massacre contract attains efficiency for a larger set of  parameters than the scapegoat contract or any other budget-balancing contract.  

 Proposition 3. If any budget-balancing contract can achieve a Pareto optimum, the massacre  contract can.5   

Proof 
    Under a Pareto-optimal contract, no agent has the incentive to choose an excessive  effort level, so that output is no greater than the efficient level x*.  Each agent faces a choice  between the desired effort level and a compensation of x*ln, or the cheating effort level and  a deviation lottery with an expected value of some smaller xln and a distribution that  depends on the form of the contract.  The expected values of different lotteries are different  because the cheating efforts are different, but that will not be relevant to this proof.     

  The advantage of the massacre contract is that its deviation lottery is the riskiest of  any contract.  Let us suppose for the moment that the cheating efforts, and thus the expected  values, are the same for the massacre contract's deviation lottery and for some arbitrary  contract k's deviation lottery.  We shall see that if the expected values are the same, the  massacre lottery can be obtained from contract k's deviation lottery by the addition of a  series of mean-preserving spreads of the kind formalized by Rothschild and Stiglitz ( 1 970).  The massacre contract creates the two-point deviation lottery that places the highest possible  probability, (n - I)ln, on the lowest possible payoff, -w, and the probability of lln on the  highest possible payoff, (x + (n - I)w).  Any other lottery puts smaller probability on -W  and positive probability on payoffs less than (x + (n - I)w).  If the means of the lotteries  are equal, the massacre lottery is riskier than lottery k, because it takes probability mass  away from the payoffs between -w and (x + (n - I)w) and puts it on those extreme points.       More precisely, if contract k has a discrete probability function for its deviation lottery,  then we can obtain the massacre lottery from k's deviation lottery by a series of mean-  preserving spreads of the form      a@O       for     m = -W,    -a@O  for     m = -w + d,   f(m)=. -0@O     for     m=x+(n- I)w-1,     (16)      fl@O      for     m=x+(n- I)w,      0 otherwise,  where the values of d, i, a, and 0 are chosen so that    -w<-w+d@x+(n- I)w-1:5x+(n- I)w   (17)  and  ad = #t.   If contract k's deviation lottery has a continuous density, the notation of (I 6) is inappropriate,  but it can easily be adapted to find the desired mean-preserving spreads.       Rothschild and Stiglitz (1970) have shown that the expected utility of a risk-averse  agent is lower with a lottery that is riskier in the sense of being obtained from other lotteries  by a series of mean-preserving spreads.  Since any deviation from a* triggers the deviation  lottery specified by the contract in force, an agent's expected utility is lower when a given I would likc to thank Paul Milgrom for sugming that I broaden Proposition 3 to its present form.          deviation occurs under a massacre contract than under any other contract.  An agent who    deviates to d. will have lower utility under the massacre contract than under any other    contract k.  Once we relax our temporary supposition that the cheating efforts are equal, contract    k is even less able to deter deviation because the agent will reoptimize his cheating effort    from d, to a cheating effort prefeffed under contract k, and this raises his utility when he    deviates under that contract.  Although the massacre contract may not always be able to    support a Pareto optimum by making the cost (the shift to a lottery) greater than the benefit    (the reduced effort), it makes the difference as great as possible, and hence will support the    Pareto optimum whenever any other feasible contract can.  Q.E.D.  

   The intuition behind Proposition 3 is that the massacre contract puts greater risk on    an agent if he chooses low effort, and less risk-averse agents can be deterred by this larger    amount of risk.  The massacre contract deters shirking in types of agents who would shirk    under the scapegoat contract, and hence we might expect to see the massacre contract    employed more often.  The advantage of the massacre contract, however, also points up a    major weakness of the model: the absence of production uncertainty.6     If output were some-    times low, even if effort were high, the massacre contract would trigger greater accidental    punishments than the scapegoat contract, and its superiority might well disappear.  Moreover,    I have said nothing about the output levels in the inefficient Nash equilibria that possibly    exist even under efficient contracts, and the massacre contract may do badly in those inef-    ficient equilibria. 


    5.   Concluding remarks  

  m We have seen that for the multiagent team, it is easier to find a first-best contract if    agents are risk averse than if they are risk neutral.  Although no budget-balancing first-best     contract exists when agents are risk neutral, when they are risk averse such a contract does    exist.  The contract is similar to the non-budget-balancing contract th2t Holmstrom suggests    for risk-neutral agents. because team output less than the efficient level triggers a punishment,    but with risk-averse agents the punishment can take the form of a lottery rather than of the    destruction of the output.  If agents are sufficiently risk averse, or the lottery is sufficiently    risky, each agent is unwilling to deviate from the efficient effort level, given that he believes     his fellow agents are choosing the efficient level.  The deviation lottery can take a number    of forms, including the scapegoat lottery, in which one agent is punished and the others    take his share, and the massacre lottery, in which one agent is rewarded by being granted    the shares of all the others.  Under"some parameter values any of the deviation lotteries    performs equally well, but the massacre lottery is able to attain the first best for less risk-    averse agents or more tightly bounded punishments.       

  References   

   BERNHEim, B. AND WHIKWON, M. 'Common Agcncy." Economeirica, Vol. 54 (1986), pp. 923-942. 

     HOLMSTROM, B. "Moral Hazard in Teams." Bell Joumal ofeconomics.  Vol. 13 (1982). pp. 324-340.    

   NALEBUFF, B. AND STIOLrrZ.  J. 'Mm and Incentives: Towards a Genemi Theory of Compensation and Com-   petition." Bell Joumal ofeconotnics, Vol. 14 (1983). pp. 21-43.   

 ROTHSCHILD, M. AND STiGLrrz, J. 'Increasing Risk 1: A Definition." Journal ofeconomic Tiieory, Vol. 2 (1970), pp-225-243.    

SULTAN, P. Pricing in the Electrical Oligopoly.  Vol. 1. Cambridge: Harvard University Press, 1974.   


   ' The massacre conwo is comparable to rewarding the winner of a tournament and the scapegoat contract    is comparable to punishing the loser.  Nalebuffand Stiglitz (1983) find that in tournaments with many players and    production uncerminty, a punishment for the loser is superior to a prize for the winner.  Their explanation is based    on the combination of production uncertainty and a large number of players in their model. and does not rely on     risk aversion.  Toumaments are different from teams. because in tournaments the punishment is actually inflicted,    and its direct disutility must be balanced against the effort it induces.