A simple model of herd behavior


2023年12月28日发(作者:卫星云图天气预报)

A Simple Model of Herd BehaviorAuthor(s): Abhijit V. BanerjeeSource:

The Quarterly Journal of Economics,

Vol. 107, No. 3, (Aug., 1992), pp. 797-817Published by: The MIT PressStable URL: /stable/2118364Accessed: 27/05/2008 02:04Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at/page/info/about/policies/. JSTOR's Terms and Conditions of Use provides, in part, that unlessyou have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and youmay use content in the JSTOR archive only for your personal, non-commercial contact the publisher regarding any further use of this work. Publisher contact information may be obtained at/action/showPublisher?publisherCode= copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printedpage of such is a not-for-profit organization founded in 1995 to build trusted digital archives for scholarship. We enable thescholarly community to preserve their work and the materials they rely upon, and to build a common research platform thatpromotes the discovery and use of these resources. For more information about JSTOR, please contact support@://

THE

QUARTERLY JOURNAL

OF

Vol. CVII

ECONOMICS

August 1992

Issue 3

A SIMPLE MODEL OF HERD BEHAVIOR*

ABHIJIT

V.

BANERJEE

We analyze a sequential decision model in which each decision maker looks at

the decisions made by previous decision makers in taking her own decision. This is

rational for her because these other decision makers may have some information

that is important for her. We then show that the decision rules that are chosen by

optimizing individuals will be characterized by herd behavior; i.e., people will be

doing what others are doing rather than using their information. We then show that

the resulting equilibrium is inefficient.

I. INTRODUCTION

There are innumerable social and economic situations

in

which we are influenced in our decision making by what others

around us are doing. Perhaps the commonest examples are from

everyday life: we often decide on what stores and restaurants to

patronize or what schools to attend on the basis

of how

popular

they seem to be. But it has been suggested by Keynes [1936], for

example, that this is also

how investors in asset markets often

behave (the famous "beauty contest" example).' In the literature

on fertility choices it has frequently been suggested that various

fertility decisions (how many children to have, whether or not to

use contraception, etc.) are heavily influenced by what other people

in the same area are doing.2 It has also been suggested that the

same kind of factor also influences the decision to adopt new

*I thank Aniruddha

Dasgupta, Mathias

Dewatripont,

Bob

Gibbons,

Tim

Guinnane, Sandy Korenman, Eric Maskin, Andreu Mas-Colell, Barry

Nalebuff,

Klaus Nehring, Avner Shaked, Lin Zhou, and two anonymous referees for helpful

comments and suggestions.

1. See Scharfstein and Stein [1990] for some evidence suggesting that this is

indeed how managers behave.

2. See Cotts Watkins [1990].

?)

1992 by the President and Fellows of Harvard College and the Massachusetts

Institute of

Technology.

The Quarterly Journal of Economics, August 1992

798 QUARTERLY JOURNAL OF ECONOMICS

technologies.3 Voters are known to be influenced by opinion polls to

vote in the direction that the poll predicts will win; this is another

instance of going with the flow.4 The same kind of influence is also

at work when, for example, academic researchers choose to work

on a topic that is currently "hot."

The aim of this paper is to develop a simple model in which we

can study the rationale behind this kind of decision making as well

as its implications. We set up a model in which paying heed to what

everyone else is doing is rational because their decisions may reflect

information that they have and we do not. It then turns out that a

likely consequence of people trying to use this information is what

we call herd behavior-everyone

doing what everyone else is doing,

even when their private information

suggests

doing something

quite different.

But this suggests

that the very act of trying to use the

information contained in the decisions made by others makes each

person's decision less responsive to her own information and hence

less informative to others. Indeed, we find that in equilibrium the

reduction of informativeness

may be so severe that in an ex ante

welfare sense society may actually be better off by constraining

some of the people to use only their own information.

A common real world

example may make our basic argument

clearer.5 Most of us have been in a situation where we have to

choose between

two restaurants that are both more or less

unknown

to us. Consider now a situation

where there is a

population of 100 people who are all facing such a choice.

There are two restaurants A and B that are next to each other,

and it is known that the prior

probabilities are

51

percent

for

restaurant A being the better and 49 percent for restaurant B being

better. People arrive at the restaurants in

sequence, observe the

choices made by the people before

them, and decide on one or the

other of the restaurants. Apart from knowing the prior probabili-

ties, each of these people also got a signal which says either that A

is better or that B is better (of course the signal could be

wrong). It

is also assumed that each person's signal is of the same quality.

Suppose that of the 100 people, 99 have received signals that B

is better but the one person whose signal favors A gets to choose

first. Clearly, the first person will go to A. The second person will

now know that the first person had a signal that favored

A, while

3. See, for example, Kislev and Shchori-Bachrach [1973].

4. See, for example, Cukierman

[1989].

5. I am indebted to an anonymous referee for this very transparent

example.

A SIMPLE MODEL OF HERD BEHAVIOR

799

her own signal favors B. Since the signals are of equal quality, they

effectively cancel out, and the rational choice is to go by the prior

probabilities and go to A.

The second person thus chooses A regardless of her signal.

Her choice therefore provides no new information

to the next

person in line: the third person's situation is thus exactly the same

as that of the second person, and she should make the same choice

and so on. Everyone ends up at restaurant A even

if, given the

aggregate information, it is practically certain that B is better.

To see what went wrong, notice that if instead the second

person had been someone who always followed her own signal, the

third person would have known that the second

person's signal had

favored B. The third person would then have chosen

B,

and so

would have everybody else.

The second person's decision to ignore her own information

and join the herd therefore inflicts a

negative externality on the

rest of the population. If she had used her own information, her

decision would have provided information to the rest of the

population, which would have encouraged them to use their own

information as well. As it is, they all join the herd.

The identification of this externality,

which we call "herd

externality," and the investigation of what it implies, is the main

contribution

of this

paper. The model we present is extremely

simple and does not aspire to capture any specific institutional

detail. There is a set of options represented by a line

segment, and

within this set there is one correct

option. The aim of the game is to

find the correct

option.

All those who find the

correct option get z,

while all others get 0. There is a population of N

people who take

their decisions in a fixed order; each person

moves knowing the

choices made by those before her but not the

information these

choices were based on. Each individual

may either be uninformed,

in which case she has no

signal, or informed, in which case she has

a signal about what the

right option is. This signal, however, may

not be correct. It is correct only with

probability 1; otherwise it is

completely uninformative.

Everybody is rational in the Bayesian

sense,

and the

equilibrium

we look at is a Bayesian-Nash

equilibriums

Because of the extreme simplicity of this model, using quite

6. This formulation is somewhat more

complicated and somewhat less natural

than the two-restaurant

setting discussed above, but it turns out that deriving a

general

result about the

equilibrium decision rule which holds for all parameter

values is actually simpler in this

setting.

800

QUARTERLY JOURNAL OF ECONOMICS

elementary arguments we are able to derive a number of rather

striking results that derive directly from the presence of the herd

externality. These results are summarized below.

1. The equilibrium pattern of choices may be

(and for a large

enough population, will be) inefficient in the ex ante welfare sense.

Before people know the order in which they are

choosing, they may

all agree to prevent the first few decision makers from

observing

the choices made by anyone else. This is a direct

consequence of the

herd externality and suggests why herd behavior

may be undesir-

able from the social point of view.

2. The probability that no one in the population

chooses the

correct option is bounded

away

from zero for

any size of the

population. Indeed, by making the

probability

3

small, we can

make this probability as

large as we like. This contrasts with the

case where the decision makers choose without

looking at each

other (i.e., they follow their own

information). Since the informa-

tion they have is

independent, as long as the population is large

enough, someone must choose the right option in this case.

3. Since the herd

externality is of the positive feedback type (if

we join the crowd, we induce others to do the

same), the equilib-

rium pattern of choices will be

very volatile across several plays of

the same game. The

signals (which are partly random and need not

be correct) that the first few decision

makers have will determine

where the first crowd

forms, and from then on, everybody joins the

crowd. This

may shed some light on observations

of "excess

volatility" made in the context of many asset markets7 and the

frequent and apparently unpredictable

changes in fashions.

The emphasis on the herd

externality also distinguishes

our

work from two other

explanations of clustering behavior that have

been

suggested before. One is an explanation based on

strong

complementarities: some things are more worthwhile when

others

are

doing related things.8 Examples of such complementarities are

fashions in

consumption (see, for example, the analysis of fashions

in Karni and Schmeidler

[1989]) and network externalities

in

production (see Arthur [1989], Farrell

and Saloner [1985], and

Katz and

Shapiro [1985]). Whether or not it is as important in

7. The idea that informational externalities may

explain observations of excess

volatility is also discussed in Banerjee [1988].

8. As far as we know, there is actually no formal

model that tries to explain

herd behavior in these

terms;

what exists in the literature is the idea that if

complementarities are sufficiently large, then people will do what

the crowd is doing

even if left to themselves

they would have done something else. Under suitable

conditions this could clearly lead to herd behavior.

A SIMPLE MODEL OF HERD BEHAVIOR

801

other contexts where we observe herd behavior is an open question.

In any case, there is no contradiction between this view and ours;

our point is that many aspects of herd behavior can be explained

quite plausibly

without

invoking

these kinds

of gains

from

association.

A different explanation of herd behavior, which, like the

present work is based on informational

asymmetries,

was sug-

gested in an interesting

recent paper by Scharfstein

and Stein

[1990]. The key difference between their explanation and the one

suggested here is that their explanation is

based on an agency

problem; in their model the agents get rewards for convincing

a

principal that they are right. This distortion

in incentives plays an

important role in generating herd behavior in their model.

By

contrast, in our model agents capture

all of the returns generated

by their choice so that there is no distortion

in incentives.9

In any case, this

approach is not inconsistent

with our

approach. This kind of principal agent problem (trying

to con

someone else into believing that you know something)

seems

common enough, especially in the context of asset markets.

On the

other hand, in many of the other potential instances

of herd

behavior, such as fertility choices, adoption

of innovations, voting

etc., there is no obvious principal agent problem.

It is therefore

useful to establish that inefficient herd behavior

can arise even

when the individuals themselves capture the rewards

from their

decisions.

The strategy of this paper is as follows: the

basic model is

presented in Section

II and analyzed in Section III. The results are

discussed in Section IV. Several extensions

and modifications of the

basic model are presented in Section V. We conclude

in Section VI.

The essential ideas in this paper are generally quite straightfor-

ward. Nonetheless, a fully rigorous treatment

of the results derived

here will certainly be very cumbersome

and repetitive. To prevent

the paper from being too unreadable,

we have omitted some proofs

9. There are two other important differences between the

two models. In the

Scharfstein and Stein model unlike in ours,

all

agents get

a

signal. However, only

some of these signals are potentially informative (though not necessarily correct):

the rest are duds; and the agents cannot distinguish between the

two

types of signals.

In our model this would amount to assuming that agents do not know whether they

have a signal or not. The decision to herd and not use one's signal

is therefore

somewhat less significant (since they actually may not have a signal)

than it is in our

model (where they know they have a signal). On the other hand,

it is possible to

generate instances of herd behavior

in their model even if there are only two agents.

In the model given here the equilibrium is always (second-best) optimal

if there are

two agents.

802

QUARTERLY JOURNAL OF ECONOMICS

and presented others informally. More rigorous proofs

were pre-

sented in an earlier version of this paper

[Banerjee,

1989], which is

available from the author.

II. THE BASIC MODEL

There is a population of agents of size N each of whom

maximizes the identical risk-neutral utility function VNM10

de-

fined on the space of asset returns. For convenience we shall

just

assume that this utility is the same as the monetary

amount

received by the person.

There is a set of assets indexed by numbers in [0,1] . Call the ith

asset a(i). The physical return

to the ith asset to the nth person

investing in that asset is z(i)

E

R. Let us assume that there is a

unique i* such that z(i)

= 0 for all i

?

i* and

z(i*)

=

z, where z

> 0.

This is essentially the assumption that the excess return

on one

asset to the people investing

in it is strictly greater than that on all

other assets.

Of course, everybody, given these payoffs, would want

to invest

in i*. The trouble is no one knows which one it is. We assume

uniform priors so there is not even a likely candidate

for i*.

However, some people have an idea of which one it

might be.

Formally, there is a probability

x

that each person receives a signal

telling her that the true

i* is i'. The signal need not, of course, be

true, and the probability that

it is false is 1 -

P.

If it is false, then

we assume that it is uniformly distributed

on [0,1] and therefore

gives no information about what i* really is.

The decision making in this model is sequential; one person

chosen at random takes her decision first (she

cannot decide to

delay

her decision). The next person, once again chosen at random,

takes her decision next but she is allowed

to observe the choice

made by the previous person and can benefit

from the information

contained in it. However, she is not allowed

to find out whether or

not the person before her actually got a signal.11

The rest of the game proceeds in the same way, with each new

decision maker making her decision

on the basis of the history of

the past decisions and

their own signal if they have one. After

everybody

has made her choice, all the alternatives that have

been

10. It will be evident that for much

of what we

say

it is irrelevant whether we

take our utility function to be

risk neutral or risk averse. The only point where we

need the assumption is when we measure ex

ante welfare.

11. And a fortiori she cannot observe the signal observed by her predecessor.

A SIMPLE MODEL OF HERD BEHAVIOR

803

chosen are tested, and if any of these turn out to work, those who

have chosen it receive their rewards. If no one has chosen an option

that works, the truth remains undiscovered, and no one gets

rewards.

It will be assumed that the structure of the game and Bayesian

rationality are common knowledge. Each person's strategy is a

decision rule that tells us for each possible history what that person

will choose. We are looking for a Bayesian Nash equilibrium in

these strategies. The nature of the equilibrium play, however,

turns out to depend on certain critical tie-breaking assumptions.

Some of these assumptions may be dispensed with by strengthen-

ing the equilibrium concept, but it seems more natural to introduce

these as explicit assumptions. These assumptions are listed below;

the relevance

of these

assumptions

will become clear in the

appropriate context. It should also be possible to see that each of

these assumptions is made to minimize the possibility of herding.

ASSUMPTION A. Whenever a decision maker has no signal and

everyone else has chosen

i =

0,

she always chooses i

= 0.

B. When decision makers are indifferent between

ASSUMPTION

following their own signal and following someone else's choice,

they always follow their own signal.12

ASSUMPTION C. When a decision maker is indifferent between

following more than one of the previous decision makers, she

chooses to follow the one who has the highest value of i.

III.

THE EQUILIBRIuM DECISION RULE

The first decision maker's decision will clearly depend on

whether or not she has a signal. If she has a signal, she will

certainly follow

her

signal. While

if she has no

signal, by

our

Assumption

A she will choose i

=

0. This choice minimizes

12. This assumption is not entirely innocuous.

If the third decision maker

decides to follow the second independently of her informational situation, the fact

that she follows provides no information to the next decision maker. The next

decision maker who has a signal, now faces the same choice; she is indifferent

between following her own signal and joining the second and the third decision

maker. If she too ignores her own signal, the next decision once again has the same

choice and so on. There could, for example, be an equilibrium in which almost

everybody follows either the first decision maker or the first deviant. Since this

involves even more herding than the equilibrium we describe above, its welfare

properties will typically be worse; and in this sense by making the above tie-

breaking assumption, we are choosing to focus

on the best of the set of

possible

equilibria.

804

QUARTERLY JOURNAL OF ECONOMICS

misinformation: the only case where this

will cause confusion is

when i*

= 0, but since this happens with probability 0, we can

ignore this

possibility.13

If the second decision maker has no signal, then she will

of

course imitate the first decision maker and

invest in the same

asset. However, if she has a signal and the

first person has not

chosen i

=

0, she has a problem. She knows that the

first decision

maker had a signal and this signal is as likely to be right

as her own

signal. She is therefore

indifferent between following the first

decision maker's signal and following her own signal.

In this

situation our Assumption

B becomes relevant.'4 By invoking this

assumption, we determine that the second person will,

in this case,

follow her own signal.

The third decision maker can observe four possible

histories:

one or both of her predecessors may have chosen

i = 0, and if

neither of them had chosen i

=

0, they could have still either agreed

or disagreed. If they both chose

i

=

0, the third person should

follow them if she has no signal and follow

her signal otherwise. In

all the other cases, if she does not have a signal, she

should follow

the person who has not chosen i

= 0. If both the others have chosen

i

?

0 but have not agreed with each other,

of course this does not

determine a course of action. Since she is indifferent, however,

we

can invoke our third tie-breaking rule, Assumption C,

which tells

us that she should follow the person

with the highest i.

On the other hand, if the third decision

maker does have a

signal i', she will

follow her own signal, unless both people before

her have chosen the same option and this option

is neither i

=

0 nor

i

=

i'. When both of her predecessors have chosen

i =

0, this is

obvious. When only one of them has

chosen something different

from i

=

0 and

i

=

i' and the other has chosen i

=

0, this is a

consequence of our Assumption

B.

And,

of course, when the third

person's signal matches

the choices made by one or both of

her

predecessors, she must choose to

follow her own signal since this

could not happen unless her signal

was correct.

The last point is much more general than

this specific situa-

tion and deserves to be emphasized.

Whenever some person's

signal matches

the choice made by one of her predecessors,

she

should always follow

her

signal.

This follows from the fact that the

probability that two people should get

the same signal and yet both

13. Of course, there is nothing special about the point

0. The person could just

as well choose some other point i as long as everybody

knows what i is.

14. Once again, this assumption is not entirely innocuous.

A SIMPLE MODEL

OF HERD BEHAVIOR

805

be wrong is zero. To

deal with the

remaining case, we prove the

following simple lemma.

LEMMA

1. If the first and the

second decision

makers have both

chosen the same F

X

0, the third decision

maker should choose

to follow them.

Proof of Lemma 1. Note

that

prob[i*

=

TIH]

=

{a332(1

-

A) + a23(1-

prob[i*

=

i' H]

=

{a2,8(1

-

p)(1-

)(1

-

oa}/prob[H]

o)/prob[H],

where H represents

the event in which

the first two people have

both chosen

I

and the third person

has the signal i'.

Clearly, the first term is greater than the

second. The third

person should therefore choose

i.

Q.E.D.

This has a simple

intuitive

explanation:

the third

person

knows that the first

person must have a signal,

since otherwise she

would have chosen i

=

0. The first person's choice

is therefore at

least as good as the

third person's signal.

Further, the first person

has someone who has

followed her. This is some

extra support for

the first person's choice, since it is

more likely to

happen when the

first person is right than when

she is wrong. It is therefore

always

better to follow the first

person.

The same intuition

tells us what should

happen in any

situation when several

options other than i

=

0 have been chosen

but only one of them

has been chosen by two

people. Assume that

the next person does not have a

signal that matches

any of the

options that have

already been chosen (if she has

a signal that

matches someone else's

choice, she should, of

course, follow her

own signal). In the situation

where this option is not the

one with

the highest i, it is of course

clear (by our

Assumption C) that both

those people must have the

same signal and therefore

they must be

right. In the situation

where it is the one who has the

highest i, the

argument in the previous

paragraph applies, and therefore once

again this option is the best. In

either case,

therefore, the next

person should choose the

option that has been chosen

by

two

people.

Once one option has been chosen

by two people, the next

person

should

always

follow that

option

unless her

signal

matches

one of the options that have

been already

chosen;

in that

case she

806

QUARTERLY JOURNAL OF ECONOMICS

should follow her own signal. A combination of this decision

rule

with Assumption C tells us that the next person will observe

one of

three alternative histories.

i) One option (other than i = 0) has been chosen by

more than

one person, and this is the one that has the

highest i.

ii) One option (other than i

=

0) has been chosen by more than

one person, and this is not the one with the

highest i.

iii) Two options (other than i

=

0) have been chosen by more

than one person, one of which is the one with the highest i.

In the second and the third scenarios, it is clear that the

option

which is not the highest value of i is the correct option, and all

subsequent decision makers should choose it. The argument in the

first case is very similar to the argument for Lemma 1. The

next

decision maker should decide to follow the option that has

already

been chosen by more than one person.

The same argument can now be extended to all

subsequent

decision makers. This yields the following

proposition

which

summarizes all the

arguments

we make

above.15

PROPOSITION

1. Under

Assumptions

A, B, and C, the unique

(Nash) equilibrium decision rule that everyone will adopt is

decision rule D16

given below.

1. The first decision maker follows her

signal if she has one

and chooses i

=

0 otherwise.

2. For k

>

1, if

the kth decision maker has a signal, she will

choose to follow her own signal either if and

only

if

(a) holds or

if (a) does not hold, (b) holds, where (a) and

(b) are given below.

(a) Her signal matches some option that has

already

been

chosen.

(b) No option other than i

=

0 has been chosen

by more than

one person.

3. Assume that the kth decision maker has a

signal. If any

option (among those already chosen) other than the one with

15. A formal proof is available from the author on request.

16. Actually this is not the full decision rule. Technically we have to

specify the

optimal decision in other parts of the decision tree. In

particular, we have

to

specify

the decision rule in the following scenario: one

person finds that his option matches

someone else's and chooses that

option.

In the

equilibrium specified it

is known that

he would not choose that option unless his signal matches that option.

Therefore,

everybody else follows

him

expecting that

no one else will deviate.

However,

now

someone else deviates and chooses another of the

options that

have

already

been

chosen.

It is true that this can happen only with

probability zero, but when it

happens,

we need to say what the subsequent decision makers will choose.

However, it turns

out that agents are

actually indifferent between sticking to the original path and

deviating to the new one, and we can make any assumption we like.

A SIMPLE MODEL OF HERD BEHAVIOR

807

the highest i has been chosen by more than one person, the kth

decision maker will choose this option, unless

her signal

matches one of the other options that has already been chosen.

In this case she chooses the latter option.

4. Assume that the kth decision maker has a signal. If the

option with the highest i (among those already chosen) has

been chosen by more than one person and no other option

(except i

=

0) has been chosen by more than one person, she

will choose this option unless her signal matches one of the

options already chosen. In this case she chooses the latter

option.

5. Assume that the kth decision maker does not have

a signal.

Then she will choose i

=

0 if

and only if that is what everyone

else has chosen. Otherwise, she chooses the option with the

highest value of i that has already been chosen unless one of

the other options (excluding i

=

0) has been chosen by more

than one person. In this case, she chooses the latter

option.

The only part of the statement of this proposition that

is not

explained above is the uniqueness. Because each person's payoff is

completely independent of the choices made by everyone coming

after her in the decision process, there are no

strategic elements

here, and we can solve this game by moving forward in the game

tree. The uniqueness

of the solution is therefore

automatically

guaranteed.

The equilibrium decision rule as stated above is

somewhat

complicated and potentially confusing. In Figure I the decision rule

for decision maker k

(for k

>

2) is presented in a schematic form

that may be easier to follow.

IV. DISCUSSION OF RESULTS

A. Description of the Equilibrium

The equilibrium decision rule in the above model is character-

ized by extensive herding; agents abandon their own signals and

follow others even when

they are not really sure that the other

person is right. The first person always follows her own signal if

she has

one,

and so does the second

person,

but we cannot

guarantee that even the third person follows her own signal. If the

first person chooses i

?

0 and the second person follows her, the

third person will always follow them. All

subsequent

decision

makers will also choose the same

option.

808 QUARTERLY JOURNAL OF ECONOMICS

Has no signal

Has

a signal

ik

signal

Everybody

else has

chosen

i

0

all options

already

chosen

except

i =

have been

chosen by

1 person

only one

option

other than

i = 0 has

been

chosen

by more

than

1 person

two

options

other than

i = 0 have

been chosen

by more than

1 person

some

other

person has

chosen

i =

ik

no one

else

has

chosen

i =

ik

and

no option

other than

i = 0 has

been

chosen

by

more

than

1

person

no other

person

has

chosen

i =

ik

but one

option other

than i = 0

has been

chosen

hy

more

than

1

person

no other

person

has

chosen

=k

but

two

options

other than

0 have

history

heen

chosen

hy

more

than

1

person

choose

i

= 0

choose the

highest

of

the

options

already

chosen

choose

that

option

choose the

lower

of

the

two

options

choose

i =

ik

choose

i =

ik

choose

that

option

choose

the lower

of the

two

options

choice

FIGURE I

The kth decision maker's choice problem (k

>

2)

Herding can also happen when the first and second person,

and for that matter the third and fourth person, choose different

options. After k different options have been chosen,

for

any positive

k,

if the next decision maker does not have a signal, she will choose

the option with the highest value of i (among those already

chosen). Following this, all subsequent decision makers

will choose

the same option unless one of their signals matches one of the

options already chosen. This can happen only if the correct option

has already been chosen. So, there will be herding at an incorrect

option unless the first decision maker to have a signal

or someone

coming after

her but before the first subsequent decision maker

without a signal, made the correct choice.

We can actually calculate the expression for the probability

that no one in the population chooses the right option, however

large the population. A simple calculation establishes

that this

probability is

[1

-

(1

-

)]-1(1

-

&(1

-

A).

This probability is clearly decreasing in both

a

and

3

which

makes intuitive sense. Further, if

3

is sufficiently

small, this

probability will be very close to one.

To see why this is worth a remark, note that if all the decision

makers took their decisions without observing the choices made by

others, some people will always end up choosing the correct option

(in fact, for a large enough population,

the proportion of the

A SIMPLE MODEL OF HERD BEHAVIOR

809

population who will choose the correct option will almost certainly

be close to

(up).

It is also important to note that just the fact that people

observe the decisions made by others does not guarantee that there

will be herd behavior. Consider, for example, the following modi-

fied version of the Normal Learning Model: there is a Normal

(Gaussian) distribution with known variance but unknown mean.

The distribution of the mean is known. In each period a different

agent gets a signal that is a random drawing from this distribution.

The agent then chooses her best guess for the mean of the

distribution (he is minimizing a loss function that is quadratic

in

distance from the mean) given

her signal and the choices made by

her predecessors. It is quite easy to show that this sequence of

choices converges to the mean for almost every sequence of signals.

Therefore, the result we get contrasts sharply with the result from

this superficially quite similar model.

The key reason why we get a different result is that in our

model the choices made by agents

are not always sufficient

statistics for the information they have.

If the choices are always

sufficient statistics, future agents always know what information

their predecessors had acted upon, and therefore there is no herd

externality and no inefficiency. It is when the choices made by some

agents affect the information

that subsequent decision makers

have that there is a potential for herd externality.

The fact that the choices in our model do not have this

sufficient statistic property clearly has to do with how we specify

the payoffs. It is evident that, for a wide range of signals, the agents

in our model will always choose the same option; this lack of

invertibility is what causes the sufficient property to fail. We

conjecture that typically

whenever the space of choices and the

space

of

signals

are of

comparable

dimension and the

payoff

function is continuous, we shall have this kind of invertibility. So,

for example, in the model presented above, we are more likely to

have invertibility if the agents could (and would want to) vary the

size of their investment

in different informational

settings.

On the

other hand, there are many real reasons why it may not be possible

to vary one's choice enough to register all the information one

has-machines,

for example, come in only a small number of sizes.

B. Welfare Properties

The welfare question here is motivated by the herd external-

ity. While joining the herd is optimal for the current agent given

810

QUARTERLY JOURNAL OF ECONOMICS

the play of the game, it reduces the chance that future agents may

discover the truth. So if we consider an ex ante measure of welfare,

imagining that in some primeval state before the game begins all

agents have an equal chance of being at any position in the

sequence of arrivals, we may be able to show that social welfare is

lower in the equilibrium we have described than in other plays of

the game.

This is in fact what we find. Consider another play of the game

in which agents follow the decision rule D* given below.

1. If an agent has a signal, she follows that signal, unless

someone before her has already followed someone else. In that case

she follows suit.

2. If an agent does not have a signal, she picks some option that

has not been picked by anyone else, unless someone before her has

already followed someone else.

In that case she follows suit.

This play of the game is set up to ensure that the right choice

always gets revealed as long as there are enough people

in the

population. With a very large population it is easy to see

that this

would mean that an arbitrarily large fraction of the population will

always make the right choice. Formally, the probability that at

least two people have not received the true signal by the time we get

to the nth person is

1

-

(1

-

ap)n

1

-

(n

-

1)(1

-

z43)n-2o43.

For any

e>

0,

it is easy to see that we can choose an

n(E)

large

enough that this probability is at least

1

-

e.

Now this tells us that

a lower bound for the ex ante expected utility for agents following

this rule is

z[N

-

n(E)](1-

E)-N,

where N is the size of the population. Notice that by making N

large we can make this arbitrarily close to N(1

-

e).

By contrast, the probability that no one will discover the right

choice in the herding equilibrium we described before is

H

[1

-

OL1

-

1)]-1(1

-

&)(1

-

1),

and therefore the ex ante

expected utility is bounded above by

zN[1

-

H].

Since we can choose

e

to be as small as we would like to

by making

A SIMPLE MODEL OF HERD

BEHAVIOR

811

N large, it is easy to see that we can make the

above expression

larger than this expression. There is at least

one decision rule that

for large enough N does better than the

equilibrium we described.

It may be objected here that

we have simply described a

strategy and not explained how it will be

implemented and the

equilibrium may yet be constrained Pareto

optimal. However, this

is not true. This decision rule may be

implemented by using a

number of different incentive schemes.17 One

way that works is to

punish heavily anybody who is a

follower at any option that turns

out to be the incorrect one, while

equally rewarding everybody who

chooses the right option. Given these

rewards, no one will choose to

be a follower unless

they were absolutely sure that the option that

they were choosing was the right one. But this

is, of course, exactly

what we want.

Indeed, even if D* cannot be

implemented,

some of its

advantages can be captured simply by not

allowing the first n

agents to observe anybody else's choice when

they are making their

own choice. The rest of the

population is then allowed to choose

sequentially, with each person observing the choices made

by all

her predecessors.

The first n people will, of course,

choose to follow their own

signals

if

they have one and choose at random if

they have no

signal. The only way that more than one of

these people will choose

the same

option is if they are both right.

Thus, as long as at least

two of the first n

people have chosen the same

option, the rest of

the population will realize that

this must be the correct

option and

choose it.

But an argument exactly

paralleling the one given above in the

case of D* can now be

used to establish that for a large

enough

population, by choosing n suitably, we can

make the fraction of the

population who do not choose the correct

option arbitrarily small.

In other

words, in terms of ex ante welfare, the

economy may be

better off if the early decision

makers are not allowed to observe the

choices made

by the other decision makers than in our

original

equilibrium.

In other

words, destroying information (in this lim-

ited sense), can be

socially beneficial.18

17. This question is

discussed in more detail in an earlier version of

this paper

which is available from the author.

18. This is, of course,

only in terms of ex ante welfare; in terms of

ex post

welfare this is

certainly not true.

812

QUARTERLY JOURNAL OF ECONOMICS

V. EXTENSIONS AND MODIFICATIONS

A. Alternative Payoff Structures

In the model we analyzed in the last section,

everyone who

chooses the right outcome gets the same reward irrespective of how

many others chose this option before and after them. This may be

approximately the correct model of the rewards we get for choosing

the right restaurant, but in many other real world examples the

rewards will depend on the number of people who have chosen this

option and our rank among them. The total amount of the reward

may be fixed, or at least it may not increase as fast as the number of

people who choose the correct option. And in many instances there

are extra rewards for

being first or second to choose the correct

option.

To the extent that we have ignored these

possibilities, our

analysis may overstate the extent to which there will be herding.

Both of these possibilities suggest that it pays more to choose the

correct option when most others have chosen

something else. This

kind of reward for originality clearly discourages herd behavior.

To make this idea precise, consider a model

that is identical to

the previous one except that in this case the first person to choose

the right option gets a bigger output than all the

others, who are all

assumed to get the same amount. A real world institution where

exactly these rewards are not too implausible is academia; the

emphasis is on being first to do something. If you are not first, the

rank does not matter very much.

A little reflection should

persuade the reader that this model is

not really that different from our basic model. It is true that the

relative incentives for being first versus being second are greater

now, which should discourage agents from choosing to be second

but people without signals and people with

signals that exactly

match that of someone else, will still be followers. Consequently, it

is possible that the number of people behind someone becomes so

large and the evidence of their numbers

so convincing,

that

informed agents will decide to ignore their information and the

attraction of the large first prize and join the herd.

Formally, we have Proposition

2.

2. If the return to the first

person choosing

the

right

PROPOSITION

option

is z1 and the return to

everybody

else who chooses that

option

is

Z2,

and 0

<

Z2

/Z1

<

1,

then the

unique equilibrium

A SIMPLE MODEL OF HERD BEHAVIOR

813

decision rule19 under Assumptions A,

B,

and C, is described by

an integer k * > 0 and the following rules.

1. The first decision maker follows her signal if she has one

and chooses i

=

0 otherwise.

2. For k > 1, if the kth decision maker has a signal, she will

choose to follow her own signal either if and only if (a) holds or

if (a) does not hold, (b) holds, or if (b) does not hold, (c) holds,

where (a), (b), and (c) are given below.

(a) Her signal matches some option that has already been

chosen.

(b) No option other than i

=

0 has been chosen by more than

one person.

(c) The option with the highest

i (among those

already

chosen) has not been chosen by more than k* people for some

k*

>

1,

and no other option other than i

=

0 has been chosen

by more than one person.

3. Assume that the kth decision maker has a signal. If any

option (among those already chosen) other than

the one with

the highest i has been chosen by more than one person, the kth

decision maker will choose this option, unless

her

signal

matches one of the other options that have already been

chosen. In this case she chooses the latter option.

4. Assume that the kth decision maker has a signal. If the

option with the highest i (among those already chosen) has

been chosen by more than k* people and no other option

(except i

=

0) has been chosen by more than one person, she

will choose this option unless her signal matches one of the

options already chosen.

In this case she chooses the latter

option.

5. Assume that the kth decision maker does not have a signal.

Then she will choose i

=

0 if and only if that is what everyone

else has chosen. Otherwise she chooses the option with the

highest value of i that has already been chosen unless one of

the other options (excluding

i

=

0) has been chosen by more

than one person. In this case she chooses the latter option.

The proof of this result is omitted since it is a straightforward

extension of arguments used in Section III. In fact, it should be

evident that the result in Section III is a special case of this result

19. Once again we have not actually provided the complete decision rule

here;

for this would need to say what happens at all the unreached information sets.

However, this can be done in a straightforward way.

814

QUARTERLY JOURNAL OF ECONOMICS

for the case where

Z2

=

zj.

What is striking about that special case is

that k*, which we leave as an undetermined integer in this result,

is

one in that special case.

What is more,

if

Z2/Zl

is

not much smaller than one, it is easy to

see that the value of k* will still be one. As

Z2/Zl

goes to zero, k* of

course increases, but at least for values of

Z2/z1

relatively close to

one, there will be

a substantial degree of herding. Therefore, some

of the basic welfare intuitions are going to be quite similar to the

previous case. For N large enough, it will still be true that

the

decision rule

D* will

do better than this rule though the margin of

gain will be less substantial.

While this exercise suggests that the results we got from our

basic model are robust, it also suggests a criticism

of our approach.

What we have done is to take an exogenously given payoff structure

and then argue that as long as the payoff structure falls within

a

certain class we will get socially inefficient herding.

It may be

argued that this is misleading because

if the social costs of herding

were large enough, they would automatically

bring into place

mechanisms

that will modify the payoff structures and reduce

herding. In fact, a message of

the first part of this section is that

one can always reduce and even eliminate herding by having very

high rewards for originality. One might then take

the presence of

the institution of patent laws (which rewards being first)

as proof

that society can always find ways of preventing

inefficient herding.

In our opinion, however, this would be going

too far. We feel

that in many of the cases

we consider there are substantial

informational and transactions costs as

well as institutional

con-

straints which prevent the use

of the

appropriate

incentives to

eliminate herding. In some cases,

like the restaurant

example

we

considered

in the

introduction,

it is difficult to think of how one

could put any incentive scheme

into place. But even in cases where

there is a mechanism for rewarding originality,

as

patent protec-

tion for example, the degree

to which the incentives can be changed

may be quite limited.

Anecdotal evidence suggests,

for

example,

that the amount of protection patents provide

varies a lot from case

to case.

If decreasing returns (average payoffs decline

as the number of

people

who choose it increases) tends to reduce herding,

one would

expect increasing returns,

which rewards

doing

what a lot of others

are doing, to increase the tendency

to herd. This is indeed what we

find. In an earlier version of

this

paper (which

is available from the

author),

we

present

a detailed

analysis

of this case which shows

A SIMPLE MODEL OF HERD BEHAVIOR

815

among other things that if the increasing returns are very strong,

the unique equilibrium decision rule chosen by backward

program-

ming in the game tree involves everyone choosing the same option.

B. Alternative Information Structures

Another possible criticism of our basic model is that it is based

on rather demanding informational assumptions. We assumed that

each person knows the entire history of choices made

by people

before her. This is clearly a strong assumption that

may be valid

in

some cases (like academic research) but not in others.

Examples

where one would like to make a weaker assumption include the

restaurant example suggested in the introduction. In that case we

can usually observe how many people have chosen each restaurant

but not in which order they have made these choices.

However, inspection of the equilibrium decision rules suggests

that in fact they do not make use of

any information about the

order of choice so that the same results

apply as long as the

distribution of choices is observable. It should be noted, however,

that this is possible only under an

assumption like our Assumption

C, which is specified

in terms of a location

("choose the highest

among the options already chosen") rather than in terms of some

order information

("follow

the last decision

maker").

C. Endogenizing the Order of Choice

In this model we have assumed that the order of

choice is

exogenously

fixed: a more natural assumption

(but one which

complicates matters immensely) is to assume that choose when to

move taking into account the fact that waiting is

costly. When this

cost is high, we shall have a game that is similar to the one we

analyzed; when it is low, however, a new and interesting

set of

possibilities arises. A key question here is whether, if the waiting

costs are low enough, all the agents with

signals choose before all

the agents without

signals and whether this results in an efficient

outcome. It turns

out, somewhat surprisingly perhaps, that it is

possible to have situations with low waiting costs where some of

the uninformed move before some of the

informed and the outcome

is inefficient.20 However, the overall analysis is rather complicated,

and we do not

yet have very precise ideas about what happens in

the

general

case.

20. To see the intuition behind this

result, note that it is the marginal and not

the absolute value of information that matters in the decision of

whether or not to

wait.

816

QUARTERLY JOURNAL OF ECONOMICS

VI. CONCLUSIONS

We conclude with remarks on deficiencies of our model and

possible directions of research.

The most serious departure of our model from reality is

probably our assumption that signals to the agents are essentially

free; a more realistic analysis would combine the question

of

incentives for obtaining these signals with the kinds of consider-

ations we discuss. However, it would seem that dropping this

assumption will encourage people to try to "free ride" on other

people's ideas and this would only exacerbate the herding problem.

In this direction, at least, our results seem robust.

Our assumption that there is a continuum

of options and

payoffs exhibit a discontinuity at the true value, is defensible but

somewhat

unorthodox.

Notice, however, that this assumption

would be quite standard if there were only a large but finite

number of options (then there would no discontinuity, for exam-

ple). Preliminary investigations

show that the results we get for

the case where there are a large but finite number of options are

much more complicated but quite similar. We therefore

feel

justified in working with this much more tractable model which we

see as an approximation to the other case.

However, it may still be objected that what is missing from our

model is the fact that options which are in some sense close to the

true option are often better than the other

options. By assuming

that all options other than the right one get the same return, we

have not allowed for this possibility. This is clearly an important

direction for future research.

We have implicitly assumed that the

agents

in this model

cannot actually trade in signals. This

may

be

partly due to the

problem of enforcing contracts describing the exchange of an idea,

and partly due to the presence of transactions costs. Also, if there

are some (perhaps very small) gains to having others choose what

you are choosing, everybody will have the incentive to claim that

they had

a

signal and

that it matched the

option they had chosen.

However, since

the absence of this kind of trade

has serious welfare

consequences,

it is

probably worthwhile to examine this assump-

tion more

closely.

Also, since people gain information by choosing later than

others, there may be strategic aspects of timing that may be worth

investigating. Some current work by the author attempts to extend

this analysis in this direction.

A SIMPLE MODEL OF HERD BEHAVIOR

817

The assumption

that there are only two types of decision

makers-those who have a signal and those who

do not-though

quite strong, can be easily relaxed to allow for

decision makers with

signals of different quality. As long as there are

only a few different

types whose signals differ substantially in

quality, the results are

quite similar.

PRINCETON

UNIVERSITY

REFERENCES

Arthur, W. B., "Competing

Technologies,

Increasing Returns, and Lock-In by

Historical Events," Economic Journal, XCIX

(1989), 116-3 1.

Banerjee, A. V., "The Economics of

Rumors," mimeo, 1992 (revised version).

, "Herd Behavior and the Rewards

for Originality," mimeo, Princeton

Univer-

sity, 1989.

Cotts Watkins, S., "From Local

to National Communities: The Transformation

of

Demographic Regions in Western Europe

1870-1960," Population and Devel-

opment Review, XVI (1990), 241-72.

Cukierman, A., "Asymmetric Information and the

Electoral Momentum of Public

Opinion Polls," mimeo, Princeton

University, 1989.

Farrell, J., and G. Saloner,

"Standardization, Compatibility and Innovation,"

Rand

Journal of Economics, XVI

(1985), 70-83.

Karni, E., and D. Schmeidler, "Fixed

Preferences and Changing Tastes,"

mimeo,

1989.

Katz, M., and C. Shapiro, "Network

Externalities, Competition and Compatibility,"

American Economic Review, LXXV

(1985), 424-40.

Keynes, J. M., The General Theory of

Employment, Interest and Money (London:

Macmillan, 1936).

Kislev, Y., and V. Shchori-Bachrach,

"The Process of an Innovation

Cycle,"

American Journal of Agricultural

Economics, LV (1973), 28-37.

Scharfstein, D., and J. Stein, "Herd Behavior and

Investment," American Eco-

nomic Review, LXXX

(1990), 465-79.


本文发布于:2024-09-22 20:32:52,感谢您对本站的认可!

本文链接:https://www.17tex.com/fanyi/40235.html

版权声明:本站内容均来自互联网,仅供演示用,请勿用于商业和其他非法用途。如果侵犯了您的权益请与我们联系,我们将在24小时内删除。

标签:天气预报   作者
留言与评论(共有 0 条评论)
   
验证码:
Copyright ©2019-2024 Comsenz Inc.Powered by © 易纺专利技术学习网 豫ICP备2022007602号 豫公网安备41160202000603 站长QQ:729038198 关于我们 投诉建议