Part of the Politics and Economics series |
Electoral systems |
---|
![]() |
![]() ![]() |
Arrow's impossibility theorem is a key result in social choice showing that no rank-order method for collective decision-making can behave rationally or coherently. [1] Specifically, any such rule violates independence of irrelevant alternatives, the principle that a choice between and should not depend on the quality of a third, unrelated option . [2] [3]
The result is most often cited in election science and voting theory, where is called a spoiler candidate. [4] In this context, Arrow's theorem can be restated as showing that no ranked voting rule [note 1] can eliminate the spoiler effect. [5] [6] [7]
Some methods are more susceptible to spoilers than others. Plurality and instant-runoff in particular are highly sensitive to spoilers, [8] [4] often manufacturing them even in situations where they are not forced. [9] [10] By contrast, Condorcet methods minimize the possibility of spoilers. [11] In other words, a ranked voting system can always be made limiting them to rare [12] [13] situations called Condorcet paradoxes. [9] As a result, the practical consequences of the theorem are debatable, with Arrow noting " Most [ranked] systems are not going to work badly all of the time. All I proved is that all can work badly at times." [6] [14]
Rated methods are not affected by Arrow's theorem or IIA failures. [5] [7] [15] Arrow initially asserted the information provided by these systems was meaningless, [16] and therefore could not be used to prevent paradoxes, leading him to overlook them. However, he and other authors [17] would later recognize this to have been a mistake, [18] with Arrow admitting systems based on cardinal utility (such as score and approval voting) are not subject to his theorem. [19] [20] [21]
Arrow's theorem falls under the branch of welfare economics and ethics called social choice theory. This field deals with aggregating preferences and beliefs to make fair or accurate decisions. [17] The goal of social choice is to identify a social choice function—a procedure that determines which of two outcomes or options is better, according to all members of a society—that satisfies the properties of rational behavior. [1]
Such a social ordering function can be any way to aggregate information or preferences from a group; this procedure can be a market, a voting system, a country's constitution, or even a moral or ethical framework. [22] Arrow's theorem therefore generalizes the voting paradox discovered earlier by Condorcet, proving such paradoxes exist for any possible collective decision-making procedure that relies only on orderings of different options. [22]
In social choice theory, preferences are modeled using binary relations. If A and B are different candidates or alternatives, then the notation denotes that A is preferred to B. Preferences are required to be transitive, meaning that for every three candidates A, B, and C, if and , then , and complete, meaning that for every two candidates A and B, at least one of or must be true. If both or are true, then the preference is said to be indifferent between A and B.
The idea of a social choice function, or of a ranked-choice voting system, is to take the preferences of everyone in society, and to combine them all to get one preference that represents the preferences of society. This is modeled in Arrow's theorem as a mathematical function taking in a list of preferences and outputting one preference.
Arrow's theorem is named after economist and Nobel laureate Kenneth Arrow, who demonstrated it in his doctoral thesis and popularized it in his 1951 book. [1]
Arrow's work is remembered as much for its pioneering methodology as its direct implications. Arrow's axiomatic approach provided a framework for proving facts about all conceivable social choice rules at once, contrasting with the earlier approach of investigating such rules one by one. [23]
Among the most important axioms of rational choice is independence of irrelevant alternatives, which says that when deciding between A and B, one's opinion about a third option C should not affect their decision. [1] Arrow's theorem shows that for social choice functions, this is not possible without relying on further information, such as rated ballots (rejected by some strict behaviorists). [24]
Arrow's theorem assumes that any reasonable ranked-choice voting system satisfies the principles of:
IIA is sometimes explained by a short joke by philosopher Sidney Morgenbesser:
Arrow's original statement of the theorem included an assumption of monotonicity (positive association), i.e. increasing the rank of an outcome should not make them lose. [27] However, this assumption is not needed or used in his proof, except to derive the weaker Pareto efficiency axiom, and can therefore be dropped. [25] While originally considered an obvious requirement of any practical system, ranked choice-runoff fails monotonicity. [28] Arrow later corrected his statement of the theorem to include runoffs and other voting rules that violate monotonicity. [25] [28]
Condorcet's example is already enough to see the impossibility of a fair ranked voting system, given stronger conditions for fairness than what Arrow's theorem assumes. [29] Suppose we have three candidates, A, B, and C, and that there are three voters with preferences as follows:
Voter | First preference | Second preference | Third preference |
---|---|---|---|
Voter 1 | A | B | C |
Voter 2 | B | C | A |
Voter 3 | C | A | B |
If C is chosen as the winner, it can be argued that any fair voting system would say B should win instead, since two voters (1 and 2) prefer B to C and only one voter (3) prefers C to B. However, by the same argument A is preferred to B, and C is preferred to A, by a margin of two to one on each occasion. Thus, even though each individual voter has consistent preferences, the preferences of society are contradictory: A is preferred over B which is preferred over C which is preferred over A.
Due to this example, some authors credit Condorcet with having given an intuitive argument that presents the core of Arrow's theorem. [29] However, Arrow's theorem is substantially more general and even applies to many "unfair" systems for making decisions, which give some voters more influence than others or favor some candidates. [29]
Let A be a set of alternatives. A preference on A is a complete and transitive binary relation on A (sometimes called a total preorder), that is, a subset R of A × A satisfying:
The element (a, b) being in R is interpreted to mean that alternative a is preferred to alternative b. This situation is often denoted or aRb. Denote the set of all preferences on A by Π(A).
Let N be a positive integer. An ordinal (ranked) social welfare function is a function [30]
which aggregates voters' preferences into a single preference on A. An N- tuple (R1, …, RN) ∈ Π(A)N of voters' preferences is called a preference profile.
Arrow's impossibility theorem: If there are at least three alternatives, then there is no social welfare function satisfying all three of the conditions listed below: [31]
Proof by decisive coalition
|
---|
Arrow's proof used the concept of decisive coalitions. [32] Definition:
Our goal is to prove that the decisive coalition contains only one voter, who controls the outcome—in other words, a dictator. The following proof is a simplification taken from Amartya Sen [33] and Ariel Rubinstein. [34] The simplified proof uses an additional concept:
Thenceforth assume that the social choice system satisfies unrestricted domain, Pareto efficiency, and IIA. Also assume that there are at least 3 distinct outcomes. Field expansion lemma — if a coalition is weakly decisive over for some , then it is decisive. Proof
Let be an outcome distinct from . Claim: is decisive over . Let everyone in vote over . By IIA, changing the votes on does not matter for . So change the votes such that in and and outside of . By Pareto, . By coalition weak-decisiveness over , . Thus . Similarly, is decisive over . By iterating the above two claims (note that decisiveness implies weak-decisiveness), we find that is decisive over all ordered pairs in . Then iterating that, we find that is decisive over all ordered pairs in . Group contraction lemma — If a coalition is decisive, and has size , then it has a proper subset that is also decisive. Proof
Let be a coalition with size . Partition the coalition into nonempty subsets . Fix distinct . Design the following voting pattern (notice that it is the cyclic voting pattern which causes the Condorcet paradox):
(Items other than are not relevant.) Since is decisive, we have . So at least one is true: or . If , then is weakly decisive over . If , then is weakly decisive over . Now apply the field expansion lemma. By Pareto, the entire set of voters is decisive. Thus by the group contraction lemma, there is a size-one decisive coalition—a dictator. |
Proof by pivotal voter
|
---|
Proofs using the concept of the pivotal voter originated from Salvador Barberá in 1980. [35] The proof given here is a simplified version based on two proofs published in Economic Theory. [31] [36] We will prove that any social choice system respecting unrestricted domain, unanimity, and independence of irrelevant alternatives (IIA) is a dictatorship. The key idea is to identify a pivotal voter whose ballot swings the societal outcome. We then prove that this voter is a partial dictator (in a specific technical sense, described below). Finally we conclude by showing that all of the partial dictators are the same person, hence this voter is a dictator. For simplicity we have presented all rankings as if there are no ties. A complete proof taking possible ties into account is not essentially different from the one given here, except that one ought to say "not above" instead of "below" or "not below" instead of "above" in some cases. Full details are given in the original articles. ![]() Say there are three choices for society, call them A, B, and C. Suppose first that everyone prefers option B the least: everyone prefers A to B, and everyone prefers C to B. By unanimity, society must also prefer both A and C to B. Call this situation profile 0. On the other hand, if everyone preferred B to everything else, then society would have to prefer B to everything else by unanimity. Now arrange all the voters in some arbitrary but fixed order, and for each i let profile i be the same as profile 0, but move B to the top of the ballots for voters 1 through i. So profile 1 has B at the top of the ballot for voter 1, but not for any of the others. Profile 2 has B at the top for voters 1 and 2, but no others, and so on. Since B eventually moves to the top of the societal preference as the profile number increases, there must be some profile, number k, for which B first moves above A in the societal rank. We call the voter k whose ballot change causes this to happen the pivotal voter for B over A. Note that the pivotal voter for B over A is not, a priori, the same as the pivotal voter for A over B. In part three of the proof we will show that these do turn out to be the same. Also note that by IIA the same argument applies if profile 0 is any profile in which A is ranked above B by every voter, and the pivotal voter for B over A will still be voter k. We will use this observation below. In this part of the argument we refer to voter k, the pivotal voter for B over A, as the pivotal voter for simplicity. We will show that the pivotal voter dictates society's decision for B over C. That is, we show that no matter how the rest of society votes, if pivotal voter ranks B over C, then that is the societal outcome. Note again that the dictator for B over C is not a priori the same as that for C over B. In part three of the proof we will see that these turn out to be the same too. ![]() In the following, we call voters 1 through k − 1, segment one, and voters k + 1 through N, segment two. To begin, suppose that the ballots are as follows:
Then by the argument in part one (and the last observation in that part), the societal outcome must rank A above B. This is because, except for a repositioning of C, this profile is the same as profile k − 1 from part one. Furthermore, by unanimity the societal outcome must rank B above C. Therefore, we know the outcome in this case completely. Now suppose that pivotal voter moves B above A, but keeps C in the same position and imagine that any number (even all!) of the other voters change their ballots to move B below C, without changing the position of A. Then aside from a repositioning of C this is the same as profile k from part one and hence the societal outcome ranks B above A. Furthermore, by IIA the societal outcome must rank A above C, as in the previous case. In particular, the societal outcome ranks B above C, even though Pivotal Voter may have been the only voter to rank B above C. By IIA, this conclusion holds independently of how A is positioned on the ballots, so pivotal voter is a dictator for B over C. ![]() In this part of the argument we refer back to the original ordering of voters, and compare the positions of the different pivotal voters (identified by applying parts one and two to the other pairs of candidates). First, the pivotal voter for B over C must appear earlier (or at the same position) in the line than the dictator for B over C: As we consider the argument of part one applied to B and C, successively moving B to the top of voters' ballots, the pivot point where society ranks B above C must come at or before we reach the dictator for B over C. Likewise, reversing the roles of B and C, the pivotal voter for C over B must be at or later in line than the dictator for B over C. In short, if kX/Y denotes the position of the pivotal voter for X over Y (for any two candidates X and Y), then we have shown
Now repeating the entire argument above with B and C switched, we also have
Therefore, we have
and the same argument for other pairs shows that all the pivotal voters (and hence all the dictators) occur at the same position in the list of voters. This voter is the dictator for the whole election. |
Arrow's impossibility theorem still holds if Pareto efficiency is weakened to the following condition: [37]
Arrow's theorem establishes that no ranked voting rule can always satisfy independence of irrelevant alternatives, but it says nothing about the frequency of spoilers. This led Arrow to remark that "Most systems are not going to work badly all of the time. All I proved is that all can work badly at times." [38] [39]
Attempts at dealing with the effects of Arrow's theorem take one of two approaches: either accepting his rule and searching for the least spoiler-prone methods, or dropping his assumption of ranked voting to focus on studying rated voting rules. [40]
The first set of methods economists have studied are the majority-rule methods, which limit spoilers to rare situations where majority rule is self-contradictory, and uniquely minimize the possibility of a spoiler effect among rated methods. [41] Arrow's theorem was preceded by the Marquis de Condorcet's discovery of cyclic social preferences, situations where majority rule is logically inconsistent. Condorcet believed voting rules should satisfy both independence of irrelevant alternatives and the majority rule principle, i.e. if most voters rank Alice ahead of Bob, Alice should defeat Bob in the election. [42]
Unfortunately, as Condorcet proved, this rule can be self-contradictory ( intransitive), because there can be a rock-paper-scissors cycle with three or more candidates defeating each other in a circle. [43] Thus, Condorcet proved a weaker form of Arrow's impossibility theorem long before Arrow, under the stronger assumption that a voting system in the two-candidate case will agree with a simple majority vote. [42]
Condorcet methods avoid the spoiler effect in non-cyclic elections, where candidates can be chosen by majority rule. Political scientists have found such cycles to be empirically rare, suggesting they may be of limited practical concern. [44] Spatial voting models also suggest such paradoxes are likely to be infrequent [45] [46] or even non-existent. [40]
Duncan Black showed his own remarkable result, the median voter theorem. The theorem proves that if voters and candidates are arranged on a left-right spectrum, Arrow's conditions are compatible, and all of them will be met by any rule satisfying Condorcet's majority-rule principle. [40]
More formally, Black's theorem assumes preferences are single-peaked: a voter's happiness with a candidate goes up and then down as the candidate moves along some spectrum. For example, in a group of friends choosing a volume setting for music, each friend would likely have their own ideal volume; as the volume gets progressively too loud or too quiet, they would be increasingly dissatisfied. [40]
If the domain is restricted to profiles where every individual has a single-peaked preference with respect to the linear ordering, then social preferences are acyclic. In this situation, Condorcet methods satisfy a wide variety of highly-desirable properties. [40] [47]
Unfortunately, the rule does not generalize from the political spectrum to the political compass, a result called the McKelvey-Schofield Chaos Theorem. [48] However, a well-defined median and Condorcet winner do exist if the distribution of voters on the ideological spectrum is rotationally symmetric. [49] In realistic cases, when voters' opinions follow a roughly-symmetric distribution such as a normal distribution or can be accurately summarized in one or two dimensions, Condorcet cycles tend to be rare. [45] [50]
The Campbell-Kelly theorem shows that Condorcet methods are the most spoiler-resistant class of ranked voting systems: whenever it is possible for some ranked voting system to avoid a spoiler effect, a Condorcet method will do so. [41] In other words, replacing a ranked-voting method with its Condorcet variant (i.e. elect a Condorcet winner if they exist, and otherwise run the method) will sometimes prevent a spoiler effect, but never cause a new one. [41]
In 1977, Ehud Kalai and Eitan Muller gave a full characterization of domain restrictions admitting a nondictatorial and strategyproof social welfare function. These correspond to preferences for which there is a Condorcet winner. [51]
Holliday and Pacuit devised a voting system that provably minimizes the number of candidates who are capable of spoiling an election, albeit at the cost of occasionally failing vote positivity (though at a much lower rate than seen in instant-runoff voting). [50]
As shown above, the proof of Arrow's theorem relies crucially on the assumption of ranked voting, and is not applicable to rated voting systems. As a result, systems like score voting and graduated majority judgment pass independence of irrelevant alternatives. [39] These systems ask voters to rate candidates on a numerical scale (e.g. from 0–10), and then elect the candidate with the highest average (for score voting) or median ( graduated majority judgment). [52]
While Arrow's theorem does not apply to graded systems, Gibbard's theorem still does: no voting game can be straightforward (i.e. have a single, clear always-best strategy), [53] so the informal dictum that "no voting system is perfect" still has some mathematical basis. [54]
Arrow's framework assumed individual and social preferences are orderings or rankings, i.e. statements about which outcomes are better or worse than others. Taking inspiration from the strict behaviorism popular in psychology, some philosophers and economists rejected the idea of comparing internal human experiences of well-being. [55] Such philosophers claimed it was impossible to compare the strength of preferences across several people who disagreed; Sen gives as an example that it would be impossible to know whether Nero's choice to begin the Great Fire of Rome was right or wrong, because while it killed thousands of Romans, it had the positive effect of allowing Nero to expand his palace. [56]
Arrow originally agreed with these positions and rejected cardinal utility, leading him to focus his theorem on preference rankings. [55] [57] However, he later reversed this opinion, admitting scoring methods can provide useful information that allows them to evade his theorem. [38] Similarly, Amartya Sen first claimed interpersonal comparability is necessary for IIA, but later argued it would only require "rather limited levels of partial comparability" to hold in practice. [56]
Balinski and Laraki dispute the necessity of any genuinely cardinal information for rated voting methods to pass IIA. They argue the availability of a common language with verbal grades is sufficient for IIA by allowing voters to give consistent responses to questions about candidate quality. In other words, they argue most voters will not change their beliefs about whether a candidate is "good", "bad", or "neutral" simply because another candidate joins or drops out of a race. [52]
John Harsanyi noted Arrow's theorem could be considered a weaker version of his own theorem [58] and other utility representation theorems like the VNM theorem, which generally show that rational behavior requires consistent cardinal utilities. [59] Harsanyi [58] and Vickrey [60] each independently derived results showing such interpersonal comparisons of utility could be rigorously defined as individual preferences over the lottery of birth. [61] [62]
Kaiser and Oswald conducted an empirical review of four decades of research including over 700,000 participants who provided self-reported measures of utility, with the goal of identifying whether people "have a sense of an actual underlying scale for their innermost feelings". [63] They found that responses to such questions were consistent with all expectations of a well-specified quantitative measure. Furthermore, they were highly predictive of important decisions (such as international migration and divorce), moreso than even standard socioeconomic variables such as income and demographics. [63] Ultimately, the authors concluded "this feelings-to-actions relationship takes a generic form, is consistently replicable, and is fairly close to linear in structure. Therefore, it seems that human beings can successfully operationalize an integer scale for feelings". [63]
These results have led to the rise of implicit utilitarian voting approaches, which model ranked-choice procedures as approximations of the utilitarian rule (i.e. score voting). [64]
Behavioral economists have shown individual irrationality involves violations of IIA (e.g. with decoy effects), [65] suggesting human behavior might cause IIA failures even if the voting method itself does not. [66] However, past research on similar effects has found their effects are typically small, [67] and such psychological spoiler effects can occur regardless of the electoral system in use. [66] Balinski and Laraki discuss techniques of ballot design that could minimize these psychological effects, such as asking voters to give each individual candidate a verbal grade (e.g. "bad", "neutral", "good", "excellent"). [52]
In addition to the above practical resolutions, there exist unusual (less-than-practical) situations where Arrow's conditions can be satisfied.
Supermajority rules can avoid Arrow's theorem at the cost of being poorly-decisive (i.e. frequently failing to return a result). In this case, a threshold that requires a majority for ordering 3 outcomes, for 4, etc. does not produce voting paradoxes. [68]
In spatial (n-dimensional ideology) models of voting, this can be relaxed to require only (roughly 64%) of the vote to prevent cycles, so long as the distribution of voters is well-behaved ( quasiconcave). This can be seen as providing some justification for the common requirement of a majority for constitutional amendments, which is sufficient to prevent cyclic preferences in most situations. [69]
Fishburn shows all of Arrow's conditions can be satisfied for uncountable sets of voters given the axiom of choice; [70] however, Kirman and Sondermann showed this requires disenfranchising almost all members of a society (eligible voters form a set of measure 0), leading them to refer to such societies as "invisible dictatorships". [71]
Arrow's theorem is not related to strategic voting, which does not appear in his framework, [32] [72] although the theorem does have important implications on strategic voting (being used as a lemma to prove Gibbard's theorem). The Arrovian framework of social welfare assumes all voter preferences are known and the only issue is in aggregating them. [72]
Contrary to a common misconception, Arrow's theorem deals with the limited class of ranked-choice voting systems, rather than voting systems as a whole. [72] [73]
Candidates C and D spoiled the election for B ... With them in the running, A won, whereas without them in the running, B would have won. ... Instant runoff voting ... does not do away with the spoiler problem entirely
In the present stage of the discussion on the problem of social choice, it should be common knowledge that the General Impossibility Theorem holds because only the ordinal preferences is or can be taken into account. If the intensity of preference or cardinal utility can be known or is reflected in social choice, the paradox of social choice can be solved.
The abandonment of Condition 3 makes it possible to formulate a procedure for arriving at a social choice. Such a procedure is described below
As with simple plurality elections, it is apparent the outcome will be highly sensitive to the distribution of candidates.
IRV is subject to something called the "center squeeze." A popular moderate can receive relatively few first-place votes through no fault of her own but because of vote splitting from candidates to the right and left. [...] Approval voting thus appears to solve the problem of vote splitting simply and elegantly. [...] Range voting solves the problems of spoilers and vote splitting
It is shown that the utilitarian welfare function satisfies all of Arrow's social choice postulates — avoiding the celebrated impossibility theorem by making use of information which is unavailable in Arrow's original framework.
Dr. Arrow: Well, I’m a little inclined to think that score systems where you categorize in maybe three or four classes probably (in spite of what I said about manipulation) is probably the best.
Dr. Arrow: Well, I’m a little inclined to think that score systems where you categorize in maybe three or four classes (in spite of what I said about manipulation) is probably the best.[...] And some of these studies have been made. In France, [Michel] Balinski has done some studies of this kind which seem to give some support to these scoring methods.
Is there such a thing as a perfect voting system? The respondents were unanimous in their insistence that there is not.
...the fictitious notion of 'original position' [was] developed by Vickery (1945), Harsanyi (1955), and Rawls (1971).
Part of the Politics and Economics series |
Electoral systems |
---|
![]() |
![]() ![]() |
Arrow's impossibility theorem is a key result in social choice showing that no rank-order method for collective decision-making can behave rationally or coherently. [1] Specifically, any such rule violates independence of irrelevant alternatives, the principle that a choice between and should not depend on the quality of a third, unrelated option . [2] [3]
The result is most often cited in election science and voting theory, where is called a spoiler candidate. [4] In this context, Arrow's theorem can be restated as showing that no ranked voting rule [note 1] can eliminate the spoiler effect. [5] [6] [7]
Some methods are more susceptible to spoilers than others. Plurality and instant-runoff in particular are highly sensitive to spoilers, [8] [4] often manufacturing them even in situations where they are not forced. [9] [10] By contrast, Condorcet methods minimize the possibility of spoilers. [11] In other words, a ranked voting system can always be made limiting them to rare [12] [13] situations called Condorcet paradoxes. [9] As a result, the practical consequences of the theorem are debatable, with Arrow noting " Most [ranked] systems are not going to work badly all of the time. All I proved is that all can work badly at times." [6] [14]
Rated methods are not affected by Arrow's theorem or IIA failures. [5] [7] [15] Arrow initially asserted the information provided by these systems was meaningless, [16] and therefore could not be used to prevent paradoxes, leading him to overlook them. However, he and other authors [17] would later recognize this to have been a mistake, [18] with Arrow admitting systems based on cardinal utility (such as score and approval voting) are not subject to his theorem. [19] [20] [21]
Arrow's theorem falls under the branch of welfare economics and ethics called social choice theory. This field deals with aggregating preferences and beliefs to make fair or accurate decisions. [17] The goal of social choice is to identify a social choice function—a procedure that determines which of two outcomes or options is better, according to all members of a society—that satisfies the properties of rational behavior. [1]
Such a social ordering function can be any way to aggregate information or preferences from a group; this procedure can be a market, a voting system, a country's constitution, or even a moral or ethical framework. [22] Arrow's theorem therefore generalizes the voting paradox discovered earlier by Condorcet, proving such paradoxes exist for any possible collective decision-making procedure that relies only on orderings of different options. [22]
In social choice theory, preferences are modeled using binary relations. If A and B are different candidates or alternatives, then the notation denotes that A is preferred to B. Preferences are required to be transitive, meaning that for every three candidates A, B, and C, if and , then , and complete, meaning that for every two candidates A and B, at least one of or must be true. If both or are true, then the preference is said to be indifferent between A and B.
The idea of a social choice function, or of a ranked-choice voting system, is to take the preferences of everyone in society, and to combine them all to get one preference that represents the preferences of society. This is modeled in Arrow's theorem as a mathematical function taking in a list of preferences and outputting one preference.
Arrow's theorem is named after economist and Nobel laureate Kenneth Arrow, who demonstrated it in his doctoral thesis and popularized it in his 1951 book. [1]
Arrow's work is remembered as much for its pioneering methodology as its direct implications. Arrow's axiomatic approach provided a framework for proving facts about all conceivable social choice rules at once, contrasting with the earlier approach of investigating such rules one by one. [23]
Among the most important axioms of rational choice is independence of irrelevant alternatives, which says that when deciding between A and B, one's opinion about a third option C should not affect their decision. [1] Arrow's theorem shows that for social choice functions, this is not possible without relying on further information, such as rated ballots (rejected by some strict behaviorists). [24]
Arrow's theorem assumes that any reasonable ranked-choice voting system satisfies the principles of:
IIA is sometimes explained by a short joke by philosopher Sidney Morgenbesser:
Arrow's original statement of the theorem included an assumption of monotonicity (positive association), i.e. increasing the rank of an outcome should not make them lose. [27] However, this assumption is not needed or used in his proof, except to derive the weaker Pareto efficiency axiom, and can therefore be dropped. [25] While originally considered an obvious requirement of any practical system, ranked choice-runoff fails monotonicity. [28] Arrow later corrected his statement of the theorem to include runoffs and other voting rules that violate monotonicity. [25] [28]
Condorcet's example is already enough to see the impossibility of a fair ranked voting system, given stronger conditions for fairness than what Arrow's theorem assumes. [29] Suppose we have three candidates, A, B, and C, and that there are three voters with preferences as follows:
Voter | First preference | Second preference | Third preference |
---|---|---|---|
Voter 1 | A | B | C |
Voter 2 | B | C | A |
Voter 3 | C | A | B |
If C is chosen as the winner, it can be argued that any fair voting system would say B should win instead, since two voters (1 and 2) prefer B to C and only one voter (3) prefers C to B. However, by the same argument A is preferred to B, and C is preferred to A, by a margin of two to one on each occasion. Thus, even though each individual voter has consistent preferences, the preferences of society are contradictory: A is preferred over B which is preferred over C which is preferred over A.
Due to this example, some authors credit Condorcet with having given an intuitive argument that presents the core of Arrow's theorem. [29] However, Arrow's theorem is substantially more general and even applies to many "unfair" systems for making decisions, which give some voters more influence than others or favor some candidates. [29]
Let A be a set of alternatives. A preference on A is a complete and transitive binary relation on A (sometimes called a total preorder), that is, a subset R of A × A satisfying:
The element (a, b) being in R is interpreted to mean that alternative a is preferred to alternative b. This situation is often denoted or aRb. Denote the set of all preferences on A by Π(A).
Let N be a positive integer. An ordinal (ranked) social welfare function is a function [30]
which aggregates voters' preferences into a single preference on A. An N- tuple (R1, …, RN) ∈ Π(A)N of voters' preferences is called a preference profile.
Arrow's impossibility theorem: If there are at least three alternatives, then there is no social welfare function satisfying all three of the conditions listed below: [31]
Proof by decisive coalition
|
---|
Arrow's proof used the concept of decisive coalitions. [32] Definition:
Our goal is to prove that the decisive coalition contains only one voter, who controls the outcome—in other words, a dictator. The following proof is a simplification taken from Amartya Sen [33] and Ariel Rubinstein. [34] The simplified proof uses an additional concept:
Thenceforth assume that the social choice system satisfies unrestricted domain, Pareto efficiency, and IIA. Also assume that there are at least 3 distinct outcomes. Field expansion lemma — if a coalition is weakly decisive over for some , then it is decisive. Proof
Let be an outcome distinct from . Claim: is decisive over . Let everyone in vote over . By IIA, changing the votes on does not matter for . So change the votes such that in and and outside of . By Pareto, . By coalition weak-decisiveness over , . Thus . Similarly, is decisive over . By iterating the above two claims (note that decisiveness implies weak-decisiveness), we find that is decisive over all ordered pairs in . Then iterating that, we find that is decisive over all ordered pairs in . Group contraction lemma — If a coalition is decisive, and has size , then it has a proper subset that is also decisive. Proof
Let be a coalition with size . Partition the coalition into nonempty subsets . Fix distinct . Design the following voting pattern (notice that it is the cyclic voting pattern which causes the Condorcet paradox):
(Items other than are not relevant.) Since is decisive, we have . So at least one is true: or . If , then is weakly decisive over . If , then is weakly decisive over . Now apply the field expansion lemma. By Pareto, the entire set of voters is decisive. Thus by the group contraction lemma, there is a size-one decisive coalition—a dictator. |
Proof by pivotal voter
|
---|
Proofs using the concept of the pivotal voter originated from Salvador Barberá in 1980. [35] The proof given here is a simplified version based on two proofs published in Economic Theory. [31] [36] We will prove that any social choice system respecting unrestricted domain, unanimity, and independence of irrelevant alternatives (IIA) is a dictatorship. The key idea is to identify a pivotal voter whose ballot swings the societal outcome. We then prove that this voter is a partial dictator (in a specific technical sense, described below). Finally we conclude by showing that all of the partial dictators are the same person, hence this voter is a dictator. For simplicity we have presented all rankings as if there are no ties. A complete proof taking possible ties into account is not essentially different from the one given here, except that one ought to say "not above" instead of "below" or "not below" instead of "above" in some cases. Full details are given in the original articles. ![]() Say there are three choices for society, call them A, B, and C. Suppose first that everyone prefers option B the least: everyone prefers A to B, and everyone prefers C to B. By unanimity, society must also prefer both A and C to B. Call this situation profile 0. On the other hand, if everyone preferred B to everything else, then society would have to prefer B to everything else by unanimity. Now arrange all the voters in some arbitrary but fixed order, and for each i let profile i be the same as profile 0, but move B to the top of the ballots for voters 1 through i. So profile 1 has B at the top of the ballot for voter 1, but not for any of the others. Profile 2 has B at the top for voters 1 and 2, but no others, and so on. Since B eventually moves to the top of the societal preference as the profile number increases, there must be some profile, number k, for which B first moves above A in the societal rank. We call the voter k whose ballot change causes this to happen the pivotal voter for B over A. Note that the pivotal voter for B over A is not, a priori, the same as the pivotal voter for A over B. In part three of the proof we will show that these do turn out to be the same. Also note that by IIA the same argument applies if profile 0 is any profile in which A is ranked above B by every voter, and the pivotal voter for B over A will still be voter k. We will use this observation below. In this part of the argument we refer to voter k, the pivotal voter for B over A, as the pivotal voter for simplicity. We will show that the pivotal voter dictates society's decision for B over C. That is, we show that no matter how the rest of society votes, if pivotal voter ranks B over C, then that is the societal outcome. Note again that the dictator for B over C is not a priori the same as that for C over B. In part three of the proof we will see that these turn out to be the same too. ![]() In the following, we call voters 1 through k − 1, segment one, and voters k + 1 through N, segment two. To begin, suppose that the ballots are as follows:
Then by the argument in part one (and the last observation in that part), the societal outcome must rank A above B. This is because, except for a repositioning of C, this profile is the same as profile k − 1 from part one. Furthermore, by unanimity the societal outcome must rank B above C. Therefore, we know the outcome in this case completely. Now suppose that pivotal voter moves B above A, but keeps C in the same position and imagine that any number (even all!) of the other voters change their ballots to move B below C, without changing the position of A. Then aside from a repositioning of C this is the same as profile k from part one and hence the societal outcome ranks B above A. Furthermore, by IIA the societal outcome must rank A above C, as in the previous case. In particular, the societal outcome ranks B above C, even though Pivotal Voter may have been the only voter to rank B above C. By IIA, this conclusion holds independently of how A is positioned on the ballots, so pivotal voter is a dictator for B over C. ![]() In this part of the argument we refer back to the original ordering of voters, and compare the positions of the different pivotal voters (identified by applying parts one and two to the other pairs of candidates). First, the pivotal voter for B over C must appear earlier (or at the same position) in the line than the dictator for B over C: As we consider the argument of part one applied to B and C, successively moving B to the top of voters' ballots, the pivot point where society ranks B above C must come at or before we reach the dictator for B over C. Likewise, reversing the roles of B and C, the pivotal voter for C over B must be at or later in line than the dictator for B over C. In short, if kX/Y denotes the position of the pivotal voter for X over Y (for any two candidates X and Y), then we have shown
Now repeating the entire argument above with B and C switched, we also have
Therefore, we have
and the same argument for other pairs shows that all the pivotal voters (and hence all the dictators) occur at the same position in the list of voters. This voter is the dictator for the whole election. |
Arrow's impossibility theorem still holds if Pareto efficiency is weakened to the following condition: [37]
Arrow's theorem establishes that no ranked voting rule can always satisfy independence of irrelevant alternatives, but it says nothing about the frequency of spoilers. This led Arrow to remark that "Most systems are not going to work badly all of the time. All I proved is that all can work badly at times." [38] [39]
Attempts at dealing with the effects of Arrow's theorem take one of two approaches: either accepting his rule and searching for the least spoiler-prone methods, or dropping his assumption of ranked voting to focus on studying rated voting rules. [40]
The first set of methods economists have studied are the majority-rule methods, which limit spoilers to rare situations where majority rule is self-contradictory, and uniquely minimize the possibility of a spoiler effect among rated methods. [41] Arrow's theorem was preceded by the Marquis de Condorcet's discovery of cyclic social preferences, situations where majority rule is logically inconsistent. Condorcet believed voting rules should satisfy both independence of irrelevant alternatives and the majority rule principle, i.e. if most voters rank Alice ahead of Bob, Alice should defeat Bob in the election. [42]
Unfortunately, as Condorcet proved, this rule can be self-contradictory ( intransitive), because there can be a rock-paper-scissors cycle with three or more candidates defeating each other in a circle. [43] Thus, Condorcet proved a weaker form of Arrow's impossibility theorem long before Arrow, under the stronger assumption that a voting system in the two-candidate case will agree with a simple majority vote. [42]
Condorcet methods avoid the spoiler effect in non-cyclic elections, where candidates can be chosen by majority rule. Political scientists have found such cycles to be empirically rare, suggesting they may be of limited practical concern. [44] Spatial voting models also suggest such paradoxes are likely to be infrequent [45] [46] or even non-existent. [40]
Duncan Black showed his own remarkable result, the median voter theorem. The theorem proves that if voters and candidates are arranged on a left-right spectrum, Arrow's conditions are compatible, and all of them will be met by any rule satisfying Condorcet's majority-rule principle. [40]
More formally, Black's theorem assumes preferences are single-peaked: a voter's happiness with a candidate goes up and then down as the candidate moves along some spectrum. For example, in a group of friends choosing a volume setting for music, each friend would likely have their own ideal volume; as the volume gets progressively too loud or too quiet, they would be increasingly dissatisfied. [40]
If the domain is restricted to profiles where every individual has a single-peaked preference with respect to the linear ordering, then social preferences are acyclic. In this situation, Condorcet methods satisfy a wide variety of highly-desirable properties. [40] [47]
Unfortunately, the rule does not generalize from the political spectrum to the political compass, a result called the McKelvey-Schofield Chaos Theorem. [48] However, a well-defined median and Condorcet winner do exist if the distribution of voters on the ideological spectrum is rotationally symmetric. [49] In realistic cases, when voters' opinions follow a roughly-symmetric distribution such as a normal distribution or can be accurately summarized in one or two dimensions, Condorcet cycles tend to be rare. [45] [50]
The Campbell-Kelly theorem shows that Condorcet methods are the most spoiler-resistant class of ranked voting systems: whenever it is possible for some ranked voting system to avoid a spoiler effect, a Condorcet method will do so. [41] In other words, replacing a ranked-voting method with its Condorcet variant (i.e. elect a Condorcet winner if they exist, and otherwise run the method) will sometimes prevent a spoiler effect, but never cause a new one. [41]
In 1977, Ehud Kalai and Eitan Muller gave a full characterization of domain restrictions admitting a nondictatorial and strategyproof social welfare function. These correspond to preferences for which there is a Condorcet winner. [51]
Holliday and Pacuit devised a voting system that provably minimizes the number of candidates who are capable of spoiling an election, albeit at the cost of occasionally failing vote positivity (though at a much lower rate than seen in instant-runoff voting). [50]
As shown above, the proof of Arrow's theorem relies crucially on the assumption of ranked voting, and is not applicable to rated voting systems. As a result, systems like score voting and graduated majority judgment pass independence of irrelevant alternatives. [39] These systems ask voters to rate candidates on a numerical scale (e.g. from 0–10), and then elect the candidate with the highest average (for score voting) or median ( graduated majority judgment). [52]
While Arrow's theorem does not apply to graded systems, Gibbard's theorem still does: no voting game can be straightforward (i.e. have a single, clear always-best strategy), [53] so the informal dictum that "no voting system is perfect" still has some mathematical basis. [54]
Arrow's framework assumed individual and social preferences are orderings or rankings, i.e. statements about which outcomes are better or worse than others. Taking inspiration from the strict behaviorism popular in psychology, some philosophers and economists rejected the idea of comparing internal human experiences of well-being. [55] Such philosophers claimed it was impossible to compare the strength of preferences across several people who disagreed; Sen gives as an example that it would be impossible to know whether Nero's choice to begin the Great Fire of Rome was right or wrong, because while it killed thousands of Romans, it had the positive effect of allowing Nero to expand his palace. [56]
Arrow originally agreed with these positions and rejected cardinal utility, leading him to focus his theorem on preference rankings. [55] [57] However, he later reversed this opinion, admitting scoring methods can provide useful information that allows them to evade his theorem. [38] Similarly, Amartya Sen first claimed interpersonal comparability is necessary for IIA, but later argued it would only require "rather limited levels of partial comparability" to hold in practice. [56]
Balinski and Laraki dispute the necessity of any genuinely cardinal information for rated voting methods to pass IIA. They argue the availability of a common language with verbal grades is sufficient for IIA by allowing voters to give consistent responses to questions about candidate quality. In other words, they argue most voters will not change their beliefs about whether a candidate is "good", "bad", or "neutral" simply because another candidate joins or drops out of a race. [52]
John Harsanyi noted Arrow's theorem could be considered a weaker version of his own theorem [58] and other utility representation theorems like the VNM theorem, which generally show that rational behavior requires consistent cardinal utilities. [59] Harsanyi [58] and Vickrey [60] each independently derived results showing such interpersonal comparisons of utility could be rigorously defined as individual preferences over the lottery of birth. [61] [62]
Kaiser and Oswald conducted an empirical review of four decades of research including over 700,000 participants who provided self-reported measures of utility, with the goal of identifying whether people "have a sense of an actual underlying scale for their innermost feelings". [63] They found that responses to such questions were consistent with all expectations of a well-specified quantitative measure. Furthermore, they were highly predictive of important decisions (such as international migration and divorce), moreso than even standard socioeconomic variables such as income and demographics. [63] Ultimately, the authors concluded "this feelings-to-actions relationship takes a generic form, is consistently replicable, and is fairly close to linear in structure. Therefore, it seems that human beings can successfully operationalize an integer scale for feelings". [63]
These results have led to the rise of implicit utilitarian voting approaches, which model ranked-choice procedures as approximations of the utilitarian rule (i.e. score voting). [64]
Behavioral economists have shown individual irrationality involves violations of IIA (e.g. with decoy effects), [65] suggesting human behavior might cause IIA failures even if the voting method itself does not. [66] However, past research on similar effects has found their effects are typically small, [67] and such psychological spoiler effects can occur regardless of the electoral system in use. [66] Balinski and Laraki discuss techniques of ballot design that could minimize these psychological effects, such as asking voters to give each individual candidate a verbal grade (e.g. "bad", "neutral", "good", "excellent"). [52]
In addition to the above practical resolutions, there exist unusual (less-than-practical) situations where Arrow's conditions can be satisfied.
Supermajority rules can avoid Arrow's theorem at the cost of being poorly-decisive (i.e. frequently failing to return a result). In this case, a threshold that requires a majority for ordering 3 outcomes, for 4, etc. does not produce voting paradoxes. [68]
In spatial (n-dimensional ideology) models of voting, this can be relaxed to require only (roughly 64%) of the vote to prevent cycles, so long as the distribution of voters is well-behaved ( quasiconcave). This can be seen as providing some justification for the common requirement of a majority for constitutional amendments, which is sufficient to prevent cyclic preferences in most situations. [69]
Fishburn shows all of Arrow's conditions can be satisfied for uncountable sets of voters given the axiom of choice; [70] however, Kirman and Sondermann showed this requires disenfranchising almost all members of a society (eligible voters form a set of measure 0), leading them to refer to such societies as "invisible dictatorships". [71]
Arrow's theorem is not related to strategic voting, which does not appear in his framework, [32] [72] although the theorem does have important implications on strategic voting (being used as a lemma to prove Gibbard's theorem). The Arrovian framework of social welfare assumes all voter preferences are known and the only issue is in aggregating them. [72]
Contrary to a common misconception, Arrow's theorem deals with the limited class of ranked-choice voting systems, rather than voting systems as a whole. [72] [73]
Candidates C and D spoiled the election for B ... With them in the running, A won, whereas without them in the running, B would have won. ... Instant runoff voting ... does not do away with the spoiler problem entirely
In the present stage of the discussion on the problem of social choice, it should be common knowledge that the General Impossibility Theorem holds because only the ordinal preferences is or can be taken into account. If the intensity of preference or cardinal utility can be known or is reflected in social choice, the paradox of social choice can be solved.
The abandonment of Condition 3 makes it possible to formulate a procedure for arriving at a social choice. Such a procedure is described below
As with simple plurality elections, it is apparent the outcome will be highly sensitive to the distribution of candidates.
IRV is subject to something called the "center squeeze." A popular moderate can receive relatively few first-place votes through no fault of her own but because of vote splitting from candidates to the right and left. [...] Approval voting thus appears to solve the problem of vote splitting simply and elegantly. [...] Range voting solves the problems of spoilers and vote splitting
It is shown that the utilitarian welfare function satisfies all of Arrow's social choice postulates — avoiding the celebrated impossibility theorem by making use of information which is unavailable in Arrow's original framework.
Dr. Arrow: Well, I’m a little inclined to think that score systems where you categorize in maybe three or four classes probably (in spite of what I said about manipulation) is probably the best.
Dr. Arrow: Well, I’m a little inclined to think that score systems where you categorize in maybe three or four classes (in spite of what I said about manipulation) is probably the best.[...] And some of these studies have been made. In France, [Michel] Balinski has done some studies of this kind which seem to give some support to these scoring methods.
Is there such a thing as a perfect voting system? The respondents were unanimous in their insistence that there is not.
...the fictitious notion of 'original position' [was] developed by Vickery (1945), Harsanyi (1955), and Rawls (1971).