[EM] Why I think IRV isn't a serious alternative
Abd ul-Rahman Lomax
abd at lomaxdesign.com
Mon Dec 1 09:53:49 PST 2008
At 12:42 AM 11/26/2008, Kevin Venzke wrote:
>--- En date de : Mar 25.11.08, Abd ul-Rahman
>Lomax <abd at lomaxdesign.com> a écrit :
> > If we must have a
> > single ballot, and a single winner, period, Range Voting is
> > actually a trick: it is the only relatively objective method
> > of assessing the expected voter satisfaction with an
> > outcome, turned into an election method. It's ideal
> > because it's designed that way. (The only fly in the
> > ointment is the charges about strategic voting, but I've
> > been arguing that this is based on a total misconception of
> > what we are doing when we vote.)
>
>I don't understand how you reconcile the two ideas here. Range is
>"objective" and "ideal because it's designed that way" based on the
>idea that voters have internal utilities and, if they vote them exactly,
>under Range voting, the best candidate according to overall utility
>will be elected every time.
I've responded to this in a prior post, the first
part of it. I did not make the claim that Range
Voting was "objective." It is a voting method and
does not automatically choose the best candidate
according to overall utility, neither in
simulated elections with practical methods, not
even if we assume "fully sincere voting."
It simply gets closer than any other single
ballot method. Of the methods that have been
tested through the simulation process, the only
method that beats Range, if I'm correct, is Range
with a top two runoff. Not a single ballot
method, not even considered a voting method by
many definitions. Arrow, in particular, made
"deterministic" one of the preconditions for his theorem to apply.
In the simulation process, absolute personal
utilities must be converted to Range votes, for,
with practical election methods, it's probably
impossible to vote absolute utilities, even if
people wanted to. The conversion process usually
takes the preference list for candidates
considered real options and looks only at the
range of utilities among those candidates. While
a voter *might* decide to extend the "coverage"
of the voter's vote, this is not what we normally
do when we are asked to make a choice. We
restrict our consideration to the practical
possibiities. This normalization process, to some
degree, makes the same "mistake" as preferential
methods do, it equates what may be a small
preference with what may be, with another, a
critical or crucial preference. Hence Range
Voting which depends on this normalization can
fail to find the truly optimal winner. It pleases
some people with little preference at the cost of
greater absolute preference satisfaction for others.
If readers have been following this, they would
know the answer to this question: but what if
voters exaggerate their preferences when they
vote? That is, of course, a possible objection to
Range Voting or to the study of real election
results based on, say, polling data. The
simulations avoid this entirely, by *assuming* an
underlying preference profile.
This underlying preference profile could get
quite sophisticated, but making it more accurate,
as to how real voters make voting decisions,
would probably not change the results
significantly. Good election methods should be
able to handle the kinds of voter preference
profiles that are used in the simulations, those
profiles aren't biased toward Range Voting. (And
the profiles make sense in restricted
environments; that real people use more complex
criteria in determining their preferences doesn't
change the usefulness of the approach. The goal
is not to simulate *people* and predict real
election results from, say, an analysis of
positions, popularity polls, etc. Rather it is to
study how election methods perform under reasonably realistic assumptions.
So let me be specific, though we will get back to
this. Range Voting isn't perfect. It does not
always choose the best winner. When it does not,
it is sometimes possible to detect the situation
and fix it; that is why Range Voting with Top Two
runoff gets lower regret results than Range
alone. It detects the relatively rare situations
that cause Range to fail to find the ideal
winner. And in my opinion, Runoff Range is not as
accurate in the simulations as it would be in
real life; that's because real runoffs test
*relative absolute preference strength,* free of
strategic considerations (when write-ins are
involved, this isn't totally true.) This effect
is something which has been largely overlooked.
Or even totally, as far as I know. It's original
with me, but it's likely someone else has written
about it somewhere. I don't consider it rocket
science, just something obvious that's been overlooked.
So TTR Range is better than Range. Range is not
perfect. Period. It is simply better, as studied
through this approach, than any deterministic
single-ballot method that is commonly studied.
(To get better, one has to find a way to
facilitate and encourage "fully sincere voting."
That may be an extraordinarily difficult problem,
but auctions can do it. Isn't that called a Clarke tax?
(Consider this proposal: One your ballot, you
indicate what percentage of your income you will
pay as tax for the next year. It is not an
absolute amount, because that leads to
plutocracy, or at least that is claimed.
(Libertarians might prefer a dollar amount, but
it's the idea I'm exploring here, not the
details, I'm just trying to propose some
reasonable ones. It's even possible that some
more complex tax structure would be involved,
where you would vote a *bracket,* designed to
make the difficult of "voting" in this way the
same for all voters.) These would be, we can
presume, absolute utilities, converted to income
percentage values. So a voter with little
preference would not, presumably, be willing to
pay much, and a voter with a large preference
would, again, be willing to pay more. It's an
auction, so the amount actually paid would
presumably be lower than the votes. The maximum
vote might be limited. How much would I have been
willing to pay to avoid the election of Bush? In
2000, a decent amount. In 2004, probably the
maximum vote allowed (there might be a limit at
100% of income, or some lower or higher figure).
In 2008, to avoid McCain, less, probably. Bush
was *really* bad, so bad that prominent
conservatives were complaining, and so concerned
that McCain would have continued poor Bush
policies that they recommended Obama. But I don't
think McCain would have been anywhere near as bad
as Bush was. In 2008, though, a lower vote was
probably necessary to elect Obama, because he was
so popular. I was wondering when he was going to
bring out the loaves and fishes, with many
others.... Obama is so popular that he's
dangerous. Not as a specific criticism of him,
but generically. It is even more important now
that we make it possible for the public to get good and trustworthy advice....)
>At the same time you want to defend Range against the charge of
>susceptibility to strategic voting, essentially by denying that the
>Range voter is supposed to be mapping his true, absolute preferences
>onto the ratings.
That's right. Range results shift, and they shift
to increase regret, when voters vote
"strategically." This is well know, but it's an
error to consider this a reason to avoid Range.
If, with realistic voting profiles, they shifted
results to make Range worse than other methods,
it would be one thing. But they do not, if the
simulations were accurate. Nobody has challenged
Warren's results. Yee diagrams are a similar
approach. Nobody has contradicted the basic work,
even though it is all published and source code
is available, etc. Instead, we see criticism that
often misunderstands the basic nature of the study.
The voter *does* map his true, absolute
preferences onto the ratings. Kevin, you are not
being careful. There is, in theory, a one to one,
continuous, monotonic transformation of absolute
voter utilities (not "preferences") to Range
votes (neglecting roundoff error, which makes the
function a step function, still monotonic.)
That's why we can, with a reasonable definition,
still call these votes "sincere," since they do
not violate the derived preference profile of the
voter. All that they do is to, possibly, equate
the vote of some candidate pairs when there is a
non-zero preference strength between them, and
that is larger than the Range resolution. In
other words, the voter thinks that, given all the
conditions, the voter exercises more effective
voting power elsewhere and -- presumably -- will
not regret the abstention from voting in that
candidate pair. This is quite what we routinely
do with real world choices under analogous
conditions. We do not bid on things in auctions
based merely on cost, when we have a limited
amount to bid. I won't go into describing such an
auction, but we factor in the probability of
success, and we put our limited auction dollars
into the preference pairs that seem more likely
to be a good investment. But we never try to pay
*more* for a candidate that we prefer *less.
The transfer function, thus, is monotonic but not
linear. With "sincere absolute preferences
normalized to the voting Range," we'd get a
limited regret-minimized result, and there may be
some voters who decide to vote this way. I'd not
advise it, though it's relatively harmless. It
improves the overall outcome at small cost to the
voter. (By definition, if the cost is large, the
sincere preference is large.) In real world
collective decision making on a small scale, we
often do exactly this: we reveal our sincere
preferences in absolutes, where possible, or, at
least, in terms of preference strength across our personal profile.
So is Range Voting "vulnerable" to strategic
voting? What does that mean? In practice, it is
used as a voting system criterion and a black
mark against Range Voting. But the "harm" done is
simply institutionalized by other voting systems,
and the possible improvement through accurate
expression of absolute preference strengths
(within the limitations) is made impossible.
Which, of course, harms the results even more. In
order to avoid the bete noir, strategic voting,
which has been redefined to include any failure
to accurately disclose a sincere preference,
voting systems students would avoid the only
method which *minimizes* the effect of such, and
allows it to operate only where it is relatively harmless.
The simulations answer the questions of "how
much" and "how often," which cannot be answered
through the prior approach, the use of voting
systems criteria. Approval Voting fails the
Majority Criterion, according to the usual
interpretations (which had to be modified to
apply to Approval Voting, and, clearly, the
modifications were designed to *cause* Approval
voting to fail, because the students thought,
intuitively, that it failed. That's what I mean
by subjective analysis! See James
Armytage-Green's study and application of the
Majority Criterion to Range Voting).
Okay, how often? It would be extraordinarily rare
in real public elections, because it requires a
significant number of voters to vote for both
frontrunners, which is the opposite of standard
Approval strategy and is, normally, a foolish
vote if the voter does have a significant
preference between them. And if the voter doesn't
have a significant preference, well, there you go. See the second question!)
And how much damage from failure? Little. When
the Majority Criterion fails, we have multiple
majorities. The majority has Approved another
candidate as well as their favorite, and,
together with other votes, this less-preferred
candidate has broader support. Is that bad? Many
would argue it's good, and this again points out
the subjective nature of the use of voting
systems criteria to judge election methods.
It's easy to show that there are reasonable
situations, in real life, where the Majority
Criterion, if followed by a voting system, would
prevent the voting system from choosing a winner
which the vast majority of people -- not voting
systems experts attached to some other method and
holding on for dear life to preposterous
arguments -- would agree is a better outcome than
the majority first preference.
Quite simply, to make sound decisions from
preference profiles, we need to know preference
strength. Preferential ballots can, sometimes,
approximate this (Borda does that, and works
better if there is a broad spectrum of candidates
on the ballot, thus creating, in the real
preference pairs, an approximation of preference
strength), and usually the majority preference
will also be the ratings winner. But a Range ballot directly expresses this.
How about an ideal ballot, no matter what the
method? Set aside the complexity issue, it would
obviously be a Range ballot that allows the
expression of small but significant preferences.
To be ideal, it must allow the expression of
equal preference and not force one choice or the
other. If there were some way to do it, it would
allow expression of absolute utilities, but I'll
set that aside and assume that it only allows
expression of preference on the range of 0-1
vote. (In fractional increments, of course.)
This ballot contains far more information than a
pure preferential ballot. *How* the information
is used is another story. The ballot could be
used for any preferential voting method, for
example. The simulations essentially do this. But
once preference strength information is
available, and through averaging, even if many or
most voters vote Approval style, as in some cases
strategic considerations suggest, we should
remember that Approval is a Range method and the
votes in Approval will tend to average out to
what more "sincere" Range votes would be, as
voters make different decisions about where to
put their approval cutoff. Because the voters are
spread across the spectrum, Approval Voting can
measure underlying preferences just as an analog
to digital converter which can only indicate
whether a signal is higher or lower than a
reference voltage can measure the signal voltage
at far, far higher resolution than 0 and 1 would
imply. An input signal can be compared with many
random references and the number of *hits* (0 and
1) results can then estimate the input signal
value. Here the "input signal" is the average
approval cutoff position in the candidate spectrum.
In other words, since we are going for
amalgamation anyway, that individual voters can't
express fine preferences, but only 0 or 1 for
each candidate, has less effect on the outcome
than it does on the individual voter behavior.
If we really wanted to know how election methods
other than Range were performing, we'd use a
Range ballot. This may actually be practical in
small or even large nongovernmental
organizations. Short of using a Range ballot,
we'd use a Range poll, as an exit poll (preferably), or as a pre-election poll.
And, of course, we'd compare these results with
simulation predictions and eventually we might
build up enough data to confirm or improve the
models. The problem with this approach for the
short term is that the simulations can study
thousands of elections and thus see the behavior
of a voting system under many different
conditions, whereas the actual experimental
approach, with a lot of effort, only shows one
instance. Useful, still, but probably best used
to improve the models used in simulations.
>If the idea of a sincere set of ratings irrespective of context is "a
>total misconception of what we are doing when we vote," then what useful
>theory is Range based on? What makes it "objective" and "ideal" if not
>what I stated above?
It does not depend on relatively subjective
considerations like which voting systems criteria
are more important than the others. It defines an
optimal winner in a manner that, once understood,
most people would agree is reasonable, and that
matches how we make decisions, when we are sane
and collectively functional, in real life. The
approach defines the "ideal winner" in a way that
works when we know absolute utilities, and there
is no reason to expect that it would stop working
simply because the utility profile can't be
directly determined. (It works when we can see it
and test it, why would it stop working when we can't?)
There is no usable definition of "a sincere set
of ratings irrespective of context," because
there is no way for voters to even determine such
in a manner that they could vote them. The
ratings that we can determine, in practice, are
not absolute, independent ratings. Now, as I've
noted, it's not totally possible, but we have to
go far beyond an ordinary voting system to do it,
we probably need an auction of some kind, or
perhaps voting on lotteries, and I'm not going there now.
We can approach, this, though, in certain
respects, through runoff elections, and we can
approach it through binary choices in sequence.
In a binary choice, we can assume sincere
preferences are expressed, there is little reason
to do otherwise (I'm not ruling this out, there
might be under some usages, a turkey-raising
motive, but that's in a fixed and predictable
series of votes. Highly dangerous in standard
majority required process, particularly where no
candidates are eliminated -- i.e., with standard
Robert's Rules of Order decision process. I keep
mentioning this because, without it, I know that
many critics will think I'm describing some
bizarre creature, never used. Wrong. Actually, in
very common use, just not in public political
elections except certain ones which take place,
for example, at Town Meeting. Majority of a
quorum required to make any decision.)
>I also wonder, what, theoretically, does it look like when Range fails
>and gives a poor result. Is such a thing allowed?
Well, what's a "poor result"? My claim is that
the only way to objectively study this is to see
how the method performs when absolute utilities
are known. That's what the simulations do, the
"know" the absolute utilities by assuming them,
in some hopefully reasonable way. (Don't think
it's reasonable? The simulators are configurable.
Use a better simulation of underlying utilities!
Until then, Warren's work is the best we've got!)
Now, following Warren's work, Range would very
rarely give a truly "poor result." It would take
some very bizarre preference profiles. You can
construct them .... but for them to occur in real
life would be practically impossible. What it can
do, and the simulations show it, is to miss the
true ideal winner according to the normalization
of absolute utilities, and, as well, due to the
non-linear expression of utilities (but still
monotonic) by voters seeking to maximize the
effect of their vote. However, when it does this,
it would not then flip to a much worse candidate,
unless, again, the conditions were truly bizarre.
It would simply choose, in almost every
simulation (quite likely all or nearly all), the
next best winner, and the best would be in second place.
That's why top-two runoff Range is better at
picking the ideal candidate than Range alone.
When Range fails, it picks a candidate who is
second best, almost always, with any kind of
reasonable preference profile. And then sincere
preference in a runoff *may* pick the better
candidate. Not always. What's the better
candidate? Is it the majority preference or is it
the absolute utility maximizer, or the relative
utility maximizer? Or what. Until you can answer
this question, how we decide what is a poor or
suboptimal result, whatever claims you might make
about poor results are *purely subjective.* These
standards are different. Probably the "best" is
the absolute utility maximizer, because what goes
around comes around -- on average -- but the
runoff procedure tests for the majority
preference. Usually the same, but not always.
In real life, though, the majority will surprise
us. The runoff will have different turnout than
the original election, and some voters will
change their votes, and both of these effects
will favor the *fully sincere, absolute* Range
winner over the majority preference, because, in
this situation the majority preference must be
weak. So only if the normalization or strategic
voting have sufficiently distorted the Range
results, I'd predict, would the apparent Range
winner fail to win the runoff. You hold the
runoff, indeed, because of this possibility, the possibility of distortion.
Are you starting to see this, Kevin? I'm not
writing just for you, as I assume you know, and
much of this, I'd think, you already understand. Or did you? Or do you?
More information about the Election-Methods
mailing list