[EM] Elections methods performance criterion [was Arrow's axioms]

Ken Johnson kjinnovation at earthlink.net
Sat Mar 6 02:36:02 PST 2004


>Date: Fri, 5 Mar 2004 17:51:15 -0800 (PST)
>From: Forest Simmons <fsimmons at pcc.edu>
>...
>
>We can use a less fuzzy concept than "social utility" to be measured by CR
>ballots.  If we take the CR values to be probabilities of adequate
>representation, then the sum over all ballots of the CR for a candidate is
>the expected number of voters that will be adequately represented by the
>candidate.  We might better call this expectation "social representation"
>rather than "social utility."
>
>  
>
This interpretation is convenient because it automatically puts 
individuals' CRs on a common scale of 0 to 1, deftly sidestepping the 
problem of "interpersonal comparison of utilities" that Steve Eppley 
mentioned in his post. However you define it, I think the cardinal 
rating concept provides a useful basis for defining the objective of 
election systems (i.e., maximize "social utility" or "social 
representation"), even if CR is not itself a practical voting method. 
This definition provides a quantifiable performance standard by which 
the merits of alternative voting systems can be characterized and compared.

Cardinal ratings are, in my view, a more useful definitional standard 
than rank-based standards, because of their greater expressive power. 
The ratings imply a transitive rank ordering, and ratings distinguish 
degrees of preference (strong or weak). In addition, there is a more 
direct connection between CRs and the fundamental concept of social 
utility or representation.

Although it may not be possible, in practice, to ascertain voters' 
sincere CRs, one could predict the outcome of any particular voting 
method given some presumed sincere CR profile and could quantify the 
outcome's average CR over a statistical ensemble of all possible CR 
profiles. I ran some simple numerical simulations to test out this idea. 
Results are tabulated below. (Note: There is also a separate 
election-methods thread relating to simulations that Kevin Venzke has 
been doing. I'm not sure how this relates to his results, but my model 
is probably much more rudimentary.)

For each test case I assumed a certain number of voters and candidates 
and I ran a Monte Carlo simulation of 100,000 elections with random 
(uniformly distributed) CRs. Following is a list of the election methods 
that I tested:

(1) Sincere CR, "SiCR". This is not a practically realizable method 
because voters will strategize, but it is the standard by which the 
other methods are evaluated.
(2) Strategic CR, "StCR". This only represents the most trivial voting 
strategy: exaggerate. Individuals' CRs are linearly scaled so that the 
lowest and highest values are at 0 and 1.
(3) Sincere Approval Voting, "SiAV". Sincere CRs are rounded to 0 or 1. 
Voters can potentially approve all candidates or approve none.
(4) Strategic Approval Voting, "StAV". Same as SiAV, except applied to 
exaggerated CRs. In other words, the approve/disapprove CR cutoff level 
is the average of the highest and lowest sincere CRs.
(5) Majority. This is a Condorcet method with "Smith/MinMax" cycle 
resolution. If there is no Condorcet winner, consideration is limited to 
the candidates who win the most two-way contests, and the one who 
suffers the least-worst defeat against any other candidate is selected. 
(This may not be the best representative of Condorcet methods, I just 
picked it for testing purposes.)
(6) Plurality.
(7) Random.

All ties are resolved by random choice.

Trial runs are tabulated below. The "SiCR" row represents the SiCR 
winner's CR, averaged over all voters. The "mean" is an average, and the 
"range" is the minimum and maximum, over 100,000 simulated elections. 
The remaining rows represent the amount by which the CR values decrease, 
relative to SiCR, for each method (e.g. "SiCR-StCR" represents the mean 
and range of the difference between the SiCR winner's and the StCR 
winner's average CRs). A small value in these rows is good - it 
indicates that the method's performance is close to the SiCR standard.

As expected, the relative performance of Plurality is noticeably 
degraded when there are more than two candidates. With three candidates, 
Approval appears to have slightly better performance than Majority. When 
there are ten candidates, Majority performs better. This is probably due 
to the large number of ways 10 candidates can be ranked (3628800 
possible strict preference rankings, plus indifference rankings - this 
compares to 1024 possible approval ratings). Of course, the "crisis of 
choice" created by this huge number of ranking options may, to some 
extent, be detrimental. If voters cannot discern their preferences on a 
sufficiently fine scale, they may vote indifference between many 
candidates, or may express many preferences randomly; and this might 
somewhat dilute the theoretical performance advantage of Majority when 
there are many candidates.

Is Sincere CR a useful performance metric for evaluating election 
methods, and are there other potentially better comparison criteria that 
people have used?

Ken Johnson


num_voter=10
num_candidate=2
SiCR:             mean 0.551700332840196     range [0.229040232984872 
0.863446203564651]
SiCR-StCR:        mean 0.0104330518092656    range [0 0.311396666442433]
SiCR-SiAV:        mean 0.00758379245390728   range [0 0.234348255440491]
SiCR-StAV:        mean 0.0105441802090586    range [0 0.311396666442433]
SiCR-Majority:    mean 0.0105302529155023    range [0 0.281387064795814]
SiCR-Plurality:   mean 0.0105613367781385    range [0 0.268875109478149]
SiCR-Random:      mean 0.0517924800564136    range [0 0.567096745384255]
 
num_voter=10
num_candidate=3
SiCR:             mean 0.577627811423126     range [0.300739113037274 
0.891234944164729]
SiCR-StCR:        mean 0.00643898594501008   range [0 0.208600214629151]
SiCR-SiAV:        mean 0.0112306660540103    range [0 0.2492314143065]
SiCR-StAV:        mean 0.010234125996482     range [0 0.261924538759141]
SiCR-Majority:    mean 0.0118451864266665    range [0 0.268048027870865]
SiCR-Plurality:   mean 0.0175346182631857    range [0 0.316315966603937]
SiCR-Random:      mean 0.076927965299296     range [0 0.569005124951544]
 
num_voter=100
num_candidate=3
SiCR:             mean 0.524253354378763     range [0.441475239831965 
0.631821479584611]
SiCR-StCR:        mean 0.00212956123814931   range [0 0.068479744242475]
SiCR-SiAV:        mean 0.0033425211375264    range [0 0.0803792567385083]
SiCR-StAV:        mean 0.00304235085706283   range [0 0.0730502008335385]
SiCR-Majority:    mean 0.00394114964985563   range [0 0.0877621552632714]
SiCR-Plurality:   mean 0.00588016175577818   range [0 0.121080952692736]
SiCR-Random:      mean 0.0242776145281066    range [0 0.183426720826094]
 
num_voter=100
num_candidate=10
SiCR:             mean 0.544034472376144     range [0.481335227230615 
0.633085126414266]
SiCR-StCR:        mean 0.000404771738675042  range [0 0.0215636099063402]
SiCR-SiAV:        mean 0.00611560766074788   range [0 0.0818549382408317]
SiCR-StAV:        mean 0.00579899505919091   range [0 0.0811189374850716]
SiCR-Majority:    mean 0.0041345604560462    range [0 0.0694586480805467]
SiCR-Plurality:   mean 0.020864388384224     range [0 0.155759903041651]
SiCR-Random:      mean 0.044254682627219     range [0 0.20199827990111]






More information about the Election-Methods mailing list