<html><DIV>

<DIV>

<P>PREAMBLE AND ABSTRACT.</P>

<P>Fellow EM-listers: Most of us on this list, like me, have other lives - both at work and elsewhere - and often lack the time to prepare intelligent postings - even when we’ve got new ideas or insights to share, or would like to rebut ill-considered criticisms of our prior postings. Even when others’ postings get really silly or confusing, time constraints keep most of us helpless spectators. That’s been my situation for the past six weeks. </P>

<P>Most frustrating has been Mike and Richard’s recent furious exchange as to whether two different definitions of Pij - a probability value defined for an impending election - really must amount to the same thing. Would-be conclusions have been obfuscated by various irrelevant asides, including dubious pronouncements and basic queries about math and logic conventions and about probabilities.</P>

<P>Unfortunately, I lack time to answer the queries (but thanks to Anthony Simmons for a good show on the logic part) - despite the fact that I did do my math Ph.D. dissertation on logic, and now for a living (as an applied statistician) am always computing probabilities. But here in this posting I will do my best to set the record straight on the original Pij question.</P>

<P>Namely, I give an example and use it to illustrate conclusively and derive numerically that the two Pij definitions can refer to distinctly DIFFERENT probability values - which thereby yield different ‘strategic voting’. </P>

<P>I first review, restate and give names to the two definitions at issue. Next, I give an illustrative (if in some respects fanciful) example of an impending election. Then I derive and calculate the numerical probability values. Finally, I use the example’s situation and numbers to illustrate why (despite both authors’ confusing asides) Richard’s main argument is correct and Mike’s main counter is erroneous. </P>

<P> </P>

<P>DEFINITIONS AT ISSUE. </P>

<P>The two definitions are reviewed in a 17 Feb. posting by Richard, which begins by noting that earlier Mike wrote: </P>

<P>> >> > Pij is the probability that i & j will be the 2 frontrunners.</P>

<P>> ></P>

<P>> ></P>

<P>> >Slight correction/clarification: The precise meaning of Pij is usually</P>

<P>> >taken to mean the probability, given a tie exists for first place, that</P>

<P>> >i and j will be involved in that tie.</P>

<P>></P>

<P>> Different wording, same thing. If i & j are the 2 highest vote-getters,</P>

<P>> and there's a tie for 1st place, then it's between them. If they</P>

<P>> aren't the 2 highest vote-getters, and there's a tie for 1st place, then</P>

<P>> it isn't between them.</P>

<P>Richard then comments: </P>

<P>"They aren't the same. It's easy to think they'd be proportional, but keep in mind Bayes' Theorem for conditional probabilities..."</P>

<P>My side comment: </P>

<P>Indeed, as we shall see, the two definitions of Pij aren’t the same at all. However - as we shall also see - in order to understand and explain the difference, it is unnecessarily arcane to appeal to such devices as conditional probability or Bayes Theorem. Richard’s extended comment may have been prompted by the following two facts, true but unneeded here: (1) one way the two definitions of Pij differ is that the first-defined probability is ‘unconditional’ but the second is ‘conditional’; and (2) in some situations (not really here) Bayes Theorem aids computations with conditional probabilities. </P>

<P>Let’s restate and briefly name the two probability definitions (or essentially similar ones) at issue. Let i and j be any two candidates in an impending election. </P>

<P>(1) ‘lead’: Pij = probability that i and j will be the two leading candidates (i.e. each will get more votes than any of the candidates other than i and j). </P>

<P>(2) ‘tie’: Pij = probability, given that precisely two candidates tie for first place, that i and j will be those candidates. </P>

<P>In some postings, the ‘lead’ definition was ascribed to Mike; and the ‘tie’ definition was ascribed to Bart Ingles (who in turn ascribed it to a published reference).</P>

<P>Note that, for outcomes of a given election, we are discussing PRIOR probability values, NOT POST-facto values. After the fact, an election has but one actual outcome and its data, so post-facto outcome probabilities must be 0 or 1; but an impending election has a set of possible outcomes, each with its data - so prior probabilities need not all be 0 or 1. </P>

<P>EXAMPLE.</P>

<P>This example has two versions, one for lone-mark (‘plurality’) voting and one for unrestricted pass-fail (‘approval’) voting. </P>

<P>In the state of Catatonia, in order to avoid a steep fine, each potential voter appears at the local polling place by 9 AM on Election Day, and remains there until 9:15 AM. Prior to Election Day, each voter has received one ballot for each race, and may mark it before entering the polling place. No marking of ballots is allowed within the polling place. Upon exit, each voter may - but need not - cast any or all of the voter’s ballots received for the various races. </P>

<P>There are two political parties: the Hots and the Colds. Cold voters always cast their ballots. In order to decide whether to cast her (or his) ballots, all Hots listen to the result - broadcast in all polling places at 9:05 AM - of the latest State Lottery random drawing of six digits (000000 through 999999). If the drawn number is 900000 or more, all Hots cast all their ballots; if not, they cast no ballots. </P>

<P>Hot and Cold voters of both genders are very partisan. On each ballot received, each such voter will mark only suitable candidates, i.e. candidates of the same party and gender; conversely, so long as at least one suitable candidate exists, each such voter will mark at least one. Further, for every married couple (and Catatonia allows marriage only between opposite genders), if either spouse is Hot, then both are Hot; and if either is Cold, then both are Cold. </P>

<P>The electorate comprises a grand total of T voters, split as follows:</P>

<P>60% of all voters are married Hots.</P>

<P>34% of all voters are married Colds.</P>

<P>4% of all voters are single female Colds.</P>

<P>2% of all voters are single Independents.</P>

<P>The example’s specific election contest is for state governor, between four candidates: #1 is a female Hot; #2 is a male Hot; #3 is a female Cold; and #4 is a male Cold.</P>

<P>PROBABILITY VALUES. </P>

<P>It is easy to see that the partisan (i.e., Hot or Cold) 98% of the electorate will produce just two outcomes. These outcomes differ only according as the Hots do or do not cast ballots. </P>

<P>Apart from the Independent voters’ ballots, candidates #3 and #4 will always get respectively 21% x T and 17% x T votes. </P>

<P>Suppose - as will occur with probability 10% - the Hots do cast ballots. Then, candidates #1 and #2 are the leaders - each with at least 30% x T votes. (Meanwhile, # 3 and #4 each have at most 23% x T votes). Depending on how the Independents vote, #1 and #2 may tie for first place. </P>

<P>Suppose - as will occur with probability 90% - the Hots do not cast ballots. Then candidates #3 and #4 are the leaders and cannot tie: #3 has at least 21% x T votes and #4 has at most 19% x T votes. (Meanwhile #1 and #2 each have at most 2% x T votes).</P>

<P>Using the ‘lead’ definition: P12 = 10% and P34 = 90%. That is, probability is 10% that the two Hot candidates #1 and #2 will be the leaders, and is 90% that the Cold candidates #3 and #4 will be the leaders. </P>

<P>However, using the ‘tie’ definition, P12=100% and P34=0%. That is, considering just all cases with a first-place tie: in all of these cases the tie is between Hot candidates #1 and #2.</P>

<P>INTERPRETATIONS AND CONSEQUENCES. </P>

<P>The above example and numbers illustrate that each of the defined probabilities may be interpreted simply as a fraction (valued between 0 and 1 inclusive). Thus, with the ‘lead’ definition, Pij is just the fraction of all possible outcomes which have candidates #i and #j as the leaders. With the ‘tie’ definition, Pij is the fraction - of those outcomes which have a first-place tie - where the first-place tie is between candidates #i and #j. </P>

<P>The two fractions have different denominators. For the ‘lead’ definition, the denominator counts all possible outcomes, whereas the ‘tie’ definition’s denominator counts only certain of the outcomes - namely those which meet the extra condition of having a first-place tie. For this reason, usual terminology calls the ‘lead’ probability Pij ‘unconditional’ and the ‘tie’ probability Pij ‘conditional’.</P>

<P>What do the two definitions’ Pij values tell us about possible ‘strategic’ voting on the part of the only voters - namely the Independents - who are open to such voting? Clearly, the only way an Independent voter can have any winner-determining effect is to break a first-place tie (among all other voters). Here, this situation can possibly occur only if her (or his) ballot is marked for exactly one of the two candidates - namely the Hot candidates #1 and #2 - who could possibly be in a first-place tie. The ‘tie’ probability values (P12=100% and Pij=0% for all other choices of i and j with i<j) directly reflect this reality; but the ‘lead’ probability values do not. </P>

<P>CORRECT AND INCORRECT ARGUMENTS.</P>

<P>A 20 February posting by Richard quoted and critiqued the crux of Mike’s argument that necessarily the lead and tie definitions must yield the same Pij value. Richard wrote: </P>

<P>"MIKE OSSIPOFF wrote:</P>

<P>...</P>

<P>> But when I showed you why those 2 probabilities can't be different,</P>

<P>> I asked you which part of my argument you disagree with. You didn't</P>

<P>> answer ... </P>

<P>You didn't show they can't be different. Instead you wrote this:</P>

<P>> 1. We agree that if i & j are the 2 top vote-getters in the election,</P>

<P>> then, if there's a tie for 1st place, it will be between them.</P>

<P>></P>

<P>> 2. We agree that if i & j are not the 2 biggest vote-getters in the</P>

<P>> election, then, if there is a tie for 1st place, it will not be between</P>

<P>> them.</P>

<P>></P>

<P>> 3. That means that either both of the following 2 statements are true,</P>

<P>> or neither of them are true:</P>

<P>></P>

<P>> a) i & j are the 2 biggest vote-getters in the election.</P>

<P>> b) If there's a tie for 1st place, it will be between i & j.</P>

<P>></P>

<P>> 4. Since, after the election, those 2 statements are either both true</P>

<P>> or both false, then the probability, at a time before the election</P>

<P>> that a) will be true after the election must be the same as the probability,</P>

<P>> at that same time before the election, that b) will be</P>

<P>> true after the election."</P>

<P>As Richard goes on to comment, Mike’s argument is subtly erroneous. Namely, despite the truth of Mike’s statements 1 and 2, statement 3 in fact can in some cases be false: it does not follow from 1 and 2. </P>

<P>Detailed reasons for this failure of derivation are worth noting. Statement 1 asserts that ‘if a) then b)’. Statement 2 is intended to assert that ‘if not-a), then not-b)’: then statement 3 WOULD follow from statements 1 and 2. However, what statement 2) actually asserts is in effect that ‘if not-a) then not-c)’, where c) is not quite equivalent to b). Namely, b) asserts that ‘if there is a first-place tie, then #1 and #2 tie for first place’, whereas c) asserts that ‘there is a first-place tie and (moreover) #1 and #2 tie for first place’ - or, in effect, simply that ‘#1 and #2 tie for first place’. </P>

<P>In fact, contra 3, statement b) can be true and meanwhile statement a) can be false. </P>

<P>For instance, to simplify and concretize Richard’s argument, take our example with i=#1 and j=#2. For ALL our election outcomes, #1 and #2 are the only possible first-place-tied candidates, so for ALL outcomes statement b) is true: IF there is a first-place tie (at all), THEN #1 and #2 tie for first place. On the other hand, for the 90% of our example’s outcomes where Hots do not cast ballots, statement a) is false, because the leaders are not #1 and #2.. </P>

<P>To be sure, some people may be confused by the idea that, for our example, b) is always true (i.e. is true for all possible election outcomes). Well, b) is a conditional statement of the form ‘if X then Y’. Here X (‘there is a first-place tie’) is sometimes true and sometimes false, but anyhow Y (‘#1 and #2 tie for first place’) happens always to be true whenever X is true - which is all that is required for truth of a statement of form ‘if X then Y’.</P>

<P>[As Anthony Simmons has noted, this approach to truth of if-then statements not only very well describes ordinary usage but also is a precise convention of mathematics and logic.]</P>

<P>THAT’S ALL!! THANKS FOR YOUR HEED!! </P>

<P>Joe Weinstein</P>

<P>Long Beach CA 90807  USA</P>

<P> </P></DIV></DIV><br clear=all><hr>Get your FREE download of MSN Explorer at <a href="http://explorer.msn.com">http://explorer.msn.com</a><br></p></html>