Yikes! Okay, so I have to make some minor corrections. This is because there are also people overseas, such as federal employees and certain military personnel and their dependents who may not have been included in the census. Those people's counts are not affected by the net coverage error from the Census Bureau. So, they have to be added to the tables AFTER adjusting by percentages of "undercounts" and "overcounts." Also, the total is only about 350K people across all 50 states which is less than half of one divisor distributed across 50 states. So, the impact is
expected to be negligible. It could conceivably make a difference if any state has way more overseas people than another as a ratio interacting with two such states that have very close on-the-edge rounding going on. However, I checked the numbers and it makes no impact on the discussion or results. The counts of discrepancies for each method comparison remain the same.
In the interest of transparency and showing correct numbers, even if the conclusions are the same, I have recomputed the numbers and made Version 2.0 for the previous post.
At the end, you will also find an addendum that has a comparison of similar work.
Version 2.0:
Introduction
I went ahead and made a program to run the
Hungtinton-Hill method of apportionment. This is the standard that Congress uses and is described in a previous post. The numbers I got back reproduce the delegate allocations that are present in Congress which gives confidence in the programming. This method I call
Original Huntington-Hill.
I also ran 3 other methods.
Original Intuitive simply takes the divisor of the census population divided by 435. This is about
760367 761169.
I do not know why there are other numbers out there, but this one is correct and it subtracts the DC population as I had attempted to do before but thought it was weird because it did not reproduce other numbers I was seeing at the time. This one, 760367, happens to also be documented which I found and confirms it is the right one. This is a number that I had encountered before in documentation that I had mentioned in post
716. One must subtract out the DC Population and then add overseas counts for each state to derive this divisor. So in this method, if the decimal part of the state's number after dividing by the divisor is .5 or higher, it is rounded up. Originally Congress was also looking for the highest decimal remainders when something went wrong, like not enough delegates were assigned or too many in order to prioritize the last few delegate assignments. I did not bother to implement that.
Next, I took the percentages of just the so-called statistically significant undercounts and overcounts from the report and used those to create an Adjusted Population for those few states. I also added the overseas counts to the adjusted population. This is also summed up. Strangely, it is quite a bit more than the original population even though the documentation provided by the Census Bureau says they are similar. Therefore, I do not trust their numbers. In any case, I copied the Original Populations of states into this column for all states that had no significant difference. Their new sum, 331508865 331857563, was used to create a new divisor (by dividing by 435).
Computing the adjusted populations and summing, and dividing, allowed for two more methods.
Adjusted Huntington-Hill is running the standard Congressional method of apportionment but on the Adjusted Populations. Then, Adjusted Intuitive takes the Adjusted Populations and divides them by the new divisor. If the decimal part is above .5, it rounds up. As before, there is nothing additional.
Any method that is discrepant in the resulting delegate count appears in bold font for the entire row where it is. This way we can look manually at discrepancies and decide what happened. Possibly what ought to be and count up legitimate results.
Results
Original Population | 331108434 |
Adjusted Population | 331857563 |
Original divisor | 761168.813793103 |
Adjusted divisor | 762890.949425287 |
State | Original
Population | Adjusted
Population | Original
Huntington-Hill | Original
Intuitive | Adjusted
Huntington Hill | Adjusted
Intuitive |
---|
California | 39576757 | 39576757 | 52 | 52 (51.994717) | 52 | 52 (51.877345) |
Texas | 29183290 | 29753838 | 38 | 38 (38.340102) | 39 | 39 (39.001430) |
Florida | 21570527 | 22347080 | 28 | 28 (28.338690) | 29 | 29 (29.292627) |
New York | 20215751 | 19543938 | 26 | 27 (26.558827) | 26 | 26 (25.618259) |
Pennsylvania | 13011844 | 13011844 | 17 | 17 (17.094557) | 17 | 17 (17.055968) |
Illinois | 12822739 | 13080218 | 17 | 17 (16.846117) | 17 | 17 (17.145593) |
Ohio | 11808848 | 11635617 | 15 | 16 (15.514099) | 15 | 15 (15.252006) |
Georgia | 10725274 | 10725274 | 14 | 14 (14.090533) | 14 | 14 (14.058725) |
North Carolina | 10453948 | 10453948 | 14 | 14 (13.734073) | 14 | 14 (13.703070) |
Michigan | 10084442 | 10084442 | 13 | 13 (13.248627) | 13 | 13 (13.218720) |
New Jersey | 9294493 | 9294493 | 12 | 12 (12.210817) | 12 | 12 (12.183252) |
Virginia | 8654542 | 8654542 | 11 | 11 (11.370069) | 11 | 11 (11.344403) |
Washington | 7715946 | 7715946 | 10 | 10 (10.136971) | 10 | 10 (10.114088) |
Arizona | 7158923 | 7158923 | 9 | 9 (9.405171) | 9 | 9 (9.383940) |
Tennessee | 6916897 | 7263818 | 9 | 9 (9.087205) | 9 | 10 (9.521437) |
Massachusetts | 7033469 | 6879449 | 9 | 9 (9.240354) | 9 | 9 (9.017605) |
Indiana | 6790280 | 6790280 | 9 | 9 (8.920859) | 9 | 9 (8.900722) |
Missouri | 6160281 | 6160281 | 8 | 8 (8.093186) | 8 | 8 (8.074917) |
Maryland | 6185278 | 6185278 | 8 | 8 (8.126027) | 8 | 8 (8.107683) |
Wisconsin | 5897473 | 5897473 | 8 | 8 (7.747917) | 8 | 8 (7.730427) |
Colorado | 5782171 | 5782171 | 8 | 8 (7.596437) | 8 | 8 (7.579289) |
Minnesota | 5709752 | 5498726 | 8 | 8 (7.501295) | 7 | 7 (7.207748) |
South Carolina | 5124712 | 5124712 | 7 | 7 (6.732688) | 7 | 7 (6.717490) |
Alabama | 5030053 | 5030053 | 7 | 7 (6.608328) | 7 | 7 (6.593410) |
Louisiana | 4661468 | 4661468 | 6 | 6 (6.124092) | 6 | 6 (6.110268) |
Kentucky | 4509342 | 4509342 | 6 | 6 (5.924234) | 6 | 6 (5.910861) |
Oregon | 4241500 | 4241500 | 6 | 6 (5.572351) | 6 | 6 (5.559772) |
Oklahoma | 3963516 | 3963516 | 5 | 5 (5.207144) | 5 | 5 (5.195390) |
Connecticut | 3608298 | 3608298 | 5 | 5 (4.740470) | 5 | 5 (4.729769) |
Utah | 3275252 | 3192656 | 4 | 4 (4.302925) | 4 | 4 (4.184944) |
Iowa | 3192406 | 3192406 | 4 | 4 (4.194084) | 4 | 4 (4.184616) |
Nevada | 3108462 | 3108462 | 4 | 4 (4.083801) | 4 | 4 (4.074582) |
Arkansas | 3013756 | 3173593 | 4 | 4 (3.959379) | 4 | 4 (4.159956) |
Kansas | 2940865 | 2940865 | 4 | 4 (3.863617) | 4 | 4 (3.854896) |
Mississippi | 2963914 | 3090839 | 4 | 4 (3.893898) | 4 | 4 (4.051482) |
New Mexico | 2120220 | 2120220 | 3 | 3 (2.785479) | 3 | 3 (2.779191) |
Nebraska | 1963333 | 1963333 | 3 | 3 (2.579366) | 3 | 3 (2.573543) |
Idaho | 1841377 | 1841377 | 2 | 2 (2.419144) | 2 | 2 (2.413683) |
West Virginia | 1795045 | 1795045 | 2 | 2 (2.358274) | 2 | 2 (2.352951) |
Hawaii | 1460137 | 1367607 | 2 | 2 (1.918283) | 2 | 2 (1.792664) |
New Hampshire | 1379089 | 1379089 | 2 | 2 (1.811804) | 2 | 2 (1.807714) |
Maine | 1363582 | 1363582 | 2 | 2 (1.791432) | 2 | 2 (1.787388) |
Montana | 1085407 | 1085407 | 2 | 1 (1.425974) | 2 | 1 (1.422755) |
Rhode Island | 1098163 | 1045409 | 2 | 1 (1.442733) | 1 | 1 (1.370326) |
Delaware | 990837 | 939673 | 1 | 1 (1.301731) | 1 | 1 (1.231727) |
South Dakota | 887770 | 887770 | 1 | 1 (1.166325) | 1 | 1 (1.163692) |
North Dakota | 779702 | 779702 | 1 | 1 (1.024348) | 1 | 1 (1.022036) |
Alaska | 736081 | 736081 | 1 | 1 (0.967040) | 1 | 1 (0.964857) |
Vermont | 643503 | 643503 | 1 | 1 (0.845414) | 1 | 1 (0.843506) |
Wyoming | 577719 | 577719 | 1 | 1 (0.758989) | 1 | 1 (0.757276) |
Discussion
There are 8 rows bolded which indicates across 4 methods there are a maximum 8 discrepancies in comparing any two. However, comparing any of one method against the original may produce much less than 8 differences such as 3 or 4. Here I do 3 comparisons of methods.
1. Original Huntington-Hill (OHH) vs Original Intuitive (OI)
If you focus on the bolded rows but then scan downward along these two columns, you will find 4 discrepancies:
State | OHH | OI |
---|
New York | 26 | 27 (26.558827) |
Ohio | 15 | 16 (15.514099) |
Montana | 2 | 1 (1.425974) |
Rhode Island | 2 | 1 (1.442733) |
The standard method deducts 1 from both New York and Ohio as compared to the intuitive method. Meanwhile, the standard method adds one to both Montana and Rhode Island as compared to the intuitive method. You may observe that the decimal part of the delegate is in the .4 range for both Montana and Rhode Island, but the decimal part for both New York and Ohio is in the .5 range. This may be surprising, but as noted earlier the standard method we use is
biased in favor of smaller states.
The purpose of this comparison was to illustrate how our current process works in contrast to intuition.
2. Original Huntington-Hill (OHH) vs Adjusted Huntington-Hill (AHH)
If you focus on the bolded rows but then scan downward along these two columns, you will find 4 discrepancies:
State | OHH | AHH |
---|
Texas | 38 | 39 |
Florida | 28 | 29 |
Minnesota | 8 | 7 |
Rhode Island | 2 | 1 |
In the adjusted version, Florida gains 1, and Texas gains 1. Minnesota and Rhode Island both lose 1.
The purpose of this comparison is to illustrate how the actual process would work if the adjusted numbers were actually used in reality.
3. Original Intuitive (OI) vs Adjusted Intuitive (AI)
If you focus on the bolded rows but then scan downward along these two columns, you will find 6 discrepancies:
State | OI | AI |
---|
Texas | 38 | 39 |
Florida | 28 | 29 |
New York | 27 | 26 |
Ohio | 16 | 15 |
Tennessee | 9 | 10 |
Minnesota | 8 | 7 |
This comparison is most interesting because the lack of knowledge about the exact process of apportionment might lead many people to make this comparison. Even in discussions here, we observe people using the divisor as I also initially thought to do. So I believe we can hypothesize that this is how the conspiracy theory started. We can also observe that there are a total of 6 discrepancies which is a number that is in common with the conspiracy theory that there were an extra 6 Democrats and missing 6 Republicans.
What we do observe here is 3 states that are typically red and 3 that may be blue (is Ohio purple?) being impacted, with red states negatively and "blue" states positively. This does not necessarily translate to 3 Republicans and 3 Democrats.
And as noted, the Post-Enumeration Survey results themselves are of questionable quality. Whether they are better than the census is an open question.
Conclusion
None of the methods show an extra 6 Democrats and missing 6 Republicans.
A comparison of intuitive methods (that are not used in the process) shows 3 typically red state gains, 1 purple/blue state gain, and 2 blue state gains. This comparison may have snowballed into a more dramatic version magnifying the difference and assuming partisan results based on "blue" or "red" states. In reality, following apportionment, there is redistricting and voting and those results may differ from predictions based on state gains or losses.
In any case, the intuitive method is not used, but instead the Huntington-Hill method of apportionment. In that scenario, which is the actual Congressional process, there are fewer discrepancies with 2 red-state gains and 2 blue-state losses.
Those also do not necessarily translate to two Republicans and two Democrats not merely because of the process following apportionment, but also because the survey results that the adjustments are based on are not necessarily improvements to the census.
Addendum
Here is a
website that also does these types of computations. Its author seems to have political bias, unsure... Some of their results are different with the most egregious difference being Florida which they say would have gained an additional two seats. Therefore, it is worth it to delve into computation differences for this state. We can try to discover flaws in my or their computations.
They report Florida's numbers as the following:
State | Original Population | Adjusted Population |
---|
Florida | 21,870,527 | 22,631,621 |
I have computed Florida's census plus overseas count as 21,570,527. So these numbers differ by 300K. If you perform a Google search of "Florida 21,870,527" you will see they are the only site with this number describing its population, but if you do a Google search of "Florida 21,570,527" you will find many reputable sites listing the value with 5, not 8 as the third digit. Because all the other digits are exactly the same, it appears to be a typo.
My computation comes from the
census page which gives 21538187 plus the overseas count of 32,340 which I obtained
here.
21538187 + 32,340 = 21,570,527 not 21,870,527
Let us do three more sanity checks: If I had put in the wrong state population for Florida, off by 300K, then I would not have been able to sum up all the state populations, divide by 435, and reproduce the divisor of 761169. Second, all the other counts I computed for these 13 other states in the table in Results are exactly the same as the 13 other states' original populations on their website. Finally, they are claiming that this produces an error of - 761,094 for Florida but that is less than the divisor. So how could a change less than the divisor result in an error of 2 delegates?
I next will try to reproduce their second number (Adjusted Population).
The way that the undercount and overcount rates are imagined intuitively is not really how they are computed. I will first show the simple, straightforward, imagined way they might be used.
Note the undercount rate is 3.48% for Florida. So one may imagine that all one has to do is multiply the original census number by 1.0348 to get an adjusted census number and following that add the overseas count for Florida.
However, they actually added the overseas count (plus the erroneous 300K padding) first and then applied the intuitive correction factor.
21,870,527 x 1.0348 = 22,631,621.3396 ~= 22,631,621
It turns out that this is wrong for 3 reasons: (a). the correction should not be applied after adding overseas counts because this will magnify the effect, (b). they inadvertently padded Florida by 300K people, and (c) the intuitive way to apply the correction factor is incorrect.
I will now show how I computed Florida's numbers.
State | Original Population | Adjusted Population |
---|
Florida | 21,570,527 | 22,347,080 |
I have already explained how I computed the original population which is the census count for Florida plus the overseas count:
21538187 + 32,340 = 21,570,527
Next, note how the Census Bureau defines the so-called error rates that I posted earlier:
Census Bureau said:
A net coverage error rate is the difference between the census count and the PES estimate of the number of people in the United States expressed as a percentage of the PES estimate.
"a net coverage error rate" means an undercount rate or an overcount rate. The mention of PES estimate twice may be confusing, but it's the Adjusted Census count. Let's define some quick variables:
e = net coverage error rate, the undercount rate or overcount rate
NP = new population, adjusted census count based on the alleged error
OP = old population from the census
The wrong, but intuitive, way to think of it was
NP = OP x (1 - e); so in the case of Florida, there was a correction factor of (1 - (-3.48%)) or (1.0348)
BUT that isn't what they said. They are saying:
e = (OP - NP) / NP
The new population is the baseline of comparison, not the old population.
Solve this for NP, you get:
NP = OP / (1 + e)
For Florida, NP = 21538187 / (1 - 3.48%) = 21538187 / .9652 ~= 22,314,739.95 ~= 22,314,740
But lastly, one needs to add the overseas count
22,314,740 + 32,340 = 22,347,080