• Welcome to the new Internet Infidels Discussion Board, formerly Talk Freethought.

To Give You a Size of the Immense and Growing Size of Illegal Immigration

So let's try this rewording:
A census does a full count of a state's population as 3,011,524. (See how this is better? The census isn't rounding to the nearest hundred thousand or estimating.) According to a survey which might be wrong and have systemic issues, an estimate of an undercount for that state is 151,781 which is a midpoint of a confidence interval.

Do you disagree so far with the changes in wording?
And what is that confidence interval,

I only found it by chance browsing over a map that suddenly had a popup with a 90% confidence interval about the %. I don't know even why they would only do a 90% confidence interval and keep it hidden. Maybe it's just because they are trying to appeal to average Joe readers? ETA: I remember seeing std error with the race "undercounts" and "overcounts" and so I think I may have also seen that with the states. So you can probably construct confidence intervals from those, provided they are there.
Ok, that's what my gut was telling me about the numbers. This is a case of lying with statistics, it means nothing other than the whoever authorized it is Republican deep state.

and what p value is used in calculating it?

No idea. That is something I searched and searched for...their threshold of significance. Convention is 5% but that would be inappropriate for 50 states and it's probably a more complex method.
I'm not aware of any more complex method. The precision simply isn't there.
While reporters will almost certainly omit such things any proper poll will have this information. Why was it not listed???

There are a lot of documents there and I haven't gone through them all. It wasn't out in front, but also, I really don't think we need it to see that the original claims were inaccurate of 6 Republican congress critters missing and 6 extra Democrat congress critters.
You've already found enough, their goose is cooked.
 

Worst member of the House of Representatives​

The nominees:

(a) Lauren Boebert, R-Colo.
(b) Matt Gaetz, R-Fla.
(c) Ralph Norman, R-S.C.
(d) Marjorie Taylor Greene, R-Ga.
(e) Jim Jordan, R-Ohio
(f) James R. Comer, R-Ky.
(g) Mike Johnson, R-La.
(h) Thomas Massie, R-Ky.
(i) Bob Good, R-Va.
(j) Andy Biggs, R-Ariz.
(k) Scott Perry, R-Pa.
(l) Nancy Mace, R-S.C.

The winner: Too many to choose from.
You forgot Tommy Tuberville (Yeah, I know he's a senator). By far the most idiotic and downright stupidest person in Congress, bigly.
 

Worst member of the House of Representatives​

The nominees:

(a) Lauren Boebert, R-Colo.
(b) Matt Gaetz, R-Fla.
(c) Ralph Norman, R-S.C.
(d) Marjorie Taylor Greene, R-Ga.
(e) Jim Jordan, R-Ohio
(f) James R. Comer, R-Ky.
(g) Mike Johnson, R-La.
(h) Thomas Massie, R-Ky.
(i) Bob Good, R-Va.
(j) Andy Biggs, R-Ariz.
(k) Scott Perry, R-Pa.
(l) Nancy Mace, R-S.C.
It's kinda refreshing to see a list like that and for once Indiana is not on it.
Tom
 

Worst member of the House of Representatives​

The nominees:

(a) Lauren Boebert, R-Colo.
(b) Matt Gaetz, R-Fla.
(c) Ralph Norman, R-S.C.
(d) Marjorie Taylor Greene, R-Ga.
(e) Jim Jordan, R-Ohio
(f) James R. Comer, R-Ky.
(g) Mike Johnson, R-La.
(h) Thomas Massie, R-Ky.
(i) Bob Good, R-Va.
(j) Andy Biggs, R-Ariz.
(k) Scott Perry, R-Pa.
(l) Nancy Mace, R-S.C.
It's kinda refreshing to see a list like that and for once Indiana is not on it.
Tom
Being better than that lot isn’t much of a prize.
 
I'm not aware of any more complex method. The precision simply isn't there.

I finally found how they defined statistical significance:
The PES performed statistical testing for estimates of net coverage error rates and certain components of coverage. Statements of comparison were statistically significant at the 90 percent confidence level (α = 0.10) using a two-sided test. For any PES release, estimated net coverage error rates that were statistically significantly different from zero were identified by an asterisk (*).

For a two-tailed 90% confidence interval, you will have 5% on each end. 0 will be included in one side's 5% when they are saying it is significant. When 0 is not included in the 90% middle, they are saying it is statistically significant. In a way, it is like saying there is a 95% chance (or confidence) that it is an undercount or overcount because that whole side of the distribution including one tail will be on one side of zero. So it seems equivalent in a sense to a p-value of 5% or less for significance, i.e. the convention.

To add--their goal is not to provide new numbers but to provide insights for quality and potential improvements for the next census. So they go through multiple rounds of finding problems in their own work and then have to stop because the census numbers must be used by 2023. So in 2022 they release their survey results.

They want a reasonable confidence level that will minimize false positives AND false negatives. They might want to err on the side just a little bit of including some things that are not actual problems so that they do not remove too many real problems. They can find out which is which later upon additional work.

From that same paper, check their conclusion:
In conclusion, while every survey must deal with a multitude of errors, and the PES is no different, we implemented many measures to mitigate the impact of these errors on our estimates. The PES estimates provide helpful insights about the quality of the 2020 Census and should be useful when informing plans for the 2030 Census.
 

Worst member of the House of Representatives​

The nominees:

(a) Lauren Boebert, R-Colo.
(b) Matt Gaetz, R-Fla.
(c) Ralph Norman, R-S.C.
(d) Marjorie Taylor Greene, R-Ga.
(e) Jim Jordan, R-Ohio
(f) James R. Comer, R-Ky.
(g) Mike Johnson, R-La.
(h) Thomas Massie, R-Ky.
(i) Bob Good, R-Va.
(j) Andy Biggs, R-Ariz.
(k) Scott Perry, R-Pa.
(l) Nancy Mace, R-S.C.
It's kinda refreshing to see a list like that and for once Indiana is not on it.
Tom
Being better than that lot isn’t much of a prize.
I could walk to Mike Pence's house from here.
It's not a prize.
Tom
 
I'm not aware of any more complex method. The precision simply isn't there.

I finally found how they defined statistical significance:
The PES performed statistical testing for estimates of net coverage error rates and certain components of coverage. Statements of comparison were statistically significant at the 90 percent confidence level (α = 0.10) using a two-sided test. For any PES release, estimated net coverage error rates that were statistically significantly different from zero were identified by an asterisk (*).

For a two-tailed 90% confidence interval, you will have 5% on each end. 0 will be included in one side's 5% when they are saying it is significant. When 0 is not included in the 90% middle, they are saying it is statistically significant. In a way, it is like saying there is a 95% chance (or confidence) that it is an undercount or overcount because that whole side of the distribution including one tail will be on one side of zero. So it seems equivalent in a sense to a p-value of 5% or less for significance, i.e. the convention.
So we would expect to see 5 false positives from this.

However, the number of survey points in low population states (remember, they listed the number of data points for the nation, not a number for each state) says that the low-population states will have wide error bars, I have a hard time picturing them reaching this threshold with the reported error margins.

To add--their goal is not to provide new numbers but to provide insights for quality and potential improvements for the next census. So they go through multiple rounds of finding problems in their own work and then have to stop because the census numbers must be used by 2023. So in 2022 they release their survey results.
Yeah, someone took data that wasn't meant for this purpose and reported it as if it meant something.

Compare the 2020 response data to the 2010 data. There are more than 4x as many people not willing to cooperate in 2020 as in 2010. I would be very surprised if that is random.
 
Introduction

I went ahead and made a program to run the Hungtinton-Hill method of apportionment. This is the standard that Congress uses and is described in a previous post. The numbers I got back reproduce the delegate allocations that are present in Congress which gives confidence in the programming. This method I call Original Huntington-Hill.
I also ran 3 other methods.​
Original Intuitive simply takes the divisor of the census population divided by 435. This is about 760367. I do not know why there are other numbers out there, but this one is correct and it subtracts the DC population as I had attempted to do before but thought it was weird because it did not reproduce other numbers I was seeing at the time. This one, 760367, happens to also be documented which I found and confirms it is the right one. So in this method, if the decimal part of the state's number after dividing by the divisor is .5 or higher, it is rounded up. Originally Congress was also looking for the highest decimal remainders when something went wrong, like not enough delegates were assigned or too many in order to prioritize the last few delegate assignments. I did not bother to implement that.​
Next, I took the percentages of just the so-called statistically significant undercounts and overcounts from the report and used those to create an Adjusted Population for those few states. This is also summed up. Strangely, it is quite a bit more than the original population even though the documentation provided by the Census Bureau says they are similar. Therefore, I do not trust their numbers. In any case, I copied the Original Populations of states into this column for all states that had no significant difference. Their new sum, 331508865, was used to create a new divisor (by dividing by 435).​
Computing the adjusted populations and summing, and dividing, allowed for two more methods.​
Adjusted Huntington-Hill is running the standard Congressional method of apportionment but on the Adjusted Populations. Then, Adjusted Intuitive takes the Adjusted Populations and divides them by the new divisor. If the decimal part is above .5, it rounds up. As before, there is nothing additional.​
Any method that is discrepant in the resulting delegate count appears in bold font for the entire row where it is. This way we can look manually at discrepancies and decide what happened. Possibly what ought to be and count up legitimate results.​

Results

Original Population330759736
Adjusted Population331508865
Original divisor760367.209195402
Adjusted divisor762089.344827586

StateOriginal
Population
Adjusted
Population
Original
Huntington-Hill
Original
Intuitive
Adjusted
Huntington Hill
Adjusted
Intuitive
California39538223395382235252 (51.998853)5252 (51.881349)
Texas29145505297160533838 (38.330828)3939 (38.992873)
Florida21538187223147402828 (28.326033)2929 (29.281002)
New York20201249195294362627 (26.567754)2626 (25.626176)
Pennsylvania13002700130027001717 (17.100553)1717 (17.061910)
Illinois12812508130699871717 (16.850422)1717 (17.150203)
Ohio11799448116262171516 (15.518092)1515 (15.255714)
Georgia10711908107119081414 (14.087809)1414 (14.055974)
North Carolina10439388104393881414 (13.729403)1414 (13.698378)
Michigan10077331100773311313 (13.253242)1313 (13.223293)
New Jersey928899492889941212 (12.216458)1212 (12.188852)
Virginia863139386313931111 (11.351611)1111 (11.325959)
Washington770528177052811010 (10.133631)1010 (10.110732)
Arizona7151502715150299 (9.405327)99 (9.384073)
Tennessee6910840725776199 (9.088819)910 (9.523504)
Massachusetts7029917687589799 (9.245424)99 (9.022429)
Indiana6785528678552899 (8.924014)99 (8.903848)
Missouri6154913615491388 (8.094659)88 (8.076367)
Maryland6177224617722488 (8.124001)88 (8.105643)
Wisconsin5893718589371888 (7.751147)88 (7.733631)
Colorado5773714577371488 (7.593323)88 (7.576164)
Minnesota5706494549546888 (7.504919)77 (7.211055)
South Carolina5118425511842577 (6.731517)77 (6.716306)
Alabama5024279502427977 (6.607701)77 (6.592769)
Louisiana4657757465775766 (6.125668)66 (6.111825)
Kentucky4505836450583666 (5.925868)66 (5.912477)
Oregon4237256423725666 (5.572644)66 (5.560051)
Oklahoma3959353395935355 (5.207159)55 (5.195392)
Connecticut3605944360594455 (4.742372)55 (4.731655)
Utah3271616318902044 (4.302679)44 (4.184575)
Iowa3190369319036944 (4.195827)44 (4.186345)
Nevada3104614310461444 (4.083046)44 (4.073819)
Arkansas3011524317136144 (3.960618)44 (4.161403)
Kansas2937880293788044 (3.863765)44 (3.855034)
Mississippi2961279308820444 (3.894538)44 (4.052286)
New Mexico2117522211752233 (2.784868)33 (2.778574)
Nebraska1961504196150433 (2.579680)33 (2.573850)
Idaho1839106183910622 (2.418708)22 (2.413242)
West Virginia1793716179371622 (2.359013)22 (2.353682)
Hawaii1455271136274122 (1.913906)22 (1.788164)
New Hampshire1377529137752922 (1.811663)22 (1.807569)
Maine1362359136235922 (1.791712)22 (1.787663)
Montana1084225108422521 (1.425923)21 (1.422701)
Rhode Island1097379104462521 (1.443222)11 (1.370738)
Delaware98994893878411 (1.301934)11 (1.231856)
South Dakota88666788666711 (1.166104)11 (1.163469)
North Dakota77909477909411 (1.024629)11 (1.022313)
Alaska73339173339111 (0.964522)11 (0.962343)
Vermont64307764307711 (0.845745)11 (0.843834)
Wyoming57685157685111 (0.758648)11 (0.756934)

Discussion

There are 8 rows bolded which indicates across 4 methods there are a maximum 8 discrepancies in comparing any two. However, comparing any of one method against the original may produce much less than 8 differences such as 3 or 4. Here I do 3 comparisons of methods.​
1. Original Huntington-Hill (OHH) vs Original Intuitive (OI)
If you focus on the bolded rows but then scan downward along these two columns, you will find 4 discrepancies:​

StateOHHOI
New York2627 (26.567754)
Ohio1516 (15.518092)
Montana21 (1.425923)
Rhode Island21 (1.443222)

The standard method deducts 1 from both New York and Ohio as compared to the intuitive method. Meanwhile, the standard method adds one to both Montana and Rhode Island as compared to the intuitive method. You may observe that the decimal part of the delegate is in the .4 range for both Montana and Rhode Island, but the decimal part for both New York and Ohio is in the .5 range. This may be surprising, but as noted earlier the standard method we use is biased in favor of smaller states.​
The purpose of this comparison was to illustrate how our current process works in contrast to intuition.​
2. Original Huntington-Hill (OHH) vs Adjusted Huntington-Hill (AHH)
If you focus on the bolded rows but then scan downward along these two columns, you will find 4 discrepancies:​

StateOHHAHH
Texas3839
Florida2829
Minnesota87
Rhode Island21

In the adjusted version, Florida gains 1, and Texas gains 1. Minnesota and Rhode Island both lose 1.​
The purpose of this comparison is to illustrate how the actual process would work if the adjusted numbers were actually used in reality.​
3. Original Intuitive (OI) vs Adjusted Intuitive (AI)
If you focus on the bolded rows but then scan downward along these two columns, you will find 6 discrepancies:​
StateOIAI
Texas3839
Florida2829
New York2726
Ohio1615
Tennessee910
Minnesota87

This comparison is most interesting because the lack of knowledge about the exact process of apportionment might lead many people to make this comparison. Even in discussions here, we observe people using the divisor as I also initially thought to do. So I believe we can hypothesize that this is how the conspiracy theory started. We can also observe that there are a total of 6 discrepancies which is a number that is in common with the conspiracy theory that there were an extra 6 Democrats and missing 6 Republicans.​
What we do observe here is 3 states that are typically red and 3 that may be blue (is Ohio purple?) being impacted, with red states negatively and "blue" states positively. This does not necessarily translate to 3 Republicans and 3 Democrats.​
And as noted, the Post-Enumeration Survey results themselves are of questionable quality. Whether they are better than the census is an open question.​

Conclusion

None of the methods show an extra 6 Democrats and missing 6 Republicans.
A comparison of intuitive methods (that are not used in the process) shows 3 typically red state gains, 1 purple/blue state gain, and 2 blue state gains. This comparison may have snowballed into a more dramatic version magnifying the difference and assuming partisan results based on "blue" or "red" states. In reality, following apportionment, there is redistricting and voting and those results may differ from predictions based on state gains or losses.​
In any case, the intuitive method is not used, but instead the Huntington-Hill method of apportionment. In that scenario, which is the actual Congressional process, there are fewer discrepancies with 2 red-state gains and 2 blue-state losses.
Those also do not necessarily translate to two Republicans and two Democrats not merely because of the process following apportionment, but also because the survey results that the adjustments are based on are not necessarily improvements to the census.
 
Last edited:
I'm not aware of any more complex method. The precision simply isn't there.

I finally found how they defined statistical significance:
The PES performed statistical testing for estimates of net coverage error rates and certain components of coverage. Statements of comparison were statistically significant at the 90 percent confidence level (α = 0.10) using a two-sided test. For any PES release, estimated net coverage error rates that were statistically significantly different from zero were identified by an asterisk (*).

For a two-tailed 90% confidence interval, you will have 5% on each end. 0 will be included in one side's 5% when they are saying it is significant. When 0 is not included in the 90% middle, they are saying it is statistically significant. In a way, it is like saying there is a 95% chance (or confidence) that it is an undercount or overcount because that whole side of the distribution including one tail will be on one side of zero. So it seems equivalent in a sense to a p-value of 5% or less for significance, i.e. the convention.
So we would expect to see 5 false positives from this.

Right. Offhand, I am unsure if it's 2.5 or 5. But there's an expected average amount that still has a variation about it. And that's just due to randomness, never mind all the systemic problems they identify with surveys of this nature. Measurement error, processing error, Analysis error, Recall Bias, etc etc.

However, the number of survey points in low population states (remember, they listed the number of data points for the nation, not a number for each state) says that the low-population states will have wide error bars, I have a hard time picturing them reaching this threshold with the reported error margins.

And it gets worse. If you set the confidence interval to 98% or 99% to try to get rid of the average 5 (or 2.5) errors merely from randomness, half the things you call significant go away, especially the smaller states.

To add--their goal is not to provide new numbers but to provide insights for quality and potential improvements for the next census. So they go through multiple rounds of finding problems in their own work and then have to stop because the census numbers must be used by 2023. So in 2022 they release their survey results.
Yeah, someone took data that wasn't meant for this purpose and reported it as if it meant something.

Exactly. One of the things I had brought up before is that the dimensions are cherry-picked to create a narrative. No one online who is promoting this idea of adjustments needed because of states is also considering the Census Bureau reports that had a different dimension of race undercounts and overcounts. Suppose there was a different person out there of the left-wing variety who did the mirror image, completely ignored the state statistics and talked only about the race side of it: whites were overcounted, Blacks were undercounted, Hispanics were undercounted...then they could also create a narrative that their side (Democrats) should have more votes.

But anyway, yeah, someone cherry-picked the dimension, then created material online focusing only on states...then someone else came along and changed it to magnify things. No one also looked at the very poor quality of the survey...or at least that it is not of high enough quality to replace the census.

Compare the 2020 response data to the 2010 data. There are more than 4x as many people not willing to cooperate in 2020 as in 2010. I would be very surprised if that is random.

Another thing to look at is the bottom of page 8 and top of page 9: Recall bias and proxies. You will be astonished. So they are interviewing people up to 2022 about where they were on April 1st, 2020. A time when people who have seasonal homes in the Northeast and Florida might usually be thinking of switching but also a time when there was a pandemic and there were lockdowns. How well do people recall that....But it gets worse. If those people are not available, they ask a proxy for that old information. It's already no longer a primary source with respect to time, but now it's also second-hand info, hearsay, about someone else from up to two years before. In 2010, there were only 3.7% proxies who answered the question, but in 2020, 7% were proxies!

From their own report:
Another feature that could affect the accuracy of the data was the length of time between Census Day and the PES Interview Day. Due to the COVID19 pandemic, some activities in the PES schedule— including the Person Interview operation—had to be delayed. The expanded interval between Census Day and Interview Day increased the potential for recall bias. Recall bias is the phenomenon where, as the interview occurs further from the target event, the respondent may be less likely to remember specific details about the event. These could be details such as specific move dates, the place where they were staying most of the time, or even the birth date of a child. If a respondent couldn’t pinpoint whether events like these occurred before, on, or after Census Day, this could create error in the response data.

Further, this phenomenon could become exacerbated when a proxy responded for the household.

Emphasis added.
 
We can also observe that there are a total of 6 discrepancies which is a number that is in common with the conspiracy theory that there were an extra 6 Democrats and missing 6 Republicans.
It should be clearly noted that Musk never called this a conspiracy theory. He instead carefully calls this an incentive why members of our political class will not change anything at the US southern border. They simply have an incentive not to fix our borders and to imply that anyone is a nut case is extremely insulting and disingenuous on your part..
 
What makes the discussion even further laughable:

Because on the one hand, our political institutions have become so paranoid and worried Russia will somehow invade our western hemisphere that we risk Armageddon with a false warning of incoming missiles.

Then our insane political establishment decides we must do absolutely nothing with the certain and continuing invasion on our southern border!


This woman seems to have done thorough research on the subject of nuclear war at present and the current state of this is both frightening and sobering. Because no matter who launches first all life will end.

One really does have to wonder how we become so obsessed with Russia invading way over in EuropeZ when we don't give 2 shits about our own border??
 
We can also observe that there are a total of 6 discrepancies which is a number that is in common with the conspiracy theory that there were an extra 6 Democrats and missing 6 Republicans.
It should be clearly noted that Musk never called this a conspiracy theory. He instead carefully calls this an incentive why members of our political class will not change anything at the US southern border. They simply have an incentive not to fix our borders and to imply that anyone is a nut case is extremely insulting and disingenuous on your part..
So, the interesting question would be this: if the Republicans had not killed the bi-partisan border security bill negotiated in the Senate, would the Democrats have killed it? According to your logic, they should have.
 
Yikes! Okay, so I have to make some minor corrections. This is because there are also people overseas, such as federal employees and certain military personnel and their dependents who may not have been included in the census. Those people's counts are not affected by the net coverage error from the Census Bureau. So, they have to be added to the tables AFTER adjusting by percentages of "undercounts" and "overcounts." Also, the total is only about 350K people across all 50 states which is less than half of one divisor distributed across 50 states. So, the impact is expected to be negligible. It could conceivably make a difference if any state has way more overseas people than another as a ratio interacting with two such states that have very close on-the-edge rounding going on. However, I checked the numbers and it makes no impact on the discussion or results. The counts of discrepancies for each method comparison remain the same.

In the interest of transparency and showing correct numbers, even if the conclusions are the same, I have recomputed the numbers and made Version 2.0 for the previous post.

At the end, you will also find an addendum that has a comparison of similar work.


Version 2.0:

Introduction


I went ahead and made a program to run the Hungtinton-Hill method of apportionment. This is the standard that Congress uses and is described in a previous post. The numbers I got back reproduce the delegate allocations that are present in Congress which gives confidence in the programming. This method I call Original Huntington-Hill.
I also ran 3 other methods.​
Original Intuitive simply takes the divisor of the census population divided by 435. This is about 760367 761169. I do not know why there are other numbers out there, but this one is correct and it subtracts the DC population as I had attempted to do before but thought it was weird because it did not reproduce other numbers I was seeing at the time. This one, 760367, happens to also be documented which I found and confirms it is the right one. This is a number that I had encountered before in documentation that I had mentioned in post 716. One must subtract out the DC Population and then add overseas counts for each state to derive this divisor. So in this method, if the decimal part of the state's number after dividing by the divisor is .5 or higher, it is rounded up. Originally Congress was also looking for the highest decimal remainders when something went wrong, like not enough delegates were assigned or too many in order to prioritize the last few delegate assignments. I did not bother to implement that.​
Next, I took the percentages of just the so-called statistically significant undercounts and overcounts from the report and used those to create an Adjusted Population for those few states. I also added the overseas counts to the adjusted population. This is also summed up. Strangely, it is quite a bit more than the original population even though the documentation provided by the Census Bureau says they are similar. Therefore, I do not trust their numbers. In any case, I copied the Original Populations of states into this column for all states that had no significant difference. Their new sum, 331508865 331857563, was used to create a new divisor (by dividing by 435).​
Computing the adjusted populations and summing, and dividing, allowed for two more methods.​
Adjusted Huntington-Hill is running the standard Congressional method of apportionment but on the Adjusted Populations. Then, Adjusted Intuitive takes the Adjusted Populations and divides them by the new divisor. If the decimal part is above .5, it rounds up. As before, there is nothing additional.​
Any method that is discrepant in the resulting delegate count appears in bold font for the entire row where it is. This way we can look manually at discrepancies and decide what happened. Possibly what ought to be and count up legitimate results.​

Results

Original Population331108434
Adjusted Population331857563
Original divisor761168.813793103
Adjusted divisor762890.949425287

StateOriginal
Population
Adjusted
Population
Original
Huntington-Hill
Original
Intuitive
Adjusted
Huntington Hill
Adjusted
Intuitive
California39576757395767575252 (51.994717)5252 (51.877345)
Texas29183290297538383838 (38.340102)3939 (39.001430)
Florida21570527223470802828 (28.338690)2929 (29.292627)
New York20215751195439382627 (26.558827)2626 (25.618259)
Pennsylvania13011844130118441717 (17.094557)1717 (17.055968)
Illinois12822739130802181717 (16.846117)1717 (17.145593)
Ohio11808848116356171516 (15.514099)1515 (15.252006)
Georgia10725274107252741414 (14.090533)1414 (14.058725)
North Carolina10453948104539481414 (13.734073)1414 (13.703070)
Michigan10084442100844421313 (13.248627)1313 (13.218720)
New Jersey929449392944931212 (12.210817)1212 (12.183252)
Virginia865454286545421111 (11.370069)1111 (11.344403)
Washington771594677159461010 (10.136971)1010 (10.114088)
Arizona7158923715892399 (9.405171)99 (9.383940)
Tennessee6916897726381899 (9.087205)910 (9.521437)
Massachusetts7033469687944999 (9.240354)99 (9.017605)
Indiana6790280679028099 (8.920859)99 (8.900722)
Missouri6160281616028188 (8.093186)88 (8.074917)
Maryland6185278618527888 (8.126027)88 (8.107683)
Wisconsin5897473589747388 (7.747917)88 (7.730427)
Colorado5782171578217188 (7.596437)88 (7.579289)
Minnesota5709752549872688 (7.501295)77 (7.207748)
South Carolina5124712512471277 (6.732688)77 (6.717490)
Alabama5030053503005377 (6.608328)77 (6.593410)
Louisiana4661468466146866 (6.124092)66 (6.110268)
Kentucky4509342450934266 (5.924234)66 (5.910861)
Oregon4241500424150066 (5.572351)66 (5.559772)
Oklahoma3963516396351655 (5.207144)55 (5.195390)
Connecticut3608298360829855 (4.740470)55 (4.729769)
Utah3275252319265644 (4.302925)44 (4.184944)
Iowa3192406319240644 (4.194084)44 (4.184616)
Nevada3108462310846244 (4.083801)44 (4.074582)
Arkansas3013756317359344 (3.959379)44 (4.159956)
Kansas2940865294086544 (3.863617)44 (3.854896)
Mississippi2963914309083944 (3.893898)44 (4.051482)
New Mexico2120220212022033 (2.785479)33 (2.779191)
Nebraska1963333196333333 (2.579366)33 (2.573543)
Idaho1841377184137722 (2.419144)22 (2.413683)
West Virginia1795045179504522 (2.358274)22 (2.352951)
Hawaii1460137136760722 (1.918283)22 (1.792664)
New Hampshire1379089137908922 (1.811804)22 (1.807714)
Maine1363582136358222 (1.791432)22 (1.787388)
Montana1085407108540721 (1.425974)21 (1.422755)
Rhode Island1098163104540921 (1.442733)11 (1.370326)
Delaware99083793967311 (1.301731)11 (1.231727)
South Dakota88777088777011 (1.166325)11 (1.163692)
North Dakota77970277970211 (1.024348)11 (1.022036)
Alaska73608173608111 (0.967040)11 (0.964857)
Vermont64350364350311 (0.845414)11 (0.843506)
Wyoming57771957771911 (0.758989)11 (0.757276)

Discussion

There are 8 rows bolded which indicates across 4 methods there are a maximum 8 discrepancies in comparing any two. However, comparing any of one method against the original may produce much less than 8 differences such as 3 or 4. Here I do 3 comparisons of methods.​
1. Original Huntington-Hill (OHH) vs Original Intuitive (OI)
If you focus on the bolded rows but then scan downward along these two columns, you will find 4 discrepancies:​

StateOHHOI
New York2627 (26.558827)
Ohio1516 (15.514099)
Montana21 (1.425974)
Rhode Island21 (1.442733)

The standard method deducts 1 from both New York and Ohio as compared to the intuitive method. Meanwhile, the standard method adds one to both Montana and Rhode Island as compared to the intuitive method. You may observe that the decimal part of the delegate is in the .4 range for both Montana and Rhode Island, but the decimal part for both New York and Ohio is in the .5 range. This may be surprising, but as noted earlier the standard method we use is biased in favor of smaller states.​
The purpose of this comparison was to illustrate how our current process works in contrast to intuition.​
2. Original Huntington-Hill (OHH) vs Adjusted Huntington-Hill (AHH)
If you focus on the bolded rows but then scan downward along these two columns, you will find 4 discrepancies:​

StateOHHAHH
Texas3839
Florida2829
Minnesota87
Rhode Island21

In the adjusted version, Florida gains 1, and Texas gains 1. Minnesota and Rhode Island both lose 1.​
The purpose of this comparison is to illustrate how the actual process would work if the adjusted numbers were actually used in reality.​
3. Original Intuitive (OI) vs Adjusted Intuitive (AI)
If you focus on the bolded rows but then scan downward along these two columns, you will find 6 discrepancies:​
StateOIAI
Texas3839
Florida2829
New York2726
Ohio1615
Tennessee910
Minnesota87

This comparison is most interesting because the lack of knowledge about the exact process of apportionment might lead many people to make this comparison. Even in discussions here, we observe people using the divisor as I also initially thought to do. So I believe we can hypothesize that this is how the conspiracy theory started. We can also observe that there are a total of 6 discrepancies which is a number that is in common with the conspiracy theory that there were an extra 6 Democrats and missing 6 Republicans.​
What we do observe here is 3 states that are typically red and 3 that may be blue (is Ohio purple?) being impacted, with red states negatively and "blue" states positively. This does not necessarily translate to 3 Republicans and 3 Democrats.​
And as noted, the Post-Enumeration Survey results themselves are of questionable quality. Whether they are better than the census is an open question.​

Conclusion

None of the methods show an extra 6 Democrats and missing 6 Republicans.
A comparison of intuitive methods (that are not used in the process) shows 3 typically red state gains, 1 purple/blue state gain, and 2 blue state gains. This comparison may have snowballed into a more dramatic version magnifying the difference and assuming partisan results based on "blue" or "red" states. In reality, following apportionment, there is redistricting and voting and those results may differ from predictions based on state gains or losses.​
In any case, the intuitive method is not used, but instead the Huntington-Hill method of apportionment. In that scenario, which is the actual Congressional process, there are fewer discrepancies with 2 red-state gains and 2 blue-state losses.
Those also do not necessarily translate to two Republicans and two Democrats not merely because of the process following apportionment, but also because the survey results that the adjustments are based on are not necessarily improvements to the census.

Addendum

Here is a website that also does these types of computations. Its author seems to have political bias, unsure... Some of their results are different with the most egregious difference being Florida which they say would have gained an additional two seats. Therefore, it is worth it to delve into computation differences for this state. We can try to discover flaws in my or their computations.​
They report Florida's numbers as the following:​
StateOriginal PopulationAdjusted Population
Florida21,870,52722,631,621
I have computed Florida's census plus overseas count as 21,570,527. So these numbers differ by 300K. If you perform a Google search of "Florida 21,870,527" you will see they are the only site with this number describing its population, but if you do a Google search of "Florida 21,570,527" you will find many reputable sites listing the value with 5, not 8 as the third digit. Because all the other digits are exactly the same, it appears to be a typo.​
My computation comes from the census page which gives 21538187 plus the overseas count of 32,340 which I obtained here.​
21538187 + 32,340 = 21,570,527 not 21,870,527​
Let us do three more sanity checks: If I had put in the wrong state population for Florida, off by 300K, then I would not have been able to sum up all the state populations, divide by 435, and reproduce the divisor of 761169. Second, all the other counts I computed for these 13 other states in the table in Results are exactly the same as the 13 other states' original populations on their website. Finally, they are claiming that this produces an error of - 761,094 for Florida but that is less than the divisor. So how could a change less than the divisor result in an error of 2 delegates?​
I next will try to reproduce their second number (Adjusted Population).​
The way that the undercount and overcount rates are imagined intuitively is not really how they are computed. I will first show the simple, straightforward, imagined way they might be used.​
Note the undercount rate is 3.48% for Florida. So one may imagine that all one has to do is multiply the original census number by 1.0348 to get an adjusted census number and following that add the overseas count for Florida.​
However, they actually added the overseas count (plus the erroneous 300K padding) first and then applied the intuitive correction factor.​
21,870,527 x 1.0348 = 22,631,621.3396 ~= 22,631,621​
It turns out that this is wrong for 3 reasons: (a). the correction should not be applied after adding overseas counts because this will magnify the effect, (b). they inadvertently padded Florida by 300K people, and (c) the intuitive way to apply the correction factor is incorrect.​
I will now show how I computed Florida's numbers.
StateOriginal PopulationAdjusted Population
Florida21,570,52722,347,080
I have already explained how I computed the original population which is the census count for Florida plus the overseas count:​
21538187 + 32,340 = 21,570,527​
Next, note how the Census Bureau defines the so-called error rates that I posted earlier:​
Census Bureau said:
A net coverage error rate is the difference between the census count and the PES estimate of the number of people in the United States expressed as a percentage of the PES estimate.
"a net coverage error rate" means an undercount rate or an overcount rate. The mention of PES estimate twice may be confusing, but it's the Adjusted Census count. Let's define some quick variables:​
e = net coverage error rate, the undercount rate or overcount rate​
NP = new population, adjusted census count based on the alleged error​
OP = old population from the census​
The wrong, but intuitive, way to think of it was​
NP = OP x (1 - e); so in the case of Florida, there was a correction factor of (1 - (-3.48%)) or (1.0348)​
BUT that isn't what they said. They are saying:​
e = (OP - NP) / NP​
The new population is the baseline of comparison, not the old population.​
Solve this for NP, you get:​
NP = OP / (1 + e)​
For Florida, NP = 21538187 / (1 - 3.48%) = 21538187 / .9652 ~= 22,314,739.95 ~= 22,314,740​
But lastly, one needs to add the overseas count​
22,314,740 + 32,340 = 22,347,080​
 
We can also observe that there are a total of 6 discrepancies which is a number that is in common with the conspiracy theory that there were an extra 6 Democrats and missing 6 Republicans.
It should be clearly noted that Musk never called this a conspiracy theory.

The discussion has morphed from illegal immigrants being far too many causing mayhem and crimes in red states to illegal immigrants being used in census counts and looking the other way while republicans send them off in planes to the places they say they don't want them because that will increase the census counts TO FINALLY a discussion of raw census counts that pretty much ignores the whole illegal immigrant issue but is still alleged to be unfair because of GIANT differences in census vs error adjustments that turn out to be smaller differences with very disputable, low-quality things being called errors. So, essentially, what you are talking about is different but the same theme.

He instead carefully calls this an incentive why members of our political class will not change anything at the US southern border. They simply have an incentive not to fix our borders and to imply that anyone is a nut case is extremely insulting and disingenuous on your part..

Do you think math implies people are nutcases?
 
Then our insane political establishment decides we must do absolutely nothing with the certain and continuing invasion on our southern border!

There's a simple solution. Stop giving millions of jobs to undocumented employees.
No budget busting Wall necessary. No massive ramping up of border guard budgets. Start prosecuting illegal employers.
Tom
 
Sorry, I just kind of dropped off there with the addendum. To continue...

That website states the following summary:
The following map shows how the 2020 Congressional Apportionment would have changed if each State’s total population were accurately reported by the 2020 Census. California, Illinois, Ohio, Michigan, West Virginia, Pennsylvania, and New York would each still have lost a seat, but Minnesota and Rhode Island would each have lost a seat as well. North Carolina, Montana, and Oregon would each still have gained a seat, but Texas and Florida would each have gained three seats instead of two and one, respectively, while Colorado would not have gained any.

Emphasis is added to the portions they are describing as different if adjustments were to be applied. They are counting 5 delegate changes or discrepancies. Minnesota -1, Rhode Island -1, Texas +1, Florida +2, Colorado -1. If you look at the tables, you can see with adjusted numbers Colorado has an edge case of 7.57... delegates. Because they padded Florida numbers with 300K people, Florida ended up winning an extra delegate it ought not to have won and it takes it from the edge case of Colorado. So, if you put that back with Colorado, there will only be 4 discrepancies: Minnesota -1, Rhode Island -1, Texas +1, Florida +1, which is the same result as I had gotten in the AHH vs OHH comparison. Even though they may have run the error correction after adding overseas counts, those did not impact their conclusions. Their primary issue was the padding of 300K.
 
If there is an accurate enough account to show the census undercounted why don’t we use *that* count instead of the census? Am I missing something here? Sorry, I haven’t followed all the posts in this thread
Because the current method is to reapportion based on the US Census. Reapportionment only occurs every 10 years, when a new Census is done.

There's good sense in not having reapportionment done too frequently - it would be very unstable. Hypothetically, we could do a good estimate every year, and apportion as needed... but that would be a bad idea. Apportionment affects the districting within states, and those districts are what the representatives represent (theoretically anyway). The districts are supposed to be drawn such that they're all roughly equal in terms of the number of people being represented. If it happened every year, it would be constantly changing, we'd find ourselves having to fire the occasional congresscritter, or add new ones without sufficient preparation. It would be more problem than it's worth.

On the other hand... given the rate of movement between states that we see in modern times, as well as the impacts of immigration and emigration... I think that every 10 years is probably too long.
 
...
I was under the impression it was 750,000.
...

You might think that you take the total population and divide it by 435, the set number of reps since 1913. This would give about 330M / 435 ~= 761K. However, it doesn't work that way. It might even be close most of the time. One might go ahead and imagine apportionment methods, but it's actually more complicated than intuitive arithmetic and the process has a history that is centuries long.

Problems began to appear early with intuitive methods. Like rounding off the numbers to the nearest whole number. Divide 19 reps among 3 states each having 6.33 delegates. Do you round up, round down/truncate? If you round up, they are all 6, not 19, but there are more complicated scenarios that you can google which gave rise to very insane stuff.

First, you need to meet a Constitutional mandate that each state has to have at least 1 rep. So, all these reps are allocated in the complicated apportionment process. That leaves 385 reps to be distributed among the 50 states in some way based on populations. They do this 1 rep allocation at a time. They assign each state a priority which is the ratio of the state population (P) to the geometric mean of its assigned reps so far in the process and the number it would get if it got this one (n+1). So priority = P / sqt(n * (n+1)). In the 51st round, California will take that spot because all denominators are the same for all states and California has the highest P. Texas might be the next one since California's denominator has increased dramatically in comparison to its last one. The process continues until all 435 spots are assigned.

This method is called the Huntington-Hill method and was alleged to be unbiased when it was implemented in the 40s. However, analysis since then has shown it is biased toward smaller states.

That all said, I'm not going to go through that process to try to prove anything. That would be a bit of work to do.

Suppose we tried to assess this imperfectly by examining the intuitive ways to do apportionment. The 2020 US Census population was 331,464,948. Divided among 435 delegates, we'd get an average of 761,988. I've seen different figures near this floating around the Internet so let's hope we do not observe numbers on the edge.

Suppose we did look at Arkansas. Its 2020 census population was 3,011,524. The midpoint of the undercount confidence interval is 151,781. So add the numbers together and you get 3,163,305. Now, divide each by 761,988. You go from about 3.95 delegates to about 4.15 delegates. Both of these easily round to 4. And if you check the apportionments for Arkansas, you will find that they do have 4 currently. There was no loss of representation. No missing Republican congress critter.

Florida is supposed to be the questionable one. 21,538,187. The midpoint of the confidence interval for undercount is 3.48% which I will convert to people: 749,529. If we add that midpoint of interval to the census population, we get 22,287,716. The original count divided by the 761K gives 28.27 delegates. The adjusted count divided by that gives 29.25. So intuitively we would think Florida missed addition of a delegate, provided that we accept the estimated undercount as a true undercount. Further, if we look up Florida info, we can see they currently only have 28 delegates. So, it's good reason to believe they'd probably have 29 if they had a different census count, though we cannot say if it is valid without understanding the root cause for the discrepancy between census and sampling.

Now, apportionment is only the first step. Because the count of delegates changed in this hypothetical, they'd next have to do redistricting. We know there might be shenanigans there and so we cannot claim that the result of redistricting is necessarily fairly representative of what ought to be so far as addition of an unbiased district.

Given the extreme partisan gerrymandering in Florida, in all fairness the district ought to undo some of that...and likewise the new district ought to be reflective of changing populations in some way.

(assuming that there ought to be a new delegate).
Very informative
 
Back
Top Bottom