candycorn
Diamond Member
How people who should know better abuse math to bolster the âelection fraudâ lie
Imagine that you run a small deli. Over the years, youâve noticed that thereâs a rhythm to when you sell sandwiches during the week, with a peak generally landing about 1:30 p.m. or so and then fading over the next few hours.
Curious about whether you can precisely predict daily sales, allowing you to manage supplies and staffing, you decide youâre going to track sandwich sales for a week. The result looks like this.
Youâre not a math whiz, so you pass the data to your cousin Fritz, who has a PhD. Your question to Fritz is simple: How many sandwiches should you expect to sell each hour of a weekday?
In short order, he tells you something unexpected. The data you gave him isnât sandwich sales at all. Instead, itâs phony data, derived from an algorithm aimed at masking deli fraud. And he can prove it.
See, if you take the sales from two days â say, Monday and Tuesday â and average the values, you can then create a sixth-order polynomial that describes the hourly pattern. Fritzâs PhD allows him to do the math himself, he assures you, but he passes along a formula that he derived from Excel. There it is: the precise formula for determining how many sandwiches (the y value) you will sell each hour (the x value). Thatâs math, hard at work, way beyond your ken â but precise.
But wait! If you take that same formula and compare it to the sales each day of the week, something alarming happens. The formula predicts the number of sandwiches being sold very well. Suspiciously well. If you look at the R-value of the correlation between the sales each hour and compare it to the formula, you get numbers that are very close to 1, meaning itâs a perfect correlation. And in a human-based system like sandwich sales, that shouldnât happen!
Below, we used the average to do the R-value calculation, but you get the point. For each day, the predicated sales â here, the average â is extremely close to perfectly correlated to the actual sales. Ergo: This could be a function only of a computer-based effort to forge sales data.
You find this surprising for quite a few reasons. The first is that you tallied the sales yourself, so you know theyâre correct. The second is that, even if Fritz were right that the numbers were artificial, why does he assume thereâs some deli-fraud algorithm out there thatâs responsible? The third is that, even without a PhD, you see a problem with Fritzâs analysis. Heâs comparing an average derived from two of the values with all five of the values. Doesnât it seem obvious that the result would be a strong correlation?
The answer, of course, is yes. Being surprised that sandwich sales over the course of the day is correlated to an average of the number of sandwiches sold over the course of two days is like being surprised that a coin comes up heads about half the time you flip it.
Or, more to the point, like being surprised that an estimate of voter turnout based on four counties in Michigan correlates strongly to voter turnout in nine counties in Michigan â including the four used to generate the âsixth-degree polynomialâ (that complicated formula) in the first place.
This, however, is what the analysis of Douglas Frank, PhD, offers. Frankâs analysis of voter data in Michigan has led him to determine with seeming authority that the election results in that state were rigged, tailored to match the precise formula he himself derived from the stateâs results. Claims like Frankâs analysis of Michigan have earned him the attention of MyPillow chief executive Mike Lindell, whose efforts to prove that voter fraud occurred in 2020 has led him to elevate all sorts of unfounded allegations about last yearâs presidential election. Frankâs analysis has convinced others, too, with the conservative polling firm Rasmussen Reports elevating a write-up of his allegations over the weekend.
Rasmussen highlighted a different part of Frankâs assessment, the idea that about 66,000 Michigan voters cast ballots in last yearâs election who werenât in voter rolls in October. As The Washington Postâs Lenny Bronner quickly pointed out, Michigan has same-day voter registration, so those 66,000 voters are almost certainly just people who actually werenât registered in October but who voted anyway.
The firm, which consistently showed more favorable approval data for Donald Trump over the course of his presidency than other pollsters, has repeatedly elevated dubious and unfounded fraud claims over the past few months. Thatâs aligned with a broader shift in its public-facing presence to be more aggressive toward critics from the mainstream media. (Last year, it accused me of ârepublishing a defamatory falsehood [and] committing fraudâ for pointing out that its 2018 general-election polling showed Republicans with a one-point lead over the Democrats in an election where Democrats won more votes in House races national by a nearly 10-point margin.) Responding to Bronnerâs tweet, Rasmussen offered the equivalent of a âjust asking questionsâ shrug.
It should know better than to take Frankâs analysis at face value. This is a polling firm, after all, a company whose business is statistical analysis. Yet, there it was, sharing Frankâs claims uncritically.
Frank has been working with an attorney named Matthew DePerno, who has been sharing graphs from Frankâs presentation on Twitter with a bit of colorful commentary.
So what do those graphs show? What our third sandwich chart shows: that a prediction of how many votes would be cast in a Michigan county by age derived from the number of votes cast in a Michigan county by age correlates with the number of votes cast in a Michigan county by age. Frank does a lot of hand-waving on the side, like that discrepancy between the October voter roll and votes cast and by including comparisons of Census Bureau population estimates â which appear to be five-year averages of the population from 2015 to 2019 â are lower than the number of registered voters in some places. (Frank does point out that this could be a function of outdated voter rolls, but he doesnât dwell on it.)
The heart of his analysis, though, is that R-value correlation between his predicted turnout and the actual turnout. How did he generate his prediction?
âWhat I actually did is I averaged four counties, the four largest counties, and used that key to predict all nine,â he explains. A few seconds later, he marvels that âthe accuracy of my prediction is just ridiculously good. It shouldnât be that good.â
Well, it should, because you are predicting data based on the data itself. If it werenât a really close correlation, thatâs when things would get funky.
Incidentally, that Frank is using a âsixth-order polynomialâ doesnât mean heâs doing some incredibly complicated calculation. It just means that heâs trying to fit his prediction as closely as possible to the existing data, thereby increasing the correlations.[/I]
OAN's math wiz turned out to be a convicted felon who makes his living assembling children's swing sets.
I think he works with the ditz in the OP as well.
Dominion says OAN's 'expert mathematician' who claimed to prove election fraud had a job 'setting up swing sets'
While Ed Solomon claims to have uncovered an election conspiracy, Dominion says he doesn't have a math degree.
