The Perils of Polling Part I: Can Polling Really Tell Us Who Will Win The 2020 American Presidential Election?
More than 100 years ago, writer H.G. Wells said: “Statistical thinking will one day be as necessary for efficient citizenship as the ability to read and write.”
Polling of public opinion has become an industry of its own, one of our more thriving growth industries.
Today we ask people what they think about the 2020 presidential election: President Trump's strategy and tactics for putting a halt to illegal immigration; the ever-growing budget deficit and its implications for America's future; the meaning of Democratic Socialism; the implications of Bernie Sanders "Medicare For All" proposal and dozens of other topics now front-page news.
An irresistible urge to poll people on every conceivable topic has seized media companies corporations, politicians, healthcare institutions and the like.
We ask workers what they find wrong with their jobs, with perhaps no better result than to activate their natural human instinct for dissatisfaction.
We ask CEOs and CFOs what they see for the immediate economic future, knowing that they cannot know─and when there's been little time for sober thought.
We ask people their opinion on raising the minimum wage, increasing corporate taxes, levying a wealth tax on multimillionaires and billionaires, government control of the means of production, providing government "freebies" in a multitude of areas, growing income inequality (when we know full well few of those sampled/polled have the prerequisite economic literacy to fully understand what this ultimately means to their future purchasing power), Social Security, pensions and all the rest.
We're bombarded almost daily with the result of polls. But─and this is a big "but"─in the public's “opinion” on almost any issue will be a function of three factors, namely, the questions asked, the responses and the analysis.
Many books and articles have been devoted to the subject of misuses and abuses of polling and sample surveys, but in today's fast-paced media world, the science of correctly collecting, summarizing, analyzing and using data seems bothersome. Ignorance is bliss!
Lawmakers seem more interested in the results of polls, not in how they were obtained. Until recently, it didn't occur to them that the results of polls and how they're designed, implemented and analyzed are inseparable.
A Neglected Truth About Opinion Polls
Neil Postman, in a marvelous book entitled Technopoly, delightfully exposed an inherent weakness in interpreting survey results:
Pollsters ask questions that will elicit yes or no answers. Polling ignores what people know about the subjects they're queried on."
In a culture that isn't obsessed with measuring and ranking things, this omission would probably be regarded as bizarre.
But let us imagine what we'd think of opinion polls if the questions came in pairs, indicating what people 'believe' and what they 'know' about the subject.
If I may make up some figures, let us suppose we read the following:
The latest poll indicates that 72 percent of the American public believes we should withdraw economic aid from Nicaragua…
Of those who expressed this opinion, 28 percent thought Nicaragua was in Central Asia, 18 percent thought it was an island near New Zealand, and 27.4 percent believed that ‘Africans should help themselves,' obviously confusing Nicaragua with Nigeria...
Moreover, of those polled, 61.8 percent didn't know that Americans give economic aid to Nicaragua, and 23 percent didn't know what ‘economic aid’ means…"
Postman sadly concluded: "Were pollsters inclined to provide such information, the prestige and power of polling would be considerably reduced."
Making Postman's observation into a modern day example: While Medicare For All polls well (56%) in the abstract, support plummets to 37% when voters are told it means the end of private insurance on which 180 million Americans currently depend.
Polls Show Support for Embattled Public Sector Workers – A Suspicious Finding
Several years ago, a New York Times/CBS poll reported a majority of people opposed efforts to weaken collective bargaining rights and were against cutting the pay or benefits of public employees to reduce state budget deficits.
Similarly, a national Gallup survey suggested that a majority of Americans oppose measures like the one once proposed in Wisconsin that restricts collective bargaining rights for public employees–a result near-identical to the New York Times/CBS News poll.
It'd be interesting to apply Postman's comments relating to what those sampled knew about the issues involved. Readers can draw their own conclusions.
We should say, however, that one sharp commentator on the New York Times website summed it up this way: “Americans prefer to have their taxes raised rather than decrease benefits to public employees… that would certainly be news if it were true.”
How to Guarantee Wrong Conclusions from Polls or Surveys
Let's get right to it. Classically trained statisticians call it the statistical universe or population.
What is a statistical universe? The statistical universe consists of all things about which conclusions are to be drawn.
For example, in a study of price fluctuations in the price of heating oil in New York City from 2015 to 2018, the statistical universe would include every price change which had occurred during the specified time interval.
If the scope of the study were expanded to cover a larger territory or a longer period of time, the statistical universe is correspondingly enlarged.
Obviously the term statistical universe is an elastic one and varies in its precise connotation every statistical undertaking.
Take-home message: In general, the statistical universe may be defined as a totality embracing every item which might've been brought under observation had a complete enumeration been effected.
The Need for Sampling
Lack of time and money render it impossible to make a complete survey of most statistical universes. Thankfully, it's not necessary to survey the entire statistical universe.
Why? Because hard-working, brilliant statisticians discovered how to get the same information from carefully selected, relatively small samples.
Making the correct inferences on small size samples taken from large or sometimes infinite statistical universes is the subject matter basic and advanced statistics.
If the sampling process is properly carried out, an analysis of the samples makes it possible to infer information about the statistical universe within the limits of unavoidable chance errors of sampling–the so-called margin of error.
Unfortunately, many courses in basic statistics fail to emphasize one critical point, namely, if the statistical universe is improperly defined, the powerful techniques of inferential statistics (making inferences on the basis of samples) are of little or no value.
The term for making inferences from an improperly defined statistical universe is called non-sampling error.
Statistical techniques designed to measure sampling error are valueless if the group conducting the poll/survey has committed the biggest statistical error of them all─non-sampling error.
An Example of a Poorly Defined Statistical Universe
Undoubtedly, the most widely publicized illustration of a poorly defined statistical universe is the one concerning the Literary Digest's error in predicting the winner of the presidential election of 1936. (Indeed, this was the example most often cited by W. Edwards Deming – a statistical sampling guru and quality management pioneer).
During the 1936 election campaign between Democrat Franklin D. Roosevelt and Republican Alfred M. Landon, the Literary Digest magazine sent mock ballots to a large list of people whose names appeared in telephone directories and automobile registration records. (Their lists also included their own magazine subscribers, country club members and the like).
Over 10 million mock ballots were sent out; 2.4 million ballots were returned by respondents; 7.6 million were not returned.
On the basis of 2.4 million returned ballots, the Digest predicted Landon would win by a comfortable margin–indeed, a landslide.
As it turned out, however, Roosevelt received 61% of the votes cast, a proportion representing one of the largest majorities in American presidential history.
How Could They Be So Wrong?
Polls only represent the people who are in a statistical universe and who respond to them. Despite the sample's huge size, this election became a textbook case of a biased sample: all the sample's component groups were heavily Republican.
Let's get more specific. There were two important reasons for the erroneous prediction– namely:(1) an incorrectly defined statistical universe and;(2) non-response bias.
Everyone with telephones and automobiles in 1936 were in a higher economic group than those people without these two " luxuries.”. There was a bias inherent in the statistical universe.
A large percentage of the voting population would not show up in telephone directories, automobile registrations, and club memberships.
The statistical universe was improperly defined─it tended to be biased in favor of higher income groups. Higher income groups tended to be Republican.
In the 1936 election there was a strong relationship between income and party preference. Lower income groups tended to be Democratic.
Bias in the Sample Selection Process
Classically trained statisticians define a bias as a persistent error in one direction. What does this mean?
No matter who you sampled from the Literary Digest' statistical universe, there was a high probability a relatively affluent person would be selected.
To repeat: the statistical universe selected was slanted towards middle and upper-class voters and excluded most lower-income voters. And, in reality, there were a great many low income voters in 1936.
Nine million people were unemployed in 1936.
“With regard to economic status, the Literary Digest poll was far from being a representative cross-section of the population. Then, as now, voters are generally known to vote with their pocketbooks.”
It should be mentioned–indeed, emphasized–that George Gallup was able to predict a victory for Roosevelt using a much smaller sample of about 50,000 people.
Gallup's statistical universe consisted of a representative cross-section on the population.
The Literary Digest poll sample size was 2.4 million people. This illustrates that a poorly defined statistical universe cannot be compensated for by increasing the size of the sample, which in fact just compounds the mistakes.
The second problem with the Literary Digest poll was that out of the 10 million people whose names were on the original mailing list, only 2.4 million responded to the survey.
It was then a fact that individuals of higher educational and higher economic status were more likely to respond to mail questionnaires than those of lower economic and educational status.
Therefore, the non-response group–7.6 million people–contained a high percentage of the lower economic status group. The 2.4 million people who responded to the questionnaire tended to be from a higher educational and economic status group.
A case study involving the Roosevelt/ Landon poll from the University of Pennsylvania's Wharton school describes the situation as follows:
“When the response rate is low (as it was in this case, 24%), a survey is said to suffer from non-response bias. This is a special type of selection bias where reluctant and non-responsive people are excluded from the sample.”
A Quick Look At The Erroneous Predictions Of The 2016 Presidential Election
The 2016 presidential polls was a startling upset. Yet several savvy practicing statisticians/polling organizations called attention to the fact in an era in which few people answer landline calls, pollsters would have difficulty in reaching selected target audiences. In short, non-response error could severely bias polling results.
Indeed, several major polling organizations predicted total response ratios could be in 9 to 10% range – probably closer to 9% or less.
That could make them increasingly likely to under-represent key voting blocs. Non-response bias could impact/distort the results.
Summary and Conclusions
It's a major error to take a sample from a statistical universe which differs considerably from the “ideal” statistical universe about which you want to draw valid conclusions.
Considerable time must be spent in defining the ideal statistical universe and every attempt must be made to draw a representative sample from that universe.
Always be on the lookout for a poorly defined statistical universe. Fancy calculations relating to sampling errors, interval estimates and highfalutin statistical tests are meaningless if the statistical universe from which conclusions are drawn is incorrect.
In recent years, a growing number of people use only a cell phone and do without a separate landline in their home. Cell-phone-only persons in many surveys (but certainly not all) are generally not included–and some people are skeptical about the results.
Finally, beware of non-response bias. Said the Wharton school case study: “People who respond to surveys are different from people who don't, not only in the obvious way (their attitude toward surveys) but also in more subtle and significant ways.”
Part II of this article continues the discussion of the colossal blunders that can be made by improperly defining the statistical universe.