Undercoverage bias occurs when a part of the population is excluded from your sample. As a result, the sample is no longer representative of the target population. Non-probability sampling designs are susceptible to this type of research bias.
Example: Undercoverage bias
You are conducting research by randomly calling landline numbers. Because of your sampling method, individuals who only have mobile phones are not sampled. In this case, they are not merely undercovered, but not covered at all.
Undercoverage is a type of selection bias.
Undercoverage bias is the systematic distortion of a study’s findings due to the way the sample was selected.
Ideally, researchers should draw a sample that, like a snapshot, adequately captures characteristics that are both present in the target population and relevant for the research. In other words, researchers aim to collect a representative sample.
In some cases, researchers may sample too few units from a specific segment of the population. If the segment is small in comparison to others in the population, this may not impact the research findings much. However, if the segment is larger, it can lead to a sample that doesn’t accurately capture the characteristics of the population.
In more extreme cases, researchers may completely fail to include a part of the population, which can distort the findings completely.
Keep in mind that two things must both happen for undercoverage bias to occur:
Example: Undercoverage bias
Online surveys exclude people who don’t have internet access. Previous research shows that internet access also relates to demographics like socioeconomic status and age.
Depending on your research objective, this may impact your results. If your research is on online shopping habits, opting for an online survey is a legitimate choice. If however, you are interested in people’s voting intentions, running an online survey excludes a significant part of the population, leading to undercoverage bias.
If your sampling frame excludes a large part of your target population, you need to step back and consider how the excluded units may systematically differ from those included in your sample.
Although undercoverage bias and nonresponse bias may seem similar, they are actually quite different.
In other words, undercoverage means that some units never make it into the sample or are inadequately represented. Nonresponse means that some units are included in the sample, but their responses are missing.
There are two main sources of undercoverage bias:
Non-probability sampling designs like convenience sampling are almost always biased. When researchers recruit study participants based on proximity or ease of access, their results can’t be representative of the population. The reason is that not all members of the population of interest have an equal chance of being selected for the survey.
For example, if you stand at a shopping mall and select shoppers as they walk by to fill out a survey, you are neglecting to survey everyone not at the mall that day.
Probability samples are not immune to undercoverage bias either. Simple random sampling can also yield biased results if the sampling frame is incomplete.
For example, if you use email lists or phone lists as a sampling frame, error may be introduced due to the makeup of the list. In other words, the individuals included in the frame may differ from those who are not. As a result, a segment of the population is not sampled at all or is underrepresented in the sample.
Undercoverage varies in severity depending on the population studied.
Example: Incomplete sampling frame
You are researching the self-reported health status of the employees at a big law firm. The human resources department agrees to give you a list of all employee email addresses. You use this list to draw a sample and ask them to fill in your survey.
However, the list is outdated. More recent recruits to the firm are not in your sampling frame because they weren’t on the email address list. If these new employees are not sampled, your survey results will be affected by undercoverage bias.
If there are only a couple of recent hires, this may not be a problem, and the discrepancy between the target population and the sample will be small. If the law firm recently hired many new employees, the discrepancy will be bigger, and so will the extent of the undercoverage bias.
There are a few steps you can take to shield your research from undercoverage bias:
Use probability sampling. If the goal of your research is to draw a representative sample, probability samples will allow you to safely generalize your findings to a large population better than non-probability samples.
Nikolopoulou, K. (2023, March 24). What Is Undercoverage Bias? | Definition & Example. Scribbr. Retrieved October 16, 2023, from https://www.scribbr.com/research-bias/undercoverage-bias/