Uncovering digital trace data biases:tracking undercoverage in web tracking data
Digital trace data is an increasingly popular alternative to surveys, often considered as the gold standard. This study critically assesses the use of web tracking data to study online media exposure. Specifically, we focus on a critical error source of this type of data, tracking undercoverage: researchers’ failure to capture data from all the devices and browsers that individuals utilize to go online. Using data from Spain, Portugal, and Italy, we explore undercoverage in online panels and simulate biases in online media exposure estimates. We show that undercoverage is highly prevalent when using commercial panels, with more than 70% of participants affected. Additionally, the primary determinant of undercoverage is the type and number of devices used, rather than individual’s characteristics. Moreover, through a simulation study, we demonstrate that web tracking estimates are often substantially biased. Methodologically, the paper showcases how auxiliary survey data can help study web tracking errors.
| Item Type | Article |
|---|---|
| Departments | Methodology |
| DOI | 10.1080/19312458.2024.2393165 |
| Date Deposited | 13 Aug 2024 13:30 |
| URI | https://researchonline.lse.ac.uk/id/eprint/124537 |
Explore Further
-
picture_as_pdf -
subject - Published Version
-
- Available under Creative Commons: Attribution-NonCommercial-No Derivative Works 4.0