Generalizing from Survey Experiments Conducted on Mechanical Turk: A Replication Approach

Leeper, T. J., Coppock, A. & Mullinix, K. J. (2022). Generalizing from Survey Experiments Conducted on Mechanical Turk: A Replication Approach. [Dataset]. Code Ocean. https://doi.org/10.24433/co.2960428.v1
Copy

The extent to which studies conducted with non-representative convenience samples are generalizable to broader populations depends critically on the level of treatment effect heterogeneity. Recent inquiries have found a strong correspondence between average treatment effects estimated in nationally representative experiments and in replication studies conducted with convenience samples. In this paper, we consider three possible explanations: low levels of effect heterogeneity, high levels of effect heterogeneity that are unrelated to selection into the convenience sample, or just good luck. We reanalyze 27 original-replication study pairs (encompassing 101,745 individual survey responses) to assess the extent to which subgroup effect estimates generalize. While there are exceptions, the overwhelming pattern that emerges is one of treatment effect homogeneity, providing a partial explanation for strong correspondence across both unconditional and conditional average treatment effect estimates.

Available at: 10.24433/co.2960428.v1

Access level: Open

Licence: CC0 1.0


Export as

EndNote BibTeX Reference Manager Refer Atom Dublin Core JSON Multiline CSV
Export

Downloads