Multi-label prediction for political text-as-data

Erlich, A., Dantas, S. G., Bagozzi, B. E., Berliner, D.

& Palmer-Rubin, B. (2022). Multi-label prediction for political text-as-data. Political Analysis, 30(4), 463 - 480. https://doi.org/10.1017/pan.2021.15

Copy

Political scientists increasingly use supervised machine learning to code multiple relevant labels from a single set of texts. The current "best practice"of individually applying supervised machine learning to each label ignores information on inter-label association(s), and is likely to under-perform as a result. We introduce multi-label prediction as a solution to this problem. After reviewing the multi-label prediction framework, we apply it to code multiple features of (i) access to information requests made to the Mexican government and (ii) country-year human rights reports. We find that multi-label prediction outperforms standard supervised learning approaches, even in instances where the correlations among one's multiple labels are low.

Item Type	Article
Copyright holders	© 2021 The Authors
Departments	LSE > Academic Departments > Government
DOI	10.1017/pan.2021.15
Date Deposited	2 July 2021
Acceptance Date	21 April 2021
URI	https://researchonline.lse.ac.uk/id/eprint/110971

Explore Further

Berliner, Daniel

https://www.scopus.com/pages/publications/85107965219 (Scopus publication)
https://www.lse.ac.uk/government/people/academic-staff/daniel-berliner (Author)
https://www.cambridge.org/core/journals/political-... (Official URL)

Berliner, D., Erlich, A., Dantas, S., Bagozzi, B. & Palmer-Rubin, B. (2021). Replication Data for: Multi-label Prediction for Political Text-as-Data. [Dataset]. Harvard Dataverse. https://doi.org/10.7910/dvn/sovpa4

Multi-label prediction for political text-as-data

Explore Further

Export as