Historical analysis of legal opinions with a sparse mixed-effects latent variable model
We propose a latent variable model to enhance historical analysis of large corpora. This work extends prior work in topic modelling by incorporating metadata, and the interactions between the components in metadata, in a general way. To test this, we collect a corpus of slavery-related United States property law judgements sampled from the years 1730 to 1866. We study the language use in these legal cases, with a special focus on shifts I opinions on controversial topics across different regions. Because this is a longitudinal data set, we are also interested in understanding how these opinions change over the course of decades. We show that the joint learning scheme of our sparse mixed-effects model improves on other state-of-the-art generative and discriminative models on the region and time period identification tasks. Experiments show that our sparse mixed-effects model is more accurate quantitatively and qualitatively interesting, and that these improvements are robust across different parameter settings.
| Item Type | Working paper |
|---|---|
| Copyright holders | © 2012 Association for Computational Linguistics |
| Departments | LSE > Academic Departments > Economics |
| Date Deposited | 16 Oct 2013 |
| URI | https://researchonline.lse.ac.uk/id/eprint/53577 |
Explore Further
- https://www.aclweb.org/anthology/P/P12/P12-1078.pdf (Publisher)
- https://www.scopus.com/pages/publications/84878178053 (Scopus publication)
- http://aclweb.org/ (Official URL)