Measuring human capital in the united states using copyright title pages, 1790-1870

Rapone, Tancredi (2022) Measuring human capital in the united states using copyright title pages, 1790-1870 [Working paper]
Copy

This paper uses optical character recognition (OCR) to analyze the production of books in the US over 1790 to 1870 using copyright title pages taken from the online archives of the Library of Congress. We construct national time series of book production over this period which show an uptake in per-capita terms in 1830, around the starting point of the US’ industrial revolution. We break down the production of books into topics using keywords for 8 topics: science, religion, novel, invention, diffusion, business, philosophy and textbook. On this basis we show that the composition of book production by topics is stable over time, except for textbooks and novels which show a persistent increase over the whole period both in relative and absolute terms. This pushes back the beginning of the growth in US human capital before the first reliable data on schooling and literacy starting in 1870. We thus offer mild support to an interpretation of US growth over the 19th century based on the expansion of knowledge and capabilities, while conceding that the link between the content of books and industrialization is tenuous.

picture_as_pdf

picture_as_pdf
subject
Published Version

Download

Atom BibTeX OpenURL ContextObject in Span OpenURL ContextObject Dublin Core MPEG-21 DIDL Data Cite XML EndNote HTML Citation METS MODS RIOXX2 XML Reference Manager Refer ASCII Citation
Export

Downloads