Page 1 of 1

Bot that finds u.s. public domain works published before 1964 on HathiTrust

Posted: December 24th, 2023, 5:50 am
by jlso4u
https://botsin.space/@SecretlyPublicDomain

"Most of the books published in the US before 1964 never had their copyright renewed, putting them in the public domain. Here are some of them."

It appears HathiTrust itself takes the findings shared seriously, because many of the items linked are changed from "Limited (search only)" to "Full view" within a few days of being posted by this account.

Has anyone here tried using this to find material to record, or doing some kind of bulk processing of works posted to make them easier to look through?

(Previous discussion mentioning this tool: viewtopic.php?t=97196 )

Re: Bot that finds u.s. public domain works published before 1964 on HathiTrust

Posted: December 24th, 2023, 10:31 am
by redrun
Right! In case it saves some time in parsing the thread:
There's a pair of datasets you can match between, to determine that a book is probably public domain. It seems like it would be quite useful for the folks at HathiTrust and Gutenberg, as a part of their vetting process (like cereal can be a part of a balanced breakfast).

The bot you linked will post at random from the dataset, but anyone looking to search a particular work might use this tool instead:
https://cce-search.nypl.org/

We can't use it as a definitive source for LV, but it is pretty neat to have it for folks "upstream" of us in the process.