Bot that finds u.s. public domain works published before 1964 on HathiTrust

Comments about LibriVox? Suggestions to improve things? News?
Post Reply
jlso4u
Posts: 23
Joined: December 23rd, 2023, 9:16 am

Post by jlso4u »

https://botsin.space/@SecretlyPublicDomain

"Most of the books published in the US before 1964 never had their copyright renewed, putting them in the public domain. Here are some of them."

It appears HathiTrust itself takes the findings shared seriously, because many of the items linked are changed from "Limited (search only)" to "Full view" within a few days of being posted by this account.

Has anyone here tried using this to find material to record, or doing some kind of bulk processing of works posted to make them easier to look through?

(Previous discussion mentioning this tool: viewtopic.php?t=97196 )
redrun
LibriVox Admin Team
Posts: 2940
Joined: August 11th, 2022, 8:32 pm
Contact:

Post by redrun »

Right! In case it saves some time in parsing the thread:
There's a pair of datasets you can match between, to determine that a book is probably public domain. It seems like it would be quite useful for the folks at HathiTrust and Gutenberg, as a part of their vetting process (like cereal can be a part of a balanced breakfast).

The bot you linked will post at random from the dataset, but anyone looking to search a particular work might use this tool instead:
https://cce-search.nypl.org/

We can't use it as a definitive source for LV, but it is pretty neat to have it for folks "upstream" of us in the process.
I'll be out for a bit on this last weekend of April, but still checking in as I get the chance. I will try to follow up on Monday, with anything I can't do on the go.
Post Reply