I'm guessing that this is the month where the project was catalogued (factoring in how many hours it was?). Which might mean that the March 2021 spike is the end of longer projects that people began in various lockdowns? Just a guess.
Yes - I've just changed the first post to say that. Or spikes could be a clean up month
But it is just another way of looking at what we achieving, as in a project catalogued count 1 small bible chapter counts the same as the complete KJV.
And if anyone wished, they could work out how long it would take to read the whole collection
103 h 55 min Holy Bible Complete American Standard Version (2021-12-31)
100 h 45 min Bible Complete King James Version (2020-01-21)
99 h 53 min World English Bible Complete (2017-07-06)
62 h 12 min London Labour And The London Poor Volume Ii By Henry Mayhew (2019-05-16)
60 h 4 min Summa Theologica 11 Pars Secunda Secundae Treatise On The Cardinal Virtues Prudence Justice Fortitude Te
mperance By Saint Thomas Aquinas (2019-10-01)
58 h 15 min The Book Of Household Management By Isabella Beeton (2009-11-14)
58 h 2 min Old Testament World English Bible (2009-01-23)
57 h 51 min The Life Of Jesus Critically Examined By David Friedrich Strauss (2012-01-28)
56 h 51 min De Civitate Dei Libri Xxii By Saint Augustine Of Hippo (2019-09-03)
56 h 51 min David Copperfield Nl By Charles Dickens (2012-04-26)
54 h 16 min The Count Of Monte Cristo Version 3 By Alexandre Dumas (2013-08-09)
54 h 11 min Enneads By Plotinus (2018-06-11)
53 h 21 min London Labour And The London Poor Volume I By Henry Mayhew (2021-07-31)
52 h 59 min De Kermis Der Ijdelheid Door William Makepeace Thackeray (2011-11-15)
51 h 52 min Aus Meinem Leben Dichtung Und Wahrheit Von Johann Wolfgang Von Goethe (2009-09-16)
51 h 26 min Le Comte De Monte Cristo By Alexandre Dumas (2008-05-13)
51 h 25 min London Labour And The London Poor Volume Iii By Henry Mayhew (2021-05-29)
51 h 10 min Dombey En Zoon By Charles Dickens (2013-01-03)
50 h 57 min Newspaper Articles By Mark Twain (2010-11-27)
50 h 48 min Het Verlaten Huis By Charles Dickens (2013-06-23)
50 h 0 min Maarten Chuzzlewit By Charles Dickens (2012-06-08)
49 h 48 min The Count Of Monte Cristo By Alexandre Dumas 2 (2013-03-22)
49 h 43 min The Count Of Monte Cristo By Alexandre Dumas (2007-11-30)
49 h 30 min Onze Wederzijdsche Vriend By Charles Dickens (2013-04-15)
48 h 59 min Romische Geschichte Buch 5 By Theodor Mommsen (2008-10-10)
BengtW wrote: ↑February 10th, 2022, 12:45 am
Put some data together if anyone is interested.
Hi BengtW
Is that data publicly available somewhere? If so it would be good to update the graph on Wikiedia's LibrVox page, which currently shows monthly data only up to 2011. That happens to list completed projects, but recorded hours is a better metric.
I agree it is a better measure and it is great we are getting it again Thanks Bengt
I find the graph a little hard to read (old age is creeping up). Michael - if you were going to add it to the wiki would a bar graph be better ? Or a monthly one with a yearly one ? Or something ? You can make visual displays from spreadsheets. Or just use the same format as the present number of projects - or any other idea you have
Bengt has been uploading all the LV files to a youtube channel and he has been able to give me some very handy info about missing fields and files which have helped improve our database. I said I missed having a recording total time and he managed to give us one
The data is available although a bit complicated to get at. The runtime is available in the API but not the date of release. I have my own mirror database to support the YouTube channel to keep track of the relationship between librivox projects and YouTube videos and in it i have recently added the date as I get it anyway when parsing out chapter and other meta information from the project page. If you want I can provide a JSON export or something. What would you need?
Many thanks for that (and sorry for the delay in replying). I don't actually need anything that complicated to create an image for Wikipedia, as I can do that using the raw data you posted above. All I needed was to know where the data came from so that I can specify the source when I upload a new image. I can simply include a link to this thread and say that the data have been extracted from the underlying LibriVox database.
I'll PM you about another potential project, though ...
Last edited by MichaelMaggs on April 5th, 2022, 5:29 am, edited 1 time in total.
BengtW wrote: ↑March 29th, 2022, 12:14 pm
The data is available although a bit complicated to get at. The runtime is available in the API but not the date of release. I have my own mirror database to support the YouTube channel to keep track of the relationship between librivox projects and YouTube videos and in it i have recently added the date as I get it anyway when parsing out chapter and other meta information from the project page. If you want I can provide a JSON export or something. What would you need?
Would you have an easy way to pull out a list of all the text sources on archive.org in the catalogue? I was wondering the other day where the most common clusters of dates are for the books we have read and it occurred to me that the archive.org API makes it quite easy to pull that data out (it's not super accurate but it would work) if you had a list of URLs...