Page 1 of 1

Subsets of librivox books in Spanish, French and Italian

Posted: May 23rd, 2016, 8:19 am
by Basquetteur
Hi

I am a recent collaborator contributing only with making covers. I inadvertently started polluting the thread of monthly statistics with newbie questions about numbers of books, topics, languages, versions, etc... I got quite a number of very interesting replies and suggestions. I had looked also to the librivox.nl site where the subset of librivox audiobooks in Dutch are nicely put together. Following somewhat that kind of model I decided to try to mimick something similar and simpler for Spanish, and possibly other languages. In the end, I found that the favourite tool in archive.org seems relatively suitable for that (only you have to use several users).

So I have done something for Spanish here:

https://archive.org/details/fav-miguel_gran?&sort=-downloads

and then for French, here:

https://archive.org/details/fav-migrenier?sort=-downloads

and for Italian, here:

https://archive.org/details/fav-michgra?sort=-downloads


The sets can be sorted by author, title and views. (It can also be sorted by date favorited, but that is less useful, I think).

The sets should be rather complete. If anyone spots a missing book or a mistake, that would be fine to signal.

My Italian, and my French, are certainyl perfectible, so the headers can easily be corrected, improved and supplemented. Also including possibly the header of the Spanish subset, my mother tongue.

I think I might do in the future something similar for German books, and possibly Chinese, Japanese and Latin.

I suppose theoretically it could be done for the thirty something languages in librivox other than English.

Cheers!

Basquetteur

Re: Subsets of librivox books in Spanish, French and Italian

Posted: May 24th, 2016, 8:07 am
by kayray
Hey this is really great! Thanks! I'll bookmark this thread so I can send people here. Do please make a German collection :)

Do you plan to update the collections as we create more books in these languages?

Re: Subsets of librivox books in Spanish, French and Italian

Posted: May 24th, 2016, 1:35 pm
by Basquetteur
kayray,

Yes I do plan to update these three pages when new books in these languages appear.

I will consider doing the German version as I think is the second language in terms of books. And there is already the one for dutch books.

Cheers,

Basquetteur

Re: Subsets of librivox books in Spanish, French and Italian

Posted: June 10th, 2016, 5:18 am
by Basquetteur
Hi,
As a follow up and to supplement the information.

I think for German audiobooks in Librivox there is already the index here:

http://virtualhorst.de/lvsammlungen/index.php

The books can be sorted by title, author, reader and also the newest additions.

So I think I am less inclined now to do something similar as I did for Spanish, French and Italian (i.e. use the favourite tool in archive.org) for German, as there is already this database.

So all in all, there is possibility of looking at the subsets of Librivox audiobooks in Dutch, Deutsch, Français, Italiano and Español, with the different tools compiled in this thread.

And there is also Ekzemplaro's database: http://ekzemplaro.org/librivox/catalog/

Those possiblities above are besides the catalog in librivox.org and the one in archive.org, which have their own search facilities.

As I indicated earlier, I might do Chinese and Japanese indexes along the lines the Spanish, French and Italian I have done already myself.

Regards,

Basquetteur

Re: Subsets of librivox books in Spanish, French and Italian

Posted: June 12th, 2016, 11:05 am
by Basquetteur
Hi,

Herewith the link to the set of librivox audiobooks in Japanese:

https://archive.org/details/fav-mikakorn?sort=-downloads

So there are so far easy links for French, Spanish, Italian and Japanese books.

And a complete database in librivox.nl for books in Dutch, and also the German database.

Regards,

Basquetteur

Re: Subsets of librivox books in Spanish, French and Italian

Posted: June 12th, 2016, 1:47 pm
by ekzemplaro
Hello Basquetteur san,

Good job.
Muchas gracias.

Cheers,
Masa

Re: Subsets of librivox books in Spanish, French and Italian

Posted: June 12th, 2016, 9:44 pm
by Hokuspokus
Basquetteur wrote: I think for German audiobooks in Librivox there is already the index here:

http://virtualhorst.de/lvsammlungen/index.php
This list contains only poems and short works from the various collections. So if you feel like doing a collection/link list for the German books, it would be very useful!

I like your idea very much. Thank you!

Re: Subsets of librivox books in Spanish, French and Italian

Posted: June 13th, 2016, 6:00 am
by Basquetteur
Hi,


First to ekzemplaro: thank you very much for your appreciation. If you so wish I could add a text in Japanese to the header such as "Collection of free audiobooks from LibriVox in Japanese". This way the header of the page could be bilingual Japanese English. Of course I do not know anything about Japanese so I can only offer to do a simple "copy and paste" (and assume it will properly caught by archive.org) if you supply me with a Japanese text.

Hokuspokus: thankyou very much also for your helpful comments. In the case of German it is relatively tedious to favourite each bookas it is quite numerous and it also becomes a challenge to keep it up to date as it would be the subset with more frequent updates.

I have been looking at the advanced search queries in archive.org and I think it could be possible to extract the books in German. Here are two rough approximations which are not correct:

one misses apparently all multilingual books:

https://archive.org/search.php?query=%28Deutsch+OR+Deutsch%29+AND+collection%3A%28librivoxaudio%29+AND+mediatype%3A%28audio%29

It yields 317 audiobooks.

The other query harvests books containing German, but also books also in English (as it include anything containing "German") so it is too wide. Here it is:

https://archive.org/search.php?query=German+AND+collection%3Alibrivoxaudio

This one yields 647 books.

So a query taking the best of the two and yielding something in between might be the correct search query. I think it is a question of introducing the right terms in it.

It is also quite possible that both the two queries miss some audiobooks. Again this could be corrected introducing appropriate words (in this case "OR" terms, I suppose).

The advance search is reached as in here

https://archive.org/advancedsearch.php?q=

in order to restrict the query is best to put
collection=librivoxaudio (this is a bit tricky as the dropdown box lists separately the words in uppercase then in lowercase, and there is no term for "librivox" or Librivox")
and mediatype:audio

Regards

Basquetteur

Re: Subsets of librivox books in Spanish, French and Italian

Posted: June 14th, 2016, 2:50 am
by Basquetteur
Hi,

The following query yields 527 books. It still needs further refinement:

German AND collection:librivoxaudio AND NOT title:popenjoy AND NOT title:awful AND NOT title:none AND NOT title:dramatic AND NOT title:chemistry AND NOT title:childish AND NOT title:little AND NOT title:German AND NOT title:vocation AND NOT title:character AND NOT title:woman AND NOT title:Fairy AND NOT title:Chapters AND NOT title:Double AND NOT title:poems AND NOT title:Canada's AND NOT title:the AND NOT title:upanishad AND NOT title:Entertainments AND NOT title:מרים

The exclusions AND NOT in many cases affect one single book, unfortunately for the building of the query.

It is possible to paste the text above in the search box.

It is also a link, like:
https://archive.org/search.php?query=German+AND+collection%3Alibrivoxaudio+AND+NOT+title%3Apopenjoy+AND+NOT+title%3Aawful+AND+NOT+title%3Anone+AND+NOT+title%3Adramatic+AND+NOT+title%3Achemistry+AND+NOT+title%3Achildish+AND+NOT+title%3Alittle+AND+NOT+title%3AGerman+AND+NOT+title%3Avocation+AND+NOT+title%3Acharacter+AND+NOT+title%3Awoman+AND+NOT+title%3AFairy+AND+NOT+title%3AChapters+AND+NOT+title%3ADouble+AND+NOT+title%3Apoems+AND+NOT+title%3ACanada%27s+AND+NOT+title%3Athe+AND+NOT+title%3Aupanishad+AND+NOT+title%3AEntertainments+AND+NOT+title%3A%D7%9E%D7%A8%D7%99%D7%9D&sort=-downloads

Regards,

Basquetteur

Re: Subsets of librivox books in Spanish, French and Italian

Posted: June 14th, 2016, 3:03 am
by Basquetteur
Hi

sorry to come back again.

I think that query still includes some 6 or 7 books that should not be included. So that is only this few too much.

The other point is to check if to this query it should be added some terms to include some of the multilingual and short works compilations in case they are not captured already.

I see that the long query disrupts the presentation of that message in the forum. Sorry. Simply by clicking, it works.

Regards

Basquetteur

Re: Subsets of librivox books in Spanish, French and Italian

Posted: June 15th, 2016, 2:16 am
by Basquetteur
Hi,

sorry to come back again

I have further redined the query for books in German


The query is now like this

(German OR gedichte) AND collection:librivoxaudio AND mediatype:audio AND NOT description:Sculpturally AND NOT description:Egypt AND NOT title:taboo AND NOT title:tank AND NOT title:they AND NOT title:highness AND NOT title:three AND NOT title:cinderella AND NOT title:newspaper AND NOT title:la AND NOT title:hazard AND NOT title:mother AND NOT title:weird AND NOT title:nephew AND NOT title:lover's AND NOT title:Lāčplēsis AND NOT title:battle AND NOT title:how AND NOT title:howards AND NOT title:belgian AND NOT title:popenjoy AND NOT title:awful AND NOT title:none AND NOT title:dramatic AND NOT title:chemistry AND NOT title:childish AND NOT title:little AND NOT title:German AND NOT title:vocation AND NOT title:character AND NOT title:woman AND NOT title:Fairy AND NOT title:Chapters AND NOT title:Double AND NOT title:poems AND NOT title:Canada's AND NOT title:the AND NOT title:upanishad AND NOT title:Entertainments AND NOT title:מרים

The link is

https://archive.org/search.php?query=%28German%20OR%20gedichte%29%20AND%20collection%3Alibrivoxaudio%20AND%20mediatype%3Aaudio%20AND%20NOT%20description%3ASculpturally%20AND%20NOT%20description%3AEgypt%20AND%20NOT%20title%3Ataboo%20AND%20NOT%20title%3Atank%20AND%20NOT%20title%3Athey%20AND%20NOT%20title%3Ahighness%20AND%20NOT%20title%3Athree%20AND%20NOT%20title%3Acinderella%20AND%20NOT%20title%3Anewspaper%20AND%20NOT%20title%3Ala%20AND%20NOT%20title%3Ahazard%20AND%20NOT%20title%3Amother%20AND%20NOT%20title%3Aweird%20AND%20NOT%20title%3Anephew%20AND%20NOT%20title%3Alover%27s%20%20AND%20NOT%20title%3AL%C4%81%C4%8Dpl%C4%93sis%20AND%20NOT%20title%3Abattle%20AND%20NOT%20title%3Ahow%20AND%20NOT%20title%3Ahowards%20AND%20NOT%20title%3Abelgian%20AND%20NOT%20title%3Apopenjoy%20AND%20NOT%20title%3Aawful%20AND%20NOT%20title%3Anone%20AND%20NOT%20title%3Adramatic%20AND%20NOT%20title%3Achemistry%20AND%20NOT%20title%3Achildish%20AND%20NOT%20title%3Alittle%20AND%20NOT%20title%3AGerman%20AND%20NOT%20title%3Avocation%20AND%20NOT%20title%3Acharacter%20AND%20NOT%20title%3Awoman%20AND%20NOT%20title%3AFairy%20AND%20NOT%20title%3AChapters%20AND%20NOT%20title%3ADouble%20AND%20NOT%20title%3Apoems%20AND%20NOT%20title%3ACanada%27s%20AND%20NOT%20title%3Athe%20AND%20NOT%20title%3Aupanishad%20AND%20NOT%20title%3AEntertainments%20AND%20NOT%20title%3A%D7%9E%D7%A8%D7%99%D7%9D



I have converted this link into a tiny url here:

http://tinyurl.com/librivox-german-books

It yields 520 books.

It captures some 20 more books with "gedichte" and eliminates some 15 books or so with english terms in the title. It also elimiantes two boos by taking terms int he summary (filed "description"). This is for Arachne (elminating it by picking the word Egypt in th edescription) and Auguste rodin by picking the word Sculpturally in the description.

Regards,

Basquetteur

Re: Subsets of librivox books in Spanish, French and Italian

Posted: June 15th, 2016, 2:19 am
by Basquetteur
Hi,

This query above would also capture books in German in the future.
I might do it for chinese, russian, arabic, latin.

Regards,

Basquetteur

Re: Subsets of librivox books in Spanish, French and Italian

Posted: June 15th, 2016, 5:23 am
by ekzemplaro
Hello Basquetteur san,
If you so wish I could add a text in Japanese to the header such as "Collection of free audiobooks from LibriVox in Japanese".
Here's the Japanese header.
リブリボックスの日本語オーディオブック
Cheers,
Masa

Re: Subsets of librivox books in Spanish, French and Italian

Posted: June 15th, 2016, 5:49 am
by Basquetteur
ekzemplaro wrote:Hello Basquetteur san,
If you so wish I could add a text in Japanese to the header such as "Collection of free audiobooks from LibriVox in Japanese".
Here's the Japanese header.
リブリボックスの日本語オーディオブック
Cheers,
Masa
THank you very much Masa. I will put in the Header.

Cheers

Basquetteur