Subsets of librivox books in Spanish, French and Italian

Comments about LibriVox? Suggestions to improve things? News?
Post Reply
Basquetteur
Posts: 294
Joined: January 23rd, 2016, 1:17 am
Location: Belgium - Bélgica - Belgique- België
Contact:

Post by Basquetteur » May 23rd, 2016, 8:19 am

Hi

I am a recent collaborator contributing only with making covers. I inadvertently started polluting the thread of monthly statistics with newbie questions about numbers of books, topics, languages, versions, etc... I got quite a number of very interesting replies and suggestions. I had looked also to the librivox.nl site where the subset of librivox audiobooks in Dutch are nicely put together. Following somewhat that kind of model I decided to try to mimick something similar and simpler for Spanish, and possibly other languages. In the end, I found that the favourite tool in archive.org seems relatively suitable for that (only you have to use several users).

So I have done something for Spanish here:

https://archive.org/details/fav-miguel_gran?&sort=-downloads

and then for French, here:

https://archive.org/details/fav-migrenier?sort=-downloads

and for Italian, here:

https://archive.org/details/fav-michgra?sort=-downloads


The sets can be sorted by author, title and views. (It can also be sorted by date favorited, but that is less useful, I think).

The sets should be rather complete. If anyone spots a missing book or a mistake, that would be fine to signal.

My Italian, and my French, are certainyl perfectible, so the headers can easily be corrected, improved and supplemented. Also including possibly the header of the Spanish subset, my mother tongue.

I think I might do in the future something similar for German books, and possibly Chinese, Japanese and Latin.

I suppose theoretically it could be done for the thirty something languages in librivox other than English.

Cheers!

Basquetteur

kayray
LibriVox Admin Team
Posts: 11888
Joined: September 26th, 2005, 9:10 am
Location: Union City, California
Contact:

Post by kayray » May 24th, 2016, 8:07 am

Hey this is really great! Thanks! I'll bookmark this thread so I can send people here. Do please make a German collection :)

Do you plan to update the collections as we create more books in these languages?
Kara
http://kayray.org/
--------
"Mary wished to say something very sensible into her Zoom H2 Handy Recorder, but knew not how." -- Jane Austen (& Kara)

Basquetteur
Posts: 294
Joined: January 23rd, 2016, 1:17 am
Location: Belgium - Bélgica - Belgique- België
Contact:

Post by Basquetteur » May 24th, 2016, 1:35 pm

kayray,

Yes I do plan to update these three pages when new books in these languages appear.

I will consider doing the German version as I think is the second language in terms of books. And there is already the one for dutch books.

Cheers,

Basquetteur

Basquetteur
Posts: 294
Joined: January 23rd, 2016, 1:17 am
Location: Belgium - Bélgica - Belgique- België
Contact:

Post by Basquetteur » June 10th, 2016, 5:18 am

Hi,
As a follow up and to supplement the information.

I think for German audiobooks in Librivox there is already the index here:

http://virtualhorst.de/lvsammlungen/index.php

The books can be sorted by title, author, reader and also the newest additions.

So I think I am less inclined now to do something similar as I did for Spanish, French and Italian (i.e. use the favourite tool in archive.org) for German, as there is already this database.

So all in all, there is possibility of looking at the subsets of Librivox audiobooks in Dutch, Deutsch, Français, Italiano and Español, with the different tools compiled in this thread.

And there is also Ekzemplaro's database: http://ekzemplaro.org/librivox/catalog/

Those possiblities above are besides the catalog in librivox.org and the one in archive.org, which have their own search facilities.

As I indicated earlier, I might do Chinese and Japanese indexes along the lines the Spanish, French and Italian I have done already myself.

Regards,

Basquetteur

Basquetteur
Posts: 294
Joined: January 23rd, 2016, 1:17 am
Location: Belgium - Bélgica - Belgique- België
Contact:

Post by Basquetteur » June 12th, 2016, 11:05 am

Hi,

Herewith the link to the set of librivox audiobooks in Japanese:

https://archive.org/details/fav-mikakorn?sort=-downloads

So there are so far easy links for French, Spanish, Italian and Japanese books.

And a complete database in librivox.nl for books in Dutch, and also the German database.

Regards,

Basquetteur

ekzemplaro
Posts: 2030
Joined: December 31st, 2011, 7:17 am
Location: Tochigi,Japan
Contact:

Post by ekzemplaro » June 12th, 2016, 1:47 pm

Hello Basquetteur san,

Good job.
Muchas gracias.

Cheers,
Masa

Hokuspokus
LibriVox Admin Team
Posts: 8015
Joined: October 24th, 2007, 12:17 pm
Location: Germany
Contact:

Post by Hokuspokus » June 12th, 2016, 9:44 pm

Basquetteur wrote: I think for German audiobooks in Librivox there is already the index here:

http://virtualhorst.de/lvsammlungen/index.php
This list contains only poems and short works from the various collections. So if you feel like doing a collection/link list for the German books, it would be very useful!

I like your idea very much. Thank you!

Basquetteur
Posts: 294
Joined: January 23rd, 2016, 1:17 am
Location: Belgium - Bélgica - Belgique- België
Contact:

Post by Basquetteur » June 13th, 2016, 6:00 am

Hi,


First to ekzemplaro: thank you very much for your appreciation. If you so wish I could add a text in Japanese to the header such as "Collection of free audiobooks from LibriVox in Japanese". This way the header of the page could be bilingual Japanese English. Of course I do not know anything about Japanese so I can only offer to do a simple "copy and paste" (and assume it will properly caught by archive.org) if you supply me with a Japanese text.

Hokuspokus: thankyou very much also for your helpful comments. In the case of German it is relatively tedious to favourite each bookas it is quite numerous and it also becomes a challenge to keep it up to date as it would be the subset with more frequent updates.

I have been looking at the advanced search queries in archive.org and I think it could be possible to extract the books in German. Here are two rough approximations which are not correct:

one misses apparently all multilingual books:

https://archive.org/search.php?query=%28Deutsch+OR+Deutsch%29+AND+collection%3A%28librivoxaudio%29+AND+mediatype%3A%28audio%29

It yields 317 audiobooks.

The other query harvests books containing German, but also books also in English (as it include anything containing "German") so it is too wide. Here it is:

https://archive.org/search.php?query=German+AND+collection%3Alibrivoxaudio

This one yields 647 books.

So a query taking the best of the two and yielding something in between might be the correct search query. I think it is a question of introducing the right terms in it.

It is also quite possible that both the two queries miss some audiobooks. Again this could be corrected introducing appropriate words (in this case "OR" terms, I suppose).

The advance search is reached as in here

https://archive.org/advancedsearch.php?q=

in order to restrict the query is best to put
collection=librivoxaudio (this is a bit tricky as the dropdown box lists separately the words in uppercase then in lowercase, and there is no term for "librivox" or Librivox")
and mediatype:audio

Regards

Basquetteur

Basquetteur
Posts: 294
Joined: January 23rd, 2016, 1:17 am
Location: Belgium - Bélgica - Belgique- België
Contact:

Post by Basquetteur » June 14th, 2016, 2:50 am

Hi,

The following query yields 527 books. It still needs further refinement:

German AND collection:librivoxaudio AND NOT title:popenjoy AND NOT title:awful AND NOT title:none AND NOT title:dramatic AND NOT title:chemistry AND NOT title:childish AND NOT title:little AND NOT title:German AND NOT title:vocation AND NOT title:character AND NOT title:woman AND NOT title:Fairy AND NOT title:Chapters AND NOT title:Double AND NOT title:poems AND NOT title:Canada's AND NOT title:the AND NOT title:upanishad AND NOT title:Entertainments AND NOT title:מרים

The exclusions AND NOT in many cases affect one single book, unfortunately for the building of the query.

It is possible to paste the text above in the search box.

It is also a link, like:
https://archive.org/search.php?query=German+AND+collection%3Alibrivoxaudio+AND+NOT+title%3Apopenjoy+AND+NOT+title%3Aawful+AND+NOT+title%3Anone+AND+NOT+title%3Adramatic+AND+NOT+title%3Achemistry+AND+NOT+title%3Achildish+AND+NOT+title%3Alittle+AND+NOT+title%3AGerman+AND+NOT+title%3Avocation+AND+NOT+title%3Acharacter+AND+NOT+title%3Awoman+AND+NOT+title%3AFairy+AND+NOT+title%3AChapters+AND+NOT+title%3ADouble+AND+NOT+title%3Apoems+AND+NOT+title%3ACanada%27s+AND+NOT+title%3Athe+AND+NOT+title%3Aupanishad+AND+NOT+title%3AEntertainments+AND+NOT+title%3A%D7%9E%D7%A8%D7%99%D7%9D&sort=-downloads

Regards,

Basquetteur

Basquetteur
Posts: 294
Joined: January 23rd, 2016, 1:17 am
Location: Belgium - Bélgica - Belgique- België
Contact:

Post by Basquetteur » June 14th, 2016, 3:03 am

Hi

sorry to come back again.

I think that query still includes some 6 or 7 books that should not be included. So that is only this few too much.

The other point is to check if to this query it should be added some terms to include some of the multilingual and short works compilations in case they are not captured already.

I see that the long query disrupts the presentation of that message in the forum. Sorry. Simply by clicking, it works.

Regards

Basquetteur

Basquetteur
Posts: 294
Joined: January 23rd, 2016, 1:17 am
Location: Belgium - Bélgica - Belgique- België
Contact:

Post by Basquetteur » June 15th, 2016, 2:16 am

Hi,

sorry to come back again

I have further redined the query for books in German


The query is now like this

(German OR gedichte) AND collection:librivoxaudio AND mediatype:audio AND NOT description:Sculpturally AND NOT description:Egypt AND NOT title:taboo AND NOT title:tank AND NOT title:they AND NOT title:highness AND NOT title:three AND NOT title:cinderella AND NOT title:newspaper AND NOT title:la AND NOT title:hazard AND NOT title:mother AND NOT title:weird AND NOT title:nephew AND NOT title:lover's AND NOT title:Lāčplēsis AND NOT title:battle AND NOT title:how AND NOT title:howards AND NOT title:belgian AND NOT title:popenjoy AND NOT title:awful AND NOT title:none AND NOT title:dramatic AND NOT title:chemistry AND NOT title:childish AND NOT title:little AND NOT title:German AND NOT title:vocation AND NOT title:character AND NOT title:woman AND NOT title:Fairy AND NOT title:Chapters AND NOT title:Double AND NOT title:poems AND NOT title:Canada's AND NOT title:the AND NOT title:upanishad AND NOT title:Entertainments AND NOT title:מרים

The link is

https://archive.org/search.php?query=%28German%20OR%20gedichte%29%20AND%20collection%3Alibrivoxaudio%20AND%20mediatype%3Aaudio%20AND%20NOT%20description%3ASculpturally%20AND%20NOT%20description%3AEgypt%20AND%20NOT%20title%3Ataboo%20AND%20NOT%20title%3Atank%20AND%20NOT%20title%3Athey%20AND%20NOT%20title%3Ahighness%20AND%20NOT%20title%3Athree%20AND%20NOT%20title%3Acinderella%20AND%20NOT%20title%3Anewspaper%20AND%20NOT%20title%3Ala%20AND%20NOT%20title%3Ahazard%20AND%20NOT%20title%3Amother%20AND%20NOT%20title%3Aweird%20AND%20NOT%20title%3Anephew%20AND%20NOT%20title%3Alover%27s%20%20AND%20NOT%20title%3AL%C4%81%C4%8Dpl%C4%93sis%20AND%20NOT%20title%3Abattle%20AND%20NOT%20title%3Ahow%20AND%20NOT%20title%3Ahowards%20AND%20NOT%20title%3Abelgian%20AND%20NOT%20title%3Apopenjoy%20AND%20NOT%20title%3Aawful%20AND%20NOT%20title%3Anone%20AND%20NOT%20title%3Adramatic%20AND%20NOT%20title%3Achemistry%20AND%20NOT%20title%3Achildish%20AND%20NOT%20title%3Alittle%20AND%20NOT%20title%3AGerman%20AND%20NOT%20title%3Avocation%20AND%20NOT%20title%3Acharacter%20AND%20NOT%20title%3Awoman%20AND%20NOT%20title%3AFairy%20AND%20NOT%20title%3AChapters%20AND%20NOT%20title%3ADouble%20AND%20NOT%20title%3Apoems%20AND%20NOT%20title%3ACanada%27s%20AND%20NOT%20title%3Athe%20AND%20NOT%20title%3Aupanishad%20AND%20NOT%20title%3AEntertainments%20AND%20NOT%20title%3A%D7%9E%D7%A8%D7%99%D7%9D



I have converted this link into a tiny url here:

http://tinyurl.com/librivox-german-books

It yields 520 books.

It captures some 20 more books with "gedichte" and eliminates some 15 books or so with english terms in the title. It also elimiantes two boos by taking terms int he summary (filed "description"). This is for Arachne (elminating it by picking the word Egypt in th edescription) and Auguste rodin by picking the word Sculpturally in the description.

Regards,

Basquetteur

Basquetteur
Posts: 294
Joined: January 23rd, 2016, 1:17 am
Location: Belgium - Bélgica - Belgique- België
Contact:

Post by Basquetteur » June 15th, 2016, 2:19 am

Hi,

This query above would also capture books in German in the future.
I might do it for chinese, russian, arabic, latin.

Regards,

Basquetteur

ekzemplaro
Posts: 2030
Joined: December 31st, 2011, 7:17 am
Location: Tochigi,Japan
Contact:

Post by ekzemplaro » June 15th, 2016, 5:23 am

Hello Basquetteur san,
If you so wish I could add a text in Japanese to the header such as "Collection of free audiobooks from LibriVox in Japanese".
Here's the Japanese header.
リブリボックスの日本語オーディオブック
Cheers,
Masa

Basquetteur
Posts: 294
Joined: January 23rd, 2016, 1:17 am
Location: Belgium - Bélgica - Belgique- België
Contact:

Post by Basquetteur » June 15th, 2016, 5:49 am

ekzemplaro wrote:Hello Basquetteur san,
If you so wish I could add a text in Japanese to the header such as "Collection of free audiobooks from LibriVox in Japanese".
Here's the Japanese header.
リブリボックスの日本語オーディオブック
Cheers,
Masa
THank you very much Masa. I will put in the Header.

Cheers

Basquetteur

Post Reply