Any interest in LibriVox's representation on wikipedia?

Comments about LibriVox? Suggestions to improve things? News?
TimoleonWash
Posts: 680
Joined: February 28th, 2014, 1:38 am
Location: USA, California, San Diego
Contact:

Post by TimoleonWash » March 25th, 2015, 8:33 pm

knotyouraveragejo wrote:All books that have a wikipedia page for the book have a link to that wikipedia page on their LibriVox catalog page. If there isn't one, then there wasn't a specific wikipedia entry for the book at the time the project was cataloged.

A lot of the books we record do not have a wikipedia entry so obviously have no link on the catalog page.
Ah! I get it now. I've never scrolled down on a LibriVox.org page to see the links in the lower left. Thanks for the explanation. What a great idea.
If you create and your creation is destroyed, create anyway. (paraphrasing Mother Teresa) . . Ƹ̵̡Ӝ̵̨̄Ʒ

mahne
Posts: 333
Joined: September 30th, 2014, 4:50 am
Location: Germany

Post by mahne » March 26th, 2015, 7:41 am

Cori wrote:
TimoleonWash wrote:Paradise Regained, (v1)=b2210 is linked. (v2)=b7483 NOT FOUND on librivox page and therefore not linked to from wikipedia, apparently only version is available.
b7483 is titled Paradise Regain'd. I'd suggest using "Paradise Regain" as the Wikipedia search term, and then both will pop up correctly (and since Milton didn't write a book by that exact name, there won't be any confusion in future.)
The template has also the possibility to specify a different searchterm for LibriVox than the one that will be seen on the Wikipedia page (stitle/dtitle). I have changed your wikipedia entry for Paradise Regained accordingly, please have look if that's what you wanted.

Cheers
mahne
all covers made by mahne, constructive criticism appreciated

ZamesCurran
Posts: 472
Joined: March 2nd, 2015, 9:08 am
Location: Bloomfield, NJ
Contact:

Post by ZamesCurran » April 23rd, 2015, 3:34 pm

(Ok, I've just read through the 5 pages of this thread, I think I'm caught up and ready to contribute...)

I think updating the pages for the backlog of existing recordings can be made fairly simple with the proper application of what we have available.

Well, first thing I should note is that my day job is creating data-driven websites, so I'll be very willing to help out with running the back-end of the website (though I don't know much about php; I'm a .NET/C# guy)

That said, the simplest version of my suggestion throws most of the work on ekzemplaro. (I could do it myself, but we need to be working on a common database, and he already has one)

We set up a web page which will scan the database for the "next" (say) 10 entries where the wiki flag is -1 ("next" needs some definition, so that on the first day of the project, when all the volunteers hit that page at the same time, they don't all get the same list)

For those ten entries, the page will display
1) a link to the Wikipedia search page for that book.

Code: Select all

<a target="_blank" href="https://en.wikipedia.org/w/index.php?search=Areopagitica+John+Milton&title=Special%3ASearch&go=Go">Link for Areopagitica</a>
2) As text, the Wikipedia tag for this book

Code: Select all

*{{Librivox book | title=Areopagitica | author=John Milton}}
3) A button/buttons which would update the wiki flag in the database.

Then a volunteer merely has to click the first link, calling up the Wikipedia search page. From there, click on the proper page, and scroll to the bottom. Verify the existence of the link, editing the page by copy'n'pasting the tag if needed. Then click the appropriate button to update the database.

ekzemplaro -- If we are going to use your database for this, at the minimum, we'll need an API for update the wiki flag. With that, I could do the rest.
Truth,
James
---------------------

Kangaroo692
Posts: 1948
Joined: August 21st, 2014, 9:34 am
Location: Probably the holodeck :)
Contact:

Post by Kangaroo692 » April 23rd, 2015, 4:11 pm

That sounds pretty cool.

ekzemplaro
Posts: 2031
Joined: December 31st, 2011, 7:17 am
Location: Tochigi,Japan
Contact:

Post by ekzemplaro » April 24th, 2015, 4:34 am

Hello James san,

Thank you for your offer.
ZamesCurran wrote:ekzemplaro -- If we are going to use your database for this, at the minimum, we'll need an API for update the wiki flag. With that, I could do the rest.
Here's a prototype.
#
COMMAND=http://ekzemplaro.org/librivox/wikipedia/wiki_status_get.php
#
curl -F id=b1080 $COMMAND
curl -F id=b1127 $COMMAND
curl -F id=b1172 $COMMAND
#
The result is as follows
{"_id":"b1080","_rev":"1-c8877bc371c66893bd84e5d53d965cfb","status":"3","reporter":"TimoleonWash"}
{"_id":"b1127","_rev":"1-ecd375d92452b055f09577e3d62e0cc7","status":"3","reporter":"Kangaroo692"}
{"_id":"b1172","_rev":"1-59c225c909d0c0f6f260d48bc49a68b1","status":"3","reporter":"mahne"}
As of now the API doesn't return '-1'. I'll soon fix this.
I'll also prepare update API.
If something is not clear, please let me know.

Cheers,
Masa

Cori
LibriVox Admin Team
Posts: 12163
Joined: November 22nd, 2005, 10:22 am
Location: Britain
Contact:

Post by Cori » April 24th, 2015, 12:50 pm

I'd recommend not doing more than 10 a day per person. Even that might be a bit much over several days. I'm not sure what triggers the "adding spam" threshold at Wikipedia. But I don't want people's hard work reverted so would advise caution. Once again, just because we have 'permission' and our own macro does not mean that every Wikipedia admin will approve of our changes. And there's little or no 'right of appeal'. Gently does it.
There's honestly no such thing as a stupid question -- but I'm afraid I can't rule out giving a stupid answer : : To Posterity and Beyond!

ekzemplaro
Posts: 2031
Joined: December 31st, 2011, 7:17 am
Location: Tochigi,Japan
Contact:

Post by ekzemplaro » April 24th, 2015, 8:14 pm

Hello,
ekzemplaro wrote:As of now the API doesn't return '-1'. I'll soon fix this.
Finished.
title and author-id are also returned.
{"_id":"b9602","_rev":"1-8aceff852e11456a1e37590239c639f6","status":-1,"reporter":null,"title":"Marie: An Episode in the Life of the Late Allan Quatermain","authors":"171"}
I also prepared a page to show the satatus.
http://ekzemplaro.org/librivox/wikipedia/
During the test period this page is not linked to catalog page.
ekzemplaro wrote:I'll also prepare update API.
I'm still working on this.

Cheers,
Masa

ZamesCurran
Posts: 472
Joined: March 2nd, 2015, 9:08 am
Location: Bloomfield, NJ
Contact:

Post by ZamesCurran » April 25th, 2015, 12:45 am

OK, here's my piece.

http://librivox.noveltheory.com/Librivox/Wiki

It's still a work in progress. Notably, the buttons do nothing (except remove the box with a cool effect).

Another odd affect, is that the Wikipedia "Edit" links don't work while the page is in a frame. To actually edit the page, you'll need to right-click the link, and choose "Open in New Tab".


ekzemplaro -- A couple questions about the db_catalog.json file.
In at least one case, the status is not numeric. (I spotted one as "2X")
In at least one case, the readers list includes a null (e.g. readers:[1,2,3,null] )
Also, the records have properties with plural names ("Authors" and "Genres") while only one is given for each. Are they going to stay that way, or do I have to plan for an array coming.
Truth,
James
---------------------

ekzemplaro
Posts: 2031
Joined: December 31st, 2011, 7:17 am
Location: Tochigi,Japan
Contact:

Post by ekzemplaro » April 25th, 2015, 4:43 am

Hello James san,
Good. You are approaching the goal.

Update API is ready.
#
COMMAND=http://ekzemplaro.org/librivox/wikipedia/wiki_status_update.php
#
#curl -F id=b1000 -F status=2 -F reporter=ekzemplaro $COMMAND
curl -F id=b1000 -F status=-1 -F reporter=null $COMMAND
#
You can check the effect at http://ekzemplaro.org/librivox/wikipedia/.
As of now the change doesn't affect to the catalog. So please feel free to test.
ZamesCurran wrote:In at least one case, the status is not numeric. (I spotted one as "2X")
I'll fix this. All should be numeric.
ZamesCurran wrote:In at least one case, the readers list includes a null (e.g. readers:[1,2,3,null] )
OK. I'll investigate about this.
ZamesCurran wrote:Also, the records have properties with plural names ("Authors" and "Genres") while only one is given for each. Are they going to stay that way, or do I have to plan for an array coming.
The original data at LibriVox is array. But my catalog shows one book in a line. So I use only Authors[0] and
Genres[0]. Talking about db_catalog.json I don't plan to change this spec.

Cheers,
Masa

ekzemplaro
Posts: 2031
Joined: December 31st, 2011, 7:17 am
Location: Tochigi,Japan
Contact:

Post by ekzemplaro » April 26th, 2015, 5:48 am

Hello James san,
ekzemplaro wrote:ZamesCurran wrote:
In at least one case, the status is not numeric. (I spotted one as "2X")

I'll fix this. All should be numeric.
Done.
ekzemplaro wrote:ZamesCurran wrote:
In at least one case, the readers list includes a null (e.g. readers:[1,2,3,null] )

OK. I'll investigate about this.
I find several examples.
b8348 7240, 9063, 9084, 9331, 9342, nul
b7998 4742, 6319, 7730, 8591, 9155, 9326, 9331, 9423, null
b7825 2033, 4998, 8181, 8207, 8828, 8842, 9185, null
b7807 19, 6687, 7150, 7874, 8468, 8796, 8894, 8925, 9023, 9123, null
b7733 4145, 4511, 4738, 4742, 8012, 9063, 9086, 9092, 9105, 9123, null
b7593 3374, 3657, 4511, 4742, 5694, 6454, 6963, 6974, 8879, 9049, 9096, null
b7279 1280, 1724, 4009, 5220, 5319, 5424, 6070, 6138, 6181, 6686, 6880, 7240,
7756, 8534, 8631, 8677, 8687, null
I guess the problem resides inside the LibriVox database. And this means this can't be fixed.

Cheers,
Masa

Availle
LibriVox Admin Team
Posts: 19957
Joined: August 1st, 2009, 11:30 pm
Contact:

Post by Availle » April 26th, 2015, 5:52 am

ekzemplaro wrote: I find several examples.
b8348 7240, 9063, 9084, 9331, 9342, nul
b7998 4742, 6319, 7730, 8591, 9155, 9326, 9331, 9423, null
b7825 2033, 4998, 8181, 8207, 8828, 8842, 9185, null
b7807 19, 6687, 7150, 7874, 8468, 8796, 8894, 8925, 9023, 9123, null
b7733 4145, 4511, 4738, 4742, 8012, 9063, 9086, 9092, 9105, 9123, null
b7593 3374, 3657, 4511, 4742, 5694, 6454, 6963, 6974, 8879, 9049, 9096, null
b7279 1280, 1724, 4009, 5220, 5319, 5424, 6070, 6138, 6181, 6686, 6880, 7240,
7756, 8534, 8631, 8677, 8687, null
I guess the problem resides inside the LibriVox database. And this means this can't be fixed.

Cheers,
Masa
If you tell me what the numbers mean, I can have a look at this in our database.
Cheers, Ava.
Resident witch of LibriVox, inspired by
Granny Weatherwax: "I ain't Nice."

--
AvailleAudio.com

ZamesCurran
Posts: 472
Joined: March 2nd, 2015, 9:08 am
Location: Bloomfield, NJ
Contact:

Post by ZamesCurran » April 26th, 2015, 7:39 am

Availle wrote:If you tell me what the numbers mean, I can have a look at this in our database.
Here are the ones I've found:
  • Yellow Sheet (LibriVox NaNoWriMo novel 2007), id_1475
  • War and Peace, Book 13: 1812, id_3207
  • What is Property? An Inquiry into the Principle of Right and of Government, id_4861
  • Guide to the Study of the Christian Religion, id_5524
  • Windsor Castle, Book 2, id_6532
  • Mystery of the 'Ocean Star' - A Collection of Maritime Sketches, id_6641
  • American Rivals of Sherlock Holmes, id_6848
  • Rilla of Ingleside (version 2), id_7036
  • Short Poetry Collection 130, id_7148
  • At Ease on Lethe Wharf, id_7768
  • Short Science Fiction Collection 049, id_8020
  • Sylvie and Bruno (version 2 Dramatic Reading), id_8264
  • Vindication of the Rights of Woman, id_83
  • Wodehouse in the Strand - Short Story Collection, id_8310
  • Novelle per un Anno, vol. 12: Il Viaggio, id_8610
  • All That Matters, id_8798
  • Tales of a Poultry Farm, id_8819
  • Ancient Ballads and Legends of Hindustan, id_8909
  • Say and Seal, id_9187
  • Jungle Book, id_9359
  • Wine of Wizardry, id_9393
  • Fairy Tales and Stories, id_9437
  • Pilot's Handbook of Aeronautical Knowledge FAA-H-8083-25A, id_9521
  • Magna Carta, id_9650
  • Island Queen, id_9693
The "Id_" number appears to be the Librivox internal ID # (it's the number used on the iTunes & RSS links)

I spot checked "Yellow Sheet" and all the readers listed on the catalog page are included. The null is a extra entry.


Also, Masa/ekzemplaro appears to be accessing the librivox database directly. Is there an API for doing that?
Truth,
James
---------------------

Kangaroo692
Posts: 1948
Joined: August 21st, 2014, 9:34 am
Location: Probably the holodeck :)
Contact:

Post by Kangaroo692 » April 26th, 2015, 7:45 am

James, let me know if there is a way for me to help with my little HTML and CSS know-how. Thanks,

Availle
LibriVox Admin Team
Posts: 19957
Joined: August 1st, 2009, 11:30 pm
Contact:

Post by Availle » April 26th, 2015, 7:51 am

ZamesCurran wrote:
Availle wrote:If you tell me what the numbers mean, I can have a look at this in our database.
Here are the ones I've found:
  • Sylvie and Bruno (version 2 Dramatic Reading), id_8264
  • Say and Seal, id_9187
  • Jungle Book, id_9359
  • Wine of Wizardry, id_9393
  • Fairy Tales and Stories, id_9437
  • Pilot's Handbook of Aeronautical Knowledge FAA-H-8083-25A, id_9521
  • Magna Carta, id_9650
  • Island Queen, id_9693
I have checked all entries you listed against our database. All readers are present in the database. The ones above are projects that are not finished yet, some of them are "open", so yes, there will be readers missing.

Edit: There may indeed be an issue with this, I misunderstood. the "null" seems to be indeed an additional entry to some of the sections. Admin will investigate.
Also, Masa/ekzemplaro appears to be accessing the librivox database directly. Is there an API for doing that?
Yes, there is an API, the thread about it is here:
viewtopic.php?f=24&t=44129
Note that the API is as is.
We know about its shortcomings but are not in a position to do anything about it at this point.
Cheers, Ava.
Resident witch of LibriVox, inspired by
Granny Weatherwax: "I ain't Nice."

--
AvailleAudio.com

knotyouraveragejo
LibriVox Admin Team
Posts: 18040
Joined: November 18th, 2006, 4:37 pm

Post by knotyouraveragejo » April 26th, 2015, 9:31 am

ekzemplaro wrote: I guess the problem resides inside the LibriVox database. And this means this can't be fixed.

Masa
No it does not mean it can't be fixed. Once an error in the data is pointed out, we can usually fix it. We are working on fixing these.
Jo
My Librivox Solos
Books are the carriers of civilization. Without books, history is silent, literature dumb, science crippled, thought and speculation at a standstill. - Barbara Tuchman

Post Reply