[NOT LV] Gutenberg PROBLEMS - driving me mad!

Report & help check download problems, corrupted files, badly-named files, bad links etc. (NOT for style & reading complaints)
Post Reply
lurcherlover
Posts: 1044
Joined: November 10th, 2016, 3:54 am
Location: LONDON UK

Post by lurcherlover » May 11th, 2021, 4:40 am

I'm having a lot of trouble - making it very difficult - if not impossible to get the text from Gutenberg. I used to just copy it to a word processor or make a .pdf file to put on my tablet. Now I find that the text is corrupted and some of it either comes as a stream of vertical print for many pages, or it misses out huge sections, or both. This is what caused the problem with chapter 5 (of the book I am recording). If I go to the version without images then it now takes me to a box connected to the Apple shop and asks for passwords etc. Gutenberg has changed a system that used to work well, to a system which is a real mess. I've spent ten times more time trying to get accurate texts than recording and editing.

I should point out that I record away from the computer on a separate recording device in another room a long way from the computer, and so I need to either print the text or put it on a tablet.

Any idea how or what has happened? At this rate the book could take ten years to complete and I won't be around in a year or two let alone ten!

TriciaG
LibriVox Admin Team
Posts: 50892
Joined: June 15th, 2008, 10:30 pm
Location: Toronto, ON (but Minnesotan to age 32)

Post by TriciaG » May 11th, 2021, 5:00 am

You don't say which format you're using, but since you say something about the ones with and without images, it isn't the html.

The plain html shouldn't cause such problems.

As a temporary fix until you figure out the problem, I've downloaded the html and made it into an RTF document: https://drive.google.com/file/d/1KY6EMil1vkmBHaJ30XScpNPIDH2Q6UvE/view?usp=sharing
By John Muir: Our National Parks
Bulwer-Lytton novel: The Caxtons
New York scenes, 1897: Darkness and Daylight
Boring works 30-70 minutes long: Insomnia Collection 5

Availle
LibriVox Admin Team
Posts: 19921
Joined: August 1st, 2009, 11:30 pm
Contact:

Post by Availle » May 11th, 2021, 5:03 am

I don't know what text version you are using, but maybe try plain text instead of mobi or epub?

I'm sorry you're frustrated but if the problem persists, this is not something we can fix here - gutenberg has a whole page on how to contact them here:
https://www.gutenberg.org/about/contact_information.html
Cheers, Ava.
Resident witch of LibriVox. "I ain't Nice."

--
AvailleAudio.com

lurcherlover
Posts: 1044
Joined: November 10th, 2016, 3:54 am
Location: LONDON UK

Post by lurcherlover » May 11th, 2021, 5:11 am

Availle wrote:
May 11th, 2021, 5:03 am
I don't know what text version you are using, but maybe try plain text instead of mobi or epub?

I'm sorry you're frustrated but if the problem persists, this is not something we can fix here - gutenberg has a whole page on how to contact them here:
https://www.gutenberg.org/about/contact_information.html
I am using the plain text as I always have, without problems before.

TriciaG
LibriVox Admin Team
Posts: 50892
Joined: June 15th, 2008, 10:30 pm
Location: Toronto, ON (but Minnesotan to age 32)

Post by TriciaG » May 11th, 2021, 5:18 am

I have no problem connecting with the plain text.

I wonder if something has changed on your device.
By John Muir: Our National Parks
Bulwer-Lytton novel: The Caxtons
New York scenes, 1897: Darkness and Daylight
Boring works 30-70 minutes long: Insomnia Collection 5

lurcherlover
Posts: 1044
Joined: November 10th, 2016, 3:54 am
Location: LONDON UK

Post by lurcherlover » May 11th, 2021, 5:22 am

TriciaG wrote:
May 11th, 2021, 5:00 am
You don't say which format you're using, but since you say something about the ones with and without images, it isn't the html.

The plain html shouldn't cause such problems.

As a temporary fix until you figure out the problem, I've downloaded the html and made it into an RTF document: https://drive.google.com/file/d/1KY6EMil1vkmBHaJ30XScpNPIDH2Q6UvE/view?usp=sharing
Thanks, that seems to have sorted it out. Brilliant! I did try and use other text programs but with no success. I had been recording a chapter and editing and uploading in a matter of two or three hours, but the new edits I had to make because of this and then the texts for new chapters has given me about another ten hours of work. I had to collapse in bed last night at 7.00 pm as my eyes had given up and I felt my brain had gone somewhere. Even our dog was growling at me, nearly as loudly as my wife ...

lurcherlover
Posts: 1044
Joined: November 10th, 2016, 3:54 am
Location: LONDON UK

Post by lurcherlover » May 11th, 2021, 5:27 am

TriciaG wrote:
May 11th, 2021, 5:18 am
I have no problem connecting with the plain text.

I wonder if something has changed on your device.
No, nothing has changed. The problem is not connecting, but trying as I usually do to copy chapters to a word processor, and if not printing out, converting that to a .pdf file to put on my tablet. I would get text as lines all the way down many pages, and huge blank pages.

But hopefully it is OK now with the download from TriciaG.

lurcherlover
Posts: 1044
Joined: November 10th, 2016, 3:54 am
Location: LONDON UK

Post by lurcherlover » May 11th, 2021, 5:53 am

Actually, sorry, bad news. As soon as I try and copy from the rtf file into anything else, the same thing happens. A string of letters down the page. I think Gutenberg have put some spoiler in every other page to make it impossible to copy into other programs. I only managed to have success with chapter 9 by copying bits of text at a time, and eventually tying it all up, which takes ages. I will try and contact Gutenberg, but it looks like it might be impossible to finish this book with my setup away from the computer and where I need copies.

Availle
LibriVox Admin Team
Posts: 19921
Joined: August 1st, 2009, 11:30 pm
Contact:

Post by Availle » May 11th, 2021, 6:09 am

What type of "word processor" are you using? Could it be that somehow the text width of the pages are set too narrow?

I don't have a tablet, but can't you just use the rtf file on it?
Cheers, Ava.
Resident witch of LibriVox. "I ain't Nice."

--
AvailleAudio.com

ColleenMc
LibriVox Admin Team
Posts: 2384
Joined: April 9th, 2017, 5:57 pm

Post by ColleenMc » May 11th, 2021, 6:31 am

What brand of tablet do you have?

Colleen
Colleen McMahon

No matter where you go, there you are. -- Buckaroo Banzai

ColleenMc
LibriVox Admin Team
Posts: 2384
Joined: April 9th, 2017, 5:57 pm

Post by ColleenMc » May 11th, 2021, 6:31 am

Or I guess I should say what OS does it use....Apple or android or something else?
Colleen McMahon

No matter where you go, there you are. -- Buckaroo Banzai

lurcherlover
Posts: 1044
Joined: November 10th, 2016, 3:54 am
Location: LONDON UK

Post by lurcherlover » May 11th, 2021, 6:41 am

OK, I've solved it and downloaded into the word processor all of the files. It's ****** computers and software that's the problem. Lot's of hours of extra fruitless work.

I used to down load from Gute and copy into the word processor. If there were images (who needs images!) - I just used to delete them and all the text was fine. Now (and I'm sure it's new software with Gute) - if I delete the images (and there were several in different chapters but not all chapters) it completely wrecks the text. It's a pain and a waste of space to have to keep the images, but I see no other way around it. Someone messed up big time writing Gute's new software.

Thanks for all the help and advice. In the end it should have been obvious to me that it was the images, and only the chapters with images, that were causing the problems.

ColleenMc
LibriVox Admin Team
Posts: 2384
Joined: April 9th, 2017, 5:57 pm

Post by ColleenMc » May 11th, 2021, 12:46 pm

Glad you were able to get it sorted!

Colleen
Colleen McMahon

No matter where you go, there you are. -- Buckaroo Banzai

Post Reply