Books to study English?

Everything except LibriVox (yes, this is where knitting gets discussed. Now includes non-LV Volunteers Wanted projects)
tovarisch
Posts: 2936
Joined: February 24th, 2013, 7:14 am
Location: New Hampshire, USA

Post by tovarisch »

What I think? :hmm: If you have a pre-conceived notion of what results should show, you can apply various techniques to drive your statistics to be close to what you expect. :P

You are planning to use that method to give some kind of rating to various texts, and to suggest the ones with lower score for learning, yes? The proof that your scoring method is close to reality would be if it ranks texts in the same way they are used in English-speaking countries in the elementary, middle, high schools and colleges as material for studying English for native speakers. The texts don't have to be PD, you know... Once you have proved that your method works, you can run it over the entire LV collection (or Project Gutenberg's) :wink:
tovarisch
  • reality prompts me to scale down my reading, sorry to say
    to PLers: do correct my pronunciation please
stillwaiting
Posts: 7
Joined: January 17th, 2018, 3:05 pm

Post by stillwaiting »

tovarisch : I think that's a very good idea. Thank you. It would even make sense to use some machine learning to build a classifier, using learning materials for schools as a training set. Let's see if I could do this. (... Unless this is an overkill :) )
Peter Why
Posts: 5801
Joined: November 24th, 2005, 3:54 am
Location: Chigwell (North-East London, U.K.)

Post by Peter Why »

I've only read the first two on your list, so I can't say how accurate those grades might be. I would agree that Alice is more complex than the Wizard of Oz. You might try to find a really simple first reader text to act as your baseline ... "The cat sat on the mat" level.

And I agree with you that the Flesch-Kincaid test can't give much indication of the complexity of a text, though sentence length is a good pointer in that direction.

Peter
"I think, therefore I am, I think." Solomon Cohen, in Terry Pratchett's Dodger
annise
LibriVox Admin Team
Posts: 38542
Joined: April 3rd, 2008, 3:55 am
Location: Melbourne,Australia

Post by annise »

The ontario readers are age level graded to 3rd and fourth and the ambleside book lists too (see Wiki)

Anne
stillwaiting
Posts: 7
Joined: January 17th, 2018, 3:05 pm

Post by stillwaiting »

So, I was able to make-up the metric for book complexity. Basically, I use an average between 2 metrics:
1. one taken from http://readable.io

2. one is the average complexity of the words (compared against the table of top 50K English books), normalization is based on min and max values over the books that I already have.

The result over the existent books can be found here: https://youcanreadit.com/books-to-read-and-learn-english/ , I think it looks pretty plausible. Wdyt?
CSCO
Posts: 393
Joined: April 6th, 2010, 10:48 am
Location: Toyokawa, Japan

Post by CSCO »

If only I could read The Scarlet Letter without using English dictionary!
!!!!!!.!!!!!!.!!!!.!!!!!!!!!..!!!.!!!!!!!!!!!...!!!!!!!!!.!!!!!!.!!!!.!!!!!!.!!!!
No way. He stole away a pretty thing, you know.
That's your heart.
!!!!.!!!!!!.!!!!.!!!!!!!!!..!!!.!!!!!!!!!!!...!!!!!.!!!!!!.!!!!!!!!.!!!!!!.!!!!!!
Post Reply