Source of the Corpus used

Dear fellow learners,

I would like to know how was the frequency list made? Generally all frequency list / dictionaries have the data on which the list is made mentioned. I saw David mention somewhere that it is based on NLP. I do not understand it.

If anybody can give me some more context, that would be lovely!

Thank you for your time!

Hi @bharatingermany,

Here is what I can tell you from the deep dive I’ve done in the forum in the past:

I know that they pull frequency lists from opensubtitles[.org]:

And after Google it “NLP” refers to a type of programing language.

I hope some of this is helpful. :slight_smile:

1 Like