In a previous post, I expounded on language learning and laid out what I think is the fastest way to learn any language. One of the components of quickly acquiring languages is to prioritize the words that you learn. Learning the most common words first will reap huge benefits for your comprehension. There are several word frequency lists out there, most of them I found were compiled from newspapers, but Mike “Pomax” Kamermans over at nihongoresources.com had a brilliant idea to use Japanese novels as material instead. His algorithm compiled over 65 million words. No word frequency list can be perfect, but I think this one is about as close as you can get.
I simply took the first 3000 words from his data and made some tweaks so the words are easier to utilize for studying. I removed punctuation and numbers, and compiled the words into 2 page pdf files that are easy to print so you can cross off words when you learn them. I’ve also included the text file of those 3000 words in case you want to do any textual searches.
PDF files: For Printing
Text file: For Searching
A little bit of number crunching on the data turned out some very interesting facts.
The first 100 words on the list make up 57.2% of the text that was processed.
The first 500? 70.3%.
The first 1000? 76.2%
The first 3000? 85.4%
The first 10,000? 94.1%
But don’t let this data fool you completely. Mike (the man who generated the list himself) said in an email…
Usually the most frequently used words don’t need explicit learning because they are found all over the place, and the medium-presence words are more important, because they convey important things. Frequent words are usually common because they contain little information, so you have a trade-off between ‘used a lot’ and ‘give critical information’.
You can find the complete list of more than 65,000 words including punctuation, word frequency, and parts of speech at http://pomax.nihongoresources.com/index.php?entry=1222520260.
Here’s a link my article How to Learn Any Language in 6 Months