Back

cb's Japanese Text Analysis Tool

#34
egoplant Wrote:
cb4960 Wrote:Reports from cb's Japanese Text Analysis Tool v3.0 based on 5000+ novels (27 May 2012):
Download via MediaFire

Includes word frequency report via Mecab, word frequency report via JParser, differences between the Mecab report and JParser report, kanji frequency report, and readability report.
How accurate would these reports be? If a word is in the top 10,000 on this list, is it worth learning? For example a word like 世界中 (around the world) is 5366th on the list, but it doesn't even appear as common on jisho.org. I know jisho isn't perfect either, do you think this list would be good to study from? What I would like to do is when I learn a new kanji, go to this list and find a few new words that include it to help me remember the pronunciations and meanings of the kanji.
It is accurate in the sense that 世界中 is probably the 5366th most frequent word in the corpus of novels that I used. Can't comment on jisho.org.
Reply

Messages In This Thread