Back

Help find the software/plugin

#1
Hi all! I'm pretty sure I've seen people discussing this functionality on the forum, but can't remember what it was exactly. I'm looking for a program that would analyze a text file (book excerpt) and give you a ranked list of words based on their frequency.

Either that, or something similar that would let you find A) the most frequently encountered words in the book, B) words from any frequency list or corpus, which are present in the book.

Is this possible at all? I'm starting from scratch with anki again, so I don't have the "known" database for morphman, if this plugin can be used for what I'm looking for. Thanks!
Reply
#2
It's "cb's Japanese Word Frequency List Generator" in this thread: http://forum.koohii.com/showthread.php?tid=7457
Reply
#3
Thank you! That's exactly what I was looking for Smile
Reply
May 16 - 30 : Pretty Big Deal: Save 31% on all Premium Subscriptions! - Sign up here
JapanesePod101
#4
Lindley Wrote:Either that, or something similar that would let you find A) the most frequently encountered words in the book, B) words from any frequency list or corpus, which are present in the book.
a) MorphMan internally stores where it found each occurrence of a morpheme and thus the data is there to have counts/frequencies, but it doesn't currently expose this in any useful way. I would just use cb's plugin for now.

b) You can make MorphMan DBs for the various lists and the book, then compare via set difference, intersection, etc. I often do this to see how many nouns/verbs I need to learn before I could reasonable try reading/watching something.
Reply