Hello,
I have just released version 5.4 of
cb's Japanese Text Analysis Tool.
Download cb's Japanese Text Analysis Tool v5.4 via SourceForge
What Changed?
● Added the "Frequency Group" and "Frequency Rank" fields to both the Word Frequency Report and Kanji Frequency Report.
Frequency Group: All words in the analysis that share the exact same frequency (Field 1) will be assigned to a numbered Frequency Group, with group 1 containing the most common word(s), group 2 containing the next most common word(s), and so on.
Frequency Rank: For a given word, the Frequency Rank is the total number of words in the analysis that are more frequent that the given word + 1. For example, if the given word has a Frequency Rank of 500, then there are 499 other words in the analysis that are more frequent than the given word.
New Word Frequency Report format:
Field 1: Number of times word was encountered
Field 2: Word
Field 3: Frequency Group
Field 4: Frequency Rank
Field 5: Percentage (Field 1 / Total number of words)
Field 6: Cumulative percentage
Field 7: Part-of-speech
New Kanji Frequency Report format:
Field 1: Number of times kanji was encountered
Field 2: Kanji
Field 3: Frequency Group
Field 4: Frequency Rank
Field 5: Percentage (Field 1 / Total number of kanji)
Field 6: Cumulative percentage
Innocent Novel analysis (Sample_Output_151003.zip) can be found at the link above.
cb4960
Edited: 2015-10-10, 9:57 pm