kanji koohii FORUM
JapaneseTextAnalysisTool User-based Readability Report - Printable Version

+- kanji koohii FORUM (http://forum.koohii.com)
+-- Forum: Learning Japanese (http://forum.koohii.com/forum-4.html)
+--- Forum: Learning resources (http://forum.koohii.com/forum-9.html)
+--- Thread: JapaneseTextAnalysisTool User-based Readability Report (/thread-12336.html)



JapaneseTextAnalysisTool User-based Readability Report - cophnia61 - 2014-11-17

From the JapaneseTextAnalysisTool's readme:

Quote:User-based Readability Report:
------------------------------
Using a list of words that the user already knows, this report can help to determine readability
of a text based on the percentage of words in the text that the user already knows.

Name: user_based_readability_report.txt

Format:
Field 1: Readability expressed as a percentage (0-100) of the total number
of non-unique known words vs. the total number of non-unique words.
Field 2: Total number of non-unique words
Field 3: Total number of non-unique known words
Field 4: Total number of non-unique unknown words
Field 5: Readability expressed as a percentage (0-100) of the total number
of unique known words vs. the total number of unique words.
Field 6: Total number of unique words
Field 7: Total number of unique known words
Field 8: Total number of unique unknown words
Field 9: Filename

Report is sorted based on Readability (Field 1).
What is the difference between unique and non-unique words?


JapaneseTextAnalysisTool User-based Readability Report - Vempele - 2014-11-17

Unique: No matter how many times a word appears, it's still only one word.
Non-unique: A word appearing n times is n words.


JapaneseTextAnalysisTool User-based Readability Report - cophnia61 - 2014-11-17

Vempele Wrote:Unique: No matter how many times a word appears, it's still only one word.
Non-unique: A word appearing n times is n words.
Thank you!