![]() |
|
looking for an ordered onyomi index - Printable Version +- kanji koohii FORUM (http://forum.koohii.com) +-- Forum: Learning Japanese (http://forum.koohii.com/forum-4.html) +--- Forum: Remembering the Kanji (http://forum.koohii.com/forum-7.html) +--- Thread: looking for an ordered onyomi index (/thread-12258.html) |
looking for an ordered onyomi index - Nadavu - 2014-10-09 Hi guys, disclaimer: I did my best to search the forum (and the net..) before starting a new thread, but all I could find were dead links to lists that are not exactly what I am looking for. I'm looking for an index that lists kanji according to onyomi, and ideally subgroups kanjis of a particular onyomi reading according to radicals/primitives. so, for example, if I was to search for the reading リン I would get: 輪倫侖棆淪綸 隣鱗燐麟 林淋琳 etc. Please note that kanji should be recognized to have more than one onyomi, i.g., the kanji 強 would belong to both the キョウ list and the ゴウ list. Is there such an index in existence? Thanks! looking for an ordered onyomi index - aldebrn - 2014-10-11 This should be easy to make, but how exactly do you want to group "kanjis of a particular onyomi reading according to radicals/primitives"? RTK ordering is easy, anything else will require a specification on database to use. looking for an ordered onyomi index - Vempele - 2014-10-11 Grouped by phonetic primitive, sorted by how good the primitive is at predicting on'yomi (I don't remember the exact details of the sorting): http://forum.koohii.com/showthread.php?pid=195510#pid195510 looking for an ordered onyomi index - Nadavu - 2014-10-11 Vempele Wrote:Grouped by phonetic primitive, sorted by how good the primitive is at predicting on'yomi (I don't remember the exact details of the sorting): http://forum.koohii.com/showthread.php?pid=195510#pid195510Oh my, this is truly frightening in its comprehensiveness. Thanks a lot! looking for an ordered onyomi index - lauri_ranta - 2014-10-12 I wrote a ruby script that uses data from kanjidic and http://cjkdecomp.codeplex.com: require "nokogiri" yomi = Hash.new { |x, y| x[y] = [] } decomp = {} IO.readlines("cjk-decomp-0.4.0.txt").each { |l| l =~ /(.+):.+\((.*)\)/ decomp[$1] = $2.split(",") } Nokogiri.XML(IO.read("kanjidic2.xml")).css("character").each { |e| next if e.css("dic_ref[dr_type='heisig']").empty? kanji = e.css("literal").text e.css("reading[r_type='ja_on']").each { |x| yomi[x.text] << kanji } } yomi.each { |y, kanji| allcomponents = kanji.map { |x| decomp[x] }.flatten.compact allcomponents.select { |x| allcomponents.count(x) >= 2 }.uniq.each { |x| puts y + " " + kanji.select { |k| decomp[k] && decomp[k].include?(x) }.join } } I uploaded the output to http://19a5b0.s3-website-us-west-2.amazonaws.com/rtk-kanji-by-component-by-yomi.txt. The list is limited to RTK kanji, but it includes all yomi in kanjidic and not just jouyou yomi. I didn't try to remove groups of kanji like 綾綸 that share non-phonetic compounds. Kanji for リン: リン 倫輪綸 リン 淋琳 リン 燐隣鱗麟 リン 綾綸 looking for an ordered onyomi index - Nadavu - 2014-10-12 Thanks Lauri - that's exactly what I was looking for! looking for an ordered onyomi index - aldebrn - 2014-10-13 lauri_ranta Wrote:I wrote a ruby script that uses data from kanjidic and http://cjkdecomp.codeplex.comOh snap, I've never seen this decomposition dataset before! It looks pretty good, although I did notice a flaw in 留:d(卯,田). Do you have any experience that could let you compare CJKDecomp against CJKVI or KanjiVG, two venerable decomposition databases? |