kanji koohii FORUM
RTK to traditional/simplified hanzi spreadsheet - Printable Version

+- kanji koohii FORUM (http://forum.koohii.com)
+-- Forum: Learning Japanese (http://forum.koohii.com/forum-4.html)
+--- Forum: General discussion (http://forum.koohii.com/forum-8.html)
+--- Thread: RTK to traditional/simplified hanzi spreadsheet (/thread-4146.html)



RTK to traditional/simplified hanzi spreadsheet - bflatnine - 2009-10-08

Threw this together today because I felt it might come in handy. It's a list of all the 3007 kanji from RTK 1 and 3, plus their counterparts in traditional and simplified Chinese. I plan on adding the missing characters from the RTH (traditional) list to it once I find a way to convert from traditional hanzi to kanji. Any ideas or links to something that can do that?

The characters were all machine converted from RTK to traditional and then from traditional to simplified, but they should be fairly accurate (much more chance of error when you start with simplified hanzi). However, I haven't checked them yet. If you see any errors, let me know!


RTK to traditional/simplified hanzi spreadsheet - bflatnine - 2009-10-13

Bump. Does anybody know of a program or website that can convert traditional hanzi to kanji? Or one that can at least tell me something like 舊 is an old form of 旧?


RTK to traditional/simplified hanzi spreadsheet - epsilondelta - 2009-11-15

Old topic, but... I'm not sure if hanzi and kanji have a good enough 1:1 correspondence to be able to convert traditional hanzi to kanji, as you are asking.

However, Wiktionary usually cross-references traditional and simplified hanzi (and I assume most Chinese dictionary sites would do so). So just enter whatever character you're looking for into Wiktionary and hover with Rikaichan to check if either the simplified or the traditional variant is used as a kanji.


RTK to traditional/simplified hanzi spreadsheet - bflatnine - 2009-11-15

epsilondelta Wrote:Old topic, but... I'm not sure if hanzi and kanji have a good enough 1:1 correspondence to be able to convert traditional hanzi to kanji, as you are asking.
Why do you say that? The conversion seems to be pretty straightforward, not like the mess that is Simplified Chinese.

Quote:However, Wiktionary usually cross-references traditional and simplified hanzi (and I assume most Chinese dictionary sites would do so). So just enter whatever character you're looking for into Wiktionary and hover with Rikaichan to check if either the simplified or the traditional variant is used as a kanji.
Thanks for the tip. However, I wasn't looking for something I could do one character at a time. There are hundreds. Plus, that method doesn't seem like it would help for characters that are different in Japanese than in either form of Chinese. But I did find a site that converts 新字体 to 旧字体 and vice versa. Some of the characters come out as archaic variants like 眞 (modern standard version is 真). But those problems aren't too hard to spot.


RTK to traditional/simplified hanzi spreadsheet - Jarvik7 - 2009-11-15

You want to use http://www.unicode.org/cgi-bin/GetUnihanData.pl?codepoint=舊

Write a script or something to parse it if you want to do it in batches.


RTK to traditional/simplified hanzi spreadsheet - bflatnine - 2009-11-15

Jarvik7 Wrote:You want to use http://www.unicode.org/cgi-bin/GetUnihanData.pl?codepoint=舊

Write a script or something to parse it if you want to do it in batches.
I know about the Unihan database, but I've got no clue how to write such a script. Not all of us on this site are programmers. Wink

Anyway, I finished this project not too long after I posted this thread, and I made a (BROKEN LINK) post about it in the RTH forum and on my blog.


RTK to traditional/simplified hanzi spreadsheet - Jarvik7 - 2009-11-16

Scripting isn't really programming. You can teach yourself how to do basic parsing in a couple of hours at most.