thecite
Member
From: Adelaide
Registered: 2009-02-05
Posts: 781
I've got two big lists of kanji in separate word documents, I want to compare the data and see which kanji from list #1 aren't in list #2. Does anyone know how I'd go about doing this?
Any help would be appreciated, thanks.
Mushi
Member
From: USA
Registered: 2010-07-06
Posts: 252
Could you copy the kanji only from the two sources to two text files with one kanji on each line, sort them, then do a line by line comparison?
It depends on your OS, but on Windows, for example, I believe you could do something like save them in Notepad in UTF8 text format, then sort each text file with sort.exe, then diff them textually via fc.exe (file compare) or via some free text comparison tool like BeyondCompare.
When doing things similar to this before, I've found that only minor issue to keep in mind is to remember to preserve the little UTF-8 file marker at the beginning of the text files when managing the output...
thecite
Member
From: Adelaide
Registered: 2009-02-05
Posts: 781
Thanks for your help!
Keep in mind that there's likely 1500+ characters that aren't in list #2, so any manual compilation would be an extreme hassle.
Anyway, I've got the UTF8 .txt files, but how do I run them through sort.exe? Googled it, but couldn't find any good explanations.
Last edited by thecite (2013 January 03, 12:30 pm)