I had a go at editing your script. It uses core.py and jmdict_freqs.txt from my GitHub link. Note that my code is Python 2 while your original was Python 3.
I figure you know enough to debug the script if it's not quite correct...
For the words it missed in your sample, looks like JMDICT only has 朝早く and not 朝早い. And JMDICT doesn't have the trailing な in its entries, so might be worth making the script remove those before looking up the word.
Also, ideally it would split the furigana over the correct kanji (since the information is there already in the source csv).
http://pastebin.com/RshFJE4P
I figure you know enough to debug the script if it's not quite correct...
For the words it missed in your sample, looks like JMDICT only has 朝早く and not 朝早い. And JMDICT doesn't have the trailing な in its entries, so might be worth making the script remove those before looking up the word.
Also, ideally it would split the furigana over the correct kanji (since the information is there already in the source csv).
http://pastebin.com/RshFJE4P
Edited: 2015-08-31, 7:23 am
