(2015-10-27, 1:09 pm)gdaxeman Wrote: Is there a way to extract all the sentences and translations from the EPWING dictionaries? (Plus the headwords they came from, for reference.) Reason: I thought about the possibility of using them to create a massive deck with one sentence per card, maybe using MorphMan to reorder them or something, for when I finish Core 10k a few months from now. I would also be interested in using the Text Analysis Tool on the content to gather some information about them as a whole, such as how many different words and kanji they use.
I have something close to that for the Kenkyusha. 411397 lines, 27.2 MB, every line that contains a period. Notably missing the headwords.
Edited: 2016-05-31, 5:53 am
