(2016-06-10, 9:25 pm)jimeux Wrote: I had a little trouble with this too. JMDict is the new format, so you should go with that. I think they were or perhaps still are updating EDICT2 in unison, but you can basically think of it as deprecated. There's a DTD for JMDict and a page explaining the relational data model they use themselves for JMdict somewhere on the site.
It took me quite a bit of work to parse it successfully into a DB, but then I was using a fairly low-level Java library. There are a lot of elements, so I'd study the DTD and the data model page before writing any code. Since JMdict is a relatively new format, the Bibliography, Link and Audit tags are currently completely empty, so I wouldn't prioritise those for now.
If you're making a J-E dictionary, you'll probably be interested in Kanjidic2, the Tanaka Corpus, Kradile-u (check license), KanjiVG, and some kind of radical list (I had to make my own).
You sound like someone who knows what's what, so I wonder if you'd be able to enlighten me on something related to this.
I have built a somewhat large vocabulary deck in Anki, 99% of which has the English definition taken from EDICT (not sure if it's 1 or 2), and I'm wondering if there is any kind of batch process I could perform on around 40K notes to output definitions taken from JMDict.
The only piece of software I know of that can perform this sort of function is Epwing2Anki, and unfortunately it is only compatible with EDICT, not JMDict. It uses a .sqlite version of EDICT to pull from I believe, and this may be a question for the author, but he unfortunately hasn't been around here in a while, so I'll ask you just in case, if that's OK. Would it be possible to TRICK or DECEIVE this piece of software into thinking it's pulling from EDICT when in fact you have sneakily replaced the edict.sqlite with jmdict.sqlite? Now, this is even assuming there is such a thing available, I don't know and obviously I should have researched this first before typing this out, but alas here we are.
I'm not a coder and am probably ignorant of a whole slew of reasons why this wouldn't work, perhaps even just the difference in format between the 2 dictionaries would make this task impossible, but what say you sir? If this is a no-go, would you know of any other way to go about such a task?
Edited: 2016-06-11, 12:07 pm

