nest0r Wrote:No, I haven't gotten it yet, so I don't know about the audio quality or whether there are actually 4000 sentences not in smart.fm. I can't bring myself to buy it knowing I'll always be looking at it and frowning, wishing I had an Anki deck I can use as a corpus on my other machines, like we have for other resources.
So I downloaded the lite version and inspected the files. It's about 378MB in size, so I assume the lite version has all the data as suggested above. I'm still figuring out how it's storing everything (just started looking at it), so maybe I've missed some stuff or made some mistake, but thus far:
----
The A1.dat to A10.dat and B1.dat to B10.dat files seem to have the words, from which I can extract the expression, reading, and an internal index. I've extracted 9619 words with about 200 duplicates (maybe for multiple meanings with the same reading??). Doing a simple comparison against the expression field, 3568 (of 9619) are not in Kore and 20 (of 6000) items from Kore are not in this.
----
sentences.idx and headwords.idx are some sort of mapping of sentences/words to the audio filename (stored in mpeg4 .m4a files). The names are js00002a.m4a through js10089a.m4a for sentences and jw00002a.m4a through jw10089a.m4a for words. There's additional data in these files but I'm not sure how to decode it yet.
----
sentences.dict and headwords.dict are some sort of archive storing all the audio files in base64.
----
JIB_ej{A,B,S}.{idx,dict} contains all the English meanings (and some wacky indexing method) and it would appear other JIB_* files have data for various other languages (eg, Chinese, Korean).
Either way, it certainly looks like extracting the data and correlating it all isn't unreasonably difficult (no encryption) [EDIT: actually, it appears some files are encrypted, or more likely, compressed in some way], it's just a question of whether figuring out all the specifics and gluing it together is worth the effort.
brianobush, could you explain what exactly you meant about the unlocking as you play? And what exactly is the difference between the "lite" (aka free) version and the deluxe one for $16?
EDIT: Ah. So the free/lite version comes with some lessons unlocked, the beginner brings it to 1750 words for $6, advanced adds 8000 words for $10, and deluxe unlocks all 9750 for $16.
Edited: 2011-01-21, 5:27 pm