Finally got around to making a searchable deck containing all of JMdict with Tangorin-style entries:
LINK
Default card colour scheme shamelessly stolen from cophnia61.
I had thought that a deck this size would cause performance issues in Anki, but so far it seems fine.
The | characters in the 'find' field can be used to compensate for Anki's (afaik) inability to match word boundaries, e.g.:
deck:current |ふう| #match exact reading/writing
deck:current 人| にん| #ending with 人 read as にん
deck:current 船 (ship or boat) #containing 船 and the words 'ship' or 'boat'
Please let me know if you find any errors/omissions and I'll post an update. I haven't started using it yet as it's still freshly squeezed, so it's quite possible there's some junk hiding in there somewhere. The issue with entries having multiple readings *and* writings was particularly irksome - hard to see what the best choice would be in all cases so I kind of copped out on it.
Where an automated choice had to be made as to what to put in Expression/Reading I've put "choices!" in FrontNotes as a temporary warning to edit the card when it comes up. This will hopefully be workable, though many of the choices are rather trivial (e.g. お尻 vs 御尻 vs オシリ), so could get annoying.
Edit: whoops, forgot to mention: card tags (JLPT levels etc) were taken from the 'Japanese corePLUS' deck, which doesn't seem to be on the shared decks list any more.
Edit2: better search examples above.
Having now used the deck a bit, noticed some possible improvements:
- separate parts of speech and other misc tags with |, e.g. |n|vs| etc, to make it possible to search for e.g. all nouns with |n|, or all expressions with |exp|, without getting false positives.
- improve the tags
- figure out wtf the '3 audio, 1 image' are, because I didn't include any (or so I thought ...)
- put 'japanese' in the note type so it works with kanji stats
Edit3: uploaded a new version with the above fixes. Still no idea wtf the image/audio are though.
It is now possible to search for e.g.
Dialect words: ben| (a mere 300 total, of which 161 are Kansai-ben)
Math terms: |math
Computing terms: |comp
Swears: |vulg
etc
Todo next: make tag sets for novels, anime eps, core decks, etc.
Edit4: Fixed minor furigana issue, and added a 'common' tag (29459 cards)
Edit 07-feb-2016 - New release:
- Updated JMdict to the latest version
- Got rid of the annoying 'choices!' thing & relegated it to a tag. Added some javascript to the card template to conditionally show the alternatives
- Cleaned up the tags a bit more, though unfortunately only a small proportion of the core6k and core10k cards matched due to differing kanji usage. Todo: better matching.
- A few more furigana fixes
- About 900 entries that were in the old version of the dict but not in current version are tagged 'old1'. These can be deleted if you want; they seem mostly useless but not entirely, hence left in.
LINK
Default card colour scheme shamelessly stolen from cophnia61.
I had thought that a deck this size would cause performance issues in Anki, but so far it seems fine.
The | characters in the 'find' field can be used to compensate for Anki's (afaik) inability to match word boundaries, e.g.:
deck:current |ふう| #match exact reading/writing
deck:current 人| にん| #ending with 人 read as にん
deck:current 船 (ship or boat) #containing 船 and the words 'ship' or 'boat'
Please let me know if you find any errors/omissions and I'll post an update. I haven't started using it yet as it's still freshly squeezed, so it's quite possible there's some junk hiding in there somewhere. The issue with entries having multiple readings *and* writings was particularly irksome - hard to see what the best choice would be in all cases so I kind of copped out on it.
Where an automated choice had to be made as to what to put in Expression/Reading I've put "choices!" in FrontNotes as a temporary warning to edit the card when it comes up. This will hopefully be workable, though many of the choices are rather trivial (e.g. お尻 vs 御尻 vs オシリ), so could get annoying.
Edit: whoops, forgot to mention: card tags (JLPT levels etc) were taken from the 'Japanese corePLUS' deck, which doesn't seem to be on the shared decks list any more.
Edit2: better search examples above.
Having now used the deck a bit, noticed some possible improvements:
- separate parts of speech and other misc tags with |, e.g. |n|vs| etc, to make it possible to search for e.g. all nouns with |n|, or all expressions with |exp|, without getting false positives.
- improve the tags
- figure out wtf the '3 audio, 1 image' are, because I didn't include any (or so I thought ...)
- put 'japanese' in the note type so it works with kanji stats
Edit3: uploaded a new version with the above fixes. Still no idea wtf the image/audio are though.
It is now possible to search for e.g.
Dialect words: ben| (a mere 300 total, of which 161 are Kansai-ben)
Math terms: |math
Computing terms: |comp
Swears: |vulg
etc
Todo next: make tag sets for novels, anime eps, core decks, etc.
Edit4: Fixed minor furigana issue, and added a 'common' tag (29459 cards)
Edit 07-feb-2016 - New release:
- Updated JMdict to the latest version
- Got rid of the annoying 'choices!' thing & relegated it to a tag. Added some javascript to the card template to conditionally show the alternatives
- Cleaned up the tags a bit more, though unfortunately only a small proportion of the core6k and core10k cards matched due to differing kanji usage. Todo: better matching.
- A few more furigana fixes
- About 900 entries that were in the old version of the dict but not in current version are tagged 'old1'. These can be deleted if you want; they seem mostly useless but not entirely, hence left in.
Edited: 2016-02-07, 12:37 pm



