Back

Need help plugging gaps from Frequency lists

#1
Another one for code proficient users (definitely not me!)

I'd like to 'compare' my Anki vocab deck to a frequency list and then somehow generate a new list of Kanji compounds and words that aren't in my Anki deck. In other words, I'd like to fill in the gaps.

Presumable I need to export my Anki deck into a certain format (*.txt ?). Then choose a frequency list and also have it in *.txt format. Any idea how would I proceed from there?
Reply
#2
It's really easy with Morph Man

http://rtkwiki.koohii.com/wiki/Morph_Man

Install it in anki and follow these steps:

1. Tell Morph Man to create a database of known words. It will automatically parse all the words in cards you have studied.
2. Import cards from your frequency list.
3. Use Morph Man to order these cards by the amount of unknown words.
4. Delete all i+0 cards and study all i+1 cards.
5. Repeat steps 1, 3 and 4 until your done.

I haven't actually used Morph Man in a long time so it may work slightly differently now but I'm sure that will get you on the right track.
Reply
#3
Splatted Wrote:It's really easy with Morph Man
lol, you have no idea how bad I am with tech stuff. But Morph Man looks amazing. I'll try to get started with it and let you know how it's going. Appreciate the advice, thanks !
Reply
May 16 - 30 : Pretty Big Deal: Save 31% on all Premium Subscriptions! - Sign up here
JapanesePod101
#4
Splatted Wrote:It's really easy with Morph Man

http://rtkwiki.koohii.com/wiki/Morph_Man

Install it in anki and follow these steps:

1. Tell Morph Man to create a database of known words. It will automatically parse all the words in cards you have studied.
2. Import cards from your frequency list.
3. Use Morph Man to order these cards by the amount of unknown words.
4. Delete all i+0 cards and study all i+1 cards.
5. Repeat steps 1, 3 and 4 until your done.

I haven't actually used Morph Man in a long time so it may work slightly differently now but I'm sure that will get you on the right track.
I have been using morphman and I think it is great, but how do I know what cards are i+0 cards ? Are they the ones tagged as "comprehension" ? How do I delete them?
Reply
#5
I haven't used it much, but according to the docs, it is the K+n value. So a value of zero means you know every word.
Reply
#6
yogert909 Wrote:I haven't used it much, but according to the docs, it is the K+n value. So a value of zero means you know every word.
I think you misunderstood my question.

I know what i+0 means, I just want to know is there a quick and easy way to identify which cards in anki are i+0 or don't introduce any new words.
Reply
#7
In morphman you can set it up to output i+0 as one of the fields. Sort by that field in the browser.
Reply
#8
I still think understand your question, it's morphman that I'm a a little foggy with but hopefully I remember enough you get you going. As I recall, you have to configure both morphman as well as anki. Morphman has a value called N_k that outputs the number of unknown words in each card. There's another one called N_m that adds words that you know but aren't mature yet. Then inside of anki, you need to create a field to receive that value. And then you need to configure morphman's output to put the N_k value into the 'n+1' field in anki. Again this is off the top of my head, but I hope there's enough there to get you what you want.
Edited: 2014-11-14, 7:52 pm
Reply
#9
yogert909 Wrote:I still think understand your question, it's morphman that I'm a a little foggy with but hopefully I remember enough you get you going. As I recall, you have to configure both morphman as well as anki. Morphman has a value called N_k that outputs the number of unknown words in each card. There's another one called N_m that adds words that you know but aren't mature yet. Then inside of anki, you need to create a field to receive that value. And then you need to configure morphman's output to put the N_k value into the 'n+1' field in anki. Again this is off the top of my head, but I hope there's enough there to get you what you want.
Wow ok,

Do you know of anywhere where it details how to set this up?
I have watched the tutorial videos etc but that didn't really seem to help me.


Thanks,
Reply
#10
Unfortunately that's about as far as I can get you without getting my hands dirty and I'm at work now. Morphman's config was somewhat convoluted as I recall. Hopefully someone else chimes in with some additional advice. I was getting my information form the link that splatted posted. Specifically, the following line. With luck, all you would have to do is make sure "unknown" is mapped to some field in anki.
Quote:mature = "%(jpn)s"
* known = "%(jpn)s [%(N_m)s]"
* unknown = "%(eng)s [%(N_k)s]"
Reply
#11
In the config file
under this section, make you your config look something like this(with the u'XXXX' being the name of the field on your anki card. The comments should be pretty self explanatory, if you need any more help give me a holler. )
Code:
# only these can have model overrides
    'enabled':True,    # whether to analyze notes of a given model, modify their fields, and manipulate due time by Morph Man Index
        # field names to store various information
    'k+N':u'k+N',       # stores how many unknowns
    'm+N':u'm+N',       # stores how many unmatures
    'morphManIndex':u'morphManIndex',   # created an ordering to learn cards in. this is the value new card 'due' times are set to
    'focusMorph':u'focusMorph',         # holds the unknown for k+0 sentences but goes away once m+0
    'unknowns':u'unknowns',             # comma seperated list of morphemes that are unknown
    'unmatures':u'unmatures',               # likewise for unmatures
    'unknownFreq':u'unknownFreq',       # average of how many times the unknowns appear in your collection
        # analyze notes based on the morphemes in these fields
    'morph_fields': [u'Expression'],
        # tag names for marking the state of notes
            # the following three are mutually exclusive and erase eachother upon promotion/demotion
    'tag_comprehension':u'comprehension',   # set once all morphs for note are mature
    'tag_vocab':u'vocab',                   # set once all but 1 morph for note is known
    'tag_notReady':u'notReady',             # set for k+2 and above cards
    'tag_alreadyKnown':u'alreadyKnown',     # you can add this tag to a note to make anki treat it as if mature
        # controls for morpheme analysis
    'morph_blacklist': [ u'記号', u'UNKNOWN'],      # you probably don't care about punctuation and things mecab couldn't parse
        # try playing fields in this order when using batch media player
    'batch media fields': [ u'Video', u'Sound' ],
        # configure morph man index algorithm
    'optimal sentence length': 4,
        # lite update
After this, reset anki, run morphman analyzer again, and anything tagged "comprehension" or "alreadyKnown" or has the k+N field at 0 will be known.
Edited: 2014-11-15, 7:29 am
Reply
#12
@RandomQuotes

Thanks for the detailed reply. Based on that, cards in my deck that are tagged "comprehension" are the cards I want to delete. Thanks again.
Reply