Search Results

Post Author Forum Replies Views Posted [asc]
    Thread: Core 10k - optimized i+1 version
Post: Core 10k - optimized i+1 version

MaxHayden Wrote:pmnox Wrote:1) The code is written in Python. However, it's not written for readability. I'm not sure if it will be any use to you, but I can share a dropbox link to all files that I u...
MaxHayden Learning resources 348 90,923 2014-09-03, 3:40 pm
    Thread: Core 10k - optimized i+1 version
Post: Core 10k - optimized i+1 version

If you have a hanzi frequency list, you could use that as your sort order and use the script that pmnox posted that he used to sort this one.
MaxHayden Learning resources 348 90,923 2014-07-28, 10:15 am
    Thread: cb's Japanese Text Analysis Tool
Post: cb's Japanese Text Analysis Tool

Thanks for adding those features.
MaxHayden Learning resources 125 56,229 2014-07-19, 10:30 pm
    Thread: Anki Vocabulary Deck Recommendations
Post: Anki Vocabulary Deck Recommendations

I like pmnox's Core10k optimized deck. I think it's the best currently available. The main issue with the deck is that the "core" vocabulary lists were made with dated materials and dubious methodol...
MaxHayden Learning resources 24 6,018 2014-07-17, 8:50 am
    Thread: Core 10k - optimized i+1 version
Post: Core 10k - optimized i+1 version

I'm working on the list of words, but I'll speed it up. The similar words thing might be helpful since you can't learn similar words as quickly (you should suspend one of them until the first one is ...
MaxHayden Learning resources 348 90,923 2014-07-13, 7:15 am
    Thread: For those who finished Core 2k/6k/10k
Post: For those who finished Core 2k/6k/10k

jeffberhow Wrote:You're absolutely right about the m-estimator thing. I'll look into it; it looks similar to the Linear Algebra least squares idea. I may put the i+1 thing on the backburner for now. ...
MaxHayden Learning resources 58 14,534 2014-07-11, 1:04 pm
    Thread: Core 10k - optimized i+1 version
Post: Core 10k - optimized i+1 version

pmnox Wrote:My original plan was to learn first 10k words using flashcards. The remaining words would be suspended until I enable them as soon as I encounter them in writing. I tried to learn more wo...
MaxHayden Learning resources 348 90,923 2014-07-11, 10:02 am
    Thread: For those who finished Core 2k/6k/10k
Post: For those who finished Core 2k/6k/10k

jeffberhow Wrote:I'm thinking about taking sentences and averaging the words into a "sentence rank." The only thing that worries me about this is the i+1 issue when learning cards. Ranking sentences w...
MaxHayden Learning resources 58 14,534 2014-07-11, 9:56 am
    Thread: For those who finished Core 2k/6k/10k
Post: For those who finished Core 2k/6k/10k

Thanks.
MaxHayden Learning resources 58 14,534 2014-07-10, 1:56 pm
    Thread: For those who finished Core 2k/6k/10k
Post: For those who finished Core 2k/6k/10k

vosmiura Wrote:VDLJ_Ver1_0_General-Learners.xlsx gives word rankings. It doesn't give the frequency though, so it won't be enough calculate the coverage % for N words. The academic one gives frequenc...
MaxHayden Learning resources 58 14,534 2014-07-09, 11:11 am
    Thread: For those who finished Core 2k/6k/10k
Post: For those who finished Core 2k/6k/10k

I'm not 100% clear on what you did. You ran cb's utility to get the frequency counts of words from Code Gueass, then you looked up those words in this frequency list. But how did you get rid of prop...
MaxHayden Learning resources 58 14,534 2014-07-08, 5:33 pm
    Thread: Core 10k - optimized i+1 version
Post: Core 10k - optimized i+1 version

pmnox Wrote:1) The code is written in Python. However, it's not written for readability. I'm not sure if it will be any use to you, but I can share a dropbox link to all files that I used to generate ...
MaxHayden Learning resources 348 90,923 2014-07-08, 12:53 pm
    Thread: Core 10k - optimized i+1 version
Post: Core 10k - optimized i+1 version

pmnox, Thanks for your quick reply and for your offer of help. Ideally I'd like to avoid as much duplication of work as possible while improving what we have. I'm generally open to ideas as to the ...
MaxHayden Learning resources 348 90,923 2014-07-07, 11:11 pm
    Thread: Core 10k - optimized i+1 version
Post: Core 10k - optimized i+1 version

So in another thread, it was pointed out that the Core lists used for this deck are not all that good in terms of frequency coverage. I contacted Dr. Tatsuhiko Matsushita at the University of Tokyo a...
MaxHayden Learning resources 348 90,923 2014-07-07, 5:13 pm
    Thread: For those who finished Core 2k/6k/10k
Post: For those who finished Core 2k/6k/10k

MaxHayden Wrote:Since these lists exclude proper nouns, grammatical particles, and the like. I'm going to rerun my frequency coverage stats against them when I have time. I'm also going to try to id...
MaxHayden Learning resources 58 14,534 2014-07-07, 4:26 pm
    Thread: cb's Japanese Text Analysis Tool
Post: cb's Japanese Text Analysis Tool

MaxHayden Wrote:I spoke to Dr. Tatsuhiko Matsushita at the University of Tokyo. He says that word families aren't as useful for Japanese vocabulary learning because of the way the morphology works. ...
MaxHayden Learning resources 125 56,229 2014-07-07, 3:51 pm
    Thread: For those who finished Core 2k/6k/10k
Post: For those who finished Core 2k/6k/10k

So I spoke to someone at the University of Tokyo. He says that word families don't work as well for Japanese as they do for English and that he thinks we should just learn from a larger lexeme/lemma ...
MaxHayden Learning resources 58 14,534 2014-07-06, 2:37 pm
    Thread: cb's Japanese Text Analysis Tool
Post: cb's Japanese Text Analysis Tool

I spoke to Dr. Tatsuhiko Matsushita at the University of Tokyo. He says that word families aren't as useful for Japanese vocabulary learning because of the way the morphology works. He recommends ju...
MaxHayden Learning resources 125 56,229 2014-07-06, 1:13 pm
    Thread: cb's Japanese Text Analysis Tool
Post: cb's Japanese Text Analysis Tool

Thank you for the info. If it isn't too difficult to add in the next release, please do add support for outputting part of speech (so that it can be filtered and/or sorted). I think this would be ve...
MaxHayden Learning resources 125 56,229 2014-07-04, 1:03 pm
    Thread: For those who finished Core 2k/6k/10k
Post: For those who finished Core 2k/6k/10k

Thanks. That's extremely helpful. I took a look at them and see that there's a lot of information in B & I.
MaxHayden Learning resources 58 14,534 2014-07-01, 7:29 pm
    Thread: JLPT1 vocab list& running it against anki deck
Post: JLPT1 vocab list& running it against anki deck

Someone has already done the work for you: This supplemental deck made by a forum member includes the 1325 JLPT vocab that isn't in Core 10k.
MaxHayden Learning resources 5 1,690 2014-07-01, 1:26 pm
    Thread: For those who finished Core 2k/6k/10k
Post: For those who finished Core 2k/6k/10k

In English, a lemma is the "headword" and its inflected forms (book and books) plus variant spellings (favor, favour). The English inflections are plural, third person singular present tense, past te...
MaxHayden Learning resources 58 14,534 2014-07-01, 12:21 pm
    Thread: cb's Japanese Text Analysis Tool
Post: cb's Japanese Text Analysis Tool

I have a question about how this tool works. Looking at various frequency lists including your novel frequency list, I'd like to know how many of the words that are showing up are proper nouns, gramm...
MaxHayden Learning resources 125 56,229 2014-06-30, 3:19 pm
    Thread: Intermediate Mass Vocabulary Building (aka 10k-40k)
Post: Intermediate Mass Vocabulary Building (aka 10k-40k...

So, couple of things, (all of this is based on citable research in language acquisition) First off, fluency is a matter of being able to use what you know *at high speed*. You can have a small vocab...
MaxHayden The Japanese language 79 14,736 2014-06-30, 2:46 pm
    Thread: For those who finished Core 2k/6k/10k
Post: For those who finished Core 2k/6k/10k

Well, if enough people are interested, we could probably make an "improved Core" for those who aren't using smart.fm to learn the vocab. Regrouping by frequency and rerunning the sort isn't that comp...
MaxHayden Learning resources 58 14,534 2014-06-30, 2:14 pm
    Thread: For those who finished Core 2k/6k/10k
Post: For those who finished Core 2k/6k/10k

Okay, so I got the word-frequency.txt file that the core10k deck uses and calculated the cumulative frequency coverage of various vocabulary sizes. The problem is that the list includes proper names ...
MaxHayden Learning resources 58 14,534 2014-06-29, 6:49 pm
    Thread: A Dictionary of Basic Japanese Grammar sorted?
Post: A Dictionary of Basic Japanese Grammar sorted?

There are grammatical frequency lists for English. I wonder if there is such a list for Japanese. If there is, it would help you prioritize the entries in each volume.
MaxHayden Learning resources 5 1,017 2014-06-28, 9:51 pm
    Thread: For those who finished Core 2k/6k/10k
Post: For those who finished Core 2k/6k/10k

Well I guess we read the OP differently. I understood him as saying "If I learned the vocabulary in Core 10k etc." but you understood him as saying "If I just SRS the vocabulary in Core 10k etc." I....
MaxHayden Learning resources 58 14,534 2014-06-27, 4:04 pm
    Thread: For those who finished Core 2k/6k/10k
Post: For those who finished Core 2k/6k/10k

Well, those decks don't have grammar, but assuming you mean "how many words can you understand?", I think this is objectively answerable. The 10k deck is the 10k most frequent Japanese words. So if ...
MaxHayden Learning resources 58 14,534 2014-06-27, 12:53 pm
    Thread: Well, I finally tried out A n k i....
Post: Well, I finally tried out A n k i....

You have to *learn* the vocab via some other means. (That's my experience and it's what Anki is designed for b/c that's what the theory behind it says.) But once you have done the initial learning, ...
MaxHayden Learning resources 23 3,596 2014-06-27, 12:35 pm