Fabrice, thanks so much for your thoughtful and encouraging reply. I will try to find some other venue for this sinister plan
ファブリス Wrote:There is no point in doing that on Koohii since by my estimates less than 10% of the active users appreciate the simplicity and integration of the SRS on Reviewing the Kanji.
Out of curiosity, how many users is that, the 10% that use the SRS? I ask because
that is the number that a study like this would need to be large-ish, and also because I'm curious about just how many users Koohii sees (sorry, very nosey).
ファブリス Wrote:Anki is very well established and people are more comfortable using one system rather than review Kanji with Koohii SRS and vocab with something else. Thus it would make more sense if somehow you can develop a new algorithm as a plugin for Anki, or maybe fork its source code. From there on you should also find quite a lot of participatns through the Anki forums since it has a large userbase.
Great point: while Anki lets you practice all aspects of Japanese (readings, combinations, sentences) in one setting, Koohii restricts itself to just the kanji. I found it difficult to read and understand libanki (Anki's internal backend) source code, but I am looking at documentation for Anki add-ons and it might just be convenient enough to overlook the GPL license restriction. It would make sense to write each algorithm as an Anki add-on so people could experiment with them without having to sign up with an experiment.
ファブリス Wrote:Plus, I would bet a lot of Japanese learners using Anki don't even know about this site here, or purposely chose not to use Remembering the Kanji. The latter probably would be better to balance your "sample" user base, since imho using a SRS with RTK is not the same as using SRS to replace typical rote memorization. It's always been the goal on this site that the SRS is to test yourself and reinforce memorization, not to replace the act of learning the characters.
The key point you make is that there are two orthogonal components here, (1) spaced repetition and (2) the Heisig approach of visual/story-based "mnemonics". In fact, there might be another axis, (3) Heisig's ordering of the jouyou+ kanji list in a "sensible" manner. SRS can be used without Heisig's stories or ordering, and Heisig doesn't make any mention of SRS in the RTK book or iOS app. (And it is important for me to note, per @Kuzunoha13, that Heisig didn't invent either the story method or the idea of reordering the kanji, and that RTK doesn't have a monopoly on these techniques. Please forgive me for continuing to lazily ascribe these to Heisig/RTK in this post.)
Personally, I would like to first restrict the research project to studying an ordered list of kanji (Heisig's RTK, KanjiDamage, etc.) in a visual/story-friendly way (where other people's stories are offered but not required). People using SRS for rote memorization will probably not be using a sequence like RTK/KanjiDamage, so will self-select themselves out of the study. Once I have a baseline for the variability of spacing algorithms for a specific kanji list, the scope could be expanded to different kanji lists, or to on/kun readings, or to vocab lists, etc. So to start out, Koohii's specific combination of SRS+RTK is what I was unconsciously looking for when I first posted, and your comments helped me recognize these goals and make them more concrete.
You also reminded me that a lot of people are using SRS to complement rote memorization without either of Heisig's techniques, i.e., traditionally-ordered kanji lists and no conscious notion of stories about primitives. Studying the impact of various spacing techniques for rote memorization is indeed the bread-and-butter of the psychology literature, which in one way or other informs much of the conventional thinking in SRS circles, i.e., retrieval/storage strength theory, although I should point out that there's a good deal of controversy in the field and many experimental results fly in the face of SuperMemo/Anki conventional wisdom (Jeffrey Karpicke has done experiments where equal-spaced repetitions outperform increasingly-spaced repetition for long-term retention while seeming to sacrifice short-term performance: see
"Expanding Retrieval Practice Promotes Short-Term Retention, but Equally Spaced Retrieval Enhances Long-Term Retention" (2007)).
But this research-space (SRS for rote memorization) is not where I want this study to live, at least not initially. I think it
is worth studying the Anki users memorizing by rote because such a group of autodidacts would be very different from the subjects typically volunteering for psychology experiments informing the literature (college students from WEIRD countries, WEIRD=Western, educated, industrial, rich, and democratic), and their practice/test styles would be very different from the experimental methodologies of (1) memorizing random vocabulary or face/names or nonsense syllables, (2) get distracted for a few minutes to a few days, (3) get tested on a subset of what was "learned". (Truly insane.)
My restriction of the initial scope of this research to study spacing algorithms for RTK-like memorization is partly personal---I am learning with RTK---and partly logistical, in that there's only a single RTK (or KanjiDamage) list, popular and well-established, whereas there are many ways to organize and describe rote memorization lists (school-ordering, different primitive-based orderings, with and without readings and compound words, etc.).
(And just how I don't want this research to live in the academic psychology niche, I don't want to restrict it to evaluating just the SuperMemo/Anki workflows. These SRS tools are highly informed by the experiences of a very few number of extraordinary learners/coders, viz., Piotr Wozniak and Damien Elmes, in learning a wide variety of things. There is a
lot of structure in a specific list like RTK that these general-purpose SRS tools cannot exploit, things like the kanji-primitive dependency graph, the frequency of kanji that a specific user encounters "in the wild" outside his/her SRS reps, periodicities in the performance on specific kanji, etc. There are also arbitrary conventions like Anki's assumption that 1 day is fine-enough resolution to space new/missed items---and while this can be tweaked with add-ons, how much does that influence the underlying algorithm? What I want to say is that I am trying to find a middle space between the WEIRD methodologies favored in academia and the general-purpose SRS tools we all know and love for the narrow task of mastering kanji.)
Edited: 2014-07-15, 8:15 am