kanji koohii FORUM
cbJisho - E-J Dictionary Based on Word Frequency - Printable Version

+- kanji koohii FORUM (http://forum.koohii.com)
+-- Forum: Learning Japanese (http://forum.koohii.com/forum-4.html)
+--- Forum: Learning resources (http://forum.koohii.com/forum-9.html)
+--- Thread: cbJisho - E-J Dictionary Based on Word Frequency (/thread-7127.html)

Pages: 1 2 3


cbJisho - E-J Dictionary Based on Word Frequency - louischa - 2013-06-24

@しーびー:
あなたは発奮させるすごい人ですね。御苦労様。


cbJisho - E-J Dictionary Based on Word Frequency - louischa - 2013-06-24

@CB:

I just examined the file from your not-yet-released program, and I have the following suggestion:

At character Heisig X, it is far too restrictive to try to generate words only composed of characters 1 to X; the Heisig order is too different from the natural character frequency order for that. Hadamitzky and Spahn do restrict their words to 1 - X in their book "Kanji & Kana", but they can do so only because they present the kanji in a frequency-based order. Naturally, then you allow for that condition to be broken, as for instance in line 18 of your output, at which none of the kanji with 冒 are to be found before it. That is discouraging to a learner and defeats the purpose of the list.

Have instead the user specify a range 1 - R among which words including X < R could use any kanji from 1 to R, not only from 1 to X.

This would allow people to do Heisig in steps, namely alternating between learning kanji and learning real Japanese words. For instance, a hypothetical learner could push up to Heisig 1000, then take a break from kanji learning and use your tool to generate words to reinforce all characters previously studied. Obviously R would need to be large enough to allow for generating a nice word repertoire.

Then that hypothetical learner would push for 500 more, re-use your tool this time with R = 1500, etc. This learning in steps would be better, and perhaps would prevent the phenomenon of people doing Heisig once, forgetting it all, and having to re-do it again.

In the event where no viable combination occurs with elements within 1 - R, then you just output nothing.

御苦労様。


cbJisho - E-J Dictionary Based on Word Frequency - Inny Jan - 2013-06-25

... and yet some people learn in a seemingly random order – as in, let’s say you want to read this NHK News:

震災の被害を受けた宮城県の子どもに雪のプレゼント

and as a matter of course, you want to use mnemonics for:
雪震被受災害宮城県子


cbJisho - E-J Dictionary Based on Word Frequency - cb4960 - 2013-06-25

@louischa:

I'll try to make an option to accommodate this.

@Inny Jan:

I'll try to make the program more general by allowing user provided lists of kanji to use. I'll just package a few default lists such as one for RTK kanji and another with kanji sorted by frequency.


cbJisho - E-J Dictionary Based on Word Frequency - Roketzu - 2015-04-21

I'm a little surprised this thread hasn't seen more action considering how useful a tool cbJisho is, but I'm happy to resurrect it in hopes of finding an answer to a particular question. I'm not even sure whether it's possible to do with cbJisho, and if not I'd appreciate any suggestions on other resources that might help.

What I'm trying to do is just search for 五字熟語 (or any form of X字熟語 really) and have the results sorted by frequency (cbJisho does this anyway so kind of a moot point). I've tried various search combinations but the results only ever give me kana mixed with kanji rather than kanji compounds alone. Is there any way to filter out the kana-containing results and only get those made of kanji? If not with cbJisho then would there be any other resource that could do this? Sort X字熟語 by frequency I mean. I'd appreciate any help, thanks!

http://subs2srs.sourceforge.net/cbJisho/help.html

This is the page with details on how the search system works, along with other helpful information!

Update: Well I managed to use DB Browser to export all 18411 entries made up of 5 characters, but of course the majority of these are a mixture of kana and kanji. I'm sure there is some way to just get kanji compounds but it might be beyond the powers of a plebian like myself.


cbJisho - E-J Dictionary Based on Word Frequency - cb4960 - 2015-04-21

Check the "RegEx" option and copy-paste the following into the search box:

Code:
[一-鿿]{5}



cbJisho - E-J Dictionary Based on Word Frequency - Roketzu - 2015-04-21

cb4960 Wrote:Check the "RegEx" option and copy-paste the following into the search box:

Code:
[一-鿿]{5}
I really wish you had been here earlier^^ I just spent the last hour+ copy/replacing every kana/alphabet/number in a text file of 18000+ lines to whittle it down to only those containing 5 kanji, then copying that back into the file with all the frequency information and putting it back into DB Browser and making it chug like crazy in the process, then typing _____ (5 underscores) into the kanji box in order to get the 3593 actual 五字熟語 I wanted! I'm happy I was able to actually do it, but man... I wish I knew that simple little command earlier! Oh well, now I do and don't have to repeat what I just did with 3/4/6 etc. Thanks a lot!