Back

Help Request - Onyomi words for Kanken Spreadsheet

#1
**Edit: Project is done. Thanks for all the help. I've updated the RTK1 and 3 on Anki shared decks to include the words. I updated the Kanken spreadsheet with common onyomi words. I also put the full kanjidic list (all 6000+ kanji) up there. Also, here's a google spreadsheet of Mext Common Yomi words, which is what I used to make the above and seems fairly useful. It's not too hard to convert that into a full on Vocabulary list**

I recently posted a thread about creating an Anki RTK deck that has lots of field information allowing a user to modify how they review as time goes on.

Here's the Kanken +Heisig spreadsheet that has lots of details that I used in an Anki Deck I uploaded. If you notice, I added a column for Joyo Kunyomi words and Kana Kunyomi words. With info like that in an Anki deck, it's easy to use those as a pseudo "Japanese Keyword".

What I would like to do is add two more columns: Joyo Onyomi words, Onyomi Kana words. These should contain up to three words max per common onyomi. Now, PREFERABLY, the words picked should themselves be common as possible. If anyone has a ready referenced list, and can do some spreadsheet/algorithm magic on it to quickly create usable data that'd be awesome.

Here's a onyomi spreadsheet I have made up to help. This is based off a sheet Khatsuo posted, so it contains only kanji with "common onyomi" only. There are list markers to help with sorting, and kanji with two onyomi are noted.

**Edit: Here's a spreadsheet of Kanji words derived from the anki plug-in. It probably can be used with a scan program to get the most common onyomi. This spreadsheet of word frequencies can be a good reference to pick most common onyomi words.
Edited: 2010-06-07, 4:17 pm
Reply
#2
Sorry about not putting the sheets as public access. That's fixed in case anyone wanted to look.
Reply
#3
There's a big list of examples here:

http://www.mext.go.jp/b_menu/hakusho/nc/...01001.html

which may be useful if the data can be extracted/organised somehow...
Reply
May 16 - 30 : Pretty Big Deal: Save 31% on all Premium Subscriptions! - Sign up here
JapanesePod101
#4
Caivano,

Thanks for the link. I looked at it last week, but didn't think about applying it seriously. However, I did get it into a useful format. I like the list as it's three example words per onyomi, which is exactly what I was thinking of using for RTK flashcards.

I also like that it lists a kanji's older variant. There were 100 of them in an image format that didn't copy over, but I guess I can manually put them in if I'm bored.

So what I'll do is update the Kanken spreadsheet on GoogleDocs to have an "Onyomi words" with kana blocks in addition to a block to put an "older version" of the kanji. After that, I'll put together an updated RTK Anki file with Kunyomi and Onyomi sample words.
Reply
#5
Ok, I updated the Kanken spreadsheet with common onyomi words. I also put the full list (all 6000+ kanji) up there. Now, the it may not be 100% perfect as I just did spreadsheet magic with all the problems that can bring. In addition, the Onyomi Kana words were done using the romaji.org converter so there may be mistakes in some of those also. As always, user beware.

I also used Cangy's plug-in to add the on and kun sample words into my RTK deck (not as bad as I thought it would be).

So thanks to Khatsu, Caivano (for the Mext link) and Cangy.
Edited: 2010-05-31, 10:41 pm
Reply
#6
This looks like an incredible deck, thanks for all the work you put into it. However, is there a way for me to sort these kanji by Heisig order and overwrite my current Heisig deck while retaining my progress?
Reply
#7
Twinzen, that's what I was talking about with Cangy's plug-in.

http://ichi2.net/anki/wiki/ContribFugoun...ite-fields

The 'usage example' link was pretty detailed. However, remember that you must create a .txt file (made doing a copy/paste from a spreadsheet). That .txt file must include tabbed separations even for the 'ignored' lines.

If you decide to do this, some other suggestions are: Save the anki file as a new file name. Turn off autosave and autosync. Be prepared to mess up two or three times or even worse. However, after doing it, you realize it's not so bad and will be willing to do it for other items of interest.

I still wish there was something a bit more intuitive for the average user to do bulk edit/replace of anki cards. However, the plug-in does work with a little effort.
Reply
#8
Nukemarine Wrote:What I would like to do is add two more columns: Joyo Onyomi words, Onyomi Kana words. These should contain up to three words max per common onyomi. Now, PREFERABLY, the words picked should themselves be common as possible. If anyone has a ready referenced list, and can do some spreadsheet/algorithm magic on it to quickly create usable data that'd be awesome.
there's http://ichi2.net/anki/wiki/ContribFugoun...anji-vocab

it's not separated by readings, but I guess an on/kun distinction at least could be made just just by checking for 2 adjacent kanji

Nukemarine Wrote:I still wish there was something a bit more intuitive for the average user to do bulk edit/replace of anki cards. However, the plug-in does work with a little effort.
I've thought of rewriting it, but it'd be as a standalone command line script, so that's unlikely to help much with the user-unfriendliness

if you want a gui interface, star / comment on this ticket
Edited: 2010-06-03, 11:10 pm
Reply
#9
Cangy, thanks. It was that Fugounashi plug-in where I put the first spreadsheet of words. However, I ended up using the Mext list as it was already organized into 3 words per yomi.
Reply
#10
Project is done. Thanks for all the help. I've updated the RTK1 and 3 on Anki shared decks to include these words. I updated the Kanken spreadsheet with common onyomi words. I also put the full kanjidic list (all 6000+ kanji) up there. Also, here's a google spreadsheet of Mext Common Yomi words, which is what I used to make the above and seems fairly useful. It's not too hard to convert that into a full on Vocabulary list
Reply
#11
I am looking for a vocabulary list sorted by Kanken level. So given a level (let's say 8 - a friend of mine is taking 8), i would like to see all the vocabulary for that level. Something like this on renshuu.org.

As far as I have looked, the links here are useful, but would require more processing to extract the vocabulary by level. This spreadsheet might be good to use if I can not find anything (it is not kanken-sorted, but would serve my purposes well enough).

Does something like this exist?
Edited: 2011-06-16, 1:23 am
Reply