Hello guys!
I wanted to announce my contribution to Anki users out there using AJATT's sentence method for studying Japanese. There is a "Tanuki" / "Tanooki" deck (not Corpus) that was released for students but contained half kanji words. I have worked with my sensei for 3 months now and we have completed kanjifying the deck. Please have a look and feel free to use it as a J<->J Sentence Deck! It should be very helpful to sentence learners (and users who only want to study in Japanese, as opposed to Core2000/6000 which gives an English meaning).
Below are the details of the deck. I hope it helps everyone!!! Best of luck in your studies and endeavors.
-------------------------------------------------
Title: Tanuki-Ultima
Size: 5722.49KB
Tanuki-Ultima
----------
6203 Japanese Sentences with Japanese Meanings + Vocab Words + Black background for Ipod/Droid!
Greetings everyone! This is the famous "Tanuki" or "Tanooki" deck that was generously donated by the japanese language community. The original deck required to be kanjified and I have worked with my sensei for 3-4 months to put the correct kanji into the deck. We are going to eat at Cocoichi Curry tonight to celebrate this achievement! We decided to contribute this to the Anki community in hopes that AJATT users of anki whom study Japanese to Japanese decks will appreciate this deck (i.e. study japanese and memorize the meaning in Japanese). We also love FF6, so use this name as the title of the deck! (See our other anki deck "RTK-Ultima" which is also really nice!"). We chose the name "Ultima" for FF6 Reference
Here are the "Ultima" fields of this deck:
* ID# 2332 (A number we created to identify the difficulty of the kanji)
* Kanji: 羊
* Onyomi: よう
* Kunyomi: ひつじ
* English: sheep
* Yomi: 音
** Vocabulary - Kanji: 羊毛
** Vocabulary – Kana: ようもう
** Japanese Definition – Kanji: 羊の毛。
** Japanese Definition – Kana: ひつじのけ。
** Example Sentence – Kanji: 羊毛で服を作る。
** Example Sentence – Kana: もうでふくをつくる。
Please try out this deck! We think it is awesome for AJATT/Anti-moon users! Its also with a black background and looks nice on a monitor or ipod!
Cheers!
Last edited by finalfantasy6forever (2011 March 07, 1:45 am)
Asriel
Member
From: 東京
Registered: 2008-02-26
Posts: 1343
finalfantasy6forever wrote:
There is a "Tanuki" / "Tanooki" deck (not Corpus) that was released for students but contained half kanji words.
I might be wrong, but isn't the Tanuki deck just a deckified version of the Tanuki corpus?
** Vocabulary - Kanji: 羊毛
** Vocabulary – Kana: ようもう
** Example Sentence – Kanji: 羊毛で服を作る。
** Example Sentence – Kana: もうでふくをつくる。
I hope this was just a forum typo, and that it's not in the deck too much D:
Anyway, major props to you guys. Definitely something I'm looking forward to get into! Seriously, this must have taken a lot of time, and it's pretty super awesome of you to contribute it to everyone.
@Asriel - Incomplete in what way? I was under the impression it was complete. Then again, I was also under the impression that the original did not need kanjification as it deliberately left out kanji in a progressive way?
And wow you're good at picking stuff up, just like with that hypothetical OCR'd text, but I don't think that example kana sentence is representative of a huge flaw so you perhaps don't need to worry. ;p It looks like that original example kana sentence in this deck has the よう in ようもう but it's 羊もう instead—thankfully there's multiple kana fields, so. I'm thinking it must be a product of the kanjification? The occasional overzealous bits. So maybe you'd come across extra/unkanafied kanji on occasion but not omissions of kana?
Just optimized the deck via Anki's tools. That made the browser much faster. ^_^
Also, maybe the OP was just referring to deck vs. corpus in the sense of an added differentiation from the Tanaka corpus, but I don't know, I've never seen the Tanuki deck before this one, just the Tanuki corpus. Are they different? Is the older Tanuki deck better/worse than the Tanuki corpus?
Last edited by nest0r (2011 March 07, 2:43 pm)
Asriel
Member
From: 東京
Registered: 2008-02-26
Posts: 1343
nest0r wrote:
@Asriel - Incomplete in what way? I was under the impression it was complete. Then again, I was also under the impression that the original did not need kanjification as it deliberately left out kanji in a progressive way?
Oh is that so? I say 'incomplete' simply because the kanji were left out. I didn't know it was deliberate. That's kinda cool, as a system...
And wow you're good at picking stuff up, just like with that hypothetical OCR'd text, but I don't think that example kana sentence is representative of a huge flaw so you perhaps don't need to worry. ;p It looks like that original example kana sentence in this deck has the よう in ようもう but it's 羊もう instead—thankfully there's multiple kana fields, so. I'm thinking it must be a product of the kanjification? The occasional overzealous bits. So maybe you'd come across extra/unkanafied kanji on occasion but not omissions of kana?
I was just thinking that if it was an automated process, it could be something that happened quite often, which might have been a problem. If not, then no worries -- just noticed it and thought I'd bring it up.
Also, maybe the OP was just referring to deck vs. corpus in the sense of an added differentiation from the Tanaka corpus, but I don't know, I've never seen the Tanuki deck before this one, just the Tanuki corpus. Are they different? Is the older Tanuki deck better/worse than the Tanuki corpus?
I actually don't know. All I have is a spreadsheet of the Tanuki Corpus from a long time ago. I'd love to take a look at it, but alas, I can't get to Anki on the computer I'm on. But with this new deck, I don't think the older things will be necessary anymore 
Thora
Member
From: Canada
Registered: 2007-02-23
Posts: 1691
FF6Forever, your contribution will be very useful for some people. Thanks for completing such a time consuming project!
(I'm curious, did you tutor have any comments about any of the idioms or the less common vocab?)
jettyke wrote:
and does this deck have that thing that 1 new word/expression per card?
Nukemarine shared a google spreadsheet which is apparently sorted per 2001KO and has the Core6000(?) vocab tagged. I don't know whether it was sorted on words or sentences. In any event, that version wasn't completed kanjified which might affect the sort. You could start with that order. Or perhaps Nukemarine could work his magic again with the revised data.
By the way how many kanji does this cover?
I don't know about kanji, but I can give you some idea how the Tanuki vocab compares to other vocab lists, if that's of interest to anyone. (Note that this is a comparison of words in the vocab field only. Tanuki has a lot of additional vocab in the sentences and definitions, including some idiomatic use and rare words.)
Tanuki: 7093 originally (in whatever version of Tanuki I had)
964 dups
The duplicates include:
- same word & sense (presumably used to test each kanji separately)
- same word, different sense (worth retaining)
~12 homographs: same kanji, different word (not included as dups and not included below).
Tanuki: 6117 unique vocab
Tanuki vocab in:
Core6K: 1990 (~6000 total)
Core7-10K: 870 (~3700)
JLPT: 2530 (~8000)
Kanji in Context: 2700 (~7300 all kanji)
Leeds Freq: 2870 (~14000)
The KIC number was surprising as I had assumed this was data based on KIC material (which has ~7300 kanji words.) By the way, these aren't precise numbers; I didn't check every minor variation (好きな v. 好き for eg). Tanuki includes some expressions as well as different verb conjugations as separate items.
Some general observations:
-The kanji words common to all these lists (using a combined core10K) is somewhere around 2500 words. KIC and Tanuki are aimed at teaching readings for many kanji so their vocab is 100% kanji words, whereas the other lists include kana words. There are about 4000 words common to the lists that include kana.
-KIC covers all Jouyou kanji (not certain about Tanuki).
-Frequency lists cover fewer kanji (in the same number of kanji words after a certain number), but can be biased in other ways. Somewhat surprisingly, that 20K Wiki Freq list doesn't include many words common to all other lists. (A parsing issue?)
-Tanuki includes some uncommon stuff (sentence words with non-Jouyou kanji and a few sentence words that don't even appear in J-E dictionaries.)
I love that it's called Tanuki. :-)