Back

Core2000/Core6000 > is there a J<->J version?

#1
Hi Guys,

I noticed Anki has a core 2000 and core 6000, and they look good! However, does anyone know if there is a version of the deck where the Japanese meaning is a field?

I use Sanseido's J<->J dictionary and it gives a simple definition for each of the vocab terms, but I have to manually do this for each card. Anyone do something similar? Do you have a script that can fetch the definitions?
Reply
#2
I found a lot of the examples really useful, but the meaning of the baseword is what I would like to change to Japanese. I wonder if there is a way to do this, or script it. Any ideas?
Reply
#3
Probably nothing already done, however there is a Tanuki list floating around (see the google spreadsheet page). You'll have to manually insert them though.
Reply
May 16 - 30 : Pretty Big Deal: Save 31% on all Premium Subscriptions! - Sign up here
JapanesePod101
#4
So i figured out that there is a linux program called 'wget'

It fetches out a page and downloads it locally...

but when i tried to use it:


wget -r12 -A.htm http://www.sanseido.net/User/Dic/Index.a...J=checkbox


It didnt work, and only saved the ASP pages.... whereas opening the url in firefox allows me to save it as htm...


I want to write a script that searches sanseido, and savves the html output locally, then grep's it for my vocab card in anki/csv and puts the web definition in the row.
Reply
#5
animehunter123 Wrote:wget -r12 -A.htm http://www.sanseido.net/User/Dic/Index.a...J=checkbox

It didnt work, and only saved the ASP pages.... whereas opening the url in firefox allows me to save it as htm...

I want to write a script that searches sanseido, and savves the html output locally, then grep's it for my vocab card in anki/csv and puts the web definition in the row.
it works fine for me with your url if you use wget with no other arguments

here's an example of what you're trying to do (though for sanseido you'll have to modify it a bit) https://sites.google.com/site/ankinihong...-sentences
Reply
#6
Nukemarine Wrote:Probably nothing already done, however there is a Tanuki list floating around (see the google spreadsheet page). You'll have to manually insert them though.
This has come up before, but might be worth mentioning again. That Tanuki list was originally part of a kanji learning program. The kanji compound words were written in half hiragana/half kanji as prompts and the student was meant to determine the correct kanji to replace the hiragana. Words in the definitions and sentences also contain a mix of hiragana and kanji.

My feeling is that if the list is going to be shared, either the kanji should be cleaned up or there should be a comment/warning on the Google spreadsheet (and perhaps in any reference to it.)

You may have already done some cleaning up, Nukemarine (I didn't look at it closely), but words are still 交ぜ書き. I suppose it would need to be sorted again after adding the missing kanji.

I think it's a project worth doing. It has ~7000 words with very short definitions and short example sentences, all in Japanese. They're intended more as hints, so they're perfect for SRS. It also can be converted into a kanji gap-filling deck because the sentences and definitions currently use hiragana in place of the target kanji.

I starting working on it to share (thinking it'd be quick b/c I'd done KIC), but soon grew tired of it. Anyone planning to use the list to learn new vocab, should at least be aware that it will involve dictionary checking and revision. [edit: I don't think I can easily incorporate the part I've done into the sheet, as I don't see a way to align them]
Edited: 2010-12-15, 6:47 am
Reply
#7
Thora,

Though I didn't clean it up, I have Cangy sorted Tanuki file with additional columns that say if it's a Core dupe. In addition, one can add a column that tells the person if there's another word with the exact same kana (likely a dupe of the variety you mention). This is easily done by sorting by kana, then doing a formula comparing the word directly above or below the kana (just copy and paste the results without the formula). The user can decide if it's worth keeping.

Tanuki is definitely not a beginner's word's list. However, it's great as a resource for later learning where the user likely knows enough to avoid the pitfalls you're warning against.
Edited: 2010-12-15, 7:01 am
Reply
#8
Tanuki is awesome, thanks for sharing it. Its very difficult but I love challenges. I wonder if there is a Tanuki book too because I want to buy it badly!


Also, thanks for that shellscript... it is amazing!!!! I wonder how to get it to work with Sanseido though... I like sanseido simple definitions for srsing. Sanseido seems... to be non html but... something like a jsp page so I wasnt able just to do... Any programmers have any advice?

Thanks guys!

wget -O - "http://www.csse.monash.edu.au/~jwb/cgi-bin/wwwjdic.cgi?9MIH$sentence" | \
grep '<li> ' | \
sed 's|.*<li> ||' | \
sed 's|<\/li>$|<br>|' | \
tr -d '\n' | \
tr '\t' ' ' | \
recode EUCJP..u8
echo
Reply