Search Results

Post Author Forum Replies Views Posted [asc]
    Thread: How many RTKv1 kanji are used as primitives in subsequent kanji?
Post: How many RTKv1 kanji are used as primitives in sub...

I got 480 if the components have to appear before the kanji they are part of and 486 if not: $ curl -s jptxt.net/kanji.txt|awk -F\; '/^[^#]/&&$12&&$12<=2200'|sort -t\; ...
lauri_ranta Remembering the Kanji 13 5,190 2014-12-21, 8:48 pm
    Thread: www.ravenbrook.com/tool/kanji-learning-tools
Post: www.ravenbrook.com/tool/kanji-learning-tools

http://jptxt.net/kanji.txt has RTK frame numbers, frequency ranks, and RTK keywords for the components of kanji: $ curl -s jptxt.net/kanji.txt|awk 'NR==3||/^八/' #1 kanji;2 kanjidic on;3 kanjidic ...
lauri_ranta Learning resources 9 3,798 2014-11-18, 4:56 am
    Thread: tool to automatically generate pics from vocab list?
Post: tool to automatically generate pics from vocab lis...

Here's a retarded shell script that uses the Google Images API: cat ~/wordlist.txt|while read l;do curl "$(curl -sG --data-urlencode "q=$l" 'https://ajax.googleapis.com/ajax/services/search/i...
lauri_ranta Learning resources 13 3,251 2014-10-12, 8:10 am
    Thread: looking for an ordered onyomi index
Post: looking for an ordered onyomi index

I wrote a ruby script that uses data from kanjidic and http://cjkdecomp.codeplex.com: require "nokogiri" yomi = Hash.new { |x, y| x[y] = [] } decomp = {} IO.readlines("cjk-decomp-0.4.0.txt").each {...
lauri_ranta Remembering the Kanji 6 1,986 2014-10-12, 7:36 am
    Thread: Asia Torrents Invites -- Active Users Only
Post: Asia Torrents Invites -- Active Users Only

MelonBerry Wrote:Are you going to be active and adhere to the rules? I only say this because I gave out 2 invites in the past on here, and those users used the website for a few things, and haven't to...
lauri_ranta Off topic 199 73,062 2014-09-27, 7:31 pm
    Thread: Asia Torrents Invites -- Active Users Only
Post: Asia Torrents Invites -- Active Users Only

I'm not that active here anymore, but does someone have an invite to jpopsuki?
lauri_ranta Off topic 199 73,062 2014-09-27, 6:48 pm
    Thread: Where to find Japanese subtitles ?
Post: Where to find Japanese subtitles ?

Dramas: http://jpsubbers.x10.mx/Japanese-Subtitles/ http://www.d-addicts.com/forum/subtitles.php#Japanese
lauri_ranta Learning resources 10 3,935 2014-07-12, 12:16 am
    Thread: Spreadsheet with componentes for RTK1
Post: Spreadsheet with componentes for RTK1

I also included a column like that in http://jptxt.net/kanji.txt: $ curl -s jptxt.net/kanji.txt|awk -F\; '$12&&$12<=2200{print $1";"$13";"$17}'|gshuf -n10 柱;pillar;...
lauri_ranta Remembering the Kanji 1 634 2014-07-12, 12:12 am
    Thread: Source of coreXk?
Post: Source of coreXk?

Thora Wrote:I recall from past threads that the Core 6000 sentences were licensed by iknow from Jack Halpern's CJK Dictionary Institute - http://www.cjk.org/cjk/index.htm As lauri_ranta explained, ...
lauri_ranta Learning resources 19 4,433 2014-05-27, 1:14 pm
    Thread: Source of coreXk?
Post: Source of coreXk?

codex Wrote:The previous posts in this thread seem to imply that some part (10,000 sentence pairs) of the Corpus was incorporated into the iKnow.com websites, and the Core series was eventually extrac...
lauri_ranta Learning resources 19 4,433 2014-05-26, 10:16 am
    Thread: Are there any good Japanese vocabulary word frequency lists out there?
Post: Are there any good Japanese vocabulary word freque...

CB's novel frequency lists from http://forum.koohii.com/showthread.php?p...#pid168616. They are based on a corpus of over 5000 novels, which includes new novels that are not available in the public do...
lauri_ranta The Japanese language 5 2,433 2014-05-25, 7:36 am
    Thread: Source of coreXk?
Post: Source of coreXk?

An older version of the Core 6000 data from the iKnow/Smart.fm API: https://sites.google.com/site/ankinihongo/home/kore Newer versions of the Core 6000 data based on JSON files downloaded from iknow...
lauri_ranta Learning resources 19 4,433 2014-05-21, 8:08 am
    Thread: Recommendations for TV Shows/Anime
Post: Recommendations for TV Shows/Anime

 
lauri_ranta General discussion 41 10,684 2014-05-12, 11:19 pm
    Thread: Common japanese idioms resources needed
Post: Common japanese idioms resources needed

I made a printable HTML file based on Katsuo's 2100 idioms spreadsheet: http://jptxt.net/printable-idioms.html. I hope it's ok that I reused the data. I added furigana, removed idioms with non-RTK1 ka...
lauri_ranta Learning resources 5 1,166 2014-02-26, 10:10 am
    Thread: Coverage of Core 2000 in strictly Core 6000 sentences
Post: Coverage of Core 2000 in strictly Core 6000 senten...

If I didn't make any errors, there are 2405 unique tokens recognized by MeCab in the first 2000 sentences, 5931 in the last 4000 sentences, and 6701 in all sentences. So there are 770 tokens that appe...
lauri_ranta Learning resources 8 2,060 2014-02-23, 7:23 pm
    Thread: I don't get it... (Heisig's primitives)
Post: I don't get it... (Heisig's primitives)

http://forum.koohii.com/showthread.php?p...0#pid11360 has a list of RTK primitive names for the 214 traditional radicals, but it's missing some primitives that are not radicals. 135 out of the 214 tr...
lauri_ranta General discussion 9 2,680 2014-02-02, 11:58 am
    Thread: Amount of vocabulary
Post: Amount of vocabulary

WataruFord Wrote:The first 100 words on the list make up 57.2% of the text that was processed. The first 500? 70.3%. The first 1000? 76.2% The first 3000? 85.4% The first 10,000? 94.1% Numbers lik...
lauri_ranta The Japanese language 85 18,799 2014-01-25, 3:27 am
    Thread: RTK (3) for Mortals
Post: RTK (3) for Mortals

I made a list of dictionary definitions for RTK keywords two years ago. It looks like this: steed 駿 a horse being ridden or available for riding assiduous 孜 showing great care and perseverance roost ...
lauri_ranta Remembering the Kanji 15 5,735 2014-01-15, 3:57 am
    Thread: Kanji & Primitives Spreadsheet?
Post: Kanji & Primitives Spreadsheet?

http://jptxt.net/kanji.txt has a column for the components of kanji with RTK keywords: $ grep -v '^#' Sites/jp/kanji.txt|awk -F\; '$12!=""&&$12<=2200{print $1";"$13";"&...
lauri_ranta Remembering the Kanji 12 3,703 2014-01-13, 4:18 pm
    Thread: Batch translating list of words?
Post: Batch translating list of words?

jmignot Wrote:I tried but it returned an error in OSX The command worked for me when I copied and pasted it from the post. If your shell is csh, try using bash instead. csh can be the default shell if...
lauri_ranta Learning resources 10 1,829 2013-12-29, 6:12 pm
    Thread: Unique kanji in Core2k
Post: Unique kanji in Core2k

vebaev Wrote:lauri_ranta Wrote:- The first 2000 vocabulary items contain 975 kanji do you know what % of these are from Kyoiku kanji 74% (726 out of 975) of the kanji are 2010 kyōiku kanji. I used th...
lauri_ranta Learning resources 9 3,299 2013-12-29, 5:39 pm
    Thread: Batch translating list of words?
Post: Batch translating list of words?

If you use OS X or Linux, you can add one word per line in ~/Desktop/words.txt and run a command like this in a terminal: curl ringtail.its.monash.edu.au/pub/nihongo/edict.gz|gzip -d|iconv -f euc-jp ...
lauri_ranta Learning resources 10 1,829 2013-12-28, 8:10 pm
    Thread: Add furigana above kanji (anime subtitles)
Post: Add furigana above kanji (anime subtitles)

Yeah, replace for example Kana2rom.kana2rom(hira) with "("+hira+")".
lauri_ranta Learning resources 6 2,040 2013-12-23, 3:47 pm
    Thread: Add furigana above kanji (anime subtitles)
Post: Add furigana above kanji (anime subtitles)

I wrote a Ruby script that adds romaji readings after kanji: [Image: http://19a5b0.s3-website-us-west-2.amazo...213556.png] I'll probably never use it myself though. The readings are not that useful...
lauri_ranta Learning resources 6 2,040 2013-12-23, 3:09 pm
    Thread: Kanji Frequency in Wikipedia
Post: Kanji Frequency in Wikipedia

@Jiroukun If you use OS X or Linux, you can use a shell command like this: for f in *.txt;do mecab -F'%t %f[6]\n' "$f"|awk '$1~/[267]/&&$2{print $2}';done|sort|uniq -c|sort %...
lauri_ranta General discussion 78 75,001 2013-12-05, 3:45 am
    Thread: Core 6K vs Core 10k
Post: Core 6K vs Core 10k

ktcgx Wrote:I believe pmnox's 10k deck has 10k unique words, plus jlpt vocab that wasn't in the original 6k data, nor the extra 4k data. You should take a look at it. It's not true that there are onl...
lauri_ranta Learning resources 13 6,430 2013-12-01, 7:07 pm
    Thread: Japanese subs?
Post: Japanese subs?

If someone gets here searching for Japanese subtitles for dramas, see http://jpsubbers.x10.mx/Japanese-Subtitles/ or http://www.d-addicts.com/forum/subtitles.php#Japanese. You can download torrents f...
lauri_ranta Learning resources 32 13,915 2013-11-25, 11:33 pm
    Thread: Unique kanji in Core2k
Post: Unique kanji in Core2k

There are multiple versions of the Core 6000 data, but in the JSON files available from iknow.jp: - The first 2000 sentences contain 1033 kanji - All sentences contain 1638 kanji - The first 2000 voc...
lauri_ranta Learning resources 9 3,299 2013-11-10, 2:39 pm
    Thread: Audio files for reviewing vocabulary
Post: Audio files for reviewing vocabulary

I made audio files for reviewing vocabulary from JMdict/EDICT and Core 6000: http://jptxt.net/advanced-vocabulary.m4a (378 MB, 18 hours) http://jptxt.net/core-vocabulary.m4a (40 MB, 2 hours) Both fi...
lauri_ranta Learning resources 4 1,856 2013-11-09, 9:01 am
    Thread: Core 6K vs Core 10k
Post: Core 6K vs Core 10k

The "Core 10k" data was extracted from the Japanese Sensei Deluxe iPhone app by overture2112 in 2011, and it was originally posted in the japanese 先生 = core 10000? thread. The Core 10k name also comes...
lauri_ranta Learning resources 13 6,430 2013-11-03, 1:36 pm