| Post | Author | Forum | Replies | Views | Posted [asc] | ||
|
Thread: How many RTKv1 kanji are used as primitives in subsequent kanji? Post: How many RTKv1 kanji are used as primitives in sub... I got 480 if the components have to appear before the kanji they are part of and 486 if not: $ curl -s jptxt.net/kanji.txt|awk -F\; '/^[^#]/&&$12&&$12<=2200'|sort -t\; ... |
lauri_ranta | Remembering the Kanji | 13 | 5,190 | 2014-12-21, 8:48 pm | ||
|
Thread: www.ravenbrook.com/tool/kanji-learning-tools Post: www.ravenbrook.com/tool/kanji-learning-tools http://jptxt.net/kanji.txt has RTK frame numbers, frequency ranks, and RTK keywords for the components of kanji: $ curl -s jptxt.net/kanji.txt|awk 'NR==3||/^八/' #1 kanji;2 kanjidic on;3 kanjidic ... |
lauri_ranta | Learning resources | 9 | 3,798 | 2014-11-18, 4:56 am | ||
|
Thread: tool to automatically generate pics from vocab list? Post: tool to automatically generate pics from vocab lis... Here's a retarded shell script that uses the Google Images API: cat ~/wordlist.txt|while read l;do curl "$(curl -sG --data-urlencode "q=$l" 'https://ajax.googleapis.com/ajax/services/search/i... |
lauri_ranta | Learning resources | 13 | 3,251 | 2014-10-12, 8:10 am | ||
|
Thread: looking for an ordered onyomi index Post: looking for an ordered onyomi index I wrote a ruby script that uses data from kanjidic and http://cjkdecomp.codeplex.com: require "nokogiri" yomi = Hash.new { |x, y| x[y] = [] } decomp = {} IO.readlines("cjk-decomp-0.4.0.txt").each {... |
lauri_ranta | Remembering the Kanji | 6 | 1,986 | 2014-10-12, 7:36 am | ||
|
Thread: Asia Torrents Invites -- Active Users Only Post: Asia Torrents Invites -- Active Users Only MelonBerry Wrote:Are you going to be active and adhere to the rules? I only say this because I gave out 2 invites in the past on here, and those users used the website for a few things, and haven't to... |
lauri_ranta | Off topic | 199 | 73,062 | 2014-09-27, 7:31 pm | ||
|
Thread: Asia Torrents Invites -- Active Users Only Post: Asia Torrents Invites -- Active Users Only I'm not that active here anymore, but does someone have an invite to jpopsuki? |
lauri_ranta | Off topic | 199 | 73,062 | 2014-09-27, 6:48 pm | ||
|
Thread: Where to find Japanese subtitles ? Post: Where to find Japanese subtitles ? Dramas: http://jpsubbers.x10.mx/Japanese-Subtitles/ http://www.d-addicts.com/forum/subtitles.php#Japanese |
lauri_ranta | Learning resources | 10 | 3,935 | 2014-07-12, 12:16 am | ||
|
Thread: Spreadsheet with componentes for RTK1 Post: Spreadsheet with componentes for RTK1 I also included a column like that in http://jptxt.net/kanji.txt: $ curl -s jptxt.net/kanji.txt|awk -F\; '$12&&$12<=2200{print $1";"$13";"$17}'|gshuf -n10 柱;pillar;... |
lauri_ranta | Remembering the Kanji | 1 | 634 | 2014-07-12, 12:12 am | ||
|
Thread: Source of coreXk? Post: Source of coreXk? Thora Wrote:I recall from past threads that the Core 6000 sentences were licensed by iknow from Jack Halpern's CJK Dictionary Institute - http://www.cjk.org/cjk/index.htm As lauri_ranta explained, ... |
lauri_ranta | Learning resources | 19 | 4,433 | 2014-05-27, 1:14 pm | ||
|
Thread: Source of coreXk? Post: Source of coreXk? codex Wrote:The previous posts in this thread seem to imply that some part (10,000 sentence pairs) of the Corpus was incorporated into the iKnow.com websites, and the Core series was eventually extrac... |
lauri_ranta | Learning resources | 19 | 4,433 | 2014-05-26, 10:16 am | ||
|
Thread: Are there any good Japanese vocabulary word frequency lists out there? Post: Are there any good Japanese vocabulary word freque... CB's novel frequency lists from http://forum.koohii.com/showthread.php?p...#pid168616. They are based on a corpus of over 5000 novels, which includes new novels that are not available in the public do... |
lauri_ranta | The Japanese language | 5 | 2,433 | 2014-05-25, 7:36 am | ||
|
Thread: Source of coreXk? Post: Source of coreXk? An older version of the Core 6000 data from the iKnow/Smart.fm API: https://sites.google.com/site/ankinihongo/home/kore Newer versions of the Core 6000 data based on JSON files downloaded from iknow... |
lauri_ranta | Learning resources | 19 | 4,433 | 2014-05-21, 8:08 am | ||
|
Thread: Recommendations for TV Shows/Anime Post: Recommendations for TV Shows/Anime |
lauri_ranta | General discussion | 41 | 10,684 | 2014-05-12, 11:19 pm | ||
|
Thread: Common japanese idioms resources needed Post: Common japanese idioms resources needed I made a printable HTML file based on Katsuo's 2100 idioms spreadsheet: http://jptxt.net/printable-idioms.html. I hope it's ok that I reused the data. I added furigana, removed idioms with non-RTK1 ka... |
lauri_ranta | Learning resources | 5 | 1,166 | 2014-02-26, 10:10 am | ||
|
Thread: Coverage of Core 2000 in strictly Core 6000 sentences Post: Coverage of Core 2000 in strictly Core 6000 senten... If I didn't make any errors, there are 2405 unique tokens recognized by MeCab in the first 2000 sentences, 5931 in the last 4000 sentences, and 6701 in all sentences. So there are 770 tokens that appe... |
lauri_ranta | Learning resources | 8 | 2,060 | 2014-02-23, 7:23 pm | ||
|
Thread: I don't get it... (Heisig's primitives) Post: I don't get it... (Heisig's primitives) http://forum.koohii.com/showthread.php?p...0#pid11360 has a list of RTK primitive names for the 214 traditional radicals, but it's missing some primitives that are not radicals. 135 out of the 214 tr... |
lauri_ranta | General discussion | 9 | 2,680 | 2014-02-02, 11:58 am | ||
|
Thread: Amount of vocabulary Post: Amount of vocabulary WataruFord Wrote:The first 100 words on the list make up 57.2% of the text that was processed. The first 500? 70.3%. The first 1000? 76.2% The first 3000? 85.4% The first 10,000? 94.1% Numbers lik... |
lauri_ranta | The Japanese language | 85 | 18,799 | 2014-01-25, 3:27 am | ||
|
Thread: RTK (3) for Mortals Post: RTK (3) for Mortals I made a list of dictionary definitions for RTK keywords two years ago. It looks like this: steed 駿 a horse being ridden or available for riding assiduous 孜 showing great care and perseverance roost ... |
lauri_ranta | Remembering the Kanji | 15 | 5,735 | 2014-01-15, 3:57 am | ||
|
Thread: Kanji & Primitives Spreadsheet? Post: Kanji & Primitives Spreadsheet? http://jptxt.net/kanji.txt has a column for the components of kanji with RTK keywords: $ grep -v '^#' Sites/jp/kanji.txt|awk -F\; '$12!=""&&$12<=2200{print $1";"$13";"&... |
lauri_ranta | Remembering the Kanji | 12 | 3,703 | 2014-01-13, 4:18 pm | ||
|
Thread: Batch translating list of words? Post: Batch translating list of words? jmignot Wrote:I tried but it returned an error in OSX The command worked for me when I copied and pasted it from the post. If your shell is csh, try using bash instead. csh can be the default shell if... |
lauri_ranta | Learning resources | 10 | 1,829 | 2013-12-29, 6:12 pm | ||
|
Thread: Unique kanji in Core2k Post: Unique kanji in Core2k vebaev Wrote:lauri_ranta Wrote:- The first 2000 vocabulary items contain 975 kanji do you know what % of these are from Kyoiku kanji 74% (726 out of 975) of the kanji are 2010 kyōiku kanji. I used th... |
lauri_ranta | Learning resources | 9 | 3,299 | 2013-12-29, 5:39 pm | ||
|
Thread: Batch translating list of words? Post: Batch translating list of words? If you use OS X or Linux, you can add one word per line in ~/Desktop/words.txt and run a command like this in a terminal: curl ringtail.its.monash.edu.au/pub/nihongo/edict.gz|gzip -d|iconv -f euc-jp ... |
lauri_ranta | Learning resources | 10 | 1,829 | 2013-12-28, 8:10 pm | ||
|
Thread: Add furigana above kanji (anime subtitles) Post: Add furigana above kanji (anime subtitles) Yeah, replace for example Kana2rom.kana2rom(hira) with "("+hira+")". |
lauri_ranta | Learning resources | 6 | 2,040 | 2013-12-23, 3:47 pm | ||
|
Thread: Add furigana above kanji (anime subtitles) Post: Add furigana above kanji (anime subtitles) I wrote a Ruby script that adds romaji readings after kanji: [Image: http://19a5b0.s3-website-us-west-2.amazo...213556.png] I'll probably never use it myself though. The readings are not that useful... |
lauri_ranta | Learning resources | 6 | 2,040 | 2013-12-23, 3:09 pm | ||
|
Thread: Kanji Frequency in Wikipedia Post: Kanji Frequency in Wikipedia @Jiroukun If you use OS X or Linux, you can use a shell command like this: for f in *.txt;do mecab -F'%t %f[6]\n' "$f"|awk '$1~/[267]/&&$2{print $2}';done|sort|uniq -c|sort %... |
lauri_ranta | General discussion | 78 | 75,001 | 2013-12-05, 3:45 am | ||
|
Thread: Core 6K vs Core 10k Post: Core 6K vs Core 10k ktcgx Wrote:I believe pmnox's 10k deck has 10k unique words, plus jlpt vocab that wasn't in the original 6k data, nor the extra 4k data. You should take a look at it. It's not true that there are onl... |
lauri_ranta | Learning resources | 13 | 6,430 | 2013-12-01, 7:07 pm | ||
|
Thread: Japanese subs? Post: Japanese subs? If someone gets here searching for Japanese subtitles for dramas, see http://jpsubbers.x10.mx/Japanese-Subtitles/ or http://www.d-addicts.com/forum/subtitles.php#Japanese. You can download torrents f... |
lauri_ranta | Learning resources | 32 | 13,915 | 2013-11-25, 11:33 pm | ||
|
Thread: Unique kanji in Core2k Post: Unique kanji in Core2k There are multiple versions of the Core 6000 data, but in the JSON files available from iknow.jp: - The first 2000 sentences contain 1033 kanji - All sentences contain 1638 kanji - The first 2000 voc... |
lauri_ranta | Learning resources | 9 | 3,299 | 2013-11-10, 2:39 pm | ||
|
Thread: Audio files for reviewing vocabulary Post: Audio files for reviewing vocabulary I made audio files for reviewing vocabulary from JMdict/EDICT and Core 6000: http://jptxt.net/advanced-vocabulary.m4a (378 MB, 18 hours) http://jptxt.net/core-vocabulary.m4a (40 MB, 2 hours) Both fi... |
lauri_ranta | Learning resources | 4 | 1,856 | 2013-11-09, 9:01 am | ||
|
Thread: Core 6K vs Core 10k Post: Core 6K vs Core 10k The "Core 10k" data was extracted from the Japanese Sensei Deluxe iPhone app by overture2112 in 2011, and it was originally posted in the japanese 先生 = core 10000? thread. The Core 10k name also comes... |
lauri_ranta | Learning resources | 13 | 6,430 | 2013-11-03, 1:36 pm | ||
