kanji koohii FORUM
Rikaichan: RevTK Community Edition - Printable Version

+- kanji koohii FORUM (http://forum.koohii.com)
+-- Forum: Learning Japanese (http://forum.koohii.com/forum-4.html)
+--- Forum: Learning resources (http://forum.koohii.com/forum-9.html)
+--- Thread: Rikaichan: RevTK Community Edition (/thread-5351.html)

Pages: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35


Rikaichan: RevTK Community Edition - IceCream - 2011-06-13

THIS!!!IS!!!AWESOME!!!!!!

♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♫


Rikaichan: RevTK Community Edition - Tori-kun - 2011-06-14

I ♡ cb4960, too!!! (fund raising???)


Rikaichan: RevTK Community Edition - fugu68 - 2011-06-14

Fantastic tool - only just discovered it! It should go in the wiki somewhere - Anki tools for Japanese page?

Thank you cb4960!!


Rikaichan: RevTK Community Edition - cb4960 - 2011-06-14

I have just uploaded version 12.1 of the Rikaichan: RevTK Community Edition extension.

Download version 12.1 via Mediafire

What Changed?

● Renamed "click-to-popup" mode to Super Sticky mode.

● Some changes to Super Sticky mode:
  □ Clicking outside of the popup will remove the popup. (Thanks nest0r)!
  □ Ctrl-Right Mouse button is now the preferred method to select a word because it won't
    follow links. I fixed it so it will no longer bring up the usual right-click context menu.
    You can still use Ctrl+Left/Middle mouse buttons if you want, but just don't use
     them on links.
  □ Ctrl-Left Mouse will no longer hilite the surrounding table element.

● This extension now has a new name (in case you didn't notice).

cb4960


Rikaichan: RevTK Community Edition - cb4960 - 2011-06-14

I also released version 1.1 of Anki's Real-Time Import plugin.

● Fixed a crash that occurred when selecting a deck from the deck browser.


Rikaichan: RevTK Community Edition - cb4960 - 2011-06-14

@ nest0r,

Still haven't tried AntConc. It took me this long just to try Yomichan (and subsequently steal it's best feature).

I'll look into the Rikaichan dictionary generation process when I get some time. Would definitely be interesting to add Stardic dictionaries.

@ Tori-kun,

I might be able to add an option to bring up the lookup bar at startup.

Edit: Oh, just noticed Rikaichan 2.03 came out. I will have to merge with that soon.

@ jino123, IceCream, Tori-kun, fugu68,

You're welcome. I especially appreciate IceCream's 412 hearts.


Rikaichan: RevTK Community Edition - nest0r - 2011-06-15

Fantastic, thanks! Works much better now. Back to lazy mouse-only-ness for lookups. ;p

Here's an example of what you can do with AntConc. Let's say you decide to give it a corpus of 18 volumes from a certain light novel series. Some of those hypothetical legally free .txt files, that've been run through AozoraRemover.

You tell it to find all n-grams (an option under the clusters tab) of a certain size/number and frequency and you'll get customizably sorted results such as this, depending on your settings:

43 逆に言えば
28 記憶を失う前の
23 という轟音と共に
8 その名の通り
14 この辺りで筆を置かせていただきます
6 この期に及んで

(Left column is # of total occurrences in those 18 texts.)

If you click on any of those words it'll take you to the concordance tab to show you the context, which you can sort by 1L or 1R, etc., so that the words to the left and/or right of these main terms are sorted. So I was thinking users could tell Rikaichan how many words of context to the left and right of the highlighted term they might want. This might be tricky and unnecessary, though. + Here's an overview of the collocates (as opposed to clusters/collocations) part, which has its own span selection for search terms: http://www.antlab.sci.waseda.ac.jp/software/AntConc_Help_3.1.2/Collocates/Overview_%28Collocates%29.htm

Here's a more general explanation for AntConc's collocational type stuff: http://ell.phil.tu-chemnitz.de/analysis/collocations.html

And it must be fate, I ran into this thesis on collocations earlier while looking for a recent/upcoming publication by Zhang Taoli on bilingualism: http://etheses.nottingham.ac.uk/622/1/final_thesis.pdf

Edit: Hmm it might be better to run novels through Gaiji Replacer before removing the formatting, except it doesn't do batch processing. ;p Edit: Or does AozoraRemover take care of that? I think I saw stuff about gaiji in its code.


Rikaichan: RevTK Community Edition - IceCream - 2011-06-15

hahah don't forget the ♫!!


Rikaichan: RevTK Community Edition - ta12121 - 2011-06-15

IceCream Wrote:THIS!!!IS!!!AWESOME!!!!!!

♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♥♫
Can't say anymore than this


Rikaichan: RevTK Community Edition - apirx - 2011-06-15

awesome work, thanks a lot


Rikaichan: RevTK Community Edition - cb4960 - 2011-06-15

@nest0r,

Thanks for the quick tutorial!

And it looks like I'll have to add batch support for Gaiji Replacer or gaiji support for AozoraRemover (or maybe both).


Rikaichan: RevTK Community Edition - nest0r - 2011-06-15

cb4960 Wrote:@nest0r,

Thanks for the quick tutorial!

And it looks like I'll have to add batch support for Gaiji Replacer or gaiji support for AozoraRemover (or maybe both).
That would be cool. ;p Maybe a two-way tool?

Also:

An even quicker AntConc tutorial. ^_^ Well, maybe not quicker.

Go here: http://www.antlab.sci.waseda.ac.jp/software.html and download the relevant file. If it's the Windows .exe, open it (no installation that I recall), and:

1) Global Settings→Language Encodings→UTF-8 or Shift_JIS

2) File→Export Settings→Default name (antconc_settings_320.ant) in same folder as .exe so that when the program starts it uses this and you don't have to keep selecting the language. You can always import settings files also.

3) File→Open File(s)/Open Dir.

4) With those files showing up in a column to your left, go to the Clusters tab.

5) Check ‘N-Grams’ box, and the Clusters tab will now say N-grams instead of Clusters.

6) Set N-Gram Size to Min. Size 1 and Max. Size 20 (For some reason you have to scroll up/down rather than type numbers.)

7) Min. N-Gram Frequency = 4

(Those above size and especially frequency settings can vary, but I find the size works best that way due to how Japanese is processed in AntConc. Probably want much lower for English or somewhat lower for larger numbers [like triple digits] of Japanese texts.)

8) Sort by Word or Frequency. You can always re-sort by clicking Sort. Sorting by Word is useful because of how n-grams are processed: http://www.antlab.sci.waseda.ac.jp/software/AntConc_Help/N-grams/Overview_(N-grams).htm (One set of n-grams for ‘This is a pen’ would be ‘This/This is/This is a/This is a pen’, so word-sorting lets you skim more easily to that final line.)

9) Click Start!

You can copy the results manually by clicking slightly to the side on the same line, or click directly for Concordance, or File→Save Output to Text File. You can also duplicate and undock the results window by clicking Save Window.

Also, because we're using default n-gram settings, it's only processing words, not punctuation, numbers, symbols, etc. You can enable those in the Global Settings but it skews the results quite a bit so I just leave it and when I see results that have gaps in them indicating where those tokens (question mark, number, etc.) used to be, I click it to see the context in the Concordance tab. In those cases you'll get no hits because it uses that punctuation-removed line as a search query, so you'll need to select just a section of it and re-search, e.g. if you click だけど それは in the search results it'll take you to the Concordance tab and say No Hits, but just re-search for だけど or それは and sort by 1L or 2R or whathaveyou.

Lastly, we're obviously only skimming the surface of what's possible with this tool.

Edit: I was earlier reading about ‘concgrams’ via that thesis paper I mentioned before, and I see that AntConc's author is thinking of incorporating those into a later version, so that's cool. Concgrams account for all the constituency and positional variation, e.g. ‘government expenditure’ can be ‘government's own expenditure’ or ‘expenditure of the government’, but normally this wouldn't get picked up by linear-only searches. There's some tools that do this automatically for texts, but they seem to be hard to get. You can manually achieve a similar effect in AntConc by searching for ‘government’ and adding the context word (in Advanced settings) ‘expenditure’ with a horizon (another instance of selecting the span, this time within how many words to the L or R of the search term). But I digress.

Guess I should've posted this elsewhere, sorry. Back to Rikaichan: RevTK Community Edition!


Rikaichan: RevTK Community Edition - IceCream - 2011-06-16

EDIT: nevermind, figured out the problem... you can't tag in japanese...

EDIT2: um, btw how does the audio work? because, it names the audio field with kana - kanji.mp3, but it doesn't put the actual audio anywhere, it seems... have i missed setting up something?

EDIT3: ah, one more thing, not sure if it was said before, but when a word has furigana, when you're trying to import the sentence to anki, only that single word is imported, not the sentence. Also, when you try to import a sentence near the left hand side of the page, it doesn't take it from the beginning of the sentence, just the beginning of the line... so you end up with half a word or a strangely cut sentence sometimes...


Rikaichan: RevTK Community Edition - cb4960 - 2011-06-16

IceCream Wrote:EDIT: nevermind, figured out the problem... you can't tag in japanese...
Can you use Japanese tags in Anki? If so, I'll fix my code to support this.

IceCream Wrote:EDIT2: um, btw how does the audio work? because, it names the audio field with kana - kanji.mp3, but it doesn't put the actual audio anywhere, it seems... have i missed setting up something?
I didn't mention it, but the Anki Real-Time Import feature doesn't support audio yet. It will someday though.

IceCream Wrote:EDIT3: ah, one more thing, not sure if it was said before, but when a word has furigana, when you're trying to import the sentence to anki, only that single word is imported, not the sentence.
Thanks for the report. I'll look into it.

IceCream Wrote:Also, when you try to import a sentence near the left hand side of the page, it doesn't take it from the beginning of the sentence, just the beginning of the line... so you end up with half a word or a strangely cut sentence sometimes...
Currently, the extension will stop a sentence at a line break. Still evaluating whether or not this change was for the better.


Rikaichan: RevTK Community Edition - IceCream - 2011-06-16

thanksthanks Big Grin

well, yeah you can tag in japanese in anki, but it really isn't an issue to tag in english, so...
the error thing just confused me for a while...

the audio thing, am i right in thinking you made some other thing sometime which pulled the audio out of a big folder or something? is that automatic and would it work instead? ah, i guess i should find and read that thing...

Well, it seems like it's dependant on the word wrapping atm. So, if i'm reading with the full screen, the line break is often in the right place. But, if i'm reading with half a screen, it chops it up a lot more often. it'd probably work better from the last full stop. Or comma, for that matter... (commas to end would also be fine, at least for me, since i prefer short sentences.)

EDIT: ah, no, it's not wordwrap, there's something weirder going on... i just checked it to make sure...
so, for example, if i hover over "のっけて” from the full line "「直してるんだ。列車が走れるようにな。枕木まくらぎを敷いて、レールをのっけてスパイクで止めるんだ」" what comes up as the sentence on the card is "を敷いて、レールをのっけてスパイクで止めるんだ」"
so, i'm not even sure what a line break is, i guess... i stand by the commas Smile

EDIT2: ohhh, i see... it's part of that furigana problem, i think. It's cutting the sentence if there's furigana over the previous word...
what is a link break anyway then...?


Rikaichan: RevTK Community Edition - keikei - 2011-06-17

hmm, I followed your instructions cb4960 but still can't get real-time import to work for some reason. Pressing the R key doesn't do anything

seems to be the UDP thingy, here's the error msg:

Traceback (most recent call last):
File "C:\cygwin\home\dae\Home\anki\win\build\pyi.win32\anki\outPYZ1.pyz/ankiqt.ui.main", line 1321, in
File "C:\cygwin\home\dae\Home\anki\win\build\pyi.win32\anki\outPYZ1.pyz/ankiqt.ui.main", line 749, in loadDeck
File "C:\cygwin\home\dae\Home\anki\win\build\pyi.win32\anki\outPYZ1.pyz/ankiqt.ui.main", line 2235, in syncDeck
File "C:\cygwin\home\dae\Home\anki\win\build\pyi.win32\anki\outPYZ1.pyz/anki.deck", line 3085, in close
File "C:\cygwin\home\dae\Home\anki\win\build\pyi.win32\anki\outPYZ1.pyz/anki.hooks", line 26, in runHook
File "C:\Users\kei\AppData\Roaming\.anki\plugins\Real-Time Import.py", line 267, in doDeckClosed
self.serverMgr.stop()
File "C:\Users\kei\AppData\Roaming\.anki\plugins\Real-Time Import.py", line 228, in stop
self.thread.stop()
AttributeError: ServerManager instance has no attribute 'thread'
----------------------------------
I can't figure out what's wrong Sad do you have any idea what it could be?


oh and, is there any way to save vocab without word type indicators ("v1, n" etc.) like the regular rikaichan does? I got it unclicked in the options but the indicators still show up in the text file.

thanks a lot in advance!


Rikaichan: RevTK Community Edition - cb4960 - 2011-06-17

keikei Wrote:I can't figure out what's wrong Sad do you have any idea what it could be?
Download the latest version of Real-Time Import to fix that issue.

keikei Wrote:oh and, is there any way to save vocab without word type indicators ("v1, n" etc.) like the regular rikaichan does? I got it unclicked in the options but the indicators still show up in the text file.
Thanks, I'll add it to the TODO list.


Rikaichan: RevTK Community Edition - cb4960 - 2011-06-17

I have just uploaded version 12.2 of the Rikaichan: RevTK Community Edition.

Download version 12.2 via Mediafire

I incorporated changes from the official Rikaichan v2.03.

cb4960


Rikaichan: RevTK Community Edition - keikei - 2011-06-18

I figured it out! (had the 1.1 plugin installed) just in case someone else runs into the same problem: it seems like you must include "$t" like here: $d$t$r$t$n.
looks like I deleted one of those by accident when I played around with the sentence feature.


Rikaichan: RevTK Community Edition - znebr47625 - 2011-06-19

I love you. My dream come true.


Rikaichan: RevTK Community Edition - Hashiriya - 2011-06-19

keep up the good work!


Rikaichan: RevTK Community Edition - nest0r - 2011-06-20

Request #48002906: The option to start off in Super Sticky Mode rather than having to trigger it? It's a trifle, but nevertheless.


Rikaichan: RevTK Community Edition - wrightak - 2011-06-20

I visited this thread a long time ago but now that I'm back it seems like you're extension does a lot more than it did before.

As far as I'm aware, it now does everything my extension does and more. Is this correct?

My extension: http://wrightak.blogspot.com/2010/08/modification-of-rikaichan.html

Did you look at my extension and copy the features? Or did I inadvertently write the code when the functionality already existed in your extension?


Rikaichan: RevTK Community Edition - nest0r - 2011-06-20

@wrightak - I think cb4960 incorporated your modifications to Rikaichan back when you last posted and we were discussing merging the two plugins.

This is from the first post in this thread:

▲ Additional save options in the Clipboard & Save options tab. See screenshot further down in this post. (This feature was originally merged from wrightak's rikaichan mod).

We've now got direct Anki card creation also, mutated from a feature FooSoft included in Yomichan.

Edit: Correction, it was quite a while after you commented and vanished (again ;p): http://forum.koohii.com/showthread.php?pid=133291#pid133291


Rikaichan: RevTK Community Edition - wrightak - 2011-06-20

Even though I am subscribed to this thread, for some reason I have been receiving no emails.

FFS guys, somebody could have dropped a comment on my blogpost. I've been doing this in my spare time and it would have been nice to know that someone else has duplicated the functionality.

Also, why doesn't cb4960 have a small web page or blog post dedicated to this? Or does he/she? This is great work and it's hard to keep up with a forum thread. Especially when I don't get any emails to inform me of updates.