![]() |
|
Capture2Text - Japanese OCR Utility - Printable Version +- kanji koohii FORUM (http://forum.koohii.com) +-- Forum: Learning Japanese (http://forum.koohii.com/forum-4.html) +--- Forum: Learning resources (http://forum.koohii.com/forum-9.html) +--- Thread: Capture2Text - Japanese OCR Utility (/thread-6769.html) |
Capture2Text - Japanese OCR Utility - cb4960 - 2010-11-26 Hello, Capture2Text allows language learners to quickly snapshot a small portion of the screen, OCR it, and (by default) save the result to the clipboard. This is a great companion for learners who enjoy reading manga. Conceptual illustration: ![]() Download the latest version via SourceForge (source code is included) Please visit the Capture2Text homepage for more information. Have Fun! cb4960 Capture2Text - Japanese OCR Utility - Nukemarine - 2010-11-26 I love you for this, but only in that man on man prison sex sort of way. Capture2Text - Japanese OCR Utility - cb4960 - 2010-11-26 Nukemarine Wrote:I love you for this, but only in that man on man prison sex sort of way.Oh good. You had me worried there for a second. Capture2Text - Japanese OCR Utility - nest0r - 2010-11-26 Awesome! I am trying this out with vobsubs per recent comments in subs2srs thread. So far as long as I'm careful about dragging the blue area, it works pretty well. Probably more accurate when I focus on specific words, per my strategy of extracting specific vocabulary. I just tried it for when films are playing—same rules apply for getting it to work as the rules for getting screenshots to work: You must change the playback/output settings to something without a single asterisk next to it. Thanks! Edit: I meant I tried it via both subs2srs preview and then normal video playback. Capture2Text - Japanese OCR Utility - nest0r - 2010-11-27 I wonder if it'd be possible to integrate this or the OCR rather with subs2srs for scanning idx/sub lines during deck generation. Probably not worth the effort, though. Capture2Text - Japanese OCR Utility - jettyke - 2010-11-27 WHOOOAAA!!! I had been looking for exactly this kind of a thing 2 weeks ago!!! I was starting to think that it was only my imagination that one day I would find such a thing. Thanks alot! Capture2Text - Japanese OCR Utility - KanjiDevourer - 2010-11-27 That is amazing! How can we append a linefeed-newline to the copied text? Capture2Text - Japanese OCR Utility - rigol - 2010-11-27 Would very much like to use it, but 3 scanners found malware: http://virusscan.jotti.org/en-GB/scanresult/483d77a931ff8da39eebeb476bb6ea065efa68f6 - can anyone reassure me that these are false positives? I'm kind of reluctant to install it... Capture2Text - Japanese OCR Utility - Tori-kun - 2010-11-27 @rigol: Definitely false alarm. The problem with those scanners is they detect prolly a part of the code accessing memory ("the selection" whatsoever) and think it's some injection of viral application. Feel free to use it, it's clean ![]() Edit: Maybe adding some Rikai-chan-like feature in it? (if possible.. that'd turn reading mangas at least easier!) Capture2Text - Japanese OCR Utility - bombpersons - 2010-11-27 Awesome! This could be extremely useful, especially if the OCR software can be made more accurate! Shame there's no linux version though (doesn't work in wine when I tried it) =( *edit* If there are any linux users out there who want something like this, I put together a quick python script to get something similar to work. I tied it to a hotkey in my window manager, works reasonably well... http://pastebay.com/110710 You need to have nhocr for the ocr, xsel for the clipboard, scrot to select an area on the screen, and imagemagick to convert the image from scrot. Capture2Text - Japanese OCR Utility - rigol - 2010-11-27 Better safe than sorry, so thanks for the explanation, Tori
Capture2Text - Japanese OCR Utility - overture2112 - 2010-11-27 rigol Wrote:Would very much like to use it, but 3 scanners found malware: http://virusscan.jotti.org/en-GB/scanresult/483d77a931ff8da39eebeb476bb6ea065efa68f6 - can anyone reassure me that these are false positives? I'm kind of reluctant to install it...He provided the source for it, so you can always look over it to make sure it's not doing anything you don't like and then compile it yourself. Capture2Text - Japanese OCR Utility - overture2112 - 2010-11-27 bombpersons Wrote:If there are any linux users out there who want something like this, I put together a quick python script to get something similar to work. I tied it to a hotkey in my window manager, works reasonably well...Thanks Capture2Text - Japanese OCR Utility - rigol - 2010-11-27 overture2112 Wrote:Yeah, but not everyone knows that much about source and compiling, so...rigol Wrote:Would very much like to use it, but 3 scanners found malware: http://virusscan.jotti.org/en-GB/scanresult/483d77a931ff8da39eebeb476bb6ea065efa68f6 - can anyone reassure me that these are false positives? I'm kind of reluctant to install it...He provided the source for it, so you can always look over it to make sure it's not doing anything you don't like and then compile it yourself. Capture2Text - Japanese OCR Utility - nest0r - 2010-11-27 I'm just surprised you'd think cb4960 would post software with a virus and that no one would notice, based on 3/19 scanners... Anyway. ;p Capture2Text - Japanese OCR Utility - rigol - 2010-11-27 I didn't say "Dude, nice try distributing malware" or anything, it's just that my antivirus program goes crazy with the exe, so I merely asked for reassurance... Tori already explained what was happening, so can we please leave it at that? Capture2Text - Japanese OCR Utility - cb4960 - 2010-11-27 I did a virus scan on each DLL and executable: ConvertImageFormat.exe http://virusscan.jotti.org/en-GB/scanresult/7427efdf28fdd027ea8f33c3aef1612b6888018a DevIL.dll http://virusscan.jotti.org/en-GB/scanresult/bc09689c1a6bf13968bd6e1c9e5284d7ca7e632b ILU.dll http://virusscan.jotti.org/en-GB/scanresult/94ba1b1e68f9482eacb6c58cbca71cd8ee244e5e msvcr90.dll http://virusscan.jotti.org/en-GB/scanresult/bb75632f54882f015ccc8d9d696dcee6056698f4/da044fcf9aa6f8ff75e97cfb003ae25403bec414 cygwin1.dll http://virusscan.jotti.org/en-GB/scanresult/e281624eee1fb8473a2b736e067b740a0f82dd75 nhocr.exe http://virusscan.jotti.org/en-GB/scanresult/2d0f9972c630c75b1a8a1a77fd592911a4eb42f5 Capture2Text.exe http://virusscan.jotti.org/en-GB/scanresult/3cd2ad2d9fa46947e2fd037595cbd860e3666c11 GdiPlus.dll http://virusscan.jotti.org/en-GB/scanresult/0904c23cac2e76782e67fc9536c29bce3a25c31c/e577bffbfba157dcd4b13f5421bb25182d252d6c ----- The only component that has trouble passing is Capture2Text.exe (3/19 give false positives based on heuristics). If you want to compile Capture2Text.exe yourself, it is very easy. Just download AutoHotKey and run the "Convert .ahk to .exe" tool on Capture2Text.ahk in the SourceCode folder. Edit: Actually, you don't even need to compile the code. Just move the Capture2Text.ahk to the same directory as settings.ini and run it with AutoHotKey. Capture2Text - Japanese OCR Utility - nest0r - 2010-11-27 rigol Wrote:I didn't say "Dude, nice try distributing malware" or anything, it's just that my antivirus program goes crazy with the exe, so I merely asked for reassurance... Tori already explained what was happening, so can we please leave it at that?Well you kind of implied it, in my opinion, so that's why I'm giving you a very slight hard time, (with no malice). If your own antivirus software went haywire (which you only just mentioned) and then you found only 3/19 results, for software from someone trusted on the forum, with other members having posted no problems but indicating they've used it, it's like you're questioning all of our integrity and savvy, in my opinion, and with little cause. It's like by seeking reassurance you preempted your own trust that allowed for the possibility of reassurance. I don't think you meant anything by it, honestly, I just think you were too paranoid. It's your computer, but since you posted here... ;p Capture2Text - Japanese OCR Utility - cb4960 - 2010-11-27 I have just posted version 1.01 of Capture2Text Download Capture2Text v1.01 via MediaFire (source code is included) - Added ability to use linefeeds, carriage returns, and tabs in the PrependText and AppendText settings in settings.ini. (Thanks KanjiDevourer!) Use these tokens: ${cr} = Carriage return ${lf} = Linefeed ${tab} = Tab - Removed the capture box showing up in the taskbar - Removed the PassThruKey settings in settings.ini. Now I disable those hotkeys when not in capture mode so they are no longer needed. - Added ReplaceControlText to settings.ini. If SendToControl=1, it allows you to replace the control text instead of sending to it (doesn't always work, depends on the control) - Added an "About" item to the tray menu ![]() - Cleaned up code and put the ScreenCapture routines in a seperate file (ScreenCapture.ahk) Capture2Text - Japanese OCR Utility - cb4960 - 2010-11-27 KanjiDevourer Wrote:That is amazing!In settings.ini, set AppendText=${cr}${lf} Capture2Text - Japanese OCR Utility - cb4960 - 2010-11-27 Tori-kun Wrote:Maybe adding some Rikai-chan-like feature in it? (if possible.. that'd turn reading mangas at least easier!)That would be interesting. For now, you can use a program that detects the clipboard like StarDict or EBwin Capture2Text - Japanese OCR Utility - cb4960 - 2010-11-27 I have just posted version 1.02 of Capture2Text. Download Capture2Text v1.02 via MediaFire (source code is included) - You can now press the space bar to toggle which corner of the selection that you move: the top-left corner or the bottom-right corner. This key can be changed in settings.ini. - You can now press the left mouse button to complete a capture. This key can be changed in settings.ini. Capture2Text - Japanese OCR Utility - Zarxrax - 2010-11-27 Very nice tool! Would it be possible to have it generate multiple possible candidates so you can select from a different one if the primary ocr result is incorrect? Capture2Text - Japanese OCR Utility - cb4960 - 2010-11-27 Zarxrax Wrote:Very nice tool!Unfortunately, the OCR tool that I'm using doesn't have such an option. Capture2Text - Japanese OCR Utility - cb4960 - 2010-11-27 I have just posted version 1.03 of Capture2Text. Download Capture2Text v1.03 via MediaFire (source code is included) - Added Chinese OCR support. See the Dictionary setting in settings.ini to enable it. |