![]() |
|
Free japanese OCR - Printable Version +- kanji koohii FORUM (http://forum.koohii.com) +-- Forum: Learning Japanese (http://forum.koohii.com/forum-4.html) +--- Forum: Learning resources (http://forum.koohii.com/forum-9.html) +--- Thread: Free japanese OCR (/thread-2480.html) |
Free japanese OCR - Zarxrax - 2009-01-26 I just came across this great free tool to OCR Japanese text: http://www.4shared.com/dir/1006463/575b200b/public.html SmartOCRlite107 It's kind of fiddly to get working, because it requires .Net 1.1 to be installed (later versions are NOT sufficient). It can analyze any regular image file though, and i tested it on a vocabulary list from a book, and it worked perfectly. I had to manually draw a box around each word though. I haven't tested it on paragraphs of text, but I think it would probably work well. Free japanese OCR - Zarxrax - 2009-01-26 I gave it a try just now on a couple full pages of text, and it did remarkably well in my opinion. I "scanned" the pages by photographing them with my digital camera, and they were quite noisy. Without any involvement on my part, it recognized all the text fine, with about 95% accuracy. Furigana seemed to confuse it a little though. This really beats typing everything out by hand! Free japanese OCR - wrightak - 2009-01-26 Thanks for sharing! Free japanese OCR - Jarvik7 - 2009-01-26 Wonder if it'll work under wine (crossover). Free japanese OCR - kazelee - 2009-02-01 Jarvik7 Wrote:Wonder if it'll work under wine (crossover).Tried it. Got nowhere. It loads but gives an error, even with .NET installed. I'm using it in windows now and I have to admit, I have no idea what I'm doing. ![]() Help? Free japanese OCR - Zarxrax - 2009-02-01 Press the 3rd button on the toolbar to open a file. If you go to the little dropdown thing beside the button, you can choose from: file, folder, TWAIN device, clipboard. So anyways, you click that button and load up a source image for it to OCR. You should see the image come up now, and it will put a box around parts that it recognizes as text, and labels each one with a yellow number. Now at this point there are a ton of options and things that you can play around with, and I don't understand how to use it so well myself... but you can draw your own boxes around text that you want it to try to recognize. Now on the very right side it will show you what it recognized. If you notice something is wrong, you can right click it, and i think it displays a list of alternatives. The drop down box on this window will let you change the layout. Finally, click the save button to save the text to a file. I prefer using normal text files, otherwise it will try to do weird things with the formatting. Free japanese OCR - kazelee - 2009-02-01 Thank you. I manage to get as far as opening a file and got lost at first. Do you know if it will accept more than one page of a pdf? I opened one and a random page showed? Perhaps I'm missing how to cycle through... Free japanese OCR - Zarxrax - 2009-02-01 sorry, dunno. A workaround might be to screenshot each page of the pdf and work with that? Free japanese OCR - jcdietz03 - 2010-11-14 Trying to figure out the interface. I'll use this space as my rikaichan scratch pad if nobody minds. Interface Example ![]() ![]() 認識 出力 プリセット スマートリーディング 画像入力 レイアウト 表 領域種別 自動判別 文章 文章+絵 見出し 表 絵 区切り線 削除 文字方向 自動判別 横書き ?書き 主に横書き 主に?書き 領域順序 自動判別 追加 横書き領域優先 ?書き領域優先 ブロックの強制結会 罫線によるセル区切りを自動判別 横罫線でセル区切り ?罫線でセル区切り セルを自動的に結会する 認識による自動削除 指定領域だけを再二値化 Free japanese OCR - toshiromiballza - 2012-09-23 The other OCR thread reminded me of this OCR program which hasn't been mentioned here yet. RealReader Lite: http://data-digital.sakura.ne.jp/RealReaderLitePrice.html Free japanese OCR - Zarxrax - 2013-01-16 I checked out the RealReader lite software, it seems to be a continuation SmartOCRLite. I couldn't get version 8 to work right though, and version 7 just seems like SmartOCR with different colors in the interface. In any case, does anyone know if there is a way to batch process files in this software? I am trying to OCR some DVD subtitles. Its very accurate, but I'm currently doing them one at a time which is taking me forever. I can load all of the files in at once, but I can't find a way to batch process and save them as text. |