kanji koohii FORUM
New manga OCR reader (capture2text alternative) - Printable Version

+- kanji koohii FORUM (http://forum.koohii.com)
+-- Forum: Learning Japanese (http://forum.koohii.com/forum-4.html)
+--- Forum: Learning resources (http://forum.koohii.com/forum-9.html)
+--- Thread: New manga OCR reader (capture2text alternative) (/thread-13376.html)



New manga OCR reader (capture2text alternative) - Digix - 2015-11-18

Hi all, 
capture2text was not very suitable for manga reading, so I made  better alternative:

Advantages over capture2text:

Automatic text extraction, just one click to read. No precise and annoying selections needed.
Can read bubbles of complex shape.
Significant text processing to remove common OCR mistakes and adjust uppercase/lowercase characters where other OCR software fails.
Image binarization includes upscaling.
It can sometimes read images that cannot be read manually.
Text output is usually good enough for automated translation.


Software is currently in very early version so memory leaks are likely and interface is messy.


Usage:

Just unzip anywhere, run and right click in the bubble you want to read.
You have to click on white space inside the text bubble somewhere close to the center !
within a second you will have text in the clipboard for further processing 
Application will press ESC button to cancel right mouse click menu which pops when you click, so you cant use right button for original purpose.
If you are reading bad quality source you may have to adjust threshold slider to get better binarization results.
If characters are improperly detected you can try to shrink or zoom source and click again  zooming improves recognition accuracy, shrinking improves character detection


Application can be downloaded here:
http://d-h.st/aJev

Since it is in early development I need feedback and suggestions what can be improved.
Also I need better name for my application.

Further development will be mostly focused on more advanced text extraction to make it possible to read text which is embedded directly into image.
Also this is starting point for manga editing application to automate manga translation which will automatically clean and replace Japanese text with English


RE: New manga OCR reader (capture2text alternative) - tokyostyle - 2015-11-18

Why does your application need SSL? Where is it connecting to and for what purpose?


RE: New manga OCR reader (capture2text alternative) - Digix - 2015-11-18

(2015-11-18, 9:30 am)tokyostyle Wrote: Why does your application need SSL? Where is it connecting to and for what purpose?

SSl is used to access translation webpage when you click translate button:
https://www.babelfish.com


RE: New manga OCR reader (capture2text alternative) - xtreme1 - 2015-12-28

(2015-11-18, 9:09 am)Digix Wrote: Hi all, 
capture2text was not very suitable for manga reading, so I made  better alternative:

Advantages over capture2text:

Automatic text extraction, just one click to read. No precise and annoying selections needed.
Can read bubbles of complex shape.
Significant text processing to remove common OCR mistakes and adjust uppercase/lowercase characters where other OCR software fails.
Image binarization includes upscaling.
It can sometimes read images that cannot be read manually.
Text output is usually good enough for automated translation.


Software is currently in very early version so memory leaks are likely and interface is messy.


Usage:

Just unzip anywhere, run and right click in the bubble you want to read.
You have to click on white space inside the text bubble somewhere close to the center !
within a second you will have text in the clipboard for further processing 
Application will press ESC button to cancel right mouse click menu which pops when you click, so you cant use right button for original purpose.
If you are reading bad quality source you may have to adjust threshold slider to get better binarization results.
If characters are improperly detected you can try to shrink or zoom source and click again  zooming improves recognition accuracy, shrinking improves character detection


Application can be downloaded here:
http://www.filedropper.com/reader

Since it is in early development I need feedback and suggestions what can be improved.
Also I need better name for my application.

Further development will be mostly focused on more advanced text extraction to make it possible to read text which is embedded directly into image.
Also this is starting point for manga editing application to automate manga translation which will automatically clean and replace Japanese text with English
Your link is down


RE: New manga OCR reader (capture2text alternative) - Digix - 2016-01-04

new link
http://www.megafileupload.com/io8v/Reader.rar


RE: New manga OCR reader (capture2text alternative) - xtreme1 - 2016-01-05

(2016-01-04, 5:27 pm)Digix Wrote: new link
http://www.megafileupload.com/io8v/Reader.rar

Wow there are some pretty cool ideas in this. I like that you can just click the box of the manga panel to get text, though sometimes it would be useful to manually select. It was still too buggy to use (mouse would freeze then come in laggy after a minute, executing the exe opened like 4 windows every time, and random crashes, and issues with furigana recognition), but I'm really looking forward to using this in the future.


RE: New manga OCR reader (capture2text alternative) - Digix - 2016-01-05

Manual selection will be in the next version
as well as some interface improvements.

Lags will usually happen if you click outside of bubble as it will capture too much of area and try to read it all.
It does not crash on my computer so it is quite hard to fix that problem


furigana is causing real problems here and I still dont know how to remove it easily
I have some cleanup ideas, to extract text from background, maybe it will also work with furigana as well


RE: New manga OCR reader (capture2text alternative) - Digix - 2016-01-07

latest version
http://d-h.st/aJev
or
http://www.filehosting.org/file/details/533315/Reader.rar

Custom selections are possible now -- Click or select desired area, center of selection must not touch any symbol
Horizontal text handling added -- If selection width is twice as big as height horizontal text mode will be activated
White on black text handling -- if selection center is on black area, image will be inverted
Furigana removal  as well as function to increase spacing between columns for better recognition accuracy (may not work on skewed text)

some bug fixes , so application should be crashing less often