Page 1 of 2 12 LastLast
Results 1 to 10 of 12

Thread: GUI for Tesseract-OCR

  1. #1

    Default GUI for Tesseract-OCR

    Do we have a GUI for Tesseract (optical character recognition) in our distribution? I see that gImageReader would be a candidate.
    Last edited by malcolmlewis; 05-Nov-2016 at 08:44. Reason: Fix prefix

  2. #2
    Join Date
    Jun 2008
    Location
    Podunk
    Posts
    26,319
    Blog Entries
    15

    Default Re: GUI for Tesseract-OCR

    Hi
    I think your talking about my build? This version is a RC for the 3.2 version, feel free to test...
    Cheers Malcolm °¿° SUSE Knowledge Partner (Linux Counter #276890)
    SUSE SLE, openSUSE Leap/Tumbleweed (x86_64) | GNOME DE
    If you find this post helpful and are logged into the web interface,
    please show your appreciation and click on the star below... Thanks!

  3. #3

    Default Re: GUI for Tesseract-OCR

    Quote Originally Posted by malcolmlewis View Post
    Hi
    I think your talking about my build? This version is a RC for the 3.2 version, feel free to test...
    Great stuff! (I tagged mistakenly Leap 42.2 instead of 42.1). Where do I get your build?

  4. #4
    Join Date
    Jun 2008
    Location
    Podunk
    Posts
    26,319
    Blog Entries
    15

    Default Re: GUI for Tesseract-OCR

    Quote Originally Posted by hnimmo View Post
    Great stuff! (I tagged mistakenly Leap 42.2 instead of 42.1). Where do I get your build?
    Hi
    Fixed the prefix

    For openSUSE Leap 42.1 it's at version 3.1.2;

    It's in this repo: http://download.opensuse.org/reposit...USE_Leap_42.1/

    Just download the files needed and manually install.
    Cheers Malcolm °¿° SUSE Knowledge Partner (Linux Counter #276890)
    SUSE SLE, openSUSE Leap/Tumbleweed (x86_64) | GNOME DE
    If you find this post helpful and are logged into the web interface,
    please show your appreciation and click on the star below... Thanks!

  5. #5

    Default Re: GUI for Tesseract-OCR

    Quote Originally Posted by hnimmo View Post
    Great stuff! (I tagged mistakenly Leap 42.2 instead of 42.1). Where do I get your build?
    OK. I've got it....searched for packages...not used to going that way....

  6. #6
    Join Date
    Mar 2008
    Location
    Phuket, Thailand
    Posts
    26,490
    Blog Entries
    37

    Default Re: GUI for Tesseract-OCR

    I have been using Malcolmlewis's Tereact with gImageReader since openSUSE-12.1 and currently have it running on Leap-42.1. I blogged about it here :
    https://forums.opensuse.org/entry.ph...-and-Tesseract

    Its no exaggeration on my part to stay that I use it a couple times/week, normally OCR scanning German (and sometimes French) and then translating same to English.

    Malcolm, for me, that packaging has been a lifesaver. Its MOST appreciated.
    .

  7. #7

    Default Re: GUI for Tesseract-OCR

    gImageReader (version 3.1.9 on Leap 42.1 / Gnome 3.16.2) works a treat on an existing jpeg file.

    It produces exactly the same result as I had using tesseract from the command line on a file produced using the Gnome 'Simple Scan' function (why am I not surprised?).

    The file was produced using the Simple Scan option 'Text' to generate a jpeg file (362 kB). Tesseract converted the file to text (600 bytes) with one conversion error. (Capital I was rendered as i)


    However, there seems to be a problem with the gImageReader scanner function (scanning from my HP 8610).
    1) Choice seems to produce only a png file of enormous resolution (44 MB) (Setting: colour, 1200 dpi)
    2) The png file cannot be loaded into gImageReader (error loading image)
    3) I think it is not possible to produce a (small) jpg file similar to the original (above)
    4) repeated attempts to scan result in error "Scan failed: Failed to save image"
    5) reducing to greyscale produced a 27 MB png image that loaded sucessfully
    6) gImageReader crashed thereafter on clicking the 'autodetect layout' button
    7) the resultung OCR generated a text with substantially more errors (51 instances of 1 or more erroneous characters)

    This result is disappointing since the identical text with other scan parameters can be better recognised (see above)

    Repeating the exercise with greyscale and 300 dpi works perfectly on the same text (0 errors). Apparently the above problems seem to be related to the size of the files and scanning density.

    Cheers
    Harvey

  8. #8
    Join Date
    Jun 2008
    Location
    West Yorkshire, UK
    Posts
    3,433

    Default Re: GUI for Tesseract-OCR

    Good to hear about gImageReader. I have been using YAGF for ages; it will be interesting to compare.

  9. #9
    Join Date
    Jun 2008
    Location
    Podunk
    Posts
    26,319
    Blog Entries
    15

    Default Re: GUI for Tesseract-OCR

    Hi
    The 3.1.99 release will only build for openSUSE Leap 42.2 and Tumbleweed due to dependencies, at least it will be ready when 42.2 is ready....
    Cheers Malcolm °¿° SUSE Knowledge Partner (Linux Counter #276890)
    SUSE SLE, openSUSE Leap/Tumbleweed (x86_64) | GNOME DE
    If you find this post helpful and are logged into the web interface,
    please show your appreciation and click on the star below... Thanks!

  10. #10
    Join Date
    Mar 2008
    Location
    Phuket, Thailand
    Posts
    26,490
    Blog Entries
    37

    Default Re: GUI for Tesseract-OCR

    I installed GImageReader and Tesseract on a new openSUSE-42.2 install. It works fine. Many thanks Malcolm for packing this !!
    .

Page 1 of 2 12 LastLast

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •