Cloudron makes it easy to run web apps like WordPress, Nextcloud, GitLab on your server. Find out more or install now.


    Cloudron Forum

    • Register
    • Login
    • Search
    • Categories
    • Recent
    • Tags
    • Popular

    Tesseract-OCR (Optical Character Recognition) on Cloudron

    App Wishlist
    tesseract-ocr tesseract ocr text
    2
    2
    122
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • L
      LoudLemur last edited by

      Tesseract helps your computer recognize text embedded in images and extract it as text. It is a text recognition engine.

      OCR can be useful for example in the editing of memes or in computer gaming, where you wish to take data from the game and process it outside of the game in another application.

      There is a Docker image.

      https://github.com/tesseract-ocr/tesseract

      Tesseract might be of use with paperless-ng, which Cloudron already supports. There is a thread mentioning this here:

      https://forum.cloudron.io/topic/6346/multi-language-ocr-support/12?_=1655907503717

      Ubuntu PPA:
      https://launchpad.net/~alex-p/+archive/ubuntu/tesseract-ocr-devel

      Docker (Tesseract 5.0 is out now, I think these are only 4.0)
      https://tesseract-ocr.github.io/tessdoc/Docker-Containers.html
      Documentation:
      https://tesseract-ocr.github.io/tessdoc/Home.html

      1 Reply Last reply Reply Quote 0
      • Referenced by  L LoudLemur 
      • girish
        girish Staff last edited by

        A quick reading suggests that this is a CLI tool (and not an app). This is also installed in paperless already btw. @LoudLemur Are you having trouble with tesseract and paperless?

        1 Reply Last reply Reply Quote 0
        • First post
          Last post
        Powered by NodeBB