Cloudron makes it easy to run web apps like WordPress, Nextcloud, GitLab on your server. Find out more or install now.


    Cloudron Forum

    • Register
    • Login
    • Search
    • Categories
    • Recent
    • Tags
    • Popular

    Tesseract-OCR (Optical Character Recognition) on Cloudron

    App Wishlist
    tesseract-ocr tesseract ocr text
    6
    6
    233
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • L
      LoudLemur last edited by

      Tesseract helps your computer recognize text embedded in images and extract it as text. It is a text recognition engine.

      OCR can be useful for example in the editing of memes or in computer gaming, where you wish to take data from the game and process it outside of the game in another application.

      There is a Docker image.

      https://github.com/tesseract-ocr/tesseract

      Tesseract might be of use with paperless-ng, which Cloudron already supports. There is a thread mentioning this here:

      https://forum.cloudron.io/topic/6346/multi-language-ocr-support/12?_=1655907503717

      Ubuntu PPA:
      https://launchpad.net/~alex-p/+archive/ubuntu/tesseract-ocr-devel

      Docker (Tesseract 5.0 is out now, I think these are only 4.0)
      https://tesseract-ocr.github.io/tessdoc/Docker-Containers.html
      Documentation:
      https://tesseract-ocr.github.io/tessdoc/Home.html

      1 Reply Last reply Reply Quote 1
      • Referenced by  L LoudLemur 
      • girish
        girish Staff last edited by

        A quick reading suggests that this is a CLI tool (and not an app). This is also installed in paperless already btw. @LoudLemur Are you having trouble with tesseract and paperless?

        rmdes 1 Reply Last reply Reply Quote 1
        • rmdes
          rmdes @girish last edited by

          @girish Wondering how we can get the Nextcloud app to leverage Tesseract for the OCR-full-text search nextcloud plugin.

          necrevistonnezr 1 Reply Last reply Reply Quote 1
          • necrevistonnezr
            necrevistonnezr @rmdes last edited by

            @rmdes Maybe the easiest way would be https://forum.cloudron.io/topic/8383/nextcloud-all-in-one-aio/ ?

            1 Reply Last reply Reply Quote 1
            • jdaviescoates
              jdaviescoates last edited by

              @Dolgoipa said in Tesseract-OCR (Optical Character Recognition) on Cloudron:

              I have heard of Tesseract before and have used it in a couple of projects. It's a great open-source OCR engine that is easy to use and can be integrated with other applications.

              You said exactly that before.

              I'm inclined to think you are only here to post the link you just shared.

              I use Cloudron with Gandi & Hetzner

              girish 1 Reply Last reply Reply Quote 1
              • girish
                girish Staff @jdaviescoates last edited by

                @jdaviescoates good catch, think it is a bot.

                1 Reply Last reply Reply Quote 1
                • First post
                  Last post
                Powered by NodeBB