Papermerge - Scan & OCR
-
"Open Source Document Management System for Digital Archives
Designed for scanned documents. Perfect software that will help you reduce paper clutter and keep your business organised."- https://www.papermerge.com
- https://www.papermerge.com/demo
- https://github.com/ciur/papermerge
- https://github.com/ciur/papermerge#comparison-with-other-dms
- https://papermerge.readthedocs.io/en/latest/setup/docker.html
The OCR looks decent and maybe better for personal use than MayanEDMS, which also looks excellent.
I can see it being a good pairing with Invoice Ninja / Firefly III.
-
I have begun to package this today. Hope to have something for comment in the next few days.
cc: @marcusquinn
-
@doodlemania2 I've got this thing to a heartbeat status but it has lots of dependencies flying around - trying another approach to packaging to squash em. Will advise on first checkin.
-
@doodlemania2 Some progress - I'm checked in here (link). Do NOT try to run this - it will go kaboom. I've got to work with the author some to see if we can move mod_wsgi, try an embedded apache (in python's virtual environment) and a few other things.
Anyways, if you happen to know python, django, virtualenv - dive in and contribute! If not, I'll keep you all posted as things come together.
-
@doodlemania2 there are many things better than apache, and you can test without it..
https://www.toptal.com/python/pythons-wsgi-server-application-interface
Can also see about uWSGI.
-
@doodlemania2 V annoying when apps want to write data to their own directory and quite bad practice from a security point as executable code directories shouldn't be writable by anything other than the deployment process. That sounds like an upstream issue to report, maybe suggest a feature for a config setting for the write paths.
-
@marcusquinn I've spoken with author - he's sympathetic but appears quite focused on iterative development of features. Don't blame him at all, just a different focus.
-
@doodlemania2 did you have any comparison between papermerge and paperless-ng ? It seems you have started focusing on the latter. Was it just because of packaging issues?
-
@girish said in Papermerge - Scan & OCR:
comparison between papermerge and paperless-ng ?
I tend to do this in the first instance
https://selfhosted.libhunt.com/compare-paperless-ng-vs-papermerge
Looks like Papermerge has the edge on paper but not a lot in it.
-
@girish papermerge was indeed more difficult to package, but also, i really enjoy the auto processing in paperless much more. i didn't do any scientific tests, of course, but after working with both - it fit my needs. I now have an app with about 10K scanned docs on it, all OCRd, tagged, and quite nice!
that's not to say papermerge isn't good - just where i landed