Cloudron makes it easy to run web apps like WordPress, Nextcloud, GitLab on your server. Find out more or install now.


Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • Bookmarks
  • Search
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

Cloudron Forum

Apps | Demo | Docs | Install
  1. Cloudron Forum
  2. Paperless-ngx
  3. "run rules" on already imported docs

"run rules" on already imported docs

Scheduled Pinned Locked Moved Solved Paperless-ngx
9 Posts 6 Posters 1.3k Views 6 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • timconsidineT Offline
      timconsidineT Offline
      timconsidine
      App Dev
      wrote on last edited by
      #1

      This may be staring me in the face but can't see it.

      Paperless-NG does an excellent job of indexing a document on import, eg correspondent, tag.
      But that depends on setting those up in advance.

      I have large archive docs I want to bring in to the app.
      I can setup the "obvious" ones, but will probably miss a number (eg suppliers I have forgotten I dealt with but have docs from them.

      Is it possible to set up eg correspondent after importing and then "run rules" on already imported docs which were not categorised on initial import ?

      nebulonN 1 Reply Last reply
      0
      • timconsidineT timconsidine

        This may be staring me in the face but can't see it.

        Paperless-NG does an excellent job of indexing a document on import, eg correspondent, tag.
        But that depends on setting those up in advance.

        I have large archive docs I want to bring in to the app.
        I can setup the "obvious" ones, but will probably miss a number (eg suppliers I have forgotten I dealt with but have docs from them.

        Is it possible to set up eg correspondent after importing and then "run rules" on already imported docs which were not categorised on initial import ?

        nebulonN Offline
        nebulonN Offline
        nebulon
        Staff
        wrote on last edited by
        #2

        @timconsidine I guess you have looked into https://paperless-ng.readthedocs.io/en/latest/advanced_usage.html# already. I also couldn't quickly find some obvious command to run.

        But generally, so you put the documents into the storage folder and then just want it to put them to the index? Or what did you mean with:

        @timconsidine said in "run rules" on already imported docs:

        But that depends on setting those up in advance.

        timconsidineT 1 Reply Last reply
        0
        • nebulonN nebulon

          @timconsidine I guess you have looked into https://paperless-ng.readthedocs.io/en/latest/advanced_usage.html# already. I also couldn't quickly find some obvious command to run.

          But generally, so you put the documents into the storage folder and then just want it to put them to the index? Or what did you mean with:

          @timconsidine said in "run rules" on already imported docs:

          But that depends on setting those up in advance.

          timconsidineT Offline
          timconsidineT Offline
          timconsidine
          App Dev
          wrote on last edited by
          #3

          @nebulon thank you
          I am thinking to :

          • upload a folder of several docs, too many to look at manually.
          • allow Paperless to do its excellent matching and tagging
          • probably then find an unforeseeable number of docs which should be allocated to correspondents or tags but weren't because I didn';t set this up in advance
          • somehow - this is the question - get Paperless to go through matching process on already imported docs.

          Maybe I am over-worrying.
          Just conscious it will be a big job importing this archive and trying to reduce my manual work tagging/classifying post-import.

          neurokrishN 1 Reply Last reply
          1
          • timconsidineT timconsidine

            @nebulon thank you
            I am thinking to :

            • upload a folder of several docs, too many to look at manually.
            • allow Paperless to do its excellent matching and tagging
            • probably then find an unforeseeable number of docs which should be allocated to correspondents or tags but weren't because I didn';t set this up in advance
            • somehow - this is the question - get Paperless to go through matching process on already imported docs.

            Maybe I am over-worrying.
            Just conscious it will be a big job importing this archive and trying to reduce my manual work tagging/classifying post-import.

            neurokrishN Offline
            neurokrishN Offline
            neurokrish
            wrote on last edited by neurokrish
            #4

            @timconsidine I think you are looking for document retagger. I used this already and it works great!

            timconsidineT 1 Reply Last reply
            3
            • neurokrishN neurokrish

              @timconsidine I think you are looking for document retagger. I used this already and it works great!

              timconsidineT Offline
              timconsidineT Offline
              timconsidine
              App Dev
              wrote on last edited by timconsidine
              #5

              @neurokrish thank you !

              EDIT : jeez, isn't Paperless-NG awesome !

              1 Reply Last reply
              2
              • ChristopherMagC Offline
                ChristopherMagC Offline
                ChristopherMag
                wrote on last edited by ChristopherMag
                #6

                PS, anyone needing to do this, open the cloudron terminal for the app and then run the following:

                cd src
                python3 manage.py document_retagger -T
                
                jdaviescoatesJ girishG 2 Replies Last reply
                2
                • ChristopherMagC ChristopherMag

                  PS, anyone needing to do this, open the cloudron terminal for the app and then run the following:

                  cd src
                  python3 manage.py document_retagger -T
                  
                  jdaviescoatesJ Offline
                  jdaviescoatesJ Offline
                  jdaviescoates
                  wrote on last edited by
                  #7

                  @ChristopherMag said in "run rules" on already imported docs:

                  PS, anyone needing to do this, open the cloudron terminal for the app and then run the following:

                  python3 src/manage.py document_retagger -T
                  

                  Sounds like something that'd be useful to add to the docs @Staff 🙂

                  I use Cloudron with Gandi & Hetzner

                  1 Reply Last reply
                  0
                  • ChristopherMagC ChristopherMag

                    PS, anyone needing to do this, open the cloudron terminal for the app and then run the following:

                    cd src
                    python3 manage.py document_retagger -T
                    
                    girishG Do not disturb
                    girishG Do not disturb
                    girish
                    Staff
                    wrote on last edited by
                    #8

                    @ChristopherMag thanks, documented in https://docs.cloudron.io/apps/paperless-ngx/#document-retagger

                    ChristopherMagC 1 Reply Last reply
                    2
                    • girishG girish

                      @ChristopherMag thanks, documented in https://docs.cloudron.io/apps/paperless-ngx/#document-retagger

                      ChristopherMagC Offline
                      ChristopherMagC Offline
                      ChristopherMag
                      wrote on last edited by
                      #9

                      @girish I thought that you could run the command without first cd'ing into the src directory but when running it again today I found out that was incorrect.

                      I have edited my post above to first include cd src and then python3 manage.py document_retagger -T.

                      Could you please update the documentation as what I originally posted doesn't work for some reason, python throws an exception if you don't cd first?

                      1 Reply Last reply
                      2
                      Reply
                      • Reply as topic
                      Log in to reply
                      • Oldest to Newest
                      • Newest to Oldest
                      • Most Votes


                        • Login

                        • Don't have an account? Register

                        • Login or register to search.
                        • First post
                          Last post
                        0
                        • Categories
                        • Recent
                        • Tags
                        • Popular
                        • Bookmarks
                        • Search