Cloudron makes it easy to run web apps like WordPress, Nextcloud, GitLab on your server. Find out more or install now.


Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • Bookmarks
  • Search
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

Cloudron Forum

Apps | Demo | Docs | Install
  1. Cloudron Forum
  2. Paperless-ngx
  3. "run rules" on already imported docs

"run rules" on already imported docs

Scheduled Pinned Locked Moved Solved Paperless-ngx
9 Posts 6 Posters 1.5k Views 6 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • timconsidineT Offline
    timconsidineT Offline
    timconsidine
    App Dev
    wrote on last edited by
    #1

    This may be staring me in the face but can't see it.

    Paperless-NG does an excellent job of indexing a document on import, eg correspondent, tag.
    But that depends on setting those up in advance.

    I have large archive docs I want to bring in to the app.
    I can setup the "obvious" ones, but will probably miss a number (eg suppliers I have forgotten I dealt with but have docs from them.

    Is it possible to set up eg correspondent after importing and then "run rules" on already imported docs which were not categorised on initial import ?

    nebulonN 1 Reply Last reply
    0
    • timconsidineT timconsidine

      This may be staring me in the face but can't see it.

      Paperless-NG does an excellent job of indexing a document on import, eg correspondent, tag.
      But that depends on setting those up in advance.

      I have large archive docs I want to bring in to the app.
      I can setup the "obvious" ones, but will probably miss a number (eg suppliers I have forgotten I dealt with but have docs from them.

      Is it possible to set up eg correspondent after importing and then "run rules" on already imported docs which were not categorised on initial import ?

      nebulonN Offline
      nebulonN Offline
      nebulon
      Staff
      wrote on last edited by
      #2

      @timconsidine I guess you have looked into https://paperless-ng.readthedocs.io/en/latest/advanced_usage.html# already. I also couldn't quickly find some obvious command to run.

      But generally, so you put the documents into the storage folder and then just want it to put them to the index? Or what did you mean with:

      @timconsidine said in "run rules" on already imported docs:

      But that depends on setting those up in advance.

      timconsidineT 1 Reply Last reply
      0
      • nebulonN nebulon

        @timconsidine I guess you have looked into https://paperless-ng.readthedocs.io/en/latest/advanced_usage.html# already. I also couldn't quickly find some obvious command to run.

        But generally, so you put the documents into the storage folder and then just want it to put them to the index? Or what did you mean with:

        @timconsidine said in "run rules" on already imported docs:

        But that depends on setting those up in advance.

        timconsidineT Offline
        timconsidineT Offline
        timconsidine
        App Dev
        wrote on last edited by
        #3

        @nebulon thank you
        I am thinking to :

        • upload a folder of several docs, too many to look at manually.
        • allow Paperless to do its excellent matching and tagging
        • probably then find an unforeseeable number of docs which should be allocated to correspondents or tags but weren't because I didn';t set this up in advance
        • somehow - this is the question - get Paperless to go through matching process on already imported docs.

        Maybe I am over-worrying.
        Just conscious it will be a big job importing this archive and trying to reduce my manual work tagging/classifying post-import.

        neurokrishN 1 Reply Last reply
        1
        • timconsidineT timconsidine

          @nebulon thank you
          I am thinking to :

          • upload a folder of several docs, too many to look at manually.
          • allow Paperless to do its excellent matching and tagging
          • probably then find an unforeseeable number of docs which should be allocated to correspondents or tags but weren't because I didn';t set this up in advance
          • somehow - this is the question - get Paperless to go through matching process on already imported docs.

          Maybe I am over-worrying.
          Just conscious it will be a big job importing this archive and trying to reduce my manual work tagging/classifying post-import.

          neurokrishN Offline
          neurokrishN Offline
          neurokrish
          wrote on last edited by neurokrish
          #4

          @timconsidine I think you are looking for document retagger. I used this already and it works great!

          timconsidineT 1 Reply Last reply
          3
          • neurokrishN neurokrish

            @timconsidine I think you are looking for document retagger. I used this already and it works great!

            timconsidineT Offline
            timconsidineT Offline
            timconsidine
            App Dev
            wrote on last edited by timconsidine
            #5

            @neurokrish thank you !

            EDIT : jeez, isn't Paperless-NG awesome !

            1 Reply Last reply
            2
            • ChristopherMagC Offline
              ChristopherMagC Offline
              ChristopherMag
              wrote on last edited by ChristopherMag
              #6

              PS, anyone needing to do this, open the cloudron terminal for the app and then run the following:

              cd src
              python3 manage.py document_retagger -T
              
              jdaviescoatesJ girishG 2 Replies Last reply
              2
              • ChristopherMagC ChristopherMag

                PS, anyone needing to do this, open the cloudron terminal for the app and then run the following:

                cd src
                python3 manage.py document_retagger -T
                
                jdaviescoatesJ Offline
                jdaviescoatesJ Offline
                jdaviescoates
                wrote on last edited by
                #7

                @ChristopherMag said in "run rules" on already imported docs:

                PS, anyone needing to do this, open the cloudron terminal for the app and then run the following:

                python3 src/manage.py document_retagger -T
                

                Sounds like something that'd be useful to add to the docs @Staff 🙂

                I use Cloudron with Gandi & Hetzner

                1 Reply Last reply
                0
                • ChristopherMagC ChristopherMag

                  PS, anyone needing to do this, open the cloudron terminal for the app and then run the following:

                  cd src
                  python3 manage.py document_retagger -T
                  
                  girishG Offline
                  girishG Offline
                  girish
                  Staff
                  wrote on last edited by
                  #8

                  @ChristopherMag thanks, documented in https://docs.cloudron.io/apps/paperless-ngx/#document-retagger

                  ChristopherMagC 1 Reply Last reply
                  2
                  • girishG girish

                    @ChristopherMag thanks, documented in https://docs.cloudron.io/apps/paperless-ngx/#document-retagger

                    ChristopherMagC Offline
                    ChristopherMagC Offline
                    ChristopherMag
                    wrote on last edited by
                    #9

                    @girish I thought that you could run the command without first cd'ing into the src directory but when running it again today I found out that was incorrect.

                    I have edited my post above to first include cd src and then python3 manage.py document_retagger -T.

                    Could you please update the documentation as what I originally posted doesn't work for some reason, python throws an exception if you don't cd first?

                    1 Reply Last reply
                    2
                    Reply
                    • Reply as topic
                    Log in to reply
                    • Oldest to Newest
                    • Newest to Oldest
                    • Most Votes


                    • Login

                    • Don't have an account? Register

                    • Login or register to search.
                    • First post
                      Last post
                    0
                    • Categories
                    • Recent
                    • Tags
                    • Popular
                    • Bookmarks
                    • Search