Cloudron makes it easy to run web apps like WordPress, Nextcloud, GitLab on your server. Find out more or install now.


Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • Bookmarks
  • Search
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

Cloudron Forum

Apps | Demo | Docs | Install
  1. Cloudron Forum
  2. App Wishlist
  3. pd3f - Open-source PDF text extraction using machine learning

pd3f - Open-source PDF text extraction using machine learning

Scheduled Pinned Locked Moved App Wishlist
2 Posts 2 Posters 528 Views 3 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • turianT Offline
    turianT Offline
    turian
    wrote on last edited by
    #1

    Their docker is so nice, I was able to get it up and running on a bare metal server in five minutes. No joke.

    Check out their demo site

    "pd3f is an Open-source PDF text extraction pipeline that is self-hosted, local-first and Docker-based.

    pd3f reconstructs the original continuous text with the help of machine learning."

    luckowL 1 Reply Last reply
    3
    • turianT turian

      Their docker is so nice, I was able to get it up and running on a bare metal server in five minutes. No joke.

      Check out their demo site

      "pd3f is an Open-source PDF text extraction pipeline that is self-hosted, local-first and Docker-based.

      pd3f reconstructs the original continuous text with the help of machine learning."

      luckowL Offline
      luckowL Offline
      luckow
      translator
      wrote on last edited by
      #2

      @turian what do you think. Is it possible to integrate it into workflows like ... upload a PDF into paperless, then into pd3f, back to paperless? Or similar with nextcloud or cubby? Upload and copy & paste from pdf3 into other apps feels wrong 🙂

      Pronouns: he/him | Primary language: German

      1 Reply Last reply
      0
      Reply
      • Reply as topic
      Log in to reply
      • Oldest to Newest
      • Newest to Oldest
      • Most Votes


      • Login

      • Don't have an account? Register

      • Login or register to search.
      • First post
        Last post
      0
      • Categories
      • Recent
      • Tags
      • Popular
      • Bookmarks
      • Search