ArchiveBox on Cloudron - Success!
-
Nebulon, you absolute beauty, this works! Thank you team Cloudron for making internet archiving something you can easily do by self-hosting ArchiveBox.
The hard part is now to figure out which of all the options to select. There are some quite interesting options available.
I tried a quick test archive of a small site using favicon and screenshot methods. I also a depth of 1 so it would also archive pages to which the main URL linked.
The process began and pulled in some of the correct urls, but when I try to follow one, expecting to see a snapshot of the archived site, I have this error:
Cloudron documentation is excellent and anticipates errors like this. I ran the update all snapshots command outlined here:
https://docs.cloudron.io/apps/archivebox/and it showed lots of yellow extractor failed messages. I shall try again later with a different, small website and set depth to 0 to see if it makes a difference.
-
@LoudLemur the main packaging work was done by @vladimir-d who has become an invaluable team member by now for pushing new app packages forward. I merely amended package meta-data.
-
Awesome work folks ! I'll try this right away
-
Currently, the LDAP login is a bit annoying because upstream does not set the roles properly of newly created LDAP users. @vladimir-d is working on getting this fixed upstream.
-
Thanks all for your packaging efforts! Let me know how I can help (via ArchiveBox Github issues , as I may not remember to check back here).
I've improved our LDAP documentation: https://github.com/ArchiveBox/ArchiveBox/wiki/Configuration#ldap
Note we also have a newADMIN_USERNAME
+ADMIN_PASSWORD
env var option to streamline setup: https://github.com/ArchiveBox/ArchiveBox/wiki/Configuration#admin_username--admin_passwordUnfortunately I don't have a lot of experience with LDAP myself, so I may ask for help doing QA if you submit any ArchiveBox PRs to modify the LDAP behavior that you're having problems with.
-
-
PR at https://github.com/ArchiveBox/ArchiveBox/pull/1335 for the LDAP fix.
-
Heya, been playing with archivebox and can't seem to figure out how to generate a functional site.
Here's the example on the demo server:
https://archivebox.demo.cloudron.io/archive/1706242844.530401/index.html(deleted)
Was an archive of https://textconverter.com/split-text-into-paragraphsthe .js for the input textbox and split functions doesn't work in the archived site ;-/
How would one get that working?