Firecrawl on Cloudron - Turn any site into LLM data by web scraping
-
Hey, @ekevu123, thank you for this brilliant app wish! I really hope this is supported on Cloudron soon.
I have heavily edited your initial post to try and use the new template that is being developed for the App Wishlist forum.
- What do you think about the new appearance of your post?
- Is it very objectionable to have somebody mod your post like this?
- Do you have any suggestions about doing this in the future?
Thanks!
-
@ekevu123 The template is posted at https://forum.cloudron.io/topic/12472/please-use-this-template-to-make-an-app-wishlist-request by @LoudLemur . Looks like a good idea to have posts (in this category) formatted a certain way. For other part of the forum, generally moderators don't edit posts (only obvious typos and language).
I am hoping people don't consider it rude if moderators edit the posts in the App Requests Category alone. Besides, the original poster gets reputation (the up arrow) anyway.
-
As I said, I would mark it accordingly, then it should be fine. I didn't know about the template, I will try to use it next time.
-
Definitely needed! This would catapult the possibilities for N8N & Cloudron being able to leverage it's capabilities in big ways!
-
I second this. Firecrawl would be a great addition to the App Library.
-
Has anyone used Firecrawl self-hosted? They describe as a main difference of their self-hosted vs. cloud variant that the cloud version rotates IP addresses, so it gets better around blockers. I have never used Firecrawl self-hosted, has this been an issue to anyone?
-
A new release of Firecrawl came available:
https://github.com/mendableai/firecrawl/releases/tag/v1.11.0There are lots of new improvements:
Firecrawl v1.11.0 is here!
Major Features
Launched our Firecrawl Index
Speed up scrapes 5x if opted in
Improved Activity Logs
View webhook events
Active crawl management
Fire Enrich Example (Open Source Clay)
Community Java SDK
and a lot more
Features
Improved Playwright tests and webhook test coverage
Added GET /crawl/ongoing endpoint
Introduced tag support in change tracking
Added integration field to jobs and propagated through queue worker
Parallel testing for runpod v2 and updated mu
Ported queryIndexAtSplitLevel to RPC
Enhanced SDK with index and missing parameters
Removed redundant GCS check to improve performance
Added credits_billed field across pipeline
Enabled domain-level index splitting for better map querying
Used index in search and extract operations
Removed unused index columns
Fixes & Improvements
Fixed crawl pre-finishing logic
Refactored callWebhook and added logging
Improved index testing (FIR-2214)
Fixed JS SDK tests
Clarified scrape options usage in README
Fixed missing PLAYWRIGHT_MICROSERVICE_URL in env example
Improved concurrency limit notification emails
Removed query param sanitization that broke extract