Cloudron makes it easy to run web apps like WordPress, Nextcloud, GitLab on your server. Find out more or install now.


Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • Bookmarks
  • Search
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

Cloudron Forum

Apps | Demo | Docs | Install
  1. Cloudron Forum
  2. App Wishlist
  3. Firecrawl on Cloudron - Turn any site into LLM data by web scraping

Firecrawl on Cloudron - Turn any site into LLM data by web scraping

Scheduled Pinned Locked Moved App Wishlist
firecrawlweb scrapingai
16 Posts 8 Posters 2.6k Views 10 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • E Offline
    E Offline
    ekevu123
    wrote on last edited by
    #3

    Yeah. Unfortunately, I wasn't able to re-package it for Cloudron, although I tried. So, if someone wants to do it, we all have something new to play with 🙂

    1 Reply Last reply
    0
    • T Offline
      T Offline
      taowang
      wrote on last edited by
      #4

      I want this as well!

      1 Reply Last reply
      0
      • T Offline
        T Offline
        taowang
        wrote on last edited by
        #5

        When can we have this? This is extremely useful to build AI applications, since it can scrape data from nearly any website. And data is valuable these days.

        1 Reply Last reply
        1
        • I Offline
          I Offline
          igaudette
          wrote on last edited by
          #6

          I want this!

          1 Reply Last reply
          1
          • L Offline
            L Offline
            LoudLemur
            wrote on last edited by
            #7

            Hey, @ekevu123, thank you for this brilliant app wish! I really hope this is supported on Cloudron soon.

            I have heavily edited your initial post to try and use the new template that is being developed for the App Wishlist forum.

            • What do you think about the new appearance of your post?
            • Is it very objectionable to have somebody mod your post like this?
            • Do you have any suggestions about doing this in the future?

            Thanks!

            1 Reply Last reply
            3
            • E Offline
              E Offline
              ekevu123
              wrote on last edited by
              #8

              Yes, I like the template, however, I think if someone else's post has been edited by a moderator, this should be very clearly marked.

              1 Reply Last reply
              1
              • girishG Offline
                girishG Offline
                girish
                Staff
                wrote on last edited by joseph
                #9

                @ekevu123 The template is posted at https://forum.cloudron.io/topic/12472/please-use-this-template-to-make-an-app-wishlist-request by @LoudLemur . Looks like a good idea to have posts (in this category) formatted a certain way. For other part of the forum, generally moderators don't edit posts (only obvious typos and language).

                I am hoping people don't consider it rude if moderators edit the posts in the App Requests Category alone. Besides, the original poster gets reputation (the up arrow) anyway.

                1 Reply Last reply
                0
                • E Offline
                  E Offline
                  ekevu123
                  wrote on last edited by
                  #10

                  As I said, I would mark it accordingly, then it should be fine. I didn't know about the template, I will try to use it next time.

                  L 1 Reply Last reply
                  1
                  • E ekevu123

                    As I said, I would mark it accordingly, then it should be fine. I didn't know about the template, I will try to use it next time.

                    L Offline
                    L Offline
                    LoudLemur
                    wrote on last edited by
                    #11

                    @ekevu123 The template is a new idea. I have marked your original posted as edited by mod.

                    1 Reply Last reply
                    2
                    • phillipwilhelmP Offline
                      phillipwilhelmP Offline
                      phillipwilhelm
                      wrote last edited by
                      #12

                      Definitely needed! This would catapult the possibilities for N8N & Cloudron being able to leverage it's capabilities in big ways!

                      1 Reply Last reply
                      1
                      • S Offline
                        S Offline
                        SamGreenwood
                        wrote last edited by
                        #13

                        I second this. Firecrawl would be a great addition to the App Library.

                        1 Reply Last reply
                        0
                        • firmansiF Offline
                          firmansiF Offline
                          firmansi
                          wrote last edited by
                          #14

                          Vote for this two times

                          1 Reply Last reply
                          0
                          • E Offline
                            E Offline
                            ekevu123
                            wrote last edited by
                            #15

                            Has anyone used Firecrawl self-hosted? They describe as a main difference of their self-hosted vs. cloud variant that the cloud version rotates IP addresses, so it gets better around blockers. I have never used Firecrawl self-hosted, has this been an issue to anyone?

                            1 Reply Last reply
                            2
                            • L Offline
                              L Offline
                              LoudLemur
                              wrote last edited by
                              #16

                              A new release of Firecrawl came available:
                              https://github.com/mendableai/firecrawl/releases/tag/v1.11.0

                              There are lots of new improvements:

                              Firecrawl v1.11.0 is here!
                              Major Features
                              Launched our Firecrawl Index
                              Speed up scrapes 5x if opted in
                              Improved Activity Logs
                              View webhook events
                              Active crawl management
                              Fire Enrich Example (Open Source Clay)
                              Community Java SDK
                              and a lot more
                              Features
                              Improved Playwright tests and webhook test coverage
                              Added GET /crawl/ongoing endpoint
                              Introduced tag support in change tracking
                              Added integration field to jobs and propagated through queue worker
                              Parallel testing for runpod v2 and updated mu
                              Ported queryIndexAtSplitLevel to RPC
                              Enhanced SDK with index and missing parameters
                              Removed redundant GCS check to improve performance
                              Added credits_billed field across pipeline
                              Enabled domain-level index splitting for better map querying
                              Used index in search and extract operations
                              Removed unused index columns
                              Fixes & Improvements
                              Fixed crawl pre-finishing logic
                              Refactored callWebhook and added logging
                              Improved index testing (FIR-2214)
                              Fixed JS SDK tests
                              Clarified scrape options usage in README
                              Fixed missing PLAYWRIGHT_MICROSERVICE_URL in env example
                              Improved concurrency limit notification emails
                              Removed query param sanitization that broke extract

                              1 Reply Last reply
                              0
                              Reply
                              • Reply as topic
                              Log in to reply
                              • Oldest to Newest
                              • Newest to Oldest
                              • Most Votes


                              • Login

                              • Don't have an account? Register

                              • Login or register to search.
                              • First post
                                Last post
                              0
                              • Categories
                              • Recent
                              • Tags
                              • Popular
                              • Bookmarks
                              • Search