Cloudron makes it easy to run web apps like WordPress, Nextcloud, GitLab on your server. Find out more or install now.


Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • Bookmarks
  • Search
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

Cloudron Forum

Apps | Demo | Docs | Install
  1. Cloudron Forum
  2. App Wishlist
  3. Firecrawl on Cloudron - Turn any site into LLM data by web scraping

Firecrawl on Cloudron - Turn any site into LLM data by web scraping

Scheduled Pinned Locked Moved App Wishlist
firecrawlweb scrapingai
16 Posts 8 Posters 2.5k Views 10 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • L Offline
    L Offline
    LoudLemur
    wrote on last edited by
    #7

    Hey, @ekevu123, thank you for this brilliant app wish! I really hope this is supported on Cloudron soon.

    I have heavily edited your initial post to try and use the new template that is being developed for the App Wishlist forum.

    • What do you think about the new appearance of your post?
    • Is it very objectionable to have somebody mod your post like this?
    • Do you have any suggestions about doing this in the future?

    Thanks!

    1 Reply Last reply
    3
    • E Offline
      E Offline
      ekevu123
      wrote on last edited by
      #8

      Yes, I like the template, however, I think if someone else's post has been edited by a moderator, this should be very clearly marked.

      1 Reply Last reply
      1
      • girishG Offline
        girishG Offline
        girish
        Staff
        wrote on last edited by joseph
        #9

        @ekevu123 The template is posted at https://forum.cloudron.io/topic/12472/please-use-this-template-to-make-an-app-wishlist-request by @LoudLemur . Looks like a good idea to have posts (in this category) formatted a certain way. For other part of the forum, generally moderators don't edit posts (only obvious typos and language).

        I am hoping people don't consider it rude if moderators edit the posts in the App Requests Category alone. Besides, the original poster gets reputation (the up arrow) anyway.

        1 Reply Last reply
        0
        • E Offline
          E Offline
          ekevu123
          wrote on last edited by
          #10

          As I said, I would mark it accordingly, then it should be fine. I didn't know about the template, I will try to use it next time.

          L 1 Reply Last reply
          1
          • E ekevu123

            As I said, I would mark it accordingly, then it should be fine. I didn't know about the template, I will try to use it next time.

            L Offline
            L Offline
            LoudLemur
            wrote on last edited by
            #11

            @ekevu123 The template is a new idea. I have marked your original posted as edited by mod.

            1 Reply Last reply
            2
            • phillipwilhelmP Offline
              phillipwilhelmP Offline
              phillipwilhelm
              wrote last edited by
              #12

              Definitely needed! This would catapult the possibilities for N8N & Cloudron being able to leverage it's capabilities in big ways!

              1 Reply Last reply
              1
              • S Offline
                S Offline
                SamGreenwood
                wrote last edited by
                #13

                I second this. Firecrawl would be a great addition to the App Library.

                1 Reply Last reply
                0
                • firmansiF Offline
                  firmansiF Offline
                  firmansi
                  wrote last edited by
                  #14

                  Vote for this two times

                  1 Reply Last reply
                  0
                  • E Offline
                    E Offline
                    ekevu123
                    wrote last edited by
                    #15

                    Has anyone used Firecrawl self-hosted? They describe as a main difference of their self-hosted vs. cloud variant that the cloud version rotates IP addresses, so it gets better around blockers. I have never used Firecrawl self-hosted, has this been an issue to anyone?

                    1 Reply Last reply
                    2
                    • L Offline
                      L Offline
                      LoudLemur
                      wrote last edited by
                      #16

                      A new release of Firecrawl came available:
                      https://github.com/mendableai/firecrawl/releases/tag/v1.11.0

                      There are lots of new improvements:

                      Firecrawl v1.11.0 is here!
                      Major Features
                      Launched our Firecrawl Index
                      Speed up scrapes 5x if opted in
                      Improved Activity Logs
                      View webhook events
                      Active crawl management
                      Fire Enrich Example (Open Source Clay)
                      Community Java SDK
                      and a lot more
                      Features
                      Improved Playwright tests and webhook test coverage
                      Added GET /crawl/ongoing endpoint
                      Introduced tag support in change tracking
                      Added integration field to jobs and propagated through queue worker
                      Parallel testing for runpod v2 and updated mu
                      Ported queryIndexAtSplitLevel to RPC
                      Enhanced SDK with index and missing parameters
                      Removed redundant GCS check to improve performance
                      Added credits_billed field across pipeline
                      Enabled domain-level index splitting for better map querying
                      Used index in search and extract operations
                      Removed unused index columns
                      Fixes & Improvements
                      Fixed crawl pre-finishing logic
                      Refactored callWebhook and added logging
                      Improved index testing (FIR-2214)
                      Fixed JS SDK tests
                      Clarified scrape options usage in README
                      Fixed missing PLAYWRIGHT_MICROSERVICE_URL in env example
                      Improved concurrency limit notification emails
                      Removed query param sanitization that broke extract

                      1 Reply Last reply
                      0
                      Reply
                      • Reply as topic
                      Log in to reply
                      • Oldest to Newest
                      • Newest to Oldest
                      • Most Votes


                      • Login

                      • Don't have an account? Register

                      • Login or register to search.
                      • First post
                        Last post
                      0
                      • Categories
                      • Recent
                      • Tags
                      • Popular
                      • Bookmarks
                      • Search