Cloudron makes it easy to run web apps like WordPress, Nextcloud, GitLab on your server. Find out more or install now.


Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • Bookmarks
  • Search
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

Cloudron Forum

Apps | Demo | Docs | Install
  1. Cloudron Forum
  2. App Wishlist
  3. Firecrawl on Cloudron - Turn any site into LLM data by web scraping

Firecrawl on Cloudron - Turn any site into LLM data by web scraping

Scheduled Pinned Locked Moved App Wishlist
firecrawlweb scrapingai
11 Posts 5 Posters 2.2k Views 7 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • E Offline
    E Offline
    ekevu123
    wrote on last edited by LoudLemur
    #1

    [EDITED by Mod]

    • Main Page: https://www.firecrawl.dev
    • Git: https://github.com/mendableai/firecrawl
    • Licence: GNU Affero General Public License v3.0
    • Docker: Yes https://github.com/mendableai/firecrawl/blob/main/docker-compose.yaml
    • Demo: https://www.firecrawl.dev/playground?url=https%3A%2F%2Fcloudron.io&mode=scrape

    • Summary:
      Firecrawl (https://www.firecrawl.dev) is a web scraping tool that prepares data in LLM-readable format that can be self-hosted.
      Crawl and convert any website into LLM-ready markdown or structured data. Built by Mendable.ai and the Firecrawl community. Includes powerful scraping, crawling and data extraction capabilities.

    • This repository is in its early development stages. We are still merging custom modules in the mono repo. It's not completely yet ready for full self-host deployment, but you can already run it locally.


    • Notes:
      Cloudron doesn't have a self-hosted scraper yet, so maybe this could be a good addition.
      Here is the self-hosting guide: https://github.com/mendableai/firecrawl/blob/main/SELF_HOST.md

    • Alternative to / Libhunt link: e.g.
    • Screenshots:

    brave_DZHeHo0izL.png brave_yMpKnkfukV.png brave_nEtYQ7lekr.png

    1 Reply Last reply
    10
    • L Offline
      L Offline
      LoudLemur
      wrote on last edited by
      #2

      Wow! The ai community is so brainy! This is a great find and how wonderful it is that people created it and released it under a Free licence!

      1 Reply Last reply
      1
      • E Offline
        E Offline
        ekevu123
        wrote on last edited by
        #3

        Yeah. Unfortunately, I wasn't able to re-package it for Cloudron, although I tried. So, if someone wants to do it, we all have something new to play with 🙂

        1 Reply Last reply
        0
        • T Offline
          T Offline
          taowang
          wrote on last edited by
          #4

          I want this as well!

          1 Reply Last reply
          0
          • T Offline
            T Offline
            taowang
            wrote on last edited by
            #5

            When can we have this? This is extremely useful to build AI applications, since it can scrape data from nearly any website. And data is valuable these days.

            1 Reply Last reply
            1
            • I Offline
              I Offline
              igaudette
              wrote on last edited by
              #6

              I want this!

              1 Reply Last reply
              1
              • L Offline
                L Offline
                LoudLemur
                wrote on last edited by
                #7

                Hey, @ekevu123, thank you for this brilliant app wish! I really hope this is supported on Cloudron soon.

                I have heavily edited your initial post to try and use the new template that is being developed for the App Wishlist forum.

                • What do you think about the new appearance of your post?
                • Is it very objectionable to have somebody mod your post like this?
                • Do you have any suggestions about doing this in the future?

                Thanks!

                1 Reply Last reply
                3
                • E Offline
                  E Offline
                  ekevu123
                  wrote on last edited by
                  #8

                  Yes, I like the template, however, I think if someone else's post has been edited by a moderator, this should be very clearly marked.

                  1 Reply Last reply
                  1
                  • girishG Offline
                    girishG Offline
                    girish
                    Staff
                    wrote on last edited by joseph
                    #9

                    @ekevu123 The template is posted at https://forum.cloudron.io/topic/12472/please-use-this-template-to-make-an-app-wishlist-request by @LoudLemur . Looks like a good idea to have posts (in this category) formatted a certain way. For other part of the forum, generally moderators don't edit posts (only obvious typos and language).

                    I am hoping people don't consider it rude if moderators edit the posts in the App Requests Category alone. Besides, the original poster gets reputation (the up arrow) anyway.

                    1 Reply Last reply
                    0
                    • E Offline
                      E Offline
                      ekevu123
                      wrote on last edited by
                      #10

                      As I said, I would mark it accordingly, then it should be fine. I didn't know about the template, I will try to use it next time.

                      L 1 Reply Last reply
                      0
                      • E ekevu123

                        As I said, I would mark it accordingly, then it should be fine. I didn't know about the template, I will try to use it next time.

                        L Offline
                        L Offline
                        LoudLemur
                        wrote on last edited by
                        #11

                        @ekevu123 The template is a new idea. I have marked your original posted as edited by mod.

                        1 Reply Last reply
                        1
                        Reply
                        • Reply as topic
                        Log in to reply
                        • Oldest to Newest
                        • Newest to Oldest
                        • Most Votes


                        • Login

                        • Don't have an account? Register

                        • Login or register to search.
                        • First post
                          Last post
                        0
                        • Categories
                        • Recent
                        • Tags
                        • Popular
                        • Bookmarks
                        • Search