Cloudron makes it easy to run web apps like WordPress, Nextcloud, GitLab on your server. Find out more or install now.


Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • Bookmarks
  • Search
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

Cloudron Forum

Apps | Demo | Docs | Install
  1. Cloudron Forum
  2. App Wishlist
  3. Mercury Parser: extract content from URLs (e.g. for RSS aggregators)

Mercury Parser: extract content from URLs (e.g. for RSS aggregators)

Scheduled Pinned Locked Moved App Wishlist
2 Posts 1 Posters 1.5k Views 1 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • necrevistonnezrN Online
    necrevistonnezrN Online
    necrevistonnezr
    wrote on last edited by
    #1

    Overview: https://medium.com/@adampash/the-secret-engines-of-the-internet-e517592266ea
    Code: https://github.com/postlight/mercury-parser

    Mercury Parser allows to pull fulltext content from URLs- It's the engine used in Reeder, NewsBlur, Feedbin, News Explorer, Feedly, Apollo (for Reddit), Medium, Bear, Zapier, etc.

    It would allow to self host the service for the Cloudron apps FreshRSS and Tiny Tiny RSS (and maybe even more that I don't think of right now?!)

    Postlight's Mercury Parser extracts the bits that humans care about from any URL you give it. That includes article content, titles, authors, published dates, excerpts, lead images, and more.

    Mercury Parser powers the Mercury AMP Converter and Mercury Reader, a Chrome extension that removes ads and distractions, leaving only text and images for a beautiful reading view on any site.

    Mercury Parser allows you to easily create custom parsers using simple JavaScript and CSS selectors. This allows you to proactively manage parsing and migration edge cases. There are many examples available along with documentation.

    • Chrome extension: https://mercury.postlight.com/reader/
    • FreshRSS plugin: https://github.com/simon-wessel/freshrss-mercury-parser
    • Tiny Tiny RSS Plugin: https://github.com/HenryQW/mercury_fulltext
    necrevistonnezrN 1 Reply Last reply
    4
    • necrevistonnezrN necrevistonnezr

      Overview: https://medium.com/@adampash/the-secret-engines-of-the-internet-e517592266ea
      Code: https://github.com/postlight/mercury-parser

      Mercury Parser allows to pull fulltext content from URLs- It's the engine used in Reeder, NewsBlur, Feedbin, News Explorer, Feedly, Apollo (for Reddit), Medium, Bear, Zapier, etc.

      It would allow to self host the service for the Cloudron apps FreshRSS and Tiny Tiny RSS (and maybe even more that I don't think of right now?!)

      Postlight's Mercury Parser extracts the bits that humans care about from any URL you give it. That includes article content, titles, authors, published dates, excerpts, lead images, and more.

      Mercury Parser powers the Mercury AMP Converter and Mercury Reader, a Chrome extension that removes ads and distractions, leaving only text and images for a beautiful reading view on any site.

      Mercury Parser allows you to easily create custom parsers using simple JavaScript and CSS selectors. This allows you to proactively manage parsing and migration edge cases. There are many examples available along with documentation.

      • Chrome extension: https://mercury.postlight.com/reader/
      • FreshRSS plugin: https://github.com/simon-wessel/freshrss-mercury-parser
      • Tiny Tiny RSS Plugin: https://github.com/HenryQW/mercury_fulltext
      necrevistonnezrN Online
      necrevistonnezrN Online
      necrevistonnezr
      wrote on last edited by
      #2

      Now called „Postlight Parser and under active development: https://github.com/postlight/parser

      1 Reply Last reply
      2
      Reply
      • Reply as topic
      Log in to reply
      • Oldest to Newest
      • Newest to Oldest
      • Most Votes


      • Login

      • Don't have an account? Register

      • Login or register to search.
      • First post
        Last post
      0
      • Categories
      • Recent
      • Tags
      • Popular
      • Bookmarks
      • Search