Cloudron makes it easy to run web apps like WordPress, Nextcloud, GitLab on your server. Find out more or install now.


Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • Bookmarks
  • Search
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

Cloudron Forum

Apps - Status | Demo | Docs | Install
  1. Cloudron Forum
  2. LanguageTool
  3. FYI size of n-gram data sets

FYI size of n-gram data sets

Scheduled Pinned Locked Moved LanguageTool
7 Posts 6 Posters 2.4k Views 6 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • luckowL Offline
    luckowL Offline
    luckow
    translator
    wrote on last edited by
    #1

    EN is around 8 GB

    Pronouns: he/him | Primary language: German

    necrevistonnezrN 1 Reply Last reply
    3
    • nebulonN Offline
      nebulonN Offline
      nebulon
      Staff
      wrote on last edited by
      #2

      good point, we will put that in the docs

      1 Reply Last reply
      0
      • RazielKanosR Offline
        RazielKanosR Offline
        RazielKanos
        wrote on last edited by
        #3

        how can i add another language? do i just
        NGRAM_DATASET=("en,de")?

        luckowL vladimir.dV 2 Replies Last reply
        0
        • RazielKanosR RazielKanos

          how can i add another language? do i just
          NGRAM_DATASET=("en,de")?

          luckowL Offline
          luckowL Offline
          luckow
          translator
          wrote on last edited by luckow
          #4

          @RazielKanos NGRAM_DATASET=("en;de") works for me.
          Sorry. Not true 🙂

          Pronouns: he/him | Primary language: German

          1 Reply Last reply
          0
          • RazielKanosR RazielKanos

            how can i add another language? do i just
            NGRAM_DATASET=("en,de")?

            vladimir.dV Offline
            vladimir.dV Offline
            vladimir.d
            wrote on last edited by vladimir.d
            #5

            @RazielKanos said in FYI size of n-gram data sets:

            how can i add another language? do i just
            NGRAM_DATASET=("en,de")?

            Basically it's a bash script array variable so you should split values by a whitespace.

            NGRAM_DATASET=("en" "de")
            

            I'm not a German speaker but I heard it works very well.
            Just wondering how it works with two languages.

            1 Reply Last reply
            4
            • girishG Offline
              girishG Offline
              girish
              Staff
              wrote on last edited by
              #6

              The warning is now in https://docs.cloudron.io/apps/languagetool/#n-grams . Also, the way to install ngrams has slightly changed.

              1 Reply Last reply
              1
              • luckowL luckow

                EN is around 8 GB

                necrevistonnezrN Offline
                necrevistonnezrN Offline
                necrevistonnezr
                wrote on last edited by
                #7

                @luckow said in FYI size of n-gram data sets:

                EN is around 8 GB

                This is download size. Unpacked it takes 14.34 GB of server space for English and 3.06 GB for German.

                2834EBAC-FFAF-40E4-B4B7-3584448CB671.jpeg 49076E70-5F1C-479F-911E-D1C717771557.jpeg

                1 Reply Last reply
                1

                Hello! It looks like you're interested in this conversation, but you don't have an account yet.

                Getting fed up of having to scroll through the same posts each visit? When you register for an account, you'll always come back to exactly where you were before, and choose to be notified of new replies (either via email, or push notification). You'll also be able to save bookmarks and upvote posts to show your appreciation to other community members.

                With your input, this post could be even better 💗

                Register Login
                Reply
                • Reply as topic
                Log in to reply
                • Oldest to Newest
                • Newest to Oldest
                • Most Votes


                • Login

                • Don't have an account? Register

                • Login or register to search.
                • First post
                  Last post
                0
                • Categories
                • Recent
                • Tags
                • Popular
                • Bookmarks
                • Search