Cloudron makes it easy to run web apps like WordPress, Nextcloud, GitLab on your server. Find out more or install now.


Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • Bookmarks
  • Search
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

Cloudron Forum

Apps | Demo | Docs | Install
  1. Cloudron Forum
  2. LanguageTool
  3. FYI size of n-gram data sets

FYI size of n-gram data sets

Scheduled Pinned Locked Moved LanguageTool
7 Posts 6 Posters 961 Views 6 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • luckowL Offline
    luckowL Offline
    luckow
    translator
    wrote on last edited by
    #1

    EN is around 8 GB

    Pronouns: he/him | Primary language: German

    necrevistonnezrN 1 Reply Last reply
    3
    • nebulonN Offline
      nebulonN Offline
      nebulon
      Staff
      wrote on last edited by
      #2

      good point, we will put that in the docs

      1 Reply Last reply
      0
      • RazielKanosR Offline
        RazielKanosR Offline
        RazielKanos
        wrote on last edited by
        #3

        how can i add another language? do i just
        NGRAM_DATASET=("en,de")?

        luckowL vladimir.dV 2 Replies Last reply
        0
        • RazielKanosR RazielKanos

          how can i add another language? do i just
          NGRAM_DATASET=("en,de")?

          luckowL Offline
          luckowL Offline
          luckow
          translator
          wrote on last edited by luckow
          #4

          @RazielKanos NGRAM_DATASET=("en;de") works for me.
          Sorry. Not true 🙂

          Pronouns: he/him | Primary language: German

          1 Reply Last reply
          0
          • RazielKanosR RazielKanos

            how can i add another language? do i just
            NGRAM_DATASET=("en,de")?

            vladimir.dV Offline
            vladimir.dV Offline
            vladimir.d
            wrote on last edited by vladimir.d
            #5

            @RazielKanos said in FYI size of n-gram data sets:

            how can i add another language? do i just
            NGRAM_DATASET=("en,de")?

            Basically it's a bash script array variable so you should split values by a whitespace.

            NGRAM_DATASET=("en" "de")
            

            I'm not a German speaker but I heard it works very well.
            Just wondering how it works with two languages.

            1 Reply Last reply
            4
            • girishG Offline
              girishG Offline
              girish
              Staff
              wrote on last edited by
              #6

              The warning is now in https://docs.cloudron.io/apps/languagetool/#n-grams . Also, the way to install ngrams has slightly changed.

              1 Reply Last reply
              1
              • luckowL luckow

                EN is around 8 GB

                necrevistonnezrN Offline
                necrevistonnezrN Offline
                necrevistonnezr
                wrote on last edited by
                #7

                @luckow said in FYI size of n-gram data sets:

                EN is around 8 GB

                This is download size. Unpacked it takes 14.34 GB of server space for English and 3.06 GB for German.

                2834EBAC-FFAF-40E4-B4B7-3584448CB671.jpeg 49076E70-5F1C-479F-911E-D1C717771557.jpeg

                1 Reply Last reply
                1
                Reply
                • Reply as topic
                Log in to reply
                • Oldest to Newest
                • Newest to Oldest
                • Most Votes


                • Login

                • Don't have an account? Register

                • Login or register to search.
                • First post
                  Last post
                0
                • Categories
                • Recent
                • Tags
                • Popular
                • Bookmarks
                • Search