What is the intention of /home/yellowtent/boxdata/mail/spamd ?
- 
I was just browsing around the file system out of curiosity and interest, and found /home/yellowtent/boxdata/mail/spamd and didn't see any references to this in the forum or documentation so figured it'd be a good chance to learn how that part of the Cloudron server works. I assume this is where SpamAssassin does it's learning from Spam mailboxes and stores it's Bayes tokens and such? In my case, I saw a ton of folders in there and oddly enough for many mailboxes that don't exist such as my username@differentDomain.tld and I suspect this may have to do with a long time ago when I setup a bunch of aliases in Rainloop to quickly switch to different domains hosted on my server for postmaster/webmaster, etc. I changed tactics a long time ago though and so unsure if this is just old left-over cruft. And if so, is it safe for me to remove those to clean it out a bit? 
- 
Yes, correct. /home/yellowtent/boxdata/mail/spamdcontains:- per-user spamassassin training information. Each spam directory contains bayes_seen and bayes_tok files (bayes is the algorithm for filtering spam).
- custom spamassassin rules in custom.cf
- spam blacklist in acl.cf
 You can safely delete the random mailbox directories. 
- 
Yes, correct. /home/yellowtent/boxdata/mail/spamdcontains:- per-user spamassassin training information. Each spam directory contains bayes_seen and bayes_tok files (bayes is the algorithm for filtering spam).
- custom spamassassin rules in custom.cf
- spam blacklist in acl.cf
 You can safely delete the random mailbox directories. 
- 
Yes, correct. /home/yellowtent/boxdata/mail/spamdcontains:- per-user spamassassin training information. Each spam directory contains bayes_seen and bayes_tok files (bayes is the algorithm for filtering spam).
- custom spamassassin rules in custom.cf
- spam blacklist in acl.cf
 You can safely delete the random mailbox directories. @girish In looking through the directory in more detail, I see what seems like a lot of duplicates, with the expected mailbox and then the same mailbox but at the primary domain of the server. Is that as designed? Seems strange, and not 100% across all of them, but many of them. Also what would be the worst case if I emptied that directory, would it just be that it has to re-learn the spam stuff from the users spam folders? 
- 
@girish In looking through the directory in more detail, I see what seems like a lot of duplicates, with the expected mailbox and then the same mailbox but at the primary domain of the server. Is that as designed? Seems strange, and not 100% across all of them, but many of them. Also what would be the worst case if I emptied that directory, would it just be that it has to re-learn the spam stuff from the users spam folders? @d19dotca said in What is the intention of /home/yellowtent/boxdata/mail/spamd ?: Also what would be the worst case if I emptied that directory, would it just be that it has to re-learn the spam stuff from the users spam folders? Yes, it just loses training data, that's all. Can you give me an example of what you mean by "the expected mailbox and then the same mailbox but at the primary domain of the server" ? 
- 
@robi Not causing any real issues to my knowledge but I like having a “clean” system where I can. And am interested in how those directories get handled under different situations (like when a mailbox is deleted) in case not all use-cases are accounted for. Because it seems to me there are a lot of old mailbox references here not to mention the seemingly duplicates for several of them too (duplicate first part but different domain where they don’t actually have a mailbox on that domain). Thus the questions.  
- 
@d19dotca said in What is the intention of /home/yellowtent/boxdata/mail/spamd ?: Also what would be the worst case if I emptied that directory, would it just be that it has to re-learn the spam stuff from the users spam folders? Yes, it just loses training data, that's all. Can you give me an example of what you mean by "the expected mailbox and then the same mailbox but at the primary domain of the server" ? @girish Okay perfect, I may just delete most of them then at some point to sort of “reset” them. I’ll perhaps send you a listing of the folder over email because I’d like to protect their privacy. But as an example: Let’s say I have a mailbox on example.com, the listing in this directory will have two in some cases for my mailbox: - dustin@example.com
- dustin@<primaryDomainOfCloudronServer>
 And I’m unsure why that is. There’s also a few that were never mailboxes so I have no idea how or why they were created. I can recognize most of them but there’s one or two I have absolutely no idea what it’s referencing as it was never a mailbox nor a name I even recognize. So it’s seem quite odd. I’ll email you a listing though and won’t delete anything yet in case you want to look into it on why and how they could have been created and such. 
- 
@girish Okay perfect, I may just delete most of them then at some point to sort of “reset” them. I’ll perhaps send you a listing of the folder over email because I’d like to protect their privacy. But as an example: Let’s say I have a mailbox on example.com, the listing in this directory will have two in some cases for my mailbox: - dustin@example.com
- dustin@<primaryDomainOfCloudronServer>
 And I’m unsure why that is. There’s also a few that were never mailboxes so I have no idea how or why they were created. I can recognize most of them but there’s one or two I have absolutely no idea what it’s referencing as it was never a mailbox nor a name I even recognize. So it’s seem quite odd. I’ll email you a listing though and won’t delete anything yet in case you want to look into it on why and how they could have been created and such. 
- 
@d19dotca We had a bug a year or so ago where spam detection was being done with incorrect addresses. It has the symptoms you mention. You will see random addresses, cc addresses etc there. You can safely delete them, they won't reappear. If they reappear, let us know. 
- 
@d19dotca We had a bug a year or so ago where spam detection was being done with incorrect addresses. It has the symptoms you mention. You will see random addresses, cc addresses etc there. You can safely delete them, they won't reappear. If they reappear, let us know. 
- 
@d19dotca We had a bug a year or so ago where spam detection was being done with incorrect addresses. It has the symptoms you mention. You will see random addresses, cc addresses etc there. You can safely delete them, they won't reappear. If they reappear, let us know. @girish Just a quick question: When would the folders get re-generated by Cloudron? For context: I moved all the files inside of that directory to /home/ubuntu/spamd-backup/ just so I had it in case I needed to restore. So the directory was then empty, and I restarted the mail service and it auto-generated the acl.cf and custom.cf files, but I don't yet see any user mailbox folders in there. Is that expected? 
- 
Okay I see a couple of mailboxes regenerated in there now so that's good. I feel less scared. haha. Still not sure when it gets generated though, I had tried for example moving one email to my own spam folder and checked but didn't see it generated. Any insight there? 
- 
They get created when the spam filter learnt something about the address. This can happen because of user action - user marked spam or ham (moving in and out of Spam folder). There is also nightly cron job that learns from recent mail (mail of last 3 days) of each user. 
- 
They get created when the spam filter learnt something about the address. This can happen because of user action - user marked spam or ham (moving in and out of Spam folder). There is also nightly cron job that learns from recent mail (mail of last 3 days) of each user. @girish Is it expected to see forwarding addresses created in there? For example, I have one directory created in there for an email that doesn't reside on my server and is used in forwarding only from a mailing list in Cloudron (not from a mailbox). Seems strange that'd be in there, no? 
- 
@d19dotca Yes, you shouldn't see forwarding addresses in spamd directory. Do you know if this address was a mailbox before? If so, can you also double check if /home/yellowtent/boxdata/mail/vmail/<forwardingaddress> exists ? @girish I’ll check in the morning for sure, I’ll double check if it exists somehow in vmail. It shouldn’t though as it’s an @me.com address (Apple-owned) so it’s never been something I could even host at all in the first place. Though now that I think about it, back before Cloudron allowed mailing list to external addresses, it was a mailbox on their business domain and then a forwarding rule assigned to their @me.com address. But that mailbox was removed soon after mailing lists could forward externally and is just a mailing list since the. So maybe there’s still something in there? I’ll update tomorrow  
- 
Oh, the vmail and spamd directories should only ever contain addresses that are hosted on Cloudron. If you see anything not belonging to Cloudron, you can delete it . If they "re-appear", please report here, because that is some bug. I thought we fixed this bug a long time ago. I quickly checked atleast 5 Cloudrons that I could get hold of and none of them have external mailboxes (anymore). 
- 
Oh, the vmail and spamd directories should only ever contain addresses that are hosted on Cloudron. If you see anything not belonging to Cloudron, you can delete it . If they "re-appear", please report here, because that is some bug. I thought we fixed this bug a long time ago. I quickly checked atleast 5 Cloudrons that I could get hold of and none of them have external mailboxes (anymore). 
 


