I'd like to have more insight in how every app and service is performing. For example:
- Average response time for each app
- Whether an app is doing too many I/O operations (and/or swapping)
- Whether I should increase RAM on services such as MySQL
- Some API for error logging and reporting of async tasks
Some general dashboard with a notification system would be brilliant.
Any idea how to approach this without having to build it from scratch or using some expensive and overly complex cloud service?
This indeed would be nice. We have a collectd running for various stats like memory usage, however so far we have had a hard time to make that data really useful in my opinion.
I wonder if this can be done as an app or has to be built into the platform itself.
Maybe one of these? https://phoenixnap.com/blog/docker-container-monitoring-tools
I'll look into Zabbix, thanks for the recommendation