▲ | robinhoodexe 8 months ago | |||||||
We’re running standard Prometheus on Kubernetes (14 onprem Talos clusters, total of 191 nodes, 1.1k cpu cores, 4.75TiB memory and 4k pods). We use Thanos to store metrics in self-hosted S3 (seaweedfs) with 30 days retention, aggressively downsample after 3 days. It works pretty good tbh. I’m excited about upgrading to version 3, as is does take a lot of resources to keep going, especially on clusters with a lot of pods being spawned all the time. | ||||||||
▲ | xgbi 7 months ago | parent | next [-] | |||||||
Hey robinhood, any feedback on Talos? We've been using Talos for our internal clusters for a while, but with quite small ones (3 kube node, 5 worker nodes). Upgrading has been generally a non event, and we're quite happy with them. How do you deploy Thanos ? In one of the clusters ? | ||||||||
| ||||||||
▲ | ChocolateGod 8 months ago | parent | prev [-] | |||||||
> We use Thanos to store metrics in self-hosted S3 (seaweedfs) with 30 days retention, aggressively downsample after 3 days. Any reason to not just use Mimir for this? | ||||||||
|