| ▲ | blutoot 3 hours ago | |||||||
I'm a little confused. An agent's value-add is to automate what a human actor (in this case, an SRE) does and thus reduces the time taken to recovery, etc. A human SRE never manually detects an error - we already have well-established anomaly detection implementations and wiring them to some ticket generation tool is also an established pattern. My confusion is, what value the "agent" is bringing here. Nothing wrong in competing with the Datadogs of the world. | ||||||||
| ▲ | kemotep 3 hours ago | parent | next [-] | |||||||
I guess if you don’t want to have to pay for Rapid7 or are too lazy to configure the Teams/Slack integration for your EDR? But I mean you still have to pay for a Claude API with Moltclaw or whatever no? | ||||||||
| ▲ | nisegami 3 hours ago | parent | prev | next [-] | |||||||
>A human SRE never manually detects an error - we already have well-established anomaly detection implementations and wiring them to some ticket generation tool is also an established pattern. I'm currently dealing with fallout at job because we were doing all this with humans with no alerts and we missed a couple major issues. This product could have prevented a lot of stress in my case, but it'd be a bit like a bandage on a missing limb. | ||||||||
| ||||||||
| ▲ | esseph 3 hours ago | parent | prev [-] | |||||||
Logs are pretty dry sometimes. INFO gives you a ton but it's low SNR. WARN/ERROR may tell you that something could happen or is happening, but it doesn't tell you the ramifications of that may be. It could be nothing! Now imagine you're getting hundreds, thousands, millions of messages like this an hour? How do you determine what's really important? For instance, if a kubernetes pod on a single node runs out of space, that could be a problem if your app is only running in that node. But what if your app is spread against 30x nodes? It's a triage system with context, at least it sounds like it. It's helping you classify based on actual current or potential problems with the app in the ways that a plain log message does not. | ||||||||
| ||||||||