▲ | Drahflow 9 hours ago | |||||||||||||
The point that the trinity of logs, metrics and traces wastes a lot of engineering effort to pre-select the right metrics (and labels) and storage (by having too many information triplicate), is a good one. > We believe raw data based approach will transform how we use observability data and extract value from it. Yep. We have built quuxLogging on the same premise, but with more emphasis on "raw": Instead of parsing events (wide or not), we treat it fundamentally as a very large set of (usually text) lines and optimized hard on the querying-lots-of-text part. Basically a horizontally scaled (extremely fast) regex engine with data aggregation support. Having a decent way to get metrics from logs ad-hoc completely solves the metric cardinality explosion. | ||||||||||||||
▲ | wvh 8 hours ago | parent | next [-] | |||||||||||||
Many companies are having trouble to even keep Prometheus running without it getting OOM killed though. I understand and agree with the problem this is trying to solve; but the solution will rival the actual business software it is observing in cost and resource usage. And hence, just like in quantum mechanics, observing it will drastically impact the event. | ||||||||||||||
| ||||||||||||||
▲ | thewisenerd 8 hours ago | parent | prev [-] | |||||||||||||
> having a decent way to get metrics from logs ad-hoc completely solves the metric cardinality explosion. last i checked, the span metrics connector[1] was supposed to "solve" this in otel; but i'm not particularly inclined, as configurations are fixed. any data analytics platform worth it's money should be able to do this at runtime (for specified data volume constraints, in reasonable time). in general, structured logging should also help with this; as much as i love regex, i do not think extracting "data" from raw logs is lossless. [1] https://github.com/open-telemetry/opentelemetry-collector-co... |