Built for the failures nobody else can explain.
kprobe exists because there is a class of failure in distributed financial systems that the entire observability industry cannot see. Not because the tools are bad — they are excellent at what they do. But because they sit in the wrong place in the stack.
Where this comes from
The pattern is familiar to anyone who has operated a financial system at scale. A payment fails at 3am. The on-call engineer wakes up, opens Datadog, sees a latency spike. Opens Jaeger, finds a slow span. Spends four hours correlating logs across six microservices, working backwards from an error to something that might have caused it.
At the end of the investigation, nobody can fully confirm the root cause. A hypothesis is formed. A fix is shipped. The fix might work. It might not. If it does not, the incident repeats.
This is not a failure of the tools. Datadog recorded exactly what it saw. Jaeger traced exactly what it was told to trace. The problem is structural — the relevant events happened below where any of these tools can see. In the kernel. In the scheduler. In the memory subsystem.
kprobe was built to close that gap. Not to replace the existing stack, but to extend it downward into the layer where the hardest financial system failures actually originate.
A flight recorder and a debugger. Nothing else.
- A kernel-level event recorder deployed passively alongside your services
- A causal inference engine that constructs cause-and-effect graphs from raw kernel events
- A deterministic replay engine for reproducing production incidents on development machines
- A tool for financial systems engineering teams investigating complex, low-level failures
- Complementary to your existing observability stack
- A monitoring tool — it does not alert on thresholds or track SLOs
- A replacement for Datadog, Jaeger, or OpenTelemetry
- An APM platform — it does not aggregate application performance metrics
- A general-purpose observability tool — it is purpose-built for kernel-level incident investigation
- Appropriate for systems not running on Linux kernel 5.15+
Every decision has a reason.
The choices that make kprobe possible.
kprobe is open source.
kprobe is in active early development. The core pipeline — eBPF probe, Kafka transport, causal engine, Neo4j graph model, gRPC API — is complete. The dashboard and replay panel are in progress.
If you work on financial infrastructure, observability tooling, or low-level systems and want to contribute or share feedback, the repository is open.