eBPF · Causal Graphs · Deterministic Replay

Kernel-level visibility
for financial systems.

kprobe attaches to the Linux kernel using eBPF and captures everything your application-layer tools miss — CPU scheduling decisions, memory pressure events, network packet timing — without touching a single line of your code.

No instrumentation required Attaches directly to kernel tracepoints. No library imports, no redeployment, no code changes ever.
Full causal chain to kernel level Not just what failed — exactly why, down to the kernel event that triggered it, across every service.
Deterministic incident replay Reproduce any production incident on a dev machine hours after it happened. Test fixes before shipping.
Rust + Aya · Go · Kafka · ClickHouse · Neo4j · Kubernetes
live · kernel event stream
03:47:12.004 tcp_sendmsg PID 2841 payment-handler → settlement-svc · 1.2kb
03:47:12.005 sys_write PID 2841 fd=7 · ledger write initiated
03:47:12.821 mm_page_fault PID 4721 addr=0x7f3a · batch-job memory pressure
03:47:12.822 sched_switch PID 2841 preempted · CPU 3 → PID 4721
03:47:13.210 sched_switch PID 2841 resumed · CPU 3 · 388ms delayed
03:47:13.621 sys_write PID 2841 fd=7 · completed · 802ms total
03:47:13.622 tcp_recvmsg PID 2841 timeout — payment-handler · ERR
Causal graph engine
Correlates kernel events with OTel traces on PID + timestamp. Builds a directed causal graph in Neo4j. Any financial event to root kernel cause in milliseconds.
Deterministic replay
Reproduce any production incident exactly on a dev machine via ptrace. Inject timing changes, test fixes against the real event log before shipping.
Nanosecond precision
Every event timestamped at nanosecond resolution. Timeline view zoomable to microsecond level across all services simultaneously.
Financial domain primitives
Settlement boundaries, clearing windows, ledger writes, order book ops — kprobe maps kernel events to your financial domain natively.
Payment #98721 received
payment-handler · 03:47:12.004
Risk check passed
risk-engine · 03:47:12.408
0.4ms
Settlement write initiated
settlement-svc · 03:47:12.819
1.2ms
Memory pressure — kernel
PID 4721 · batch-job competing for RAM · 03:47:12.821
root cause
Settlement write completed
settlement-svc · 03:47:13.621
800ms
Payment failed — timeout exceeded 750ms
payment-handler · 03:47:13.622
+52ms
5 min
median investigation time with kprobe
4 hrs
median investigation time without it
0
lines of application code changed

kprobe was recording the entire time. The causal graph was ready before anyone woke up.

What no other tool sees.

Signal Datadog Jaeger OpenTelemetry kprobe
Distributed traces Yes Yes Instrumented only Zero instrumentation
Database query timing Partial Partial Partial Yes
CPU scheduling decisions Yes
Memory pressure events Yes
Network packet-level timing Yes
Cross-process causal chain Yes
Root cause to kernel level Yes
Deterministic incident replay Yes

One command into any cluster.

Requires Kubernetes 1.26+ and Linux kernel 5.15+. No changes to existing services.

$ helm repo add kprobe https://charts.kprobe.io
$ helm install kprobe kprobe/kprobe --namespace monitoring --create-namespace

Every technical decision was deliberate.

Rust
Rust + Aya
Kernel-side eBPF programs and userspace loader. Compiled to eBPF bytecode. No C anywhere — memory-safe from the kernel up.
Go
Go
Causal engine, replay engine, gRPC API server. Syscall interception for deterministic replay via ptrace.
Kafka
Apache Kafka
KRaft mode, topic-per-event-type. Handles millions of kernel events per second. Durable and replayable, no Zookeeper.
ClickHouse
ClickHouse
Columnar storage for billions of timestamped kernel events. Sub-second analytical queries at scale.
Neo4j
Neo4j
Graph database for causal relationships. Cypher queries traverse from any financial event to root kernel cause in milliseconds.
Kubernetes
Kubernetes
eBPF probe deployed as a DaemonSet across all nodes. Single Helm install deploys the full stack into any existing cluster.