Systems debugging for production reality.

I help AI, IoT, and edge teams debug failures that sit between application code, runtime dependencies, networks, devices, and customer environments.

Failure boundary: app code / runtime / toolchain / network / device / customer site

Production failures rarely stay in one layer.

Recent code changes are not always the cause. Tunnel instability may be a LAN problem. A database startup failure may be a runtime dependency problem. An agent failure may be a toolchain compatibility problem. The work starts by finding the boundary everyone assumed was already known.

How I usually help

Most work starts with a focused diagnostic: define the failure boundary, test the highest-signal hypotheses, and turn the result into something the team can reuse.

Anonymized failure patterns

These notes are anonymized, but the failure patterns are real: toolchain mismatch, replaced runtime dependency, and LAN-level IP conflict.

Where assumptions usually break

App code / Runtime / Toolchain / Network / Device / Customer site

Have a production issue that does not fit the usual checklist?

Send the symptom, environment, and what has already been tried.

Start with a diagnostic note