A new tool category has emerged around observing, recording, and analyzing the interactions between developers and AI coding agents (Claude Code, Cursor, Copilot, Codex, OpenCode, etc.). These tools sit alongside the coding agent -- intercepting traffic, reading session files, hooking into agent events, or integrating into the IDE -- and provide visibility into what was asked, what was done, how much it cost, and what patterns emerge over time.
This is distinct from production LLM observability (Langfuse, LangSmith, Helicone, etc.), which monitors AI applications running in production. The tools here are focused on developer workstations and coding sessions.
The field is roughly six months old as a recognizable category and is evolving rapidly.