ThinkWatch - enterprise AI and MCP bastion host
Rust gateway in front of OpenAI, Anthropic, Gemini, and self-hosted LLMs (plus MCP servers) with RBAC, audit logs, rate limits, and cost tracking. The boring layer enterprises actually need.
Tag
44 entries tagged with #observability.
Tools for seeing what your agents, services, and infrastructure are actually doing - token spend, runtime state, traces, and metrics.
Rust gateway in front of OpenAI, Anthropic, Gemini, and self-hosted LLMs (plus MCP servers) with RBAC, audit logs, rate limits, and cost tracking. The boring layer enterprises actually need.
CLI that scans ~/.claude for MCPs, skills, hooks, and CLAUDE.md files, shows how many tokens each consumes before you type, and lets you remove items with reversible snapshots.
Local-first Rust TUI that searches Claude Code, Codex, and OpenCode session history with hybrid full-text plus semantic retrieval. Built on ratatui.
Electron capture and inspection tool for Model Context Protocol traffic. Records every HTTP request/response between an IDE and its MCP servers for forensic analysis.
Terminal UI to search, preview, read, and resume Claude Code and Codex sessions in a unified view. More framework integrations planned.
npx dashboard to manage Claude Code memories, configs, and MCP servers. Includes a tool-poisoning scanner, context token budget tracker, duplicate cleanup, and scope management.
Rust TUI and CLI that indexes and searches local coding-agent session history across Codex, Claude Code, Gemini, Cursor, Aider and seven other providers.
Swift macOS app that browses, archives, and resumes sessions across Codex CLI, Claude Code, OpenCode, Gemini CLI, Factory Droid, GitHub Copilot CLI, and OpenClaw with rate-limit tracking.
tmux sidebar that tracks Amp, Claude Code, Codex, and OpenCode sessions with per-thread markers, a local HTTP API, and live session state.
Rust terminal app that puts every Claude/Codex/Gemini/Kilocode session on a GPU-accelerated infinite canvas you can pan and zoom across.
MCP-fronted router that runs deterministic shell commands locally with ~19us latency, bypassing the LLM entirely. Cuts token spend on routine commands inside Claude Code, Codex, Cursor, and Copilot.
Kanban-style UI where agents take tickets, message each other, and review each other's work. Supports Codex, Claude, OpenCode, and 75+ LLM providers.
Swift menu-bar app that surfaces Claude Code sessions in the macOS notch: monitor progress, approve permission prompts, and jump back into a session without leaving the current window.
Single-binary Rust OpenTelemetry receiver and dashboard for local LLM development. Captures spans from agent runs without spinning up Jaeger or a vendor backend.
TypeScript policy engine that wraps agent execution with cryptographic audit trails, human-in-the-loop approvals, and a kill switch, with no code changes to the agent itself.
Lightweight Go daemon that tracks AI API quotas across Anthropic, Codex, Copilot, Antigravity, Synthetic, and Z.ai with a Material Design 3 dashboard. Local SQLite, zero telemetry.
Python debugger built for AI agents: captures every executed function call, return value, local variable, and HTTP request into greppable trace files.
Statusline replacement for Claude Code with real-time cost tracking, MCP server status, and themeable layouts. Bash-based and cross-platform.
CLI proxy for Claude Code, Cursor, Copilot, and Gemini that strips noise from LLM context with declarative YAML filters. Reports 60-90% token savings on typical agent traffic.
Self-hostable gateway that routes requests across Anthropic, OpenAI, and other LLM providers with API-key management, analytics, and per-team policies. Designed for multi-provider agent deployments.
Real-time terminal monitor for Claude Code: cost, cache efficiency, model comparison, and alert thresholds. Targeted at users running long agent sessions who need spend visibility.
eBPF agent that captures pre-encrypted network traffic from containers and processes, attributing every egress to its originating process. Aimed at observability and exfiltration detection in agent runtimes.
React component library for visualizing distributed traces from AI agents. Drop-in widgets for timelines, span trees, and tool-call breakdowns from LangChain or custom runtimes.
Local-first tracker for Claude Code, Codex, Cursor, Gemini, Kiro, OpenCode, and OpenClaw token usage with a web dashboard, native macOS menu bar app, and four desktop widgets. Zero-config setup.
Real-time Claude Code statusline showing token usage, remaining budget, burn rate, and depletion time. Drop-in shell prompt addition.
Electron desktop app that visualizes how Claude Code constructs prompts under the hood. Useful for debugging context, tool calls, and prompt assembly when an agent goes off the rails.
Zero-dependency Go dashboard for OpenClaw AI agents covering cost tracking, token usage, and per-agent monitoring. Single-binary deploy.
Desktop app to browse and analyze Claude Code conversation history. Also reads opencode and Codex transcripts so you can compare runs across harnesses.
Token usage analyzer and cost tracker for opencode sessions. Breaks down spend across models, tools, and conversations.
Rust TUI plus JSON/Prometheus metrics server for M1-M5 Macs. No sudo required, exposes CPU/GPU/power metrics for Grafana and ad-hoc inspection.
Python proxy that intercepts LLM API traffic and shows real-time token usage, costs, and context window in a terminal dashboard. Useful for debugging prompts and tracking spend.
Rust gateway that fronts LLMs and MCP servers with policy enforcement and observability. Aimed at securing agent traffic in larger deployments.
Open agent control plane in Go that enforces pre-execution policy, approval gates, and audit trails over LangChain, CrewAI, MCP, or any framework.
Rust gateway and debugger for AI agent traffic across Anthropic, OpenAI, Azure, Gemini, DeepSeek, and others. Trace and inspect tool calls in flight.
Python tool that breaks down Claude Code context window usage per tool call. Surfaces which Read or Grep calls are eating the most tokens.
Python TUI that finds where extra tokens are burned in Claude Code sessions, why sessions fail, and what to fix. Built on Textual, focused on debugging your own usage.
Interactive terminal dashboard that breaks down where your AI coding tokens actually go. Surfaces the chat-vs-tool-use split most users get wrong.
Rust TUI that monitors Claude Code and Codex sessions: token spend, context window, rate limits, and ports in real time. Like btop but for agent runtimes.
MCP server that lets agents query and analyze Prometheus metrics through a standardized tool interface. Useful for ops-aware coding agents.
Watch your Claude Code, Codex, and OpenCode sessions from a tiny Go TUI. Shows what each agent is doing, no SaaS dashboard needed.
Watch your Claude Code agents think, branch, and coordinate as they run. Live visualization of subagent trees and tool calls in the browser.
End-to-end platform for evaluating, observing, and improving LLM and agent apps. Tracing, evals, simulations, datasets, and prompt management in one project.
Build, deploy, and scale MCP servers with auth, observability, debugging, telemetry, and a runtime baked in. Skips the per-server boilerplate.
Fast, remote-first log viewer that tails multiple SSH targets at once and renders a histogram timeline. No central log server required.