sqz - context compressor for LLM CLIs
Rust CLI that compresses prompt and context payloads before they hit the model, trading a small accuracy delta for measurable token-cost savings.
Tag
8 entries tagged with #cost-tracking.
Rust CLI that compresses prompt and context payloads before they hit the model, trading a small accuracy delta for measurable token-cost savings.
Statusline replacement for Claude Code with real-time cost tracking, MCP server status, and themeable layouts. Bash-based and cross-platform.
CLI proxy for Claude Code, Cursor, Copilot, and Gemini that strips noise from LLM context with declarative YAML filters. Reports 60-90% token savings on typical agent traffic.
Real-time terminal monitor for Claude Code: cost, cache efficiency, model comparison, and alert thresholds. Targeted at users running long agent sessions who need spend visibility.
Token usage analyzer and cost tracker for opencode sessions. Breaks down spend across models, tools, and conversations.
Python proxy that intercepts LLM API traffic and shows real-time token usage, costs, and context window in a terminal dashboard. Useful for debugging prompts and tracking spend.
Python TUI that finds where extra tokens are burned in Claude Code sessions, why sessions fail, and what to fix. Built on Textual, focused on debugging your own usage.
Interactive terminal dashboard that breaks down where your AI coding tokens actually go. Surfaces the chat-vs-tool-use split most users get wrong.