Discovery
All entries

Collection ยท 177 entries

Claude Code tools, plugins, and integrations

The best tools, MCP servers, and harnesses for getting more out of Claude Code - orchestration, observability, telemetry, and remote control.

Claude Code is the largest cluster in this directory for a reason: most of the interesting work in coding agents right now happens around it. The picks below cover orchestration (multi-agent harnesses, parallel runners), observability (token cost, session monitors, live agent views), security (sandboxes, scrubbers, governance), and remote control (Telegram, Slack, mobile).

This isn't an exhaustive list of everything that touches Claude Code - it's the curated subset where I've actually read the code or notes and think the idea is worth knowing about.

GitHubHackFeatured

Claude Code Analysis - architectural reverse-engineering of the leaked source

82 docs and 15 diagrams mapping every major subsystem of Claude Code's accidentally exposed 512K-line TypeScript source - YOLO classifier, 93% context compaction, prompt-cache layout, 88+ feature flags, the custom React-Fiber terminal renderer.

Why I saved this - Useful primary source for anyone building a coding agent - the YOLO two-stage classifier, the cache-busting after MCP instructions, and the 6 compaction strategies are the bits nobody else has documented.
GitHubToolFeatured

trace-mcp - framework-aware codebase MCP for coding agents

MCP server with 138 tools and cross-language framework awareness (58 integrations across 81 languages). Indexes Laravel/Inertia/Vue, Rails/Hotwire, Django/HTMX edges so agents skip re-deriving call graphs. Decision memory links architectural choices to the code they're about. Local-first ONNX embeddings, optional LSP enrichment.

Why I saved this - Distinct from Qartez - Qartez is structural (PageRank, blast radius), trace-mcp is framework-semantic. The cross-language edges (Laravel controller -> Vue page via Inertia) are the differentiated bit.
GitHubLibraryFeatured

Garden Skills - production skill pack for Claude Code, Cursor, and Codex

Three carefully-scoped skills: web-design-engineer (with an anti-cliche blocklist that breaks the generic-AI-landing-page loop), gpt-image-2 (80+ templates, three runtime modes including advisor-only fallback), and kb-retriever (layered data_structure.md navigation for bounded local-KB retrieval). Tested across Claude Code, Claude.ai, Cursor, Codex, Gemini, OpenCode.

Why I saved this - The web-design skill's anti-cliche blocklist is the most opinionated take on 'stop producing the same hero + 3 cards' I've seen.
GitHubToolFeatured

wanman - worktree-isolated multi-agent runtime for Claude Code and Codex

Multi-agent runtime that spawns each Claude Code or Codex agent in its own git worktree and home directory. JSON-RPC subprocess control, task pooling, artifact storage. Solves the share-a-directory failure mode that breaks most multi-agent harnesses.

Why I saved this - The 'one-man train' framing is load-bearing: humans observe rather than approve every step. Worktree-per-agent isolation is the upgrade most multi-agent harnesses skip.
GitHubToolFeatured

PostTrainBench - can a CLI agent post-train a base LLM in 10 hours?

Benchmark measuring whether Claude Code, Codex CLI, Gemini CLI, and OpenCode can autonomously improve 4 small base models (Qwen3-1.7B/4B, SmolLM3-3B, Gemma-3-4B) on 7 evals (AIME, BFCL, GPQA, GSM8K, HealthBench, HumanEval, Arena Hard) within a single H100 GPU and 10 hours. Includes agent-as-judge anti-reward-hacking and baseline-replacement penalties for tampering.

Why I saved this - Current leader: Opus 4.6 via Claude Code at 23.2 average. The reward-hacking safeguards (eval tampering and model-substitution detection, baseline-replacement penalty) are the part most agent benchmarks skip.
GitHubToolFeatured

mcptube - Karpathy-style LLM wiki for YouTube

MCP server that turns YouTube videos into a persistent, merging wiki rather than ephemeral vector chunks. Scene-change frame extraction + vision analysis captures slides, code, and diagrams that transcripts miss. 25+ MCP tools, FTS5+LLM hybrid retrieval, version history with source attribution per claim.

Why I saved this - The wiki-merge design is the differentiator vs RAG-over-YouTube clones - one MCP article with citations, not ten near-duplicate chunks. Scene-change extraction is what makes visual-heavy talks usable.
GitHubToolFeatured

Claudraband - persistent, resumable Claude Code sessions over HTTP and ACP

Wraps the real Claude Code TUI with a session lifecycle layer. Resumable non-interactive workflows, HTTP daemon for remote/headless control, ACP server for editor integrations (Zed, Toad). Drives your existing Claude Code install rather than reimplementing it - keeps skills, hooks, MCPs, and approvals intact.

Why I saved this - Different from the Claude SDK - Claudraband drives the real CLI from outside, so user-installed skills/hooks/MCPs all still work. The ACP support is the easy path to editor integrations.
GitHubLibrary

gpt_image_2_skill - 162-prompt gallery and skill for GPT Image 2

Curated gallery, CLI, and agentic skill for OpenAI's GPT Image 2. 162 reusable prompts across anime, gaming, photography, UI/UX, and research-figure categories. Supports text-to-image, mask-edit, multi-reference edits, and batch ops. Installable as a Claude Code plugin, Codex skill, or standalone CLI via uv.

Why I saved this - Bigger prompt gallery than garden-skills' gpt-image-2 (162 vs 80+) but narrower in scope - this is just the image skill, not a multi-skill pack.

Frequently asked

What's the difference between Claude Code orchestration and observability tools?

Orchestration tools (oh-my-team, agent-flow, paseo) coordinate multiple agents or sessions; observability tools (codeburn, abtop, lazyagent) read session state passively to surface token spend, context usage, and what agents are doing right now.

Are there security tools specifically for Claude Code?

Yes - see the agent security collection. AgentShield scans configs and MCP servers, LLM-Anonymization scrubs data before requests, and Destructive Command Guard blocks dangerous shell commands at runtime.

Which tools work across Claude Code, Codex, and other coding agents?

Most observability and orchestration tools here are multi-engine: agent-deck, codesight, paseo, parallel-code, and agentic-stack all support several coding agents in the same workflow.

Related collections