pentest-ai-agents - Claude Code subagents for offensive security
Specialized Claude Code subagents that turn the CLI into a pentest assistant: plan engagements, analyze recon, research exploits, build detections, audit STIGs, and write reports.
Tag
13 entries tagged with #security.
Sandboxes, scanners, proxies, and privacy/anonymization tools. See the curated agent security guide for a structured walkthrough.
Specialized Claude Code subagents that turn the CLI into a pentest assistant: plan engagements, analyze recon, research exploits, build detections, audit STIGs, and write reports.
Pre-installed sandboxed VM with Claude and Codex ready to run, plus git credentials wired up. Removes the 'press enter to accept' loop without exposing the host.
Rust gateway in front of OpenAI, Anthropic, Gemini, and self-hosted LLMs (plus MCP servers) with RBAC, audit logs, rate limits, and cost tracking. The boring layer enterprises actually need.
MCP server tailored for defenders - exposes detection-engineering primitives so agents can author, refactor, and validate SIEM/EDR detections.
Rust TUI that wraps gpg key management: list, generate, sign, export, and edit trust without memorizing flag combinations.
Single Rust binary that decodes, encodes, and validates JWTs from the command line. Standard tool when debugging auth flows.
Embeddable Go secret store using Argon2id and XChaCha20-Poly1305 by default, with four security levels, audit chains, and crash-safe rotation. Vault when Vault is overkill.
CLI, GitHub Action, and GitHub App that scan agent configs, MCP servers, and tool permissions for vulnerabilities. Detects skill poisoning and prompt-injection vectors.
Reverse proxy for Claude Code that strips IPs, hashes, credentials, and PII before requests hit Anthropic. Dual-layer detection: local Ollama LLM plus regex.
Brex's HTTP proxy that uses an LLM judge to vet agent traffic in production. Drop it in front of any agent runtime to block exfiltration and jailbreaks.
Microsoft's policy engine for autonomous agents: zero-trust identity, execution sandboxing, and reliability checks. Maps to all 10 OWASP Agentic Top 10 categories.
Rust CLI that blocks dangerous git and shell commands before an agent can run them. Pattern-matched safety net for autonomous coding agents.
Lightweight, cross-platform process sandbox in Rust. Wraps any command with file, network, and credential controls - built on OpenAI Codex's runtime primitives.