Tag

AI agents and agent runtimes

68 entries tagged with #ai-agent.

Tools and libraries that build, run, or interface with autonomous AI agents.

pentest-ai-agents - Claude Code subagents for offensive security

Specialized Claude Code subagents that turn the CLI into a pentest assistant: plan engagements, analyze recon, research exploits, build detections, audit STIGs, and write reports.

Why I saved this - Authorized-use scope is explicit in the README - it is a research harness, not a 'jailbreak the agent' kit.

#claude-code #security #pentest #ctf #ai-agent

GitHub Tool

OpenSail - open AI workspace platform

Self-hostable platform for building, running, and sharing AI workspace agents and apps with any model. No vendor lock-in - bring your own LLM provider or run local.

#ai-agent #self-hosted #agent-framework #desktop-app #python

GitHub Library

Foundry - foundation layer for agentic intelligence

Python agent runtime and framework aimed at production agentic systems. Early but already has 800+ stars and a clear shape around runtime primitives.

#agent-framework #agent-runtime #ai-agent #python #multi-agent

GitHub Tool

agent-hub - one chat surface for every local and remote agent

Open-source hub that connects to Claude Code, Codex, Hermes, OpenClaw, and other agent runtimes - local or on remote machines - through a single chat UI. Less workflow-tied than Conductor.

#claude-code #codex #multi-agent #ai-agent #remote-control

GitHub Hack

AI Inner OS - inner-monologue plugin for AI CLIs

Cross-tool plugin for Claude Code, Codex CLI, Cursor, and OpenCode CLI that injects an optional 'inner monologue' track alongside normal output. The model decides whether and how to use it.

#claude-code #codex #cursor #ai-agent #prompt-engineering

GitHub Tool

helix-agent - Claude Code as a personal Discord/Telegram bot

Always-on personal agent harness powered by Claude Code with Discord, Telegram, and built-in web UI front-ends. The 'phone in your pocket runs an agent' setup.

#claude-code #telegram #discord #ai-agent #remote-control

GitHub Tool

OpenTabs - API-driven browser agent

Chrome extension and MCP server that lets coding agents drive web tasks by calling site APIs instead of clicking through the DOM. Targets the brittleness of Playwright-style browser automation.

#claude-code #mcp #browser-automation #chrome-extension #ai-agent

GitHub Tool

agent-browser-mcp - real Chrome control over MCP

MCP server that drives an actual Chrome instance via the Chrome DevTools Protocol with page scanning, screenshots, and physical input simulation for agents.

#mcp #browser-automation #chrome #ai-agent #automation

GitHub Tool

anything-analyzer - protocol analyzer with MCP

All-in-one network protocol toolkit with browser capture, MITM proxy, JS hooks, fingerprint spoofing, and an MCP server so agents can drive the analysis directly.

#mcp #network #mitm #ai-agent #developer-tools

GitHub Library

OQP - verification protocol for AI agents

MCP-compatible spec defining four endpoints (capabilities, workflows, execute, assess-risk) so agents can prove a shipped change satisfies business requirements before it goes live.

#mcp #agent-security #evals #verification #ai-agent

GitHub Library

LABE - legal action boundary eval

Public benchmark that tests an agent at the moment it's about to take a high-impact legal action. Same harness, baseline vs verified, measures unjustified action drops and goal-completion gains.

#evals #agent-security #ai-agent #benchmark

GitHub Tool

pipelock - MCP firewall for AI agents

Go-based agent firewall that controls egress from MCP servers, blocking SSRF, DLP leaks, and prompt-injection vectors at the network layer. Acts as a fetch proxy for tool calls.

#mcp #agent-security #go #firewall #ai-agent

GitHub Tool

figma-use - control Figma from the command line

TypeScript CLI exposing 100+ Figma read/write commands, giving AI agents full control to create shapes, components, styles, and exports without the Figma plugin sandbox.

#cli #typescript #ai-agent #design

GitHub Tool

browser-harness - self-healing browser harness via CDP

Experiment from the Browser Use team that replaces Playwright with raw Chrome DevTools Protocol and lets the agent write its own tools. ~600 lines, no framework lock-in.

#ai-agent #developer-tools #scraping

GitHub Tool

Palmier - phone bridge for AI coding agents

Lets you control AI agents running on your computer from your phone, and gives those agents access to phone-side capabilities (push, SMS, calendar, contacts, location). Supports 15+ agent CLIs across Linux, Windows, and macOS.

#claude-code #codex #cli #remote-control #ai-agent

GitHub Tool

Interceptor - agent-driven Chrome control via CLI

Chrome extension paired with a CLI that gives AI agents full browser control: tabs, DOM, navigation, and automation. Aimed at agent-driven web tasks rather than human-recorded scripts.

#browser-automation #cli #ai-agent #typescript

GitHub Tool

surf-cli - agent-agnostic Chrome control CLI

Zero-config CLI that exposes Chrome to AI agents over a uniform interface. Designed to plug into any agent runtime without per-agent configuration.

#browser-automation #cli #ai-agent #javascript

GitHub Library

a2a-java - Java SDK for Agent2Agent protocol

Official Java SDK implementing the Agent2Agent (A2A) protocol for inter-agent messaging and capability discovery. Provides client and server implementations for JVM agent stacks.

#multi-agent #java #protocol #ai-agent

GitHub Hack

awesome-agent-harness - curated harness list

Curated list of agent harnesses, orchestrators, and coding-agent runtimes. Useful index for evaluating multi-agent infrastructure projects.

#awesome-list #multi-agent #orchestration #ai-agent

GitHub Tool

llmgateway - unified LLM provider gateway

Self-hostable gateway that routes requests across Anthropic, OpenAI, and other LLM providers with API-key management, analytics, and per-team policies. Designed for multi-provider agent deployments.

#self-hosted #ai-agent #observability #typescript

GitHub Hack

real-world-rails - 200+ Rails apps for agent search

Aggregated repo of 200+ production open-source Rails apps and engines, intended as a corpus for AI agents to search for real-world architectural patterns. Acts as a grounding dataset rather than a tutorial.

#ai-agent #developer-tools #shell

GitHub Tool

RocketSim - iOS Simulator power tools with agent CLI

30+ tools that extend Xcode's iOS Simulator: testing, debugging, network monitoring, captures, accessibility, and a CLI that lets AI agents drive simulator actions. Used by 80k+ iOS developers.

#ai-agent #developer-tools #cli

GitHub Tool

dirac - open-source coding agent for TerminalBench

Open-source coding agent that scored 65.2% on TerminalBench with Gemini 3 flash, beating Junie CLI and Google's official harness. Run leaderboard-compliant with full transcripts and no AGENTS.md tricks.

#evals #ai-agent #cli

GitHub Tool

auto-memory - progressive session recall CLI

CLI that gives AI coding agents persistent recall across sessions through progressive memory snapshots. Aimed at workflows where context is lost between agent runs.

#agent-memory #cli #python #ai-agent

GitHub Tool

RedAI - validate vulnerabilities in live targets

Security agent that runs scanner agents to surface candidate vulnerabilities, then has validator agents reproduce each one against a running instance. Outputs only confirmed exploitable findings.

#agent-security #multi-agent #ai-agent

GitHub Library

passmark - Playwright AI regression testing

Open-source Playwright library for AI-driven browser regression testing with intelligent caching, auto-healing locators, and multi-model verification. Designed to keep flaky AI tests stable across model versions.

#evals #ai-agent #typescript #developer-tools

GitHub Library

bunqueue - SQLite-backed job queue for Bun

High-performance Bun job queue with SQLite persistence, dead-letter queue, cron scheduling, and S3 backups. Marketed as BullMQ alternative for AI agent workloads.

#typescript #developer-tools #ai-agent

GitHub Tool

Broccoli - cloud sandbox harness for Linear

Open-source harness that pulls coding tasks from Linear, runs them in isolated cloud sandboxes, and opens PRs for human review. Built to manage many concurrent agent jobs without local worktree juggling.

#multi-agent #orchestration #self-hosted #ai-agent

GitHub Hack

TokenBurner - Claude Code skill that burns tokens on demand

A deliberately wasteful Claude Code skill for stress testing, inflating metrics, or just burning budget. Useful for testing observability dashboards and rate-limit handling.

#claude-code #stress-testing #developer-tools #ai-agent

GitHub Tool

stereOS - hardened Linux for AI agents

Nix-based Linux distribution purpose-built for running AI agents. Hardened defaults and an immutable base aimed at sandboxing autonomous coding agents.

#agent-security #self-hosted #ai-agent #sandbox

GitHub Library

agent-prism - React components for agent traces

React component library for visualizing distributed traces from AI agents. Drop-in widgets for timelines, span trees, and tool-call breakdowns from LangChain or custom runtimes.

#observability #typescript #ai-agent #developer-tools

GitHub Library

terminator - Playwright for Windows computer use

Rust SDK for driving Windows applications with native UI Automation, designed as a Playwright-style API for AI agents. Lets LLMs click, type, and read state across desktop apps.

#rust #ai-agent #automation

GitHub Tool

openclaw-dashboard - command center for OpenClaw agents

Zero-dependency Go dashboard for OpenClaw AI agents covering cost tracking, token usage, and per-agent monitoring. Single-binary deploy.

#go #observability #ai-agent #multi-agent

GitHub Tool

sfsym - export Apple SF Symbols as SVG/PDF/PNG

CLI that exports SF Symbols as true vector SVG, PDF, or PNG by walking macOS's private symbol renderer. Designed so an agent can fetch icon assets autonomously during design sessions.

#cli #developer-tools #ai-agent

GitHub Tool

imsg - CLI for Apple Messages so agents can text

Swift CLI that lets your agent read and send iMessages and SMS through Apple's Messages.app. Useful for routing notifications or two-factor codes back into a coding session.

#cli #ai-agent #developer-tools

GitHub Library

browser-harness-js - self-healing browser harness for LLMs

TypeScript browser harness that lets an LLM complete arbitrary tasks with automatic recovery when selectors break or pages restructure. From the browser-use team.

#ai-agent #typescript #automation

GitHub Library

reasonix - DeepSeek-native agent framework

TypeScript agent framework built around DeepSeek with cache-first loops, R1 thought harvesting, and tool-call repair. Ships with an Ink-based TUI for runtime inspection.

#ai-agent #deepseek #typescript #tui #prompt-caching

GitHub Tool

kimi-2-6-code - terminal agent for Moonshot Kimi

Terminal-native coding agent powered by Moonshot's Kimi K2.6 model. TypeScript-based alternative to Claude Code or Codex CLI for users who want to drive Kimi from the shell.

#kimi #cli #typescript #ai-agent #coding-agent

GitHub Tool

pi-annotate - browser-to-AI visual feedback

Chrome extension for clicking elements in a running app, leaving comments, and shipping the annotated context back to an AI agent for fixes. Closes the loop between UI bug reports and code edits.

#browser-extension #ai-agent #developer-tools #javascript

GitHub Tool

Ghost Pepper - local hold-to-talk speech-to-text for macOS

Push-to-talk dictation app for macOS that runs entirely on local models, no data leaves the machine. Designed to drive coding agents and email by voice.

#macos #developer-tools #ai-agent

GitHub Tool

trustgraph - context platform for agents

Graph-native infrastructure for storing, enriching, and retrieving structured agent context. Provides semantic retrieval and portable context cores you can move between agent runtimes.

#ai-agent #agent-memory #context-engineering #python

GitHub Tool

pilot-shell - production gates for Claude Code

TypeScript harness that wraps Claude Code with spec-driven plans, enforced quality gates, and persistent project knowledge. Targets teams shipping production code with the agent rather than prototyping.

#claude-code #typescript #ai-agent #orchestration

GitHub Tool

davia - editable docs for coding agents

Documentation system designed to be read and rewritten by coding agents instead of humans. Stores knowledge in a format that survives long agent sessions.

#typescript #ai-agent #agent-memory #context-engineering

GitHub Hack

awesome-autoresearch - autonomous research agent loops

Curated list of self-improvement loops, research agents, and autoresearch systems following Karpathy's framing. Useful index when designing multi-step agent harnesses.

#claude-code #ai-agent #multi-agent #evals

GitHub Tool

web-agent-protocol - record and replay browser MCP

MCP server that records user browser interactions and lets agents replay them as automation scripts. Bridge between human demonstrations and agent execution.

#mcp #ai-agent #python

GitHub Library

concierge - SDK for building MCP servers

Python SDK for authoring MCP servers with batteries for tool registration, auth, and apps-sdk style flows. Aims to be the universal scaffold for new MCPs.

#mcp #python #ai-agent #developer-tools

GitHub Tool

computer-agent - Rust desktop app for Claude computer use

Rust-based desktop app that lets Claude drive your terminal, browser, mouse, and keyboard via the Anthropic computer-use API. Single binary, multi-model.

#claude-code #rust #ai-agent #computer-use

GitHub Hack

compose-for-agents - Docker Compose recipes for agents

Docker's collection of ready-to-use Compose stacks for orchestrating open-source LLMs, tools, and agent runtimes. Useful starting points for self-hosted setups.

#devops #self-hosted #ai-agent #typescript

GitHub Tool

n8n-install - one-command self-hosted AI automation

Shell installer that deploys n8n, Ollama, Flowise, Supabase, RAG stack, and 30+ tools behind auto-HTTPS. Self-hosted Zapier or Make alternative.

#self-hosted #devops #shell #ai-agent

GitHub Library

secure-exec - npm-compatible Node sandboxing

Lightweight library for sandboxing Node.js code execution from agents without containers or VMs, using runtime isolation. Built for code interpreter use cases.

#agent-security #javascript #ai-agent

GitHub Tool

axe - single-purpose AI agents from TOML

Lightweight Go CLI for defining focused AI agents in TOML and triggering them from pipes, git hooks, cron, or the terminal. No framework, just unix.

#cli #go #ai-agent #developer-tools

GitHub Tool

vllora - debug AI agents across providers

Rust gateway and debugger for AI agent traffic across Anthropic, OpenAI, Azure, Gemini, DeepSeek, and others. Trace and inspect tool calls in flight.

#observability #rust #ai-agent #developer-tools

GitHub Tool

ROSA - natural language agent for ROS robotics

NASA JPL agent that lets developers inspect, diagnose, and operate ROS1/ROS2 robotics systems through natural language. Bridges LLMs with the ROS toolchain.

#ai-agent #python #developer-tools

GitHub Library

open-harness - composable SDK for AI agents

TypeScript SDK for building agents with a code-first composition model: tools, skills, and MCP servers wire together as plain modules. Ships an agent loop you control.

#typescript #ai-agent #mcp #agent-harness #sdk

GitHub Tool

DeepZero - automated kernel driver vuln research

Vulnerability research framework that parses, decompiles, and analyzes Windows kernel drivers for exploitable IOCTLs using AI agents. Sleep through fuzzing campaigns.

#python #agent-security #ai-agent #reverse-engineering #automation

GitHub Tool

tui-use - drive interactive REPLs from agents

Lets agents interact with programs that expect a human at the keyboard - REPLs, debuggers, TUI apps - things bash pipes cannot reach. Fills the gap between shell and full computer-use.

#typescript #tui #ai-agent #cli #developer-tools

GitHub Library

helixent - tiny ReAct agent loop for Bun

Small TypeScript library for ReAct-style agent loops on the Bun stack. Tools, skills, and a coding-focused harness in a minimal package.

#typescript #agent-harness #ai-agent #bun #react-pattern

GitHub Library

cli-to-js - turn any CLI into a JS API

Wraps any command-line tool as a typed JavaScript API agents can call directly. Saves writing a custom MCP for every CLI you want to expose.

#typescript #cli #ai-agent #developer-tools #node

GitHub Tool

Cloudflare Agentic Inbox

Self-hosted email client with an embedded AI agent, running entirely on Cloudflare Workers. No backend to manage, edge-distributed by default.

#cloudflare #email #ai-agent #edge #self-hosted

GitHub Tool

Obscura - Rust headless browser for AI agents

Open-source Rust headless browser built for AI agents and scraping. Lower memory and faster cold starts than Chromium-based stacks like Puppeteer and Playwright.

#scraping #headless-browser #automation #rust #ai-agent

GitHub Tool

1mcp - unified MCP server aggregator

Aggregates many MCP servers behind one endpoint. Acts as an MCP gateway/proxy so clients only configure a single server.

#mcp #gateway #proxy #ai-agent #aggregator

GitHub Tool

mcpc - universal CLI for MCP

MCP client with persistent sessions, stdio + HTTP transports, OAuth 2.1, JSON output for code mode, and a sandbox proxy. Calls any MCP server from a shell.

#mcp #cli #oauth #ai-agent #scripting

GitHub Tool

mcp2cli - turn any MCP/OpenAPI/GraphQL server into a CLI

Runtime adapter that exposes any MCP, OpenAPI, or GraphQL server as a flat CLI. Zero codegen, zero rebuild - handy for shell scripts and agent toolchains.

#mcp #openapi #graphql #cli #ai-agent

GitHub Library

claude-code-java - embeddable Claude Code engine

AI-agent engine for Java apps. CLI plus REST API that wraps the Claude Code execution model so you can drop it into any JVM service.

#claude-code #java #jvm #rest-api #ai-agent

GitHub Tool

cc-telegram-bridge - Codex & Claude Code on Telegram

Multi-bot, multi-engine Telegram bridge with per-bot personality, budget caps, streaming, session resume, and an Agent Bus for parallel pipelines.

#claude-code #codex #telegram #automation #ai-agent

GitHub Library

ESP-Claw - AI agent framework for IoT

Espressif's chat-coding agent framework for ESP32 devices. Brings tool-calling LLM agents to embedded targets with C-level memory budgets.

#iot #esp32 #embedded #ai-agent #c

GitHub Tool

figma-mcp-go - Figma MCP for free users

Go-based MCP server that gives agents read/write Figma access without rate limits. Text-to-design and design-to-code in one binary.

#mcp #figma #design #go #ai-agent

GitHub Tool

OfficeCLI - Word/Excel/PowerPoint for AI agents

C# CLI built specifically for agents to read, edit, and automate Office files. Single binary, no Office install required.

#cli #office #automation #ai-agent #csharp

Browse other tags

#developer-tools200 #claude-code177 #cli169 #rust99 #mcp88 #typescript84 #codex76 #go74 #python65 #self-hosted58 #devops51 #tui49