Discovery
All entries

Tag

Self-hosted tools

58 entries tagged with #self-hosted.

See the curated guide →

Open-source, self-hosted alternatives across PaaS, CI, dev tooling, and personal apps. See the self-hosted dev tools guide for the curated picks.

GitHubToolFeatured

Rapid-MLX - 2-4x faster local LLM inference on Apple Silicon

MLX-native inference engine with OpenAI-compatible API. The novel piece: DeltaNet state snapshots bring prompt caching to non-trimmable architectures (Qwen3.5 hybrids), restoring RNN state in ~0.1ms. 2-5x faster TTFT, native Metal kernels, continuous batching.

Why I saved this - DeltaNet state snapshots are described as the first prompt-cache technique for non-trimmable MLX architectures - Ollama and llama.cpp can't match this on hybrid RNN-attention models today.

Browse other tags