Local AI Runtime Update: What Shipped in Ollama, vLLM, llama.cpp, MLX, and LM Studio in May 2026
May 2026 was a heavy ship month for local AI runtimes. Ollama added Codex App support. vLLM 0.21 stabilised DeepSeek V4 on Blackwell. llama.cpp merged MTP speculative decoding. MLX hit 4x faster on M5. LM Studio shipped stable MTP. Practical runtime-by-runtime changelog.