How to Run GLM-5.2 Locally — Hardware, Quants, and Setup
A practical walkthrough for self-hosting GLM-5.2 (744B MoE, 40B active) on llama.cpp. Quant tables, four hardware paths, exact install commands, verification, and a fallback to the Z.ai cloud API if your rig falls short.