Mellum2: JetBrains' 12B MoE Code Model, Explained for Developers
JetBrains released Mellum2, a 12B Mixture-of-Experts model that activates just 2.5B parameters per token and ships under Apache 2.0. Here's what it is, where it fits in an AI stack, and how to put it to work.