Mellum started as a code completion model.
Mellum2 goes much further – supporting both natural language and code.
Now open source on @hf.co, Mellum2 is a 12B-parameter LLM for routing, RAG, and sub-agents, optimized for ultra-low-latency and high-performance inference.
Learn more: jb.gg/awpava