Inlay

Mellum started as a code completion model. Mellum2 goes much further – supporting both natural language and code. Now open source on @hf.co, Mellum2 is a 12B-parameter LLM for routing, RAG, and sub-agents, optimized for ultra-low-latency and high-performance inference. Learn more: jb.gg/awpava