trend: non-NVIDIA training
DeepSeek V3.1 was trained on Huawei Ascend NPUs
this one is a South Korean lab training on AMD
mr. TIM
Motif 2.6B — compact model with long context
unique: trained on AMD GPUs
focus is on long context & low hallucination rate — imo this is a growing genre of LLM that enables new search patterns
huggingface.co/Motif-Techno...
huggingface.co
We’re on a journey to advance and democratize artificial intelligence through open source and open science.