The mixture of experts model is also an option:
```
cargo run --example qwen --features metal --release -- --prompt "Write a poem about butterflies. <think></think>." --model "3-moe-a3b"
```
Qwen 3 is now supported in Candle!
Run the 3-4B model locally with:
```
cargo run --example qwen --release -- --model 3-4b --prompt 'The capital of France is '
```
On macOS, enable Metal for faster inference:
```
--features metal
```
Clone the repo and test it out. github.com/huggingface/...
Google just released Gemma 3, an open, on-device LLM with vision capabilities and support for over 140 different languages. Models range from 1B-27B parameters.
Zero-day support for multiple frameworks including transformers, MLX, llama.cpp, and more! 💼 🚀
Read more here:
huggingface.co/blog/gemma3
RIFTS Dataset: Solving Critical LLM Conversation Failures
- LLMs 3x less likely to clarify than humans
- 16x less likely to provide follow-up requests
- Early failures predict later breakdowns
- Includes preliminary intervention strategies
huggingface.co/datasets/mic...
I'm writing an article series about creating tensors from scratch in Rust. #tensors #machine-learning #ml #ai
huggingface.co/blog/KeighBe...
🕯️🔥[Candle](github.com/huggingface/...) is now much faster on macOS thanks to a contribution by @EricLBuehler, which brings major speed improvements to the Metal backend.🍎📈
Try it out by running some of our examples with the `--features metal` flag.
#Candle #RustLang #macOS #Metal #HuggingFace
I just published part 2 of my article series about creating tensors from scratch in Rust. This one is about view operations.
#tensors #machine-learning #ml #ai
Take a look here:
huggingface.co/blog/KeighBe...
What to know about DeepSeek
youtu.be/0eMzc-WnBfQ?...
In which we attempt to figure out MoE, o1, scaling, tech reporting, modern semiconductors, microeconomics, and international geopolitics.