//
sign in
Profile
by @danabra.mov
Profile
by @dansshadow.bsky.social
Profile
by @jimpick.com
AviHandle
by @danabra.mov
AviHandle
by @dansshadow.bsky.social
AviHandle
by @katherine.computer
EventsList
by @katherine.computer
ProfileHeader
by @dansshadow.bsky.social
ProfileHeader
by @danabra.mov
ProfileMedia
by @danabra.mov
ProfilePlays
by @danabra.mov
ProfilePosts
by @danabra.mov
ProfilePosts
by @dansshadow.bsky.social
ProfileReplies
by @danabra.mov
Record
by @atsui.org
Skircle
by @danabra.mov
StreamPlacePlaylist
by @katherine.computer
+ new component
Profile
Loading...
AI Research @Hugging Face 🤗 Contributing to the Chinese ML community.
Adina Yakup









Loading...
dots.tts 🔊 New TTS from Xiaohongshu (RedNote) huggingface.co/collections/... ✨ 2B - Apache 2.0 ✨ Fully continuous architecture (no codec tokens) ✨ 48kHz synthesis ✨ Zero-shot voice cloning
MiniCPM5-1B is an impressive release in the 1B class! huggingface.co/collections/... ✨ 1B - Apache 2.0 ✨ Hybrid reasoning with Think / No-Think modes ✨ 128K context ✨ Runs on CPU/Apple Silicon/GPU ✨ Strong eval result in the same size class
BitCPM4-CANN 🔥Native 1.58-bit LLM training system on Ascend NPUs huggingface.co/collections/... ✨ 0.5B/1B/3B/8B - Apache 2.0 ✨ 6× less memory at inference ✨ Only 4.5% training throughput overhead
PP-OCRv6 just released by Baidu huggingface.co/collections/... ✨ tiny 1.5M / small 7.7M / medium 34.5M ✨ 48+ languages ✨ Supports handwritten/printed/industrial/screen and card text ✨ Edge friendly deployment
LongCat-Video-Avatar 1.5🐱 an audio driven avatar video generation framework from Meituan huggingface.co/meituan-long... ✨ Multi-character + multi-audio support ✨ Drive video from audio alone or audio + image + text ✨ 8-step inference ✨ Whisper-Large powered lip sync ✨ MIT license
4d
18d
Step-3.7-Flash 🔥 New VL model from StepFun_ai huggingface.co/collections/... ✨ 198B / 11B active - MoE ✨ 256K context ✨ 3 reasoning level ✨ Up to 400 tokens/sec 🤯
huggingface.co/papers/2605....
Macaron-V1-Preview-749B 👀 a Mixture-of-LoRA personal agent model from MindLab ✨ 744B base + 5 specialist LoRAs ✨ Generative UI as a core skill ✨ Personal agent focused ✨ 202K context ✨ MIT license huggingface.co/collections/...
Qwen just dropped a new Text to Image benchmark + a judge model huggingface.co/collections/... ✨ 56 fine-grained evaluation facets ✨ Measures creativity beyond prompt alignment ✨ Covers storytelling/typography/design & physical logic ✨ Human aligned judge model (ρ = 0.92)
MiniMax-M3 just dropped 🔥 huggingface.co/MiniMaxAI/Mi... ✨ 428B / 23B active ✨ 1M context ✨ MiniMax Sparse Attention (MSA) And it’s not just weights! - paper: huggingface.co/papers/2606.... - kernel: huggingface.co/kernels/Mini... - Transformers support Love how this was released❤️
21d
1d
21d
Video
14d
15d
4d
15d
9h
Full-pipeline ternary quantized model trained on CANN.
huggingface.co
BitCPM4-CANN - a openbmb Collection
Adina Yakup
Adina Yakup
Step-3.7-Flash - a stepfun-ai Collection
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
Join the discussion on this paper page
Paper page - Qwen-Image-Bench: From Generation to Creation in Text-to-Image Evaluation
huggingface.co
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
Qwen-Image-Bench - a Qwen Collection
Adina Yakup
Adina Yakup
Adina Yakup
Adina Yakup
Adina Yakup
Adina Yakup
Adina Yakup
Adina Yakup
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
Macaron-V1 - a mindlab-research Collection