//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
OK so I must say that LLM processing with a H200 has its merits for R&D. From one running project: "Avg prompt throughput: 6605.5 tokens/s, Avg generation throughput: 1238.8 tokens/s".