//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
If you find yourself asking "how does this model checkpoint differ from the last, and where did it improve/regress?", that's what olmo-eval is for. We're releasing it openly so the community can build on it. šŸ’» Code: buff.ly/veAANKX šŸ“ Blog: buff.ly/64B7dPh
Contribute to allenai/olmo-eval development by creating an account on GitHub.
github.com
GitHub - allenai/olmo-eval
1d
Ai2