This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
Epoch AI
epochai.bsky.social
did:plc:enqyiq55ungiw6isng5amift
We’ve run independent evaluations of Grok-3 and Grok-3 mini on our suite of benchmarks!
Grok-3 currently doesn’t do extended reasoning, while Grok-3 mini is a reasoning model. We ran Grok-3 mini with both “low” and “high” reasoning effort.
Full results in thread!
2025-04-14T14:37:13.384Z