This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
Sung Kim
sungkim.bsky.social
did:plc:cq4gg3odxz2pzmkx2fuac3u3
What does it mean to "use" test-time compute wisely? How to train to do so? How to measure that scaling it is useful?
"Optimizing LLM Test-Time Compute Involves Solving a Meta-RL Problem"
https://blog.ml.cmu.edu/2025/01/08/optimizing-llm-test-time-compute-involves-solving-a-meta-rl-problem/
2025-01-10T05:20:11.057Z