This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
Andrew White 🐦⬛
andrew.diffuse.one
did:plc:5lmwpxyligfn4bmpeb5a4ejf
A lot of effort in this work was framing the learning problem of agents. We settled on defining agents using stochastic compute graphs and splitting the environment and agent according to what we want to train. Here are some components of well-known agents as compute graphs:
2024-12-31T19:54:31.690Z