This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
Interactive Data Lab
idl.uw.edu
did:plc:opdnbsrv3d3bk6y5z3idoqcj
For value comparison tasks, GPT-4 Turbo mostly aligns with human performance data. But for summary tasks, the GPT responses are uncorrelated and likely unhelpful! This may be due to much research and punditry on value comparison, but less attention to aggregate perception, biasing LLM training data.
2024-11-26T19:12:34.562Z