@roeiherz.bsky.social on Bluesky

JavaScript RequiredThis is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is. Learn more about Bluesky at bsky.social and atproto.com.

Post

Roei Herzig

roeiherz.bsky.social

did:plc:m55bczg6jkxlsua6h2hq4hpn

For example, VLAs use language decoders, which are pretrained on tasks like visual question answering and image captioning. This presents a discrepancy between the models’ high-level pre-training objective and the need for robotic models to predict low-level actions.

2025-02-24T03:49:42.597Z