This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
Jianyuan Wang
jianyuanwang.bsky.social
did:plc:ifg7igkdskwfu7fhagwmopsc
Introducing VGGT (CVPR'25), a feedforward Transformer that directly infers all key 3D attributes from one, a few, or hundreds of images, in seconds!
Project Page: vgg-t.github.io
Code & Weights: https://github.com/facebookresearch/vggt/
2025-03-17T02:08:18.048Z