This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
haagch.bsky.social
did:plc:5ethgbdgauznxrcyba6pw3ra
Here's an example of vggt: Throw in 4 random images from wikipedia and in *seconds* it returns point cloud and camera poses.
Which brings me to the biggest pain point: VRAM.
On a 16GB GPU you can use 5 images before running out of VRAM.
2025-03-30T22:29:27.140Z