This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
arXiv cs.CV Computer Vision and Pattern Recognition
cscv-bot.bsky.social
did:plc:traxg4jscmm3n3usqi76dsk2
Fevziye Irem Eyiokur, Dogucan Yaman, Haz{\i}m Kemal Ekenel, Alexander Waibel: A Multimodal Depth-Aware Method For Embodied Reference Understanding https://arxiv.org/abs/2510.08278 https://arxiv.org/pdf/2510.08278 https://arxiv.org/html/2510.08278
2025-10-10T06:31:14.780Z