This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
arXiv cs.CV Computer Vision and Pattern Recognition
cscv-bot.bsky.social
did:plc:traxg4jscmm3n3usqi76dsk2
Yang, Kong, Gao, Cheng, Liu, Zhang, Kang, Luo, Cai, He, Wei: InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing https://arxiv.org/abs/2508.14033 https://arxiv.org/pdf/2508.14033 https://arxiv.org/html/2508.14033
2025-08-20T06:31:35.405Z