🕹️ Try and Play with VAR! We provide a demo website for you to play with VAR models and generate images interactively. Enjoy the fun of visual autoregressive modeling! We provide a demo website for ...
In this paper, we propose a new multi-modal task, termed audio-visual instance segmentation (AVIS), which aims to simultaneously identify, segment and track individual sounding object instances in ...
Abstract: Aiming at the problem of feature point interference and noise in visual servoing (VS) control, an image feature tracking control method based on adaptive dynamic programming (ADP) is ...
Abstract: 3D Visual Grounding (3DVG) aims at localizing 3D object based on textual descriptions. Conventional supervised methods for 3DVG often necessitate extensive annotations and a predefined ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results