Crime-scene models with detailed measurements once took hours to collect. Now, law-enforcement agencies are increasingly turning to new technology to help speed up the work and better collect evidence ...
Abstract: This work focuses on the problem of 6D pose estimation for novel objects when a reference 3D model or posed reference images are not available. While existing methods can estimate the ...
Abstract: This work explores expanding the capabilities of large language models (LLMs) pretrained on text to generate 3D meshes within a unified model. This offers key advantages of (1) leveraging ...
We propose TesserAct, the first open-source and generalized 4D World Model for robotics, which takes input images and text instructions to generate RGB, depth, and normal videos, reconstructing a 4D ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results