3D Representation of a Continuous Loop

SpatialCLIP: Learning 3D-aware Image Representations from Spatially Discriminative Language

Abstract: Contrastive Language-Image Pre-training (CLIP) learns robust visual models through language supervision, making it a crucial visual encoding technique for various applications. However, CLIP ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Feedback

SpatialCLIP: Learning 3D-aware Image Representations from Spatially Discriminative Language

Trending now