3D Trigonometry GCSE Questions

Weakly-Supervised 3D Spatial Reasoning for Text-Based Visual Question Answering

Abstract: Text-based Visual Question Answering (TextVQA) aims to produce correct answers for given questions about the images with multiple scene texts. In most cases, the texts naturally attach to ...

IEEE

3D-MoRe: Unified Modal-Contextual Reasoning for Embodied Question Answering

With the growing need for diverse and scalable data in indoor scene tasks, such as question answering and dense captioning, we propose 3D-MoRe, a novel paradigm designed to generate large-scale ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Weakly-Supervised 3D Spatial Reasoning for Text-Based Visual Question Answering

3D-MoRe: Unified Modal-Contextual Reasoning for Embodied Question Answering

Trending now