Abstract: Temporal language grounding (TLG) is a fundamental and challenging problem for vision and language understanding. Existing methods mainly focus on fully supervised setting with temporal ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results