Abstract: The audio-visual event localization task investigates how audio and visual modalities can mutually enhance video event localization. Current methods often rely on single-modality features or ...
Finally, some good news: More and more people are getting active, according to the Sports and Fitness Industry Association’s (SFIA) recent report. For the first time since they started tracking ...
Abstract: This paper introduces AVCaps, an audio-visual dataset that contains separate textual captions for the audio, visual, and audio-visual contents of video clips. The dataset contains 2061 video ...
ABSTRACT: As morphemes are the smallest phonetic and semantic word formation units in Chinese, the study of morphemes has always been an important part of Chinese language acquisition research. Taking ...
We are delighted to announce that our paper has been officially accepted by the ACM International Conference on Multimedia (ACMMM 2025) and selected for Oral Presentation! Highlights of Review Results ...
This article describes a combined visual and haptic localization experiment that addresses the area of multimodal cueing. The aim of the present investigation was to characterize two-dimensional (2D) ...
Choose from Modality stock illustrations from iStock. Find high-quality royalty-free vector images that you won't find anywhere else.
The manuscript presents a short report investigating mismatch responses in the auditory cortex, following previous studies focused on visual cortex. By correlating mouse locomotion speed with acoustic ...
Vision-language models (VLMs) represent an advanced field within artificial intelligence, integrating computer vision and natural language processing to handle multimodal data. These models allow ...
The congruency sequence effect (CSE) refers to the reduction in the congruency effect in the current trial after an incongruent trial compared with a congruent trial. Although previous studies widely ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results