Abstract: Multilingual multimodal (MM) summarization, involving the processing of multimodal input (MI) data across multiple languages to generate corresponding multimodal summaries (MS) using a ...