xGQA: Cross-Lingual Visual Question Answering

Jonas Pfeiffer; Gregor Geigle; Aishwarya Kamath; Jan-Martin O. Steitz; Stefan Roth; Ivan Vulić; Iryna Gurevych

doi:https://doi.org/10.18653/v1/2022.findings-acl.196

doi.org/10.18653/v1/2022.findings-acl.196

xGQA: Cross-Lingual Visual Question Answering

,

,

..., Iryna Gurevych

41

Published: Jan 1, 2022

Abstract

Recent advances in multimodal vision and language modeling have predominantly focused on the English language, mostly due to the lack of multilingual multimodal datasets to steer modeling efforts. In this work, we address this gap and provide xGQA, a new multilingual evaluation benchmark for the visual question answering task. We extend the established English GQA dataset to 7 typologically diverse languages, enabling us to detect and explore...

Paper Fields

Paper Details

Title

xGQA: Cross-Lingual Visual Question Answering

DOI

doi.org/10.18653/v1/2022.findings-acl.196

Published Date

Jan 1, 2022

Citation AnalysisPro

You’ll need to upgrade your plan to Pro

Looking to understand the true influence of a researcher’s work across journals & affiliations?

Scinapse’s Top 10 Citation Journals & Affiliations graph reveals the quality and authenticity of citations received by a paper.
Discover whether citations have been inflated due to self-citations, or if citations include institutional bias.

Learn more

Notes

History