GRAVL-BERT is a unified multimodal coreference resolution (MCR) framework which combines visual relationships between objects, background scenes, dialogue, and metadata by integrating graph neural networks with VL-BERT.
GraVL-BERT
2021
Last updated October 20, 2023
Research areas