Enhancing multi-document summarization with cross-document graph-based information extraction

Zixuan Zhang; Heba Elfardy; Markus Dreyer; Kevin Small; Heng Ji; Mohit Bansal

Publication

Enhancing multi-document summarization with cross-document graph-based information extraction

By Zixuan Zhang, Heba Elfardy, Markus Dreyer, Kevin Small, Heng Ji, Mohit Bansal

2023

Download Copy BibTeX GitHub

Share

Download

Copy BibTeX

GitHub

Share

Information extraction (IE) and summarization are closely related, both tasked with presenting a subset of the information contained in a natural language text. However, while IE extracts structural representations, summarization aims to abstract the most salient information into a generated text summary — thus potentially encountering the technical limitations of current text generation methods (e.g., hallucination). To mitigate this risk, this work uses structured IE graphs to enhance the abstractive summarization task. Specifically, we focus on improving Multi-Document Summarization (MDS) performance by using cross-document IE output, incorporating two novel components: (1) the use of auxiliary entity and event recognition systems to focus the summary generation model; and (2) incorporating an alignment loss between IE nodes and their text spans to reduce inconsistencies between the IE graphs and text representations. Operationally, both the IE nodes and corresponding text spans are projected into the same embedding space and pairwise distance is minimized. Experimental results on multiple MDS benchmarks show that summaries generated by our model are more factually consistent with the source documents than baseline models while maintaining the same level of abstractiveness.

Enhancing multi-document summarization with cross-document graph-based information extraction

Latest news

Work with us