Causal Intervention for Abstractive Related Work Generation

Liu, Jiachang, Zhang, Qi, Shi, Chongyang, Naseem, Usman, Wang, Shoujin, Hu, Liang, and Tsang, Ivor W. (2023) Causal Intervention for Abstractive Related Work Generation. In: Findings of the Association for Computational Linguistics: EMNLP 2023. pp. 2148-2159. From: EMNLP 2023: Conference on Empirical Methods in Natural Language Processing, 6-10 December 2023, Singapore.

[img]
Preview
PDF (Published Version) - Published Version
Available under License Creative Commons Attribution.

Download (1MB) | Preview
View at Publisher Website: https://doi.org/10.18653/v1/2023.finding...
 
56


Abstract

Abstractive related work generation has attracted increasing attention in generating coherent related work that helps readers grasp the current research. However, most existing models ignore the inherent causality during related work generation, leading to spurious correlations which downgrade the models' generation quality and generalizability. In this study, we argue that causal intervention can address such limitations and improve the quality and coherence of generated related work. To this end, we propose a novel Causal Intervention Module for Related Work Generation (CaM) to effectively capture causalities in the generation process. Specifically, we first model the relations among the sentence order, document (reference) correlations, and transitional content in related work generation using a causal graph. Then, to implement causal interventions and mitigate the negative impact of spurious correlations, we use do-calculus to derive ordinary conditional probabilities and identify causal effects through CaM. Finally, we subtly fuse CaM with Transformer to obtain an end-to-end related work generation framework. Extensive experiments on two real-world datasets show that CaM can effectively promote the model to learn causal relations and thus produce related work of higher quality and coherence.

Item ID: 82150
Item Type: Conference Item (Research - E1)
ISBN: 9798891760615
Copyright Information: Creative Commons 4.0 BY (Attribution) license
Date Deposited: 11 Mar 2024 23:43
FoR Codes: 46 INFORMATION AND COMPUTING SCIENCES > 4602 Artificial intelligence > 460208 Natural language processing @ 100%
SEO Codes: 22 INFORMATION AND COMMUNICATION SERVICES > 2204 Information systems, technologies and services > 220403 Artificial intelligence @ 100%
Downloads: Total: 56
Last 12 Months: 16
More Statistics

Actions (Repository Staff Only)

Item Control Page Item Control Page