TCohPrompt: task-coherent prompt-oriented fine-tuning for relation extraction

Uloženo v:
Podrobná bibliografie
Vydáno v:Complex & Intelligent Systems vol. 10, no. 6 (Dec 2024), p. 7565
Hlavní autor: Long, Jun
Další autoři: Yin, Zhuoying, Liu, Chao, Huang, Wenti
Vydáno:
Springer Nature B.V.
Témata:
On-line přístup:Citation/Abstract
Full Text - PDF
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Abstrakt:Prompt-tuning has emerged as a promising approach for improving the performance of classification tasks by converting them into masked language modeling problems through the insertion of text templates. Despite its considerable success, applying this approach to relation extraction is challenging. Predicting the relation, often expressed as a specific word or phrase between two entities, usually requires creating mappings from these terms to an existing lexicon and introducing extra learnable parameters. This can lead to a decrease in coherence between the pre-training task and fine-tuning. To address this issue, we propose a novel method for prompt-tuning in relation extraction, aiming to enhance the coherence between fine-tuning and pre-training tasks. Specifically, we avoid the need for a suitable relation word by converting the relation into relational semantic keywords, which are representative phrases that encapsulate the essence of the relation. Moreover, we employ a composite loss function that optimizes the model at both token and relation levels. Our approach incorporates the masked language modeling (MLM) loss and the entity pair constraint loss for predicted tokens. For relation level optimization, we use both the cross-entropy loss and TransE. Extensive experimental results on four datasets demonstrate that our method significantly improves performance in relation extraction tasks. The results show an average improvement of approximately 1.6 points in F1 metrics compared to the current state-of-the-art model. Codes are released at <ext-link xlink:href="https://github.com/12138yx/TCohPrompt" ext-link-type="url">https://github.com/12138yx/TCohPrompt</ext-link>.
ISSN:2199-4536
2198-6053
DOI:10.1007/s40747-024-01563-4
Zdroj:Advanced Technologies & Aerospace Database