Named Entity Recognition for Code Review Comments

保存先:
書誌詳細
出版年:Programming and Computer Software vol. 50, no. 7 (Dec 2024), p. 511
出版事項:
Springer Nature B.V.
主題:
オンライン・アクセス:Citation/Abstract
Full Text
Full Text - PDF
タグ: タグ追加
タグなし, このレコードへの初めてのタグを付けませんか!
その他の書誌記述
抄録:This paper addresses the problem of named entities recognition from source code reviews. The paper provides a comparative analysis of existing approaches and proposes its own methods to improve the quality of problem solving. Proposed and implemented improvements include: methods to deal with data imbalances, improved tokenization of input data, the use of large arrays of unlabeled data, and the use of additional binary classifiers. To assess quality, a new set of 3000 user code reviews was collected and manually labeled. It is shown that the proposed improvements can significantly increase the performance measured by quality metrics, calculated both at the token level (+22%) and at the entire entity level (+13%).
ISSN:0361-7688
1608-3261
DOI:10.1134/S0361768824700233
ソース:Advanced Technologies & Aerospace Database