Text Classification Using Enhanced Binary Wind Driven Optimization Algorithm
Đã lưu trong:
| Xuất bản năm: | International Journal of Advanced Computer Science and Applications vol. 16, no. 6 (2025) |
|---|---|
| Tác giả chính: | |
| Được phát hành: |
Science and Information (SAI) Organization Limited
|
| Những chủ đề: | |
| Truy cập trực tuyến: | Citation/Abstract Full Text - PDF |
| Các nhãn: |
Không có thẻ, Là người đầu tiên thẻ bản ghi này!
|
| Bài tóm tắt: | Document classification using supervised machine learning is now widely used on the internet and in digital libraries. Several studies have focused on English-language document classification. However, Arabic text includes high variation in its morphology, which leads to high extracted features and increases the dimensionality of the classification task. Towards reducing the curse of dimension in Arabic text classification, a wrapper feature selection method is proposed in this study. In more detail, a hybrid metaheuristic model based on the Wind Driven and Simulated Annealing is designed to solve FS task in Arabic text, known as WDFS. The Wind Driven method is initially introduced to optimize the Fs task in the exploration phase. Then, WD is hybridized with simulated annealing as a local search in the exploitation phase to enhance the solutions located by the WD. Three classifiers are utilized to evaluate the selected features using the proposed WDFS: K-nearest Neighbor, Naïve Bayesian, and Decision Tree. The proposed WDFS method was assessed on selected four groups of files from a benchmark TREC Arabic text newswire dataset. Comparative results showed that the WDFS method outperforms other existing Arabic text classification methods in term of the accuracy. The obtained results reveal the high potentiality of WDFS in reliably searching the feature space to obtain the optimal combination of features. |
|---|---|
| số ISSN: | 2158-107X 2156-5570 |
| DOI: | 10.14569/IJACSA.2025.01606107 |
| Nguồn: | Advanced Technologies & Aerospace Database |