Impact of cross-validation designs on cattle behavior prediction using machine learning and deep learning models with tri-axial accelerometer data

Na minha lista:
Detalhes bibliográficos
Publicado no:bioRxiv (Jan 23, 2025)
Autor principal: Wang, Jin
Outros Autores: Yu, Ziwen, Chebel, Ricardo C, Yu, Haipeng
Publicado em:
Cold Spring Harbor Laboratory Press
Assuntos:
Acesso em linha:Citation/Abstract
Full Text - PDF
Full text outside of ProQuest
Tags: Adicionar Tag
Sem tags, seja o primeiro a adicionar uma tag!

MARC

LEADER 00000nab a2200000uu 4500
001 3158976040
003 UK-CbPIL
022 |a 2692-8205 
024 7 |a 10.1101/2025.01.22.634181  |2 doi 
035 |a 3158976040 
045 0 |b d20250123 
100 1 |a Wang, Jin 
245 1 |a Impact of cross-validation designs on cattle behavior prediction using machine learning and deep learning models with tri-axial accelerometer data 
260 |b Cold Spring Harbor Laboratory Press  |c Jan 23, 2025 
513 |a Working Paper 
520 3 |a Assessing cattle behaviors provides insights into animal health, welfare, and productivity to support on-farm management decisions. Wearable accelerometers offer an alternative approach to traditional human evaluation, providing a more objective and efficient method for predicting cattle behavior. Random cross-validation (CV) is commonly used to evaluate behavior prediction by splitting data into training and testing sets, but it can yield inflated results when records from the same animal are included in both sets. Block CV splits data by block effects, offering a more realistic evaluation but remains underexplored for predicting multi-class imbalanced cattle behavior. Additionally, deep learning (DL) models have not been fully explored for behavior prediction compared to machine learning (ML) models. The objectives of this study were to examine the impact of CV designs on multi-class imbalanced cattle behavior prediction and to compare the performance of ML and DL models. Three ML and two DL models were used to predict the four behaviors of six beef cows from a public tri-axial accelerometer dataset, with model performance evaluated using both hold-out and leave-cow-out CV designs representing random and block CV, respectively. In hold-out CV, ML models achieved accuracies of 0.94 to 0.96 and F1 scores of 0.93 to 0.95, while DL models achieved accuracies of 0.9 to 0.92 and F1 scores of 0.89 to 0.91. In the leave-cow-out CV, ML models obtained accuracies of 0.72 to 0.82 and F1 scores of 0.64 to 0.82, whereas DL models obtained accuracies of 0.76 to 0.82 and F1 scores of 0.64 to 0.76. Generally, ML models outperformed DL models in the hold-out CV, but the multi-layer perceptron DL model demonstrated comparable or superior performance in the leave-cow-out CV. All models performed better with hold-out CV than leave-cow-out CV. Our results suggest that CV designs can affect behavior prediction performance. While a random CV produces seemingly good predictions, these results can be artificially inflated by the data partition. A block CV that strategically partitions data could be a more appropriate design.Competing Interest StatementThe authors have declared no competing interest. 
653 |a Machine learning 
653 |a Behavior 
653 |a Deep learning 
653 |a Cattle 
653 |a Animal models 
653 |a Predictions 
653 |a Farm management 
653 |a Learning algorithms 
700 1 |a Yu, Ziwen 
700 1 |a Chebel, Ricardo C 
700 1 |a Yu, Haipeng 
773 0 |t bioRxiv  |g (Jan 23, 2025) 
786 0 |d ProQuest  |t Biological Science Database 
856 4 1 |3 Citation/Abstract  |u https://www.proquest.com/docview/3158976040/abstract/embedded/7BTGNMKEMPT1V9Z2?source=fedsrch 
856 4 0 |3 Full Text - PDF  |u https://www.proquest.com/docview/3158976040/fulltextPDF/embedded/7BTGNMKEMPT1V9Z2?source=fedsrch 
856 4 0 |3 Full text outside of ProQuest  |u https://www.biorxiv.org/content/10.1101/2025.01.22.634181v1