Impact of cross-validation designs on cattle behavior prediction using machine learning and deep learning models with tri-axial accelerometer data

Na minha lista:

Detalhes bibliográficos
Publicado no:	bioRxiv (Jan 23, 2025)
Autor principal:	Wang, Jin
Outros Autores:	Yu, Ziwen, Chebel, Ricardo C, Yu, Haipeng
Publicado em:	Cold Spring Harbor Laboratory Press
Assuntos:	Machine learning Behavior Deep learning Cattle Animal models Predictions Farm management Learning algorithms
Acesso em linha:	Citation/Abstract Full Text - PDF Full text outside of ProQuest
Tags:	Adicionar Tag Sem tags, seja o primeiro a adicionar uma tag!

MARC


LEADER	00000nab a2200000uu 4500
001	3158976040
003	UK-CbPIL
022			\|a 2692-8205
024	7		\|a 10.1101/2025.01.22.634181 \|2 doi
035			\|a 3158976040
045	0		\|b d20250123
100	1		\|a Wang, Jin
245	1		\|a Impact of cross-validation designs on cattle behavior prediction using machine learning and deep learning models with tri-axial accelerometer data
260			\|b Cold Spring Harbor Laboratory Press \|c Jan 23, 2025
513			\|a Working Paper
520	3		\|a Assessing cattle behaviors provides insights into animal health, welfare, and productivity to support on-farm management decisions. Wearable accelerometers offer an alternative approach to traditional human evaluation, providing a more objective and efficient method for predicting cattle behavior. Random cross-validation (CV) is commonly used to evaluate behavior prediction by splitting data into training and testing sets, but it can yield inflated results when records from the same animal are included in both sets. Block CV splits data by block effects, offering a more realistic evaluation but remains underexplored for predicting multi-class imbalanced cattle behavior. Additionally, deep learning (DL) models have not been fully explored for behavior prediction compared to machine learning (ML) models. The objectives of this study were to examine the impact of CV designs on multi-class imbalanced cattle behavior prediction and to compare the performance of ML and DL models. Three ML and two DL models were used to predict the four behaviors of six beef cows from a public tri-axial accelerometer dataset, with model performance evaluated using both hold-out and leave-cow-out CV designs representing random and block CV, respectively. In hold-out CV, ML models achieved accuracies of 0.94 to 0.96 and F1 scores of 0.93 to 0.95, while DL models achieved accuracies of 0.9 to 0.92 and F1 scores of 0.89 to 0.91. In the leave-cow-out CV, ML models obtained accuracies of 0.72 to 0.82 and F1 scores of 0.64 to 0.82, whereas DL models obtained accuracies of 0.76 to 0.82 and F1 scores of 0.64 to 0.76. Generally, ML models outperformed DL models in the hold-out CV, but the multi-layer perceptron DL model demonstrated comparable or superior performance in the leave-cow-out CV. All models performed better with hold-out CV than leave-cow-out CV. Our results suggest that CV designs can affect behavior prediction performance. While a random CV produces seemingly good predictions, these results can be artificially inflated by the data partition. A block CV that strategically partitions data could be a more appropriate design.Competing Interest StatementThe authors have declared no competing interest.
653			\|a Machine learning
653			\|a Behavior
653			\|a Deep learning
653			\|a Cattle
653			\|a Animal models
653			\|a Predictions
653			\|a Farm management
653			\|a Learning algorithms
700	1		\|a Yu, Ziwen
700	1		\|a Chebel, Ricardo C
700	1		\|a Yu, Haipeng
773	0		\|t bioRxiv \|g (Jan 23, 2025)
786	0		\|d ProQuest \|t Biological Science Database
856	4	1	\|3 Citation/Abstract \|u https://www.proquest.com/docview/3158976040/abstract/embedded/7BTGNMKEMPT1V9Z2?source=fedsrch
856	4	0	\|3 Full Text - PDF \|u https://www.proquest.com/docview/3158976040/fulltextPDF/embedded/7BTGNMKEMPT1V9Z2?source=fedsrch
856	4	0	\|3 Full text outside of ProQuest \|u https://www.biorxiv.org/content/10.1101/2025.01.22.634181v1