WOLO: Wilson Only Looks Once – Estimating ant body mass from reference-free images using deep convolutional neural networks

保存先:
書誌詳細
出版年:bioRxiv (Jan 27, 2025)
第一著者: Plum, Fabian
その他の著者: Plum, Lena, Bischoff, Corvin, Labonte, David
出版事項:
Cold Spring Harbor Laboratory Press
主題:
オンライン・アクセス:Citation/Abstract
Full text outside of ProQuest
タグ: タグ追加
タグなし, このレコードへの初めてのタグを付けませんか!

MARC

LEADER 00000nab a2200000uu 4500
001 3160209611
003 UK-CbPIL
022 |a 2692-8205 
024 7 |a 10.1101/2024.05.15.594277  |2 doi 
035 |a 3160209611 
045 0 |b d20250127 
100 1 |a Plum, Fabian 
245 1 |a WOLO: Wilson Only Looks Once – Estimating ant body mass from reference-free images using deep convolutional neural networks 
260 |b Cold Spring Harbor Laboratory Press  |c Jan 27, 2025 
513 |a Working Paper 
520 3 |a Size estimation is a hard computer vision problem with widespread applications in quality control in manufacturing and processing plants, livestock management, and research on animal behaviour. Image-based size estimation is typically facilitated by either well-controlled imaging conditions, the provision of global cues, or both. Reference-free size estimation remains challenging, because objects of vastly different sizes can appear identical if they are of similar shape. Here, we explore the feasibility of implementing automated and reference-free body size estimation to facilitate large-scale experimental work in a key model species in sociobiology: the leaf-cutter ants. Leaf-cutter ants are a suitable testbed for reference-free size estimation, because their workers differ vastly in both size and shape; in principle, it is therefore possible to infer body mass - a proxy for size - from relative body proportions alone. Inspired by earlier work by E.O. Wilson, who trained himself to discern ant worker size from visual cues alone, we deployed deep learning techniques to achieve the same feat automatically, quickly, at scale, and from reference-free images: Wilson Only Looks Once (WOLO). Using 150,000 hand-annotated and 100,000 computer-generated images, a set of deep convolutional neural networks were trained to estimate the body mass of ant workers from image cutouts. The best-performing WOLO networks achieved errors as low as 11% on unseen data, approximately matching or exceeding human performance, measured for a small group of both experts and non-experts, but were about 1000 times faster. Further refinement may thus enable accurate, high throughput, and non-intrusive body mass estimation in behavioural work, and so eventually contribute to a more nuanced and comprehensive understanding of the rules that underpin the complex division of labour that characterises polymorphic insect societies.Competing Interest StatementThe authors have declared no competing interest.Footnotes* In response to reviewer feedback we: Added a comprehensive Data Availability Statement; Defined a clear goal added in the introduction (Line 100); Shifted focus to a single VGG-style CNN for regression and classification (L170 to L176); Simplified evaluation using standard metrics: categorical accuracy, relative error (MAPE), and prediction stability (L219 to L278); Moved data collection details to Supplementary Information (L681 to L829); Presented detailed results in supplementary tables; main text focuses on key findings; Retrained all networks, simplified evaluation, and performed overall manuscript streamlining; Revised equations for clarity (e.g., distinguishing ground truth and estimates); Clarified terminology (e.g., epochs vs iterations, accuracy definitions); Addressed linguistic issues (e.g., reduced excessive hyphenation); Corrected inaccuracies (e.g., cross-entropy phrasing, loss discussion).* https://zenodo.org/records/11167521* https://zenodo.org/records/11167946* https://zenodo.org/records/14747391* https://zenodo.org/records/14746456* https://github.com/FabianPlum/WOLO 
653 |a Body mass 
653 |a Data collection 
653 |a Division of labor 
653 |a Visual discrimination learning 
653 |a Body size 
653 |a Quality control 
653 |a Neural networks 
653 |a Livestock 
653 |a Workers (insect caste) 
653 |a Information processing 
653 |a Leaves 
653 |a Visual stimuli 
653 |a Deep learning 
700 1 |a Plum, Lena 
700 1 |a Bischoff, Corvin 
700 1 |a Labonte, David 
773 0 |t bioRxiv  |g (Jan 27, 2025) 
786 0 |d ProQuest  |t Biological Science Database 
856 4 1 |3 Citation/Abstract  |u https://www.proquest.com/docview/3160209611/abstract/embedded/6A8EOT78XXH2IG52?source=fedsrch 
856 4 0 |3 Full text outside of ProQuest  |u https://www.biorxiv.org/content/10.1101/2024.05.15.594277v3