RT Journal T1 A COMPREHENSIVE REVIEW OF OBJECT DETECTION IN ANIMAL AND PLANT USING VISION TRANSFORMER A1 Maolan Lin A1 Zhenchang Gao A1 Honghao Cai A1 Wenliang Liao JF Journal of Animal and Plant Sciences JO JAPS SN 1018-7081 VO 36 IS 2 SP 321 OP 330 YR 2026 FD 2026/02/28 DO DOI https://doi.org/10.36899/JAPS.2026.2.0027 AB

In digital farming, computers serve as primary sensing eyes and object detection is the core vision task that locates and counts the target objects of interest, i.e., plants, fruits and livestock, in various agricultural systems. While, Vision Transformers (ViTs), a natural language processing alternative to convolutional neural networks by capturing global context through self-attention, have shown great potential in object detection. However, the field of ViT-based detectors remains fragmented, with independent advances in plant and animal studies and a lack of comprehensive analysis connecting these domains. To bridge this gap, we conducted a systematic review, retaining 30 primary studies after a dual screening and quality appraisal process—20 focused on plant production and 10 on animal production. Our analysis shows that ViT-based models excel in multi-scale representation, complex scene reasoning, and efficient feature extraction. These capabilities give high accuracy in fruit quality assessment, crop growth monitoring, weed detection, meat grading and livestock behaviour surveillance. However, challenges such as high computational complexity, large parameter sizes, environmental variability, small object detection, and data annotation requirements remain. For researchers and practitioners, this review offers a unified framework to understand ViT-based detection. It pinpoints cross-domain challenges and concludes with a forward-looking pathway to turn these insights into practical, on-farm solutions.

K1 Convolutional neural network, Computer vision, Deep learning; Machine learning, ViT, YOLO PB Pakistan Agricultural Scientists Forum LK https://thejaps.org.pk/AbstractView.aspx?mid=2024-JAPS-2754