Manuscript Abstract

A HYBRID DEEP LEARNING APPROACH FOR PLANT DISEASE CLASSIFICATION: INTEGRATING EFFICIENT NET AND VISION TRANSFORMERS
Babar Ali, Ramzan Talib, Muhammad Kashif Hanif, Muhammad Umer Sarwar

B. Ali¹, R. Talib², M. K. Hanif³*, M. U. Sarwar⁴

¹ Government College University, Faisalabad, Pakistan,
² Government College University, Faisalabad, Pakistan,
³ Government College University, Faisalabad, Pakistan,
⁴ Government College University, Faisalabad, Pakistan,

Corresponding Author: mkashifhanif@gcuf.edu.pk
Page Number(s): 79-96
Published Online First: November 12, 2025
Publication Date: January 20, 2026
ABSTRACT

The accurate classification of plant diseases is crucial for effective crop management and the advancement of sustainable agricultural practices. Early and precise detection of plant diseases is a cornerstone of precision agriculture, enabling the reduction of crop losses and the optimization of resource allocation. In recent years, deep learning models have shown exceptional performance in image-based tasks, including plant disease classification. However, the integration of advanced attention mechanisms, such as Vision Transformers, offers a promising approach for further enhancing the capabilities of these models. This study proposes a hybrid deep learning approach that combines the strengths of EfficientNet and Vision Transformers to improve the accuracy and efficiency of plant disease classification. This work focuses on using EfficientNet’s ability to extract discriminative local features from leaf images and ViT’s capacity to model long-range dependencies between image patches, thereby refining the classification process. Experiments were conducted on the augmented PlantVillage dataset, comprising 61,486 images of 38 distinct plant diseases across 14 plant species. The proposed hybrid EfficientNet-ViT model achieved a classification accuracy of 93%, with precision, recall, and F1-scores of 91%, 93%, and 92%, respectively, outperforming standalone models such as EfficientNet (89% accuracy) and traditional CNNs (e.g., ResNet: 89%). Comparative analysis with other transformer variants (DeiT, SWIN) further demonstrated the robustness of the approach, with SWIN achieving the highest accuracy (94%). The integration of data augmentation techniques improved model generalizability, contributing to a 4% increase in accuracy over non-augmented training. These results present the potential of combining convolutional neural networks with attention-based mechanisms to address complex challenges in precision agriculture.

Keywords: Plant disease classification, EfficientNet, Vision Transformers, hybrid deep learning, attention mechanisms, feature extraction, sustainable agriculture
Open Access: This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license ( https://creativecommons.org/licenses/by/4.0/).


Download Statistics
This Manuscript
Full Text
8
downloads
Citations
Semantic Scholar1
Semantic Scholar
1
Indicators
Metrics

Cite Score: 1.3

JCR Year: 2025

Indexing
Status

Web of Science (SCIE)

SCOPUS (Q3)

Journal Metrics
Current

Journal Impact Factor: 0.5

HEC Category: W

ISSN Details
Verified

Print ISSN: 1018-7081

Electronic ISSN: 2309-8694

Search the Journal

Use the fields below to search for articles by Title, Author, or Keywords.

All Downloads
Full Text
20,575
downloads
Supplementary
32
downloads