A HYBRID DEEP LEARNING APPROACH FOR PLANT DISEASE CLASSIFICATION: INTEGRATING EFFICIENT NET AND VISION TRANSFORMERS

Babar Ali; Ramzan Talib; Muhammad Kashif Hanif; Muhammad Umer Sarwar

doi:10.36899/JAPS.2026.1.0007

Manuscript Abstract

Download PDF

Volume 36, No. (1), 2026 (February)

A HYBRID DEEP LEARNING APPROACH FOR PLANT DISEASE CLASSIFICATION: INTEGRATING EFFICIENT NET AND VISION TRANSFORMERS

Babar Ali, Ramzan Talib, Muhammad Kashif Hanif, Muhammad Umer Sarwar

B. Ali¹, R. Talib², M. K. Hanif³*, M. U. Sarwar⁴

¹ Government College University, Faisalabad, Pakistan,
² Government College University, Faisalabad, Pakistan,
³ Government College University, Faisalabad, Pakistan,
⁴ Government College University, Faisalabad, Pakistan,

Corresponding Author: mkashifhanif@gcuf.edu.pk

DOI: https://doi.org/10.36899/JAPS.2026.1.0007

Page Number(s): 79-96

Published Online First: November 12, 2025

Publication Date: January 20, 2026

ABSTRACT

The accurate classification of plant diseases is crucial for effective crop management and the advancement of sustainable agricultural practices. Early and precise detection of plant diseases is a cornerstone of precision agriculture, enabling the reduction of crop losses and the optimization of resource allocation. In recent years, deep learning models have shown exceptional performance in image-based tasks, including plant disease classification. However, the integration of advanced attention mechanisms, such as Vision Transformers, offers a promising approach for further enhancing the capabilities of these models. This study proposes a hybrid deep learning approach that combines the strengths of EfficientNet and Vision Transformers to improve the accuracy and efficiency of plant disease classification. This work focuses on using EfficientNet’s ability to extract discriminative local features from leaf images and ViT’s capacity to model long-range dependencies between image patches, thereby refining the classification process. Experiments were conducted on the augmented PlantVillage dataset, comprising 61,486 images of 38 distinct plant diseases across 14 plant species. The proposed hybrid EfficientNet-ViT model achieved a classification accuracy of 93%, with precision, recall, and F1-scores of 91%, 93%, and 92%, respectively, outperforming standalone models such as EfficientNet (89% accuracy) and traditional CNNs (e.g., ResNet: 89%). Comparative analysis with other transformer variants (DeiT, SWIN) further demonstrated the robustness of the approach, with SWIN achieving the highest accuracy (94%). The integration of data augmentation techniques improved model generalizability, contributing to a 4% increase in accuracy over non-augmented training. These results present the potential of combining convolutional neural networks with attention-based mechanisms to address complex challenges in precision agriculture.

Keywords: Plant disease classification, EfficientNet, Vision Transformers, hybrid deep learning, attention mechanisms, feature extraction, sustainable agriculture

Open Access: This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license ( https://creativecommons.org/licenses/by/4.0/).

Download Statistics

This Manuscript

Full Text

downloads

Citations

Semantic Scholar1

Semantic Scholar

Indicators

Metrics

Cite Score: 1.3

JCR Year: 2025

Indexing

Status

Web of Science (SCIE)

SCOPUS (Q3)

Journal Metrics

Current

Journal Impact Factor: 0.5

HEC Category: W

ISSN Details

Verified

Print ISSN: 1018-7081

Electronic ISSN: 2309-8694

Search the Journal

Use the fields below to search for articles by Title, Author, or Keywords.

All Downloads

Full Text

20,575

downloads

Supplementary

downloads

Manuscript Abstract

Citation Styles

A HYBRID DEEP LEARNING APPROACH FOR PLANT DISEASE CLASSIFICATION: INTEGRATING EFFICIENT NET AND VISION TRANSFORMERS

ABSTRACT

Search the Journal