ConvNext as a Basis for Interpretability in Coffee Leaf Rust Classification

Adrian Chavarro; Diego Renza; Moya-Albor, Ernesto

doi:10.3390/math12172668

ConvNext as a Basis for Interpretability in Coffee Leaf Rust Classification

Journal

Mathematics

ISSN

2227-7390

Date Issued

2024

Author(s)

Adrian Chavarro

Diego Renza

Moya-Albor, Ernesto

Facultad de Ingeniería - CampCM

Type

text::journal::journal article

DOI

10.3390/math12172668

URL

https://scripta.up.edu.mx/handle/20.500.12552/11334

Abstract

The increasing complexity of deep learning models can make it difficult to interpret and fit models beyond a purely accuracy-focused evaluation. This is where interpretable and eXplainable Artificial Intelligence (XAI) come into play to facilitate an understanding of the inner workings of models. Consequently, alternatives have emerged, such as class activation mapping (CAM) techniques aimed at identifying regions of importance for an image classification model. However, the behavior of such models can be highly dependent on the type of architecture and the different variants of convolutional neural networks. Accordingly, this paper evaluates three Convolutional Neural Network (CNN) architectures (VGG16, ResNet50, ConvNext-T) against seven CAM models (GradCAM, XGradCAM, HiResCAM, LayerCAM, GradCAM++, GradCAMElementWise, and EigenCAM), indicating that the CAM maps obtained with ConvNext models show less variability among them, i.e., they are less dependent on the selected CAM approach. This study was performed on an image dataset for the classification of coffee leaf rust and evaluated using the RemOve And Debias (ROAD) metric.