Spatially Recalibrated Convolutional Neural Network for Vehicle Type Recognition

Tan, Shi Hao and Chuah, Joon Huang and Chow, Chee-Onn and Kanesan, Jeevan (2023) Spatially Recalibrated Convolutional Neural Network for Vehicle Type Recognition. IEEE Access, 11. pp. 142525-142537. ISSN 2169-3536, DOI https://doi.org/10.1109/ACCESS.2023.3342109.

Full text not available from this repository.

Abstract

Vehicle Type Recognition (VTR) is a significant segment within the vehicle recognition field. It provides an alternative identification method aside from license plate recognition and vehicle make and model recognition. Most of the recent studies use Convolutional Neural Networks (CNNs) to perform VTR. However, the feature responses obtained from CNNs are not recalibrated based on saliency and this hinders the classification performance. In this study, we propose a Spatial Attention Module (SAM) that is compatible with the existing CNNs. We aim to exploit the spatial relationship between feature responses by scaling them according to their relative importance to increase classification accuracy. The results reveal the exceptional performance of SAM on Beijing Institute of Technology (BIT)-Vehicle, Stanford Cars and web-nature Comprehensive Cars (CompCarsWeb) with 96.92%, 84.48% and 95.96% accuracies, respectively. A qualitative inspection of the learned feature embedding suggests the high cohesivity of the features within the group. Furthermore, an ablation study is conducted to justify the hyperparameters of choice for SAM. SAM is also modular where it is highly compatible with other CNNs and it leads to considerable performance improvement. A comparison with existing attention modules suggests our proposal prevails in the VTR application. The inference times of 1 ms and 10 ms for CaffeNet-SAM and ResNet-SAM also make them suitable for real-time classification tasks.

Item Type: Article
Funders: UNSPECIFIED
Uncontrolled Keywords: Convolutional neural network; multi-head self-attention; spatial attention module; transformer; vehicle type recognition
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
T Technology > TK Electrical engineering. Electronics Nuclear engineering
Divisions: Faculty of Engineering > Department of Electrical Engineering
Depositing User: Ms. Juhaida Abd Rahim
Date Deposited: 16 Jul 2025 04:09
Last Modified: 16 Jul 2025 04:09
URI: http://eprints.um.edu.my/id/eprint/50984

Actions (login required)

View Item View Item