Logo image
YOLO-DC for vehicle detection using deformable convolutional networks and cross-channel coordinate attention
Journal article   Open access   Peer reviewed

YOLO-DC for vehicle detection using deformable convolutional networks and cross-channel coordinate attention

Zhaojian Liu, Minghao Zhu, Bo Gao and Kang Zhao
Scientific reports, Vol.16(1), 6284
2026
DOI: 10.1038/s41598-026-37094-w
PMID: 41593158
url
https://doi.org/10.1038/s41598-026-37094-wView
Published (Version of record) Open Access

Abstract

Vehicle detection is crucial for intelligent decision support in transportation systems. However, real-time detection of vehicles is challenging due to geometric variations of vehicles and complex environmental factors such as light conditions and weather. To address these issues, the paper introduces the You-Only-Look-Once with Deformable Convolution and Cross-channel Coordinate Attention (YOLO-DC) framework that improves the performance and reliability of vehicle detection. First, YOLO-DC incorporates Cross-channel Coordinate Attention, which combines channel attention and coordinate attention, to more accurately cover target sampling positions and enhance feature extraction from vehicles of various shapes. Second, to better handle vehicles of different sizes, we employ Multi-scale Grouped Convolution to enable multi-scale awareness and streamline parameter sharing. Additionally, we incorporate channel prior convolutional attention so that the model can concentrate on areas of vehicles that are critical for detection. We also optimize feature fusion by leveraging a highly efficient fusion of C2f (CSP Bottleneck with 2 Convolutions) and FasterNet to reduce the model size. Experimental results demonstrate that YOLO-DC performs better than the state-of-the-art YOLOv8n method in detecting small, medium, and large-sized vehicles, and in detecting vehicles in adverse weather conditions. In addition to its superior performance, YOLO-DC also features fast detection speed, making it appropriate for real-time detection on devices with limited computational power.
Deep learning YOLO Cross-channel Coordinate attention Smart transportation Vehicle detection

Details

Metrics

4 Record Views
Logo image