RaLiBEV: Radar and LiDAR BEV Fusion Learning for Anchor Box Free Object Detection Systems

Yang, Yanlong, Liu, Jianan, Huang, Tao, Han, Qing Long, Ma, Gang, and Zhu, Bing (2025) RaLiBEV: Radar and LiDAR BEV Fusion Learning for Anchor Box Free Object Detection Systems. IEEE Transactions on Circuits and Systems for Video Technology, 35 (5). pp. 4130-4143.

PDF (Published Version) - Published Version
Restricted to Repository staff only

DOI: 10.1109/TCSVT.2024.3521375

View at Publisher Website: https://doi.org/10.1109/TCSVT.2024.35213...

Abstract

In autonomous driving, LiDAR and radar are crucial for environmental perception. LiDAR offers precise 3D spatial sensing information but struggles in adverse weather like fog. Conversely, radar signals can penetrate rain or mist due to their specific wavelength but are prone to noise disturbances. Recent state-of-the-art works reveal that the fusion of radar and LiDAR can lead to robust detection in adverse weather. Current approaches typically fuse features from various data sources using basic convolutional/transformer network architectures and employ straightforward label assignment strategies for object detection. However, these methods have two main limitations: they fail to adequately capture feature interactions and lack consistent regression constraints. In this paper, we propose a bird’s-eye view fusion learning-based anchor box-free object detection system. Our approach introduces a novel interactive transformer module for enhanced feature fusion and an advanced label assignment strategy for more consistent regression, addressing key limitations in existing methods. Specifically, experiments show that, our approach’s average precision ranks 1^{st} and significantly outperforms the state-of-the-art method by 13.1% and 19.0% at Intersection of Union (IoU) of 0.8 under “Clear+Foggy” training conditions for “Clear” and “Foggy” testing, respectively.


Item ID:	88626
Item Type:	Article (Research - C1)
ISSN:	1558-2205
Keywords:	anchor box free object detection, Autonomous driving, bird eye’s view fusion, deep learning, interactive transformer, label assignment, LiDAR, point cloud, radar, range-azimuth heatmap
Copyright Information:	© 2024 IEEE. All rights reserved, including rights for text and data mining, and training of artificial intelligence and similar technologies. Personal use is permitted, but republication/redistribution requires IEEE permission.
Date Deposited:	04 Jun 2026 04:49
FoR Codes:	46 INFORMATION AND COMPUTING SCIENCES > 4603 Computer vision and multimedia computation > 460304 Computer vision @ 100%
SEO Codes:	27 TRANSPORT > 2703 Ground transport > 270302 Autonomous road vehicles @ 100%
	More Statistics

Actions (Repository Staff Only)

Item Control Page