TY - JOUR
T1 - High-order deep infomax-guided deformable transformer network for efficient lane detection
AU - Gao, Rong
AU - Hu, Siqi
AU - Yan, Lingyu
AU - Zhang, Li
AU - Ruan, Hang
AU - Yu, Yonghong
AU - Ye, Zhiwei
PY - 2023/4/4
Y1 - 2023/4/4
N2 - With the development of deep learning, lane detection models based on deep convolutional neural networks have been widely used in autonomous driving systems and advanced driver assistance systems. However, in the case of harsh and complex environment, the performances of detection models degrade greatly due to the difficulty in merging long-range lane points with global context and exclusion of important higher-order information. To address these issues, we propose a new learning model to better capture lane features, called Deformable Transformer with high-order Deep Infomax (DTHDI) model. Specifically, we propose a Deformable Transformer neural network model based on segmentation techniques for high-accuracy detection, in which local and global contextual information is seamlessly fused and more information about the diversity of lane line shape features is retained, resulting in extraction of rich lane features. Meanwhile, we introduce a mutual information maximization approach for mining higher-order correlations among global shape, local shape, and lane position of lane lines to learn more discriminative representations of lane lines. In addition, we employ a row classification approach to further reduce the computational complexity for robust lane line detection. Our model is evaluated on two popular lane detection datasets. The empirical results show that the proposed DTHDI model outperforms the state-of-the-art methods.
AB - With the development of deep learning, lane detection models based on deep convolutional neural networks have been widely used in autonomous driving systems and advanced driver assistance systems. However, in the case of harsh and complex environment, the performances of detection models degrade greatly due to the difficulty in merging long-range lane points with global context and exclusion of important higher-order information. To address these issues, we propose a new learning model to better capture lane features, called Deformable Transformer with high-order Deep Infomax (DTHDI) model. Specifically, we propose a Deformable Transformer neural network model based on segmentation techniques for high-accuracy detection, in which local and global contextual information is seamlessly fused and more information about the diversity of lane line shape features is retained, resulting in extraction of rich lane features. Meanwhile, we introduce a mutual information maximization approach for mining higher-order correlations among global shape, local shape, and lane position of lane lines to learn more discriminative representations of lane lines. In addition, we employ a row classification approach to further reduce the computational complexity for robust lane line detection. Our model is evaluated on two popular lane detection datasets. The empirical results show that the proposed DTHDI model outperforms the state-of-the-art methods.
UR - https://link.springer.com/article/10.1007/s11760-023-02525-y#citeas
U2 - 10.1007/s11760-023-02525-y
DO - 10.1007/s11760-023-02525-y
M3 - Article
SN - 1863-1703
JO - Signal, Image and Video Processing
JF - Signal, Image and Video Processing
ER -