Computer Vision Faculty Publications

Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection

Yu Hong, Zhejiang University
Hang Dai, Mohamed Bin Zayed University of Artificial IntelligenceFollow
Yong Ding, Zhejiang University

Document Type

Conference Proceeding

Publication Title

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Abstract

Leveraging LiDAR-based detectors or real LiDAR point data to guide monocular 3D detection has brought significant improvement, e.g., Pseudo-LiDAR methods. However, the existing methods usually apply non-end-to-end training strategies and insufficiently leverage the LiDAR information, where the rich potential of the LiDAR data has not been well exploited. In this paper, we propose the Cross-Modality Knowledge Distillation (CMKD) network for monocular 3D detection to efficiently and directly transfer the knowledge from LiDAR modality to image modality on both features and responses. Moreover, we further extend CMKD as a semi-supervised training framework by distilling knowledge from large-scale unlabeled data and significantly boost the performance. Until submission, CMKD ranks 1 st among the monocular 3D detectors with publications on both KITTI test set and Waymo val set with significant performance gains compared to previous state-of-the-art methods. Our code will be released at https://github.com/Cc-Hy/CMKD.

First Page

Last Page

104

DOI

10.1007/978-3-031-20080-9_6

Publication Date

11-3-2022

Keywords

Object detection, Optical radar

Comments

IR conditions: non-described

Recommended Citation

Y. Hong, H. Dai, and Y. Ding. Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection, in Computer Vision (ECCV 2022), Lecture Notes in Computer Science, Oct 2022, vol 13670, pp. 87-104, doi:10.1007/978-3-031-20080-9_6

Link to Full Text

COinS

Computer Vision Faculty Publications

Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection

Document Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Keywords

Comments

Recommended Citation

Browse

Contribute

Links

Computer Vision Faculty Publications

Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection

Authors

Document Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Keywords

Comments

Recommended Citation

Share

Browse

Contribute

Links