Machine Learning Faculty Publications

Box Guided Convolution for Pedestrian Detection

Jinpeng Li, Inception Institute of Artificial Intelligence
Shengcai Liao, University of Chinese Academy of Sciences
Hangzhi Jiang, Inception Institute of Artificial Intelligence
Ling Shao, Mohamed Bin Zayed University of Artificial IntelligenceFollow

Document Type

Conference Proceeding

Publication Title

MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia

Abstract

Occlusions, scale variation and numerous false positives still represent fundamental challenges in pedestrian detection. Intuitively, different sizes of receptive fields and more attention to the visible parts are required for detecting pedestrians with various scales and occlusion levels, respectively. However, these challenges have not been addressed well by existing pedestrian detectors. This paper presents a novel convolutional network, denoted as box guided convolution network (BGCNet), to tackle these challenges simultaneously in a unified framework. In particular, we proposed a box guided convolution (BGC) that can dynamically adjust the sizes of convolution kernels guided by the predicted bounding boxes. In this way, BGCNet provides position-aware receptive fields to address the challenge of large variations of scales. In addition, for the issue of heavy occlusion, the kernel parameters of BGC are spatially localized around the salient and mostly visible key points of a pedestrian, such as the head and foot, to effectively capture high-level semantic features to help detection. Furthermore, a local maximum (LM) loss is introduced to depress false positives and highlight true positives by forcing positives, rather than negatives, as local maximums, without any additional inference burden. We evaluate BGCNet on popular pedestrian detection benchmarks, and achieve the state-of-the-art results, with the significant performance improvement on heavily occluded and small-scale pedestrians.

First Page

1615

Last Page

1624

DOI

10.1145/3394171.3413989

Publication Date

10-12-2020

Keywords

box guided convolution, pedestrian detection, receptive fields, scale variation

Comments

IR conditions: non-described

Recommended Citation

J. Li et al., "Box Guided Convolution for Pedestrian Detection," MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia, pp. 1615 - 1624, Oct 2020.

The definitive version is available at https://doi.org/10.1145/3394171.3413989

Additional Links

DOI link: https://doi.org/10.1145/3394171.3413989

Link to Full Text

COinS

Machine Learning Faculty Publications

Box Guided Convolution for Pedestrian Detection

Document Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Keywords

Comments

Recommended Citation

Additional Links

Browse

Contribute

Links

Machine Learning Faculty Publications

Box Guided Convolution for Pedestrian Detection

Authors

Document Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Keywords

Comments

Recommended Citation

Additional Links

Share

Browse

Contribute

Links