Machine Learning Faculty Publications

Visible-Infrared Person Re-Identification via Homogeneous Augmented Tri-Modal Learning

Mang Ye, Inception Institute of Artificial Intelligence
Jianbing Shen, Inception Institute of Artificial Intelligence
Ling Shao, Mohamed Bin Zayed University of Artificial IntelligenceFollow

Document Type

Article

Publication Title

IEEE Transactions on Information Forensics and Security

Abstract

Matching person images between the daytime visible modality and night-time infrared modality (VI-ReID) is a challenging cross-modality pedestrian retrieval problem. Existing methods usually learn the multi-modality features in raw image, ignoring the image-level discrepancy. Some methods apply GAN technique to generate the cross-modality images, but it destroys the local structure and introduces unavoidable noise. In this paper, we propose a Homogeneous Augmented Tri-Modal (HAT) learning method for VI-ReID, where an auxiliary grayscale modality is generated from their homogeneous visible images, without additional training process. It preserves the structure information of visible images and approximates the image style of infrared modality. Learning with the grayscale visible images enforces the network to mine structure relations across multiple modalities, making it robust to color variations. Specifically, we solve the tri-modal feature learning from both multi-modal classification and multi-view retrieval perspectives. For multi-modal classification, we learn a multi-modality sharing identity classifier with a parameter-sharing network, trained with a homogeneous and heterogeneous identification loss. For multi-view retrieval, we develop a weighted tri-directional ranking loss to optimize the relative distance across multiple modalities. Incorporated with two invariant regularizers, HAT simultaneously minimizes multiple modality variations. In-depth analysis demonstrates the homogeneous grayscale augmentation significantly outperforms the current state-of-the-art by a large margin.

First Page

728

Last Page

739

DOI

10.1109/TIFS.2020.3001665

Publication Date

6-11-2020

Keywords

multi-modality, Person re-identification (Re-ID), ranking

Comments

IR Deposit conditions:

OA version (pathway a) Accepted version

No embargo

When accepted for publication, set statement to accompany deposit (see policy)

Must link to publisher version with DOI

Publisher copyright and source must be acknowledged

Recommended Citation

M. Ye, J. Shen and L. Shao, "Visible-Infrared Person Re-Identification via Homogeneous Augmented Tri-Modal Learning," in IEEE Transactions on Information Forensics and Security, vol. 16, pp. 728-739, 2021, doi: 10.1109/TIFS.2020.3001665.

Additional Links

IEEE link: https://doi.org/10.1109/TIFS.2020.3001665

Link to Full Text

COinS

Machine Learning Faculty Publications

Visible-Infrared Person Re-Identification via Homogeneous Augmented Tri-Modal Learning

Document Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Keywords

Comments

Recommended Citation

Additional Links

Browse

Contribute

Links

Machine Learning Faculty Publications

Visible-Infrared Person Re-Identification via Homogeneous Augmented Tri-Modal Learning

Authors

Document Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Keywords

Comments

Recommended Citation

Additional Links

Share

Browse

Contribute

Links