Machine Learning Faculty Publications

SAFER-STUDENT for Safe Deep Semi-Supervised Learning With Unseen-Class Unlabeled Data

Rundong He, Shandong University
Zhongyi Han, Mohamed Bin Zayed University of Artificial IntelligenceFollow
Xiankai Lu, Shandong University
Yilong Yin, Shandong University

Document Type

Article

Publication Title

IEEE Transactions on Knowledge and Data Engineering

Abstract

Deep semi-supervised learning (SSL) methods aim to utilize abundant unlabeled data to improve the seen-class classification. However, in the open-world scenario, collected unlabeled data tend to contain unseen-class data, which would degrade the generalization to seen-class classification. Formally, we define the problem as safe deep semi-supervised learning with unseen-class unlabeled data. One intuitive solution is removing these unseen-class instances after detecting them during the SSL process. Nevertheless, the performance of unseen-class identification is limited by the lack of suitable score function, the uncalibrated model, and the small number of labeled data. To this end, we propose a safe SSL method called SAFER-STUDENT from the teacher-student view. First, to enhance the ability of teacher model to identify seen and unseen classes, we propose a general scoring framework called Discrepancy with Raw (DR). Second, based on unseen-class data mined by teacher model from unlabeled data, we calibrate student model by newly proposed Unseen-class Energy-bounded Calibration (UEC) loss. Third, based on seen-class data mined by teacher model from unlabeled data, we propose Weighted Confirmation Bias Elimination (WCBE) loss to boost seen-class classification of student model. Extensive studies show that SAFER-STUDENT remarkably outperforms the state-of-the-art, verifying the effectiveness of our method in the under-explored problem.

First Page

318

Last Page

334

DOI

10.1109/TKDE.2023.3279139

Publication Date

5-1-2023

Keywords

Data models, Semisupervised learning, Uncertainty, Training, Deep learning

Comments

IR conditions: non-described

Recommended Citation

R. He, Z. Han, X. Lu and Y. Yin, "SAFER-STUDENT for Safe Deep Semi-Supervised Learning With Unseen-Class Unlabeled Data," in IEEE Transactions on Knowledge and Data Engineering, vol. 36, no. 1, pp. 318-334, Jan. 2024, doi: 10.1109/TKDE.2023.3279139

Link to Full Text

COinS

Machine Learning Faculty Publications

SAFER-STUDENT for Safe Deep Semi-Supervised Learning With Unseen-Class Unlabeled Data

Document Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Keywords

Comments

Recommended Citation

Browse

Contribute

Links

Machine Learning Faculty Publications

SAFER-STUDENT for Safe Deep Semi-Supervised Learning With Unseen-Class Unlabeled Data

Authors

Document Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Keywords

Comments

Recommended Citation

Share

Browse

Contribute

Links