SAFER-STUDENT for Safe Deep Semi-Supervised Learning With Unseen-Class Unlabeled Data
Document Type
Article
Publication Title
IEEE Transactions on Knowledge and Data Engineering
Abstract
Deep semi-supervised learning (SSL) methods aim to utilize abundant unlabeled data to improve the seen-class classification. However, in the open-world scenario, collected unlabeled data tend to contain unseen-class data, which would degrade the generalization to seen-class classification. Formally, we define the problem as safe deep semi-supervised learning with unseen-class unlabeled data. One intuitive solution is removing these unseen-class instances after detecting them during the SSL process. Nevertheless, the performance of unseen-class identification is limited by the lack of suitable score function, the uncalibrated model, and the small number of labeled data. To this end, we propose a safe SSL method called SAFER-STUDENT from the teacher-student view. First, to enhance the ability of teacher model to identify seen and unseen classes, we propose a general scoring framework called Discrepancy with Raw (DR). Second, based on unseen-class data mined by teacher model from unlabeled data, we calibrate student model by newly proposed Unseen-class Energy-bounded Calibration (UEC) loss. Third, based on seen-class data mined by teacher model from unlabeled data, we propose Weighted Confirmation Bias Elimination (WCBE) loss to boost seen-class classification of student model. Extensive studies show that SAFER-STUDENT remarkably outperforms the state-of-the-art, verifying the effectiveness of our method in the under-explored problem.
First Page
318
Last Page
334
DOI
10.1109/TKDE.2023.3279139
Publication Date
5-1-2023
Keywords
Data models, Semisupervised learning, Uncertainty, Training, Deep learning
Recommended Citation
R. He, Z. Han, X. Lu and Y. Yin, "SAFER-STUDENT for Safe Deep Semi-Supervised Learning With Unseen-Class Unlabeled Data," in IEEE Transactions on Knowledge and Data Engineering, vol. 36, no. 1, pp. 318-334, Jan. 2024, doi: 10.1109/TKDE.2023.3279139
Comments
IR conditions: non-described