Abstract:
Unsupervised person re-identification (ReID), including fully unsupervised ReID and unsupervised domain adaptive ReID, remains a challenge for the fields of biometrics and computer vision due to its difficulty in learning with unlabeled target domain data. Existing state-of-the-art methods, most of which generate pseudo-labels via unsupervised clustering for model optimization, are inevitably hampered by the under-explored problem of pseudo-label noise. Motivated by this, we propose a novel joint framework termed pseudo-label Noise Prevention, Suppression, and Softening (NPSS) for unsupervised person re-identification. Instead of refining generated label noise after clustering as many existing methods do, we start solving this issue from the source of pseudo-label noise by proposing a new Dynamic Camera-Adaptive Clustering (DCAC), which dynamically involves camera information to prevent noise caused by cross-camera variance, thus improving their quality during clustering. Moreover, we propose an Online Domain Union (ODU) mechanism for the classification model learning on the target domain via involving source domain data with their ground-truth labels, which effectively suppresses the indelible noisy pseudo-labels. Furthermore, we present the Self-Consistency Constraint (SCC) to soften the label noise in a single model with reduced computation and network parameter cost, which achieves intra-sample knowledge ensembling with our global-local SCC and cross-sample knowledge ensembling with our inter-instance SCC. Experiments demonstrate the effectiveness of our method as it surpasses state-of-the-art methods by a large margin on Market-1501, DukeMTMC-ReID, and MSMT17 benchmarks.