GitHub - zhongjingyu1/NDLP: [PR 2026] Official PyTorch Code for "Noise Correction and Distribution Fine-Tuning for Long-Tailed Partial Multi-Label Learning"

NDLP--Noise Correction and Distribution Fine-Tuning for Long-Tailed Partial Multi-Label Learning

The article "Noise Correction and Distribution Fine-Tuning for Long-Tailed Partial Multi-Label Learning" has been accepted by Pattern Recognition (PR 2026).

📹 Problem description of LT-PML

In this paper, we introduce a new and more realistic data setting for MLC, termed long-tailed partial multi-label learning (LT-PML), which simultaneously incorporates PML and LT distribution setups. As shown in Figure 1 (a), LT-PML presents two intertwined challenges: 1) Mutual hindrance between noisy annotations and LT distributions. The presence of candidate label sets prevent obtaining the class-wise label frequency priors, yet which is critical for existing LT methods. Moreover, the inherent LT distribution causes head classes to dominate training datasets, leading to under-learning of tail classes. This imbalance makes disambiguation of rare classes more challenging. Meanwhile, label ambiguity tends to disproportionately affect tail classes. In practice, when annotators are uncertain, they are more likely to assign ambiguous labels to infrequent classes, since these classes are less familiar and less visually distinctive (Figure 1 (b)). 2) Label co-occurrence. Label co-occurrence is common in natural images. As shown in Figure 1 (c), a rare label pomegranate may co-occur with a frequent label computer. Naive resampling become ineffective. Oversampling this image to compensate for the rare class inevitably increases the exposure of the head class as well, thereby corrupting their learned representations.

📜 Abstract

Multi-label classifiers in practice often face two departures from standard benchmarks: labels are ambiguous and class frequencies are long-tailed. These conditions frequently co-occur, yielding long-tailed partial multi-label learning (LT-PML), where each sample is associated with a set of candidate labels, but only a subset of these labels is accurate. LT-PML is important because noisy candidate labels obscure reliable class-frequency priors required by long-tail learning, while the scarcity of tail samples makes label disambiguation more difficult. In addition, label co-occurrence biases resampling and distribution estimation. Therefore, we propose a noise correction and distribution fine-tuning framework for LT-PML (NDLP). First, NDLP maintains moving-averaged soft pseudo-labels to estimate both label confidence and class distribution, introduces class-aware noise correction to strengthen positive learning from likely true labels, and performs batch-wise distribution fine-tuning to alleviate estimation errors caused by co-occurrence. The results indicate that NDLP outperforms existing methods on LT-PML datasets.

📕 Requirements

📋 Dataset

To evaluate/train Long-Tailed Partial Multi-Label Learning, first, the VOC2012/2007, MSCOCO and NUS-WIDE datasets need to be downloaded. The image paths, labels, and captions for the VOC-MLT and COCO-MLT datasets can be found:

Shell
├── appendix
    ├── coco
        ├── coco_lt_train.txt
        ├── coco_lt_test.txt
        ├── coco_labels.txt
        ├── longtail2017
            ├── class_freq.pkl
            ├── class_split.pkl
    ├── VOCdevkit
        ├── voc_lt_train.txt
        ├── voc_lt_test.txt
        ├── voc_labels.txt
        ├── longtail2012
            ├── class_freq.pkl
            ├── class_split.pkl
    ├── nuswide
            ├── nw_lt_train.txt
            ├── nw_lt_test.txt
            ├── nw_labels.txt
            ├── class_freq.pkl
            ├── class_split.pkl
├── data
    ├── coco
        ├── train2017
            ├── 0000001.jpg
            ...
        ├── val2017
            ├── 0000002.jpg
            ...
    ├── voc
        ├── VOCdevkit
            ├── VOC2007
                ├── Annotations
                ├── ImageSets
                ├── JPEGImages
                    ├── 0000001.jpg
                    ...
                ├── SegementationClass
                ├── SegementationObject
            ├── VOC2012
                ├── Annotations
                ├── ImageSets
                ├── JPEGImages
                    ├── 0000002.jpg
                    ...
                ├── SegementationClass
                ├── SegementationObject
    ├── nuswide
        ├── images
            ├── 0_2124494179_b039ddccac_m.jpg
            ...

📃 Usage

VOC-PMLT

python lmpt/train.py configs/voc/LT_resnet50_pfc_DB.py \
--dataset 'voc-lt' \
--seed '0' \
--batch_size 32 \
--epochs 10 \
--gamma 3
--partial_rate 0.3/0.5 \
--eta 0.9 \
--alpha_range 0.4,0.8 \

COCO-PMLT or NW-PMLT

python lmpt/train.py configs/coco/LT_resnet50_pfc_DB.py \ 
--dataset 'voc-lt' \
--seed '0' \
--batch_size 32 \
--epochs 10 \
--gamma 2
--partial_rate 0.05/0.1 \
--eta 0.9 \
--alpha_range 0.4,0.8 \

🔧 Pre-trained models

COCO-MLT

Backbone	$$\rho$$	Total	Head	Medium	Tail	Download
ResNet-50	0.05	40.06	41.87	40.57	37.80	model
ResNet-50	0.1	35.21	40.65	32.83	33.55	model

VOC-MLT

Backbone	$$\rho$$	Total	Head	Medium	Tail	Download
ResNet-50	0.3	59.51	59.54	70.33	51.38	model
ResNet-50	0.5	40.76	43.09	52.59	30.13	model

NW-MLT

Backbone	$$\rho$$	Total	Head	Medium	Tail	Download
ResNet-50	0.05	29.43	33.04	23.31	51.78	model

💗 Acknowledgements

We use code from DBL and LMPT. We thank the authors for releasing their code.

📪 Contact

If you have any questions, please create an issue on this repository or contact at 23171214508@stu.xidian.edu.cn.

Name		Name	Last commit message	Last commit date
Latest commit History 82 Commits
appendix		appendix
assets		assets
codes/pre-processed-data		codes/pre-processed-data
configs		configs
main		main
mllt		mllt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NDLP--Noise Correction and Distribution Fine-Tuning for Long-Tailed Partial Multi-Label Learning

📹 Problem description of LT-PML

📜 Abstract

📕 Requirements

📋 Dataset

📃 Usage

VOC-PMLT

COCO-PMLT or NW-PMLT

🔧 Pre-trained models

COCO-MLT

VOC-MLT

NW-MLT

💗 Acknowledgements

📪 Contact

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NDLP--Noise Correction and Distribution Fine-Tuning for Long-Tailed Partial Multi-Label Learning

📹 Problem description of LT-PML

📜 Abstract

📕 Requirements

📋 Dataset

📃 Usage

VOC-PMLT

COCO-PMLT or NW-PMLT

🔧 Pre-trained models

COCO-MLT

VOC-MLT

NW-MLT

💗 Acknowledgements

📪 Contact

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages