This repository contains the implementation code for paper:
Targeted Supervised Contrastive Learning for Long-Tailed Recognition
Tianhong Li*, Peng Cao*, Yuan Yuan, Lijie Fan, Yuzhe Yang, Rogerio Feris, Piotr Indyk, Dina Katabi
IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR 2022)
[Paper]
This repository also contains an unofficial reimplementation for paper:
Exploring Balanced Feature Spaces for Representation Learning
Bingyi Kang, Yu Li, Sa Xie, Zehuan Yuan, Jiashi Feng
The Ninth International Conference on Learning Representations (ICLR 2021)
If you find this code or idea useful, please consider citing our work:
@article{li2021targeted,
title={Targeted Supervised Contrastive Learning for Long-Tailed Recognition},
author={Li, Tianhong and Cao, Peng and Yuan, Yuan and Fan, Lijie and Yang, Yuzhe and Feris, Rogerio and Indyk, Piotr and Katabi, Dina},
journal={arXiv preprint arXiv:2111.13998},
year={2021}
}
With high imbalance ratio, class centers learned by KCL exhibit poor uniformity while class centers learned by TSC are still uniformly distributed and thus TSC achieves better performance.
- Download ImageNet dataset, and place them in your
data_root
. Long-tailed version will be created using train/val splits (.txt files) in corresponding subfolders underimagenet_inat/data/
- Change the
data_root
andsave_folder
inmain_moco_supcon_imba.py
andmain_lincls_imba.py
accordingly for ImageNet-LT. - All trainings are tested on 4 Titan X GPUs.
- PyTorch (>= 1.6, tested on 1.6)
- scikit-learn
- tensorboard-logger
Training of KCL/TSC cosists of two steps: first, the feature encoder (ResNet-50) is pretrained using KCL/TSC losses. Second, a linear classifier is added on top of the feature encoder and fine-tuned using cross-entropy loss.
To generate targets for TSC pre-training, use
python target_generation.py
The generated targets will be stored at optimal_{N}_{M}.npy, where N is the number of classes, M is the dimension of output features.
python main_moco_supcon_imba.py --cos --epochs 200 --K 6 --moco-dim 128 --mlp \ -a resnet50 --name kcl_release \
-a resnet50 --name [YOUR PREFERRED NAME] \
--lr 0.1 --batch-size 256 \
--dist-url 'tcp://localhost:10000' --multiprocessing-distributed --world-size 1 --rank 0 \
--dataset imagenet
python main_moco_supcon_imba.py --cos --epochs 400 --K 6 --moco-dim 128 --mlp --targeted --tr 1 --sep_t --tw 0.2 \
-a resnet50 --name [YOUR PREFERRED NAME] \
--lr 0.1 --batch-size 256 \
--dist-url 'tcp://localhost:10000' --multiprocessing-distributed --world-size 1 --rank 0 \
--dataset imagenet
python main_lincls_imba.py --dataset imagenet --pretrained [PRETRAINED MODEL PATH FROM 1ST STAGE] --epochs 40 --schedule 20 30 --seed 0 -b 2048
Here we show the accuracy of KCL and TSC on ImageNet-LT in 5 trails:
Method | Trial 1 | Trial 2 | Trial 3 | Trial 4 | Trial 5 |
---|---|---|---|---|---|
KCL | 51.2 | 51.6 | 51.5 | 51.2 | 51.3 |
TSC | 52.3 | 53.0 | 52.6 | 52.1 | 52.2 |
NOTE: many/medium/minor classes accuracy could change significantly with different learning rate or batch size in the second stage while overall accuracy remains the same.
This code inherits some codes from MoCo and Classifier-Balancing.
If you have any questions, feel free to contact me through email (tianhong@mit.edu) or Github issues. Enjoy!