Inverse Image Frequency for Long-Tailed Image Recognition



Alexandridis, Konstantinos Panagiotis, Luo, Shan ORCID: 0000-0003-4760-0372, Nguyen, Anh ORCID: 0000-0002-1449-211X, Deng, Jiankang and Zafeiriou, Stefanos
(2023) Inverse Image Frequency for Long-Tailed Image Recognition. IEEE Transactions on Image Processing, 32. pp. 5721-5736.

[img] PDF
Inverse_Image_Frequency.pdf - Author Accepted Manuscript

Download (10MB) | Preview

Abstract

The long-tailed distribution is a common phenomenon in the real world. Extracted large scale image datasets inevitably demonstrate the long-tailed property and models trained with imbalanced data can obtain high performance for the over-represented categories, but struggle for the under-represented categories, leading to biased predictions and performance degradation. To address this challenge, we propose a novel de-biasing method named Inverse Image Frequency (IIF). IIF is a multiplicative margin adjustment transformation of the logits in the classification layer of a convolutional neural network. Our method achieves stronger performance than similar works and it is especially useful for downstream tasks such as long-tailed instance segmentation as it produces fewer false positive detections. Our extensive experiments show that IIF surpasses the state of the art on many long-tailed benchmarks such as ImageNet-LT, CIFAR-LT, Places-LT and LVIS, reaching 55.8% top-1 accuracy with ResNet50 on ImageNet-LT and 26.3% segmentation AP with MaskRCNN ResNet50 on LVIS. Code available at https://github.com/kostas1515/iif.

Item Type: Article
Divisions: Faculty of Science and Engineering > School of Electrical Engineering, Electronics and Computer Science
Depositing User: Symplectic Admin
Date Deposited: 26 Oct 2023 10:03
Last Modified: 21 Nov 2023 15:04
DOI: 10.1109/tip.2023.3321461
Related URLs:
URI: https://livrepository.liverpool.ac.uk/id/eprint/3176456