Learning to Transform for Generalizable Instance-wise Invariance

Singhal, Utkarsh; Esteves, Carlos; Makadia, Ameesh; Yu, Stella X.

Computer Science > Computer Vision and Pattern Recognition

arXiv:2309.16672 (cs)

[Submitted on 28 Sep 2023 (v1), last revised 15 Feb 2024 (this version, v3)]

Title:Learning to Transform for Generalizable Instance-wise Invariance

Authors:Utkarsh Singhal, Carlos Esteves, Ameesh Makadia, Stella X. Yu

View PDF HTML (experimental)

Abstract:Computer vision research has long aimed to build systems that are robust to spatial transformations found in natural data. Traditionally, this is done using data augmentation or hard-coding invariances into the architecture. However, too much or too little invariance can hurt, and the correct amount is unknown a priori and dependent on the instance. Ideally, the appropriate invariance would be learned from data and inferred at test-time.
We treat invariance as a prediction problem. Given any image, we use a normalizing flow to predict a distribution over transformations and average the predictions over them. Since this distribution only depends on the instance, we can align instances before classifying them and generalize invariance across classes. The same distribution can also be used to adapt to out-of-distribution poses. This normalizing flow is trained end-to-end and can learn a much larger range of transformations than Augerino and InstaAug. When used as data augmentation, our method shows accuracy and robustness gains on CIFAR 10, CIFAR10-LT, and TinyImageNet.

Comments:	Accepted to ICCV 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2309.16672 [cs.CV]
	(or arXiv:2309.16672v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2309.16672

Submission history

From: Utkarsh Singhal [view email]
[v1] Thu, 28 Sep 2023 17:59:58 UTC (7,991 KB)
[v2] Mon, 15 Jan 2024 21:13:58 UTC (7,994 KB)
[v3] Thu, 15 Feb 2024 19:00:07 UTC (8,126 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Learning to Transform for Generalizable Instance-wise Invariance

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Learning to Transform for Generalizable Instance-wise Invariance

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators