Regularizing Deep Networks With Semantic Data Augmentation
- PMID: 33476265
- DOI: 10.1109/TPAMI.2021.3052951
Regularizing Deep Networks With Semantic Data Augmentation
Abstract
Data augmentation is widely known as a simple yet surprisingly effective technique for regularizing deep networks. Conventional data augmentation schemes, e.g., flipping, translation or rotation, are low-level, data-independent and class-agnostic operations, leading to limited diversity for augmented samples. To this end, we propose a novel semantic data augmentation algorithm to complement traditional approaches. The proposed method is inspired by the intriguing property that deep networks are effective in learning linearized features, i.e., certain directions in the deep feature space correspond to meaningful semantic transformations, e.g., changing the background or view angle of an object. Based on this observation, translating training samples along many such directions in the feature space can effectively augment the dataset for more diversity. To implement this idea, we first introduce a sampling based method to obtain semantically meaningful directions efficiently. Then, an upper bound of the expected cross-entropy (CE) loss on the augmented training set is derived by assuming the number of augmented samples goes to infinity, yielding a highly efficient algorithm. In fact, we show that the proposed implicit semantic data augmentation (ISDA) algorithm amounts to minimizing a novel robust CE loss, which adds minimal extra computational cost to a normal training procedure. In addition to supervised learning, ISDA can be applied to semi-supervised learning tasks under the consistency regularization framework, where ISDA amounts to minimizing the upper bound of the expected KL-divergence between the augmented features and the original features. Although being simple, ISDA consistently improves the generalization performance of popular deep models (e.g., ResNets and DenseNets) on a variety of datasets, i.e., CIFAR-10, CIFAR-100, SVHN, ImageNet, and Cityscapes. Code for reproducing our results is available at https://github.com/blackfeather-wang/ISDA-for-Deep-Networks.
Similar articles
-
FMixCutMatch for semi-supervised deep learning.Neural Netw. 2021 Jan;133:166-176. doi: 10.1016/j.neunet.2020.10.018. Epub 2020 Nov 10. Neural Netw. 2021. PMID: 33217685
-
Fine-Grained Recognition With Learnable Semantic Data Augmentation.IEEE Trans Image Process. 2024;33:3130-3144. doi: 10.1109/TIP.2024.3364500. Epub 2024 Apr 30. IEEE Trans Image Process. 2024. PMID: 38662557
-
Brain-inspired semantic data augmentation for multi-style images.Front Neurorobot. 2024 Mar 26;18:1382406. doi: 10.3389/fnbot.2024.1382406. eCollection 2024. Front Neurorobot. 2024. PMID: 38596181 Free PMC article.
-
Data augmentation for medical imaging: A systematic literature review.Comput Biol Med. 2023 Jan;152:106391. doi: 10.1016/j.compbiomed.2022.106391. Epub 2022 Dec 9. Comput Biol Med. 2023. PMID: 36549032 Review.
-
Generative Adversarial Networks in Medical Image augmentation: A review.Comput Biol Med. 2022 May;144:105382. doi: 10.1016/j.compbiomed.2022.105382. Epub 2022 Mar 5. Comput Biol Med. 2022. PMID: 35276550 Review.
Cited by
-
Ultrasound image-based deep learning to differentiate tubal-ovarian abscess from ovarian endometriosis cyst.Front Physiol. 2023 Feb 7;14:1101810. doi: 10.3389/fphys.2023.1101810. eCollection 2023. Front Physiol. 2023. PMID: 36824470 Free PMC article.
-
A Novel Quick-Response Eigenface Analysis Scheme for Brain-Computer Interfaces.Sensors (Basel). 2022 Aug 5;22(15):5860. doi: 10.3390/s22155860. Sensors (Basel). 2022. PMID: 35957420 Free PMC article.
-
A Review of Performance Prediction Based on Machine Learning in Materials Science.Nanomaterials (Basel). 2022 Aug 26;12(17):2957. doi: 10.3390/nano12172957. Nanomaterials (Basel). 2022. PMID: 36079994 Free PMC article. Review.
-
Semi-Supervised Medical Image Segmentation Guided by Bi-Directional Constrained Dual-Task Consistency.Bioengineering (Basel). 2023 Feb 7;10(2):225. doi: 10.3390/bioengineering10020225. Bioengineering (Basel). 2023. PMID: 36829720 Free PMC article.
-
Data augmentation via warping transforms for modeling natural variability in the corneal endothelium enhances semi-supervised segmentation.PLoS One. 2024 Nov 12;19(11):e0311849. doi: 10.1371/journal.pone.0311849. eCollection 2024. PLoS One. 2024. PMID: 39531418 Free PMC article.
LinkOut - more resources
Full Text Sources
Other Literature Sources