Differentiable Multi-Granularity Human Parsing
- PMID: 37022259
- DOI: 10.1109/TPAMI.2023.3239194
Differentiable Multi-Granularity Human Parsing
Abstract
In this work, we study the challenging problem of instance-aware human body part parsing. We introduce a new bottom-up regime which achieves the task through learning category-level human semantic segmentation as well as multi-person pose estimation in a joint and end-to-end manner. The output is a compact, efficient and powerful framework that exploits structural information over different human granularities and eases the difficulty of person partitioning. Specifically, a dense-to-sparse projection field, which allows explicitly associating dense human semantics with sparse keypoints, is learnt and progressively improved over the network feature pyramid for robustness. Then, the difficult pixel grouping problem is cast as an easier, multi-person joint assembling task. By formulating joint association as maximum-weight bipartite matching, we develop two novel algorithms based on projected gradient descent and unbalanced optimal transport, respectively, to solve the matching problem differentiablly. These algorithms make our method end-to-end trainable and allow back-propagating the grouping error to directly supervise multi-granularity human representation learning. This is significantly distinguished from current bottom-up human parsers or pose estimators which require sophisticated post-processing or heuristic greedy algorithms. Extensive experiments on three instance-aware human parsing datasets (i.e., MHP-v2, DensePose-COCO, PASCAL-Person-Part) demonstrate that our approach outperforms most existing human parsers with much more efficient inference. Our code is available at https://github.com/tfzhou/MG-HumanParsing.
Similar articles
-
Self-Correction for Human Parsing.IEEE Trans Pattern Anal Mach Intell. 2022 Jun;44(6):3260-3271. doi: 10.1109/TPAMI.2020.3048039. Epub 2022 May 5. IEEE Trans Pattern Anal Mach Intell. 2022. PMID: 33373297
-
On the Correlation Among Edge, Pose and Parsing.IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):8492-8507. doi: 10.1109/TPAMI.2021.3108771. Epub 2022 Oct 4. IEEE Trans Pattern Anal Mach Intell. 2022. PMID: 34469290
-
Hierarchical Human Semantic Parsing With Comprehensive Part-Relation Modeling.IEEE Trans Pattern Anal Mach Intell. 2022 Jul;44(7):3508-3522. doi: 10.1109/TPAMI.2021.3055780. Epub 2022 Jun 3. IEEE Trans Pattern Anal Mach Intell. 2022. PMID: 33513100
-
Pose-Guided Hierarchical Semantic Decomposition and Composition for Human Parsing.IEEE Trans Cybern. 2023 Mar;53(3):1641-1652. doi: 10.1109/TCYB.2021.3107544. Epub 2023 Feb 15. IEEE Trans Cybern. 2023. PMID: 34506295
-
Graphonomy: Universal Image Parsing via Graph Reasoning and Transfer.IEEE Trans Pattern Anal Mach Intell. 2022 May;44(5):2504-2518. doi: 10.1109/TPAMI.2020.3043268. Epub 2022 Apr 1. IEEE Trans Pattern Anal Mach Intell. 2022. PMID: 33290211
Cited by
-
Human Activity Recognition Using Cascaded Dual Attention CNN and Bi-Directional GRU Framework.J Imaging. 2023 Jun 26;9(7):130. doi: 10.3390/jimaging9070130. J Imaging. 2023. PMID: 37504807 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Miscellaneous