SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation

Tan, Shengbo; Zhang, Zeyu; Cai, Ying; Ergu, Daji; Wu, Lin; Hu, Binbin; Yu, Pengzhang; Zhao, Yang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2408.00496 (cs)

[Submitted on 1 Aug 2024]

Title:SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation

Authors:Shengbo Tan, Zeyu Zhang, Ying Cai, Daji Ergu, Lin Wu, Binbin Hu, Pengzhang Yu, Yang Zhao

View PDF HTML (experimental)

Abstract:Medical imaging segmentation plays a significant role in the automatic recognition and analysis of lesions. State-of-the-art methods, particularly those utilizing transformers, have been prominently adopted in 3D semantic segmentation due to their superior performance in scalability and generalizability. However, plain vision transformers encounter challenges due to their neglect of local features and their high computational complexity. To address these challenges, we introduce three key contributions: Firstly, we proposed SegStitch, an innovative architecture that integrates transformers with denoising ODE blocks. Instead of taking whole 3D volumes as inputs, we adapt axial patches and customize patch-wise queries to ensure semantic consistency. Additionally, we conducted extensive experiments on the BTCV and ACDC datasets, achieving improvements up to 11.48% and 6.71% respectively in mDSC, compared to state-of-the-art methods. Lastly, our proposed method demonstrates outstanding efficiency, reducing the number of parameters by 36.7% and the number of FLOPS by 10.7% compared to UNETR. This advancement holds promising potential for adapting our method to real-world clinical practice. The code will be available at this https URL

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2408.00496 [cs.CV]
	(or arXiv:2408.00496v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2408.00496

Submission history

From: Zeyu Zhang [view email]
[v1] Thu, 1 Aug 2024 12:05:02 UTC (15,782 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators