Classification and evaluation strategies of auto-segmentation approaches for PET: Report of AAPM task group No. 211 - PubMed Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Jun;44(6):e1-e42.
doi: 10.1002/mp.12124. Epub 2017 May 18.

Classification and evaluation strategies of auto-segmentation approaches for PET: Report of AAPM task group No. 211

Affiliations

Classification and evaluation strategies of auto-segmentation approaches for PET: Report of AAPM task group No. 211

Mathieu Hatt et al. Med Phys. 2017 Jun.

Abstract

Purpose: The purpose of this educational report is to provide an overview of the present state-of-the-art PET auto-segmentation (PET-AS) algorithms and their respective validation, with an emphasis on providing the user with help in understanding the challenges and pitfalls associated with selecting and implementing a PET-AS algorithm for a particular application.

Approach: A brief description of the different types of PET-AS algorithms is provided using a classification based on method complexity and type. The advantages and the limitations of the current PET-AS algorithms are highlighted based on current publications and existing comparison studies. A review of the available image datasets and contour evaluation metrics in terms of their applicability for establishing a standardized evaluation of PET-AS algorithms is provided. The performance requirements for the algorithms and their dependence on the application, the radiotracer used and the evaluation criteria are described and discussed. Finally, a procedure for algorithm acceptance and implementation, as well as the complementary role of manual and auto-segmentation are addressed.

Findings: A large number of PET-AS algorithms have been developed within the last 20 years. Many of the proposed algorithms are based on either fixed or adaptively selected thresholds. More recently, numerous papers have proposed the use of more advanced image analysis paradigms to perform semi-automated delineation of the PET images. However, the level of algorithm validation is variable and for most published algorithms is either insufficient or inconsistent which prevents recommending a single algorithm. This is compounded by the fact that realistic image configurations with low signal-to-noise ratios (SNR) and heterogeneous tracer distributions have rarely been used. Large variations in the evaluation methods used in the literature point to the need for a standardized evaluation protocol.

Conclusions: Available comparison studies suggest that PET-AS algorithms relying on advanced image analysis paradigms provide generally more accurate segmentation than approaches based on PET activity thresholds, particularly for realistic configurations. However, this may not be the case for simple shape lesions in situations with a narrower range of parameters, where simpler methods may also perform well. Recent algorithms which employ some type of consensus or automatic selection between several PET-AS methods have potential to overcome the limitations of the individual methods when appropriately trained. In either case, accuracy evaluation is required for each different PET scanner and scanning and image reconstruction protocol. For the simpler, less robust approaches, adaptation to scanning conditions, tumor type, and tumor location by optimization of parameters is necessary. The results from the method evaluation stage can be used to estimate the contouring uncertainty. All PET-AS contours should be critically verified by a physician. A standard test, i.e., a benchmark dedicated to evaluating both existing and future PET-AS algorithms needs to be designed, to aid clinicians in evaluating and selecting PET-AS algorithms and to establish performance limits for their acceptance for clinical use. The initial steps toward designing and building such a standard are undertaken by the task group members.

Keywords: PET segmentation; PET/CT; treatment assessment; treatment planning.

PubMed Disclaimer

Figures

Figure 1
Figure 1
An illustration of applying the adaptive region growing (ARG) algorithm to PET: (a) plot of segmented volume growing as a function of f, the arrow indicates the location of the transition point f* for a spherical lesion in a PET/CT of a phantom; (b) the thin blue contour indicates the delineated volume V*; (c) – (d) selection of f* and the corresponding delineation for an esophageal tumor. [Color figure can be viewed at wileyonlinelibrary.com]
Figure 2
Figure 2
A schematic representation of the algorithm proposed by De Bernardi, et al., which combines segmentation and PVE recovery within an iterative process. [Color figure can be viewed at wileyonlinelibrary.com]
Figure 3
Figure 3
(a) PET/CT images of a patient with lung cancer in case of atelectasis (lung collapse), with manual segmentation for CT (orange), PET (green) and fused PET/CT (red). (b) The multivalued level sets (MVLS) algorithm initialized (white circle), evolved contours in steps of 10 iterations (black), and the final contour (red). (c) MVLS results shown along with manual contour on the fused PET/CT. (d) MVLS contour superimposed on CT (top) and PET (bottom). Reproduced with permission from El Naqa, et al. [Color figure can be viewed at wileyonlinelibrary.com]
Figure 4
Figure 4
Variation in the optimal threshold value (y axis) obtained according to different settings of the PET reconstruction with varying number of iterations and subsets (from two iterations one subset to eight iterations eight subsets, colored bars), and for spheres of different volumes (x axis) and a sphere‐to‐background ratio of 3.5, for one single scanner model. Reproduced with permission from Ollers, et al. [Color figure can be viewed at wileyonlinelibrary.com]
Figure A1
Figure A1
A graphical illustration of the Jaccard and the Dice similarity coefficients, and of the sensitivity and the positive predictive value (PPV).

Similar articles

Cited by

References

    1. MacManus M, Nestle U, Rosenzweig KE, et al. Use of PET and PET/CT for radiation therapy planning: IAEA expert report 2006‐2007. Radiother Oncol. 2009;91:85–94. - PubMed
    1. Ling CC, Humm J, Larson S, et al. Towards multidimensional radiotherapy (MD‐CRT): biological imaging and biological conformality. Int J Radiat Oncol Biol Phys. 2000;47:551–560. - PubMed
    1. Huang SC. Anatomy of SUV. Standardized uptake value. Nucl Med Biol. 2000;27:643–646. - PubMed
    1. Boellaard R. Standards for PET image acquisition and quantitative data analysis. J Nucl Med. 2009;50:11S–20S. - PubMed
    1. Gambhir SS, Czernin J, Schwimmer J, Silverman DH, Coleman RE, Phelps ME. A tabulated summary of the FDG PET literature. J Nucl Med. 2001;42:1S–93S. - PubMed