MA-Net:Mutex attention network for COVID-19 diagnosis on CT images

Zheng, BingBing; Zhu, Yu; Shi, Qin; Yang, Dawei; Shao, Yanmei; Xu, Tao

doi:10.1007/s10489-022-03431-5

MA-Net:Mutex attention network for COVID-19 diagnosis on CT images

Published: 09 April 2022

Volume 52, pages 18115–18130, (2022)
Cite this article

Download PDF

Applied Intelligence Aims and scope Submit manuscript

MA-Net:Mutex attention network for COVID-19 diagnosis on CT images

Download PDF

BingBing Zheng¹,
Yu Zhu ORCID: orcid.org/0000-0003-1535-6520^1,2,
Qin Shi¹,
Dawei Yang^2,3,
Yanmei Shao⁴ &
…
Tao Xu⁴

1887 Accesses
1 Altmetric
Explore all metrics

Abstract

COVID-19 is an infectious pneumonia caused by 2019-nCoV. The number of newly confirmed cases and confirmed deaths continues to remain at a high level. RT–PCR is the gold standard for the COVID-19 diagnosis, but the computed tomography (CT) imaging technique is an important auxiliary diagnostic tool. In this paper, a deep learning network mutex attention network (MA-Net) is proposed for COVID-19 auxiliary diagnosis on CT images. Using positive and negative samples as mutex inputs, the proposed network combines mutex attention block (MAB) and fusion attention block (FAB) for the diagnosis of COVID-19. MAB uses the distance between mutex inputs as a weight to make features more distinguishable for preferable diagnostic results. FAB acts to fuse features to obtain more representative features. Particularly, an adaptive weight multiloss function is proposed for better effect. The accuracy, specificity and sensitivity were reported to be as high as 98.17%, 97.25% and 98.79% on the COVID-19 dataset-A provided by the Affiliated Medical College of Qingdao University, respectively. State-of-the-art results have also been achieved on three other public COVID-19 datasets. The results show that compared with other methods, the proposed network can provide effective auxiliary information for the diagnosis of COVID-19 on CT images.

A multi-level feature attention network for COVID-19 detection based on multi-source medical images

Article 03 February 2024

Computer Aided Diagnosis for COVID-19 in CT Images Utilizing Transfer Learning and Attention Mechanism

Article 07 September 2023

Attention-based VGG-16 model for COVID-19 chest X-ray image classification

Article 17 November 2020

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Coronavirus disease 2019 (COVID-19), named by the World Health Organization, refers to pneumonia caused by the 2019 novel coronavirus (2019-nCoV) [1, 2]. As of August 5, 2021, more than 200 million confirmed cases and 4.2 million confirmed deaths had occurred worldwide. More importantly, the number of newly confirmed cases and new confirmed deaths remains at a high level and some specific variants of SARS-CoV-2 that are more transmissible and possibly more virulent have appeared worldwide [3]. With global vaccine production still insufficient, it is very important to find a fast and effective COVID-19 detection method.

The gold standard for diagnosing COVID-19 is reverse transcription polymerase chain reaction (RT–PCR) [4]. However, existing data show that the sensitivity of RT–PCR to COVID-19 infection is not high [5]. This leads to many patients who are infected with COVID-19 being mistaken as uninfected during rapid screening, which is not conducive to COVID-19 prevention and treatment.

As a common medical imaging tool, computed tomography (CT) imaging technology is a sensitive diagnostic method for COVID-19. Chest CT is an important supplement to RT–PCR in COVID-19 diagnosis[6]. Long et al. [7] reported that CT sensitivity was 97.2%, while the initial rRT-PCR sensitivity was 83.3%. In addition, CT can observe the pulmonary manifestations with different infections in different periods, which can assist doctors in diagnosis and treatment [8].

Although chest CT technology can be used as a diagnostic auxiliary tool for COVID-19, a large number of experienced radiologists are needed for screening. It increases work, and it is also prone to misdiagnosis caused by fatigue and other reasons. With successful deep learning (DL) applications in the image classification [9] and natural language processing fields [10, 11], considerable progress has been achieved in DL-based medical image processing task [12]. Deep learning is widely viewed as a crucial tool in COVID-19 diagnosis[13, 14], as they provide auxiliary information, which is very different from other computer vision tasks, such as head pose estimation [15], intelligent recommendation [16, 17], robot vision [18], and infrared imaging enhancement [19].

It is difficult to diagnose COVID-19 because the shape, size, and Hounsfield unit (Hu) value of the lesions on the CT images have large variation, as shown in Fig.1. Furthermore, the COVID-19 patients CT images are similar to CT images of other diseases (such as community-acquired pneumonia, H1N1), which also makes the diagnosis of COVID-19 very difficult, as shown in Fig. 2. In response to these problems, we propose a novel network MA-Net including mutex attention blocks and fusion attention blocks to achieve effective feature representation for COVID-19. We also design an adaptive weight loss function for a preferable diagnosis effect. The contributions of this paper are as follows:

1.
Aiming at the problem that the COVID-19 CT images are similar to other diseases, this paper designs a multi-inputs network with shared parameters and a mutex attention block. The mutex input include a pair of positive and negative samples. The purpose of the mutex attention block is to amplify the difference between positive and negative samples, which enables the network to obtain better distinguishable features. It can significantly improve the ability of deep networks to diagnose COVID-19.
2.
In this paper, the fusion attention block is designed to fuse features in the channel direction, which selects features between two inputs to obtain more representative features. It can further strengthen the network COVID-19 diagnostic capabilities.
3.
Joint training of multiple loss functions is used in the work. An adaptive weight adjustment mechanism is proposed to automatically tune the weight of different losses. The experimental results show that the adaptation weight can effectively improve the diagnosis effect of the proposed network.

2 Related work

Due to the COVID-19 outbreak, many researchers have proposed deep learning methods to analyze COVID-19 CT or X-ray images [21,22,23]. For COVID-19 diagnosis on X-ray, [24, 25] proposed methods and achieved good results. For COVID-19 diagnosis on CT images [26,27,28], many methods have been proposed and validated on different COVID-19 datasets.

2.1 COVID-9 diagnosis on CT

For private datasets, Ma et al. [29] proposed a new COVID-19 diagnosis network based on the multireceptive field attention module. This attention module includes pyramid convolution module (PCM), spatial attention block (SAB) and channel attention block (CAB). PCM is used to obtain multi-receptive field feature sets and send the multi-receiving field feature sets to SAB and CAB to enhance the feature. The proposed method was trained and verified on the DTDB provided by the Beijing Ditan Hospital Capital Medical University, which includes 40 patients infected with COVID-19 in different periods and 40 people without COVID-19 infection. 97.12% accuracy, 96.89% specificity and 97.21% sensitivity are obtained. Li et al. [30] developed a 3D deep learning framework for COVID-19 diagnosis (COVID-19, community-acquired pneumonia and non-pneumonia), referred to as COVNet. It consists of a sequence of shared parameters ResNet50. Experiments on collected dataset containing 4352 chest CT scans from 3322 patients, the per-scan sensitivity and specificity in the independent test set were 90% and 96% , respectively. Harmon et al. [31] proposed 3D model for differentiation of COVID-19 from other clinical entities. Training in a diverse multinational cohort of 1280 patients to localize parietal pleura/lung parenchyma followed by classification of COVID-19 pneumonia, it can achieve up to 90.8% accuracy, with 84% sensitivity and 93% specificity, as evaluated in an independent test set (not included in training and validation) of 1337 patients. For public datasets, Zhao et al. [20] provided a collated COVID-19 CT dataset containing 349 CT images believed to be positive for COVID-19. [20] also proposed a DensNet169 method combining contrastive self-supervised learning (CSSL) [33] and transfer learning (TL) for COVID-19 diagnosis and used CT images with a lung mask as input. Performing COVID-19 diagnosis on the proposed COVID-19 CT dataset, the accuracy, F1-Score and AUC were 85.0%, 85.9% and 92.8% respectively, although the number of CT images for training was only a few hundred. The COVID-19 CT dataset [20] provided by Zhao et al. has been widely used. Due to the limited size of the COVID-19 dataset, Mittal et al. [32] proposed a new clustering method for COVID-19 diagnosis. A novel variant of a gravitational search algorithm is employed to obtain optimal clusters. To validate the performance on the proposed variant, a comparative analysis of recent metaheuristic algorithms is conducted. The proposed method was verified on the COVID-19 CT dataset [20] and an accuracy of 0.6441 was obtained. Underperforming with traditional methods, many researchers focus on using deep learning for COVID-19 diagnosis. He et al. [34] proposed a self-transformation method that integrated comparative self-monitoring learning and transfer learning to learn strong and unbiased feature representations on the limited size dataset [20] to reduce the risk of overfitting. 86% accuracy, 85% F1-Score and 94% AUC were obtained, respectively. Wang et al. [35] proposed a new joint learning framework to achieve accurate diagnosis of COVID-19 by effectively learning heterogeneous datasets with distributed differences. A powerful backbone was built by redesigning the recently proposed COVID-Net [36] in terms of network architecture and learning strategy. Additionally, a contrastive training objective was applied to enhance the domain invariance of semantic embedding to boost the classification performance on each dataset. On the COVID-19 CT dataset [20], the proposed method achieved 78.69% accuracy, 78.83% F1-score and 85.32% AUC.

Ma et al. [29] used the pyramid convolution module to solve the problem of different sizes and shapes of COVID-19 lesions. Zhao et al. [20], Mittal et al. [32] and He et al. [34] focused on solving the problem of limited COVID-19 dataset. Li et al. [30] and Harmon et al. [31] focused on the use of large datasets for the diagnosis of COVID-19 and other lung diseases. These works have achieved excellent results. However, most of them used large-scale private datasets or were evaluated directly using commonly used deep learning network models.In this paper, we propose a COVID-19 diagnostic method based on mutex attention block to distinguish COVID-19 from other pulmonary diseases with limited data.

2.2 Image attention mechanism

The attention mechanism was first used in the field of natural language processing(NLP) and achieved state-of-the-art results.Then, using attention mechanism in the field of computer vision has recently received more research. Many outstanding attention modules have been proposed [37, 38]. The attention mechanism can be simply divided into channel attention, spatial attention, mixed attention and special attention models.

SE-Net [39] was proposed in 2017, which is a typical channel attention model and can be embedded in any basic network. SE-Net proposed a novel attention unit, the squeeze-and-excitation module (SE module), which adaptively recalibrates the channel characteristic response by explicitly modeling the interdependence between channels. Experiments show that by embedding the SE module into existing basic models (such as ResNet and VGG), it can bring significant performance improvements to the most advanced deep architecture at a small computational cost. Different from the SE module which only focuses on the feature correction of the channel dimension, CBAM [40] introduces an attention mechanism in both the spatial and channel dimensions. A set of feature maps is input, and the CBAM module sequentially infers the attention map along two independent dimensions (channel and spatial), and then multiplies the attention map with the input feature map to perform adaptive feature refinement.

3 Datasets and methods

3.1 Datasets

We conduct experiments on different COVID-19 datasets to verify our method. All experiments are based on 2D slices. Each dataset is tested separately. The details of the four datasets are shown in Table 1.

Table 1 The number of COVID and nonCOVID CT images of the three COVID-19 datasets

Full size table

The COVID-19 Dataset-A was provided by The Affiliated Hospital of Qingdao University Medical College, including 21 CT scans with COVID-19 and 18 CT scans without COVID-19, which were labeled by experienced doctors. We extracted 2499 slices of positive samples containing COVID-19 infection from CT scans with COVID-19 and randomly selected 1908 slices of negative samples from 18 CT scans without COVID-19 as “NonCOVID” . We conducted slice-level and patient-level experiments. At the slice-level, the dataset was divided into 3527 images for training, and 880 images for verification. At the patient-level, slices from 16 CT scans with COVID-19 and 14 CT scans without COVID-19 were used for training and the rest were used for verification.

The COVID-19 Dataset-B [41] was provided by Ma et al., and it includes CT scans of 20 patients. The infection area was marked by two radiologists and verified by an experienced radiologist. The slices containing COVID-19 infection were extracted as COVID, for a total of 1844. The remaining slices that did not contain COVID-19 were regarded as “NonCOVID”, for a total of 1676. The dataset was divided into 2816 images for training, and 704 images for verification.

The COVID-19 Dataset-C [20] was provided by Zhao et al. The practicability of the dataset was confirmed by a senior radiologist at Tongji Hospital in Wuhan, China, who diagnosed and treated a large number of patients with COVID-19 during the outbreak from January to April. The dataset provides 8-bit CT images of png instead of DICM with the Hu value, which results in resolution loss. Second, the original CT scan contained a series of CT slices, but only a few key slices were selected in the dataset, which also had a negative impact on the diagnosis. The COVID-19 Dataset-C is shown in Fig. 2. This dataset contains 349 COVID-19 CT images and 397 other CT images without COVID-19. The dataset was divided into 597 images for training and 148 images for verification. Because the number of images in the dataset was too small, we enhanced the training data by flipping and rotating.

The COVID-19 Dataset-D [42] is a publicly available COVID-19 CT dataset, containing 1252 CT scans that are positive for SARS-CoV-2 infection (COVID-19) and 1230 CT scans for patients with other pulmonary diseases or normal, 2482 CT scans in total. The dataset was divided into 1984 images for training, 498 images for validation.

3.2 Methodology

The network we proposed is based on the ResNet50 [43]. The architecture of the proposed network is shown in Fig. 3. Similar to ResNet, the feature extraction layers of this network consist of 5 mutex attention Res-Layers followed by the global pooling layer and the fully connected layer. The inputs of this network are designed specifically with a pair of mutex CT images that contain opposite categories. In the training process, the pair of mutex inputs are randomly selected from the training data. Since in the forward process, nothing has to do with mutex input, we do not need to enter mutex input in the testing process.

The architecture of the mutex attention Res-Layer is shown in the blue dashed box in Fig. 3. Similar to ResNet, the Res-Layer in mutex attention Res-Layer0 is a 7 × 7 convolution layer with a stride of 2 and a max pool layer with a stride of 2. The Res-Layers in mutex attention Res-Layer1 to mutex attention Res-Layer4 are made up of different numbers of blocks connected in series. In mutex attention Res-Layer0, the inputs are a pair of mutex CT images. The inputs of mutex attention Res-Layer1 to mutex attention Res-Layer4 are the output feature maps of the previous mutex attention Res-Layer. Assume that the input of the mutex attention Res-Layers is F_i and F_m. F_rei and F_rem are first obtained through the same Res-Layer, and the two Res-Layers share parameters, as shown in the blue dotted frame of Fig. 3. F_rei directly obtainsF_o. F_am is obtained by putting F_rei andF_rem into the mutex attention block (MAB). Then, the obtained F_am and F_rem are passed through the fusion attention block (FAB) to obtain the output mutex feature maps F_om. The above process can be described by (1) and (2), as follows:

$$ \begin{array}{@{}rcl@{}} \label {equ1} \left\{\begin{array}{cc} F_{r e i} = W_{r l} * F_{i} \\ F_{r e m} = W_{r l} * F_{m} \end{array}\right. \end{array} $$

(1)

$$ \label {equ2} \left\{\begin{array}{c} F_{o}=F_{r e i} \\ F_{o m}=F A B\left( M A B\left( F_{r e i}, F_{r e m}\right), F_{r e m}\right) \end{array}\right. $$

(2)

where W_rl is the Res-Layer parameter. FAB is the fusion attention block. MAB is the mutex attention block. We introduce them in detail as follows.

3.3 Mutex attention block (MAB)

The MAB inputs are the feature maps F_rei and F_rem (R^C×H×W) after the Res-Layer, as shown in Fig. 4. First, perform elementwise subtraction on F_rei and F_rem to obtain the distance matrix D(R^C×H×W), as equation:

$$ d_{c i j}=\left( a_{c i j}^{r e i}-a_{c i j}^{r e m}\right)^{2} $$

(3)

where d_cij represents the value of (i, j) of D on c channels, $a_{c i j}^{r e i}$ and $a_{c i j}^{r e m}$ represent the value of (i, j) of F_rei and F_rem on c channels. c ∈ [1, C], i ∈ [1, H], j ∈ [1, W]. Then D(R^C×H×W) is reshaped to C × HW and softmax is performed in the spatial dimension (HW). The attention map A_D is reshaped to (R^C×H×W), as shown in (4).

$$ A_{D}=\text{reshape}(\sigma(\text{reshape}(D), \dim=H W)) $$

(4)

σ is the softmax function.

Further perform element-wise multiplication between the obtained attention map A_D and the input F_rem to obtain F_am, as shown in (5).

$$ F_{a m}=A_{D} \odot F_{r e m} $$

(5)

The purpose of MAB is to reduce the similarity between input feature maps and mutex feature maps. Using the distance between mutex feature maps as an attention map can effectively increase the discrimination of the two inputs. Therefore, the network achieves a better discrimination effect. In the experiments, we visualize the changes in the feature maps to verify the effect of the mutex attention block.

3.4 Fusion attention block (FAB)

The structure of FAB is shown in Fig. 5. The inputs of FAB are F_am obtained by MAB and F_rem. First, we use elementwise summation on two inputs to obtain mixed feature maps F_mix.

$$ F_{\text{mix}}=F_{a m}+F_{r e m} $$

(6)

Then, global average pooling and max pooling are performed on the obtained mixed feature maps, and element addition is used to obtain the channel feature vector V (R^C):

$$ \left\{\begin{array}{c} V_{c 1}=\frac{1}{H \times W} \cdot {\sum}_{i=1}^{H} {\sum}_{j=1}^{W} F_{\text{mix}}^{c}(i, j) \\ V_{c 2}=\max \left( F_{\text{mix}}^{c}(i, j)\right) \\ V_{c}=V_{c 1}+V_{c 2} \end{array}\right. $$

(7)

where V_c is the value on the c-th channel of the channel feature vector V. Furthermore,two fully connected layers are used to perform feature fitting and channel dimension reduction to obtain the channel feature vector Z:

$$ Z=f_{C^{\prime}}^{B R}\left( f_{C}^{B R}(V)\right) $$

(8)

where C in $f_{C}^{B R}$ represents the number of neurons in the fully connected layer, and BR in $f_{C}^{B R}$ represents the batch normalization [44] and ReLU [45] layers. The obtained Z is passed through two independent fully connected layers to obtain two weight vectors M and N:

$$ \left\{\begin{array}{l} M=f_{C}(Z) \\ N=f_{C}^{\prime}(Z) \end{array}\right. $$

(9)

where C is the number of neurons in the fully connected layer, and the obtained M and N all belong to R^C. Softmax is used for the corresponding channels of M and N to obtain the attention weights A of F_am for each channel. In addition, the attention weights of F_rem is 1 − A. Channelwise multiplication and addition are performed together to obtain the output fusion feature maps F_fm, as following:

$$ a_{c}=\frac{e^{M_{c}}}{e^{M_{c}}+e^{{N_{c}}}} $$

(10)

$$ F_{f m}^{c}= a_{c} \times F_{a m}^{c}+(1-a_{c}) \times F_{r e m}^{c} $$

(11)

where a_c is the value of channel weight A on the c-th channel , and $F_{f m}^{c}$ is the feature map on the c-th channel of F_fm, c ∈ (0, C − 1).

The FAB function is to perform feature fusion on two sets of feature maps. Through self-learning parameters, the two sets of input feature maps are weighted in the channel dimension.

3.5 Loss functions

As shown in Fig. 3, our network contains two losses. L_CE is the classification loss corresponding to the input image and the mutex input image, and L_CS is a cosine similarity loss between the pair of inputs.

$$ \begin{array}{@{}rcl@{}} \left\{\begin{array}{c} L_{1}=-y_{i} \log y_{i}^{\prime}+\left( 1-y_{i}\right) \log \left( 1-y_{i}^{\prime}\right) \\ L_{2}=-y_{m} \log y_{m}^{\prime}+\left( 1-y_{m}\right) \log \left( 1-y_{m}^{\prime}\right) \end{array}\right. \end{array} $$

(12)

$$ L_{CE}=L_{1} + L_{2} $$

(13)

$$ L_{CS}=\cos (\theta)= \frac{{\sum}_{i=1}^{C \times H \times W}\left( {V_{i}^{i}} \times {V_{i}^{m}}\right)}{\sqrt{{\sum}_{i=1}^{C \times H \times W}\left( {V_{i}^{i}}\right)^{2}} \times \sqrt{{\sum}_{i=1}^{C \times H \times W}\left( {V_{i}^{m}}\right)^{2}}} $$

(14)

where L₁ and L₂ are cross-entropy loss, y_i and y_m are opponent labels of the input and mutex input. $y_{i}^{\prime }$, $y_{m}^{\prime }$ are the predicted values. L_CS uses cosine distance to minimize the similarity between two feature maps. In (14), ${V_{i}^{i}}$ and ${V_{i}^{m}}$ (R^CHW) are the vectors obtained by reshaping the feature maps of the input and the mutex input.

In this paper, the adaptive weight loss is shown in (16).

$$ \begin{array}{@{}rcl@{}} \left\{\begin{array}{l} a_{1}=\frac{e^{\frac{1}{L_{C E}}}}{e^{\frac{1}{L_{C E}}}+e^{\frac{1}{L_{C S}}}} \\ a_{2}=\frac{e^{\frac{1}{L_{C S}}}}{e^{\frac{1}{L_{C E}}}+e^{\frac{1}{L_{C S}}}} \end{array}\right. \end{array} $$

(15)

$$ L=a_{1} L_{CE}+a_{2} L_{CS} $$

(16)

In this way, the three loss functions can be adjusted adaptively to make the final loss more balanced.

3.6 Inference process

In the inference process, since the mutex path has no effect on the main path, only the test images need to be input and mutex input is not needed. The prediction network is a simple ResNet50, which is the same as the backbone. Therefore, the time complexity of the proposed method is the same as ResNet50 and smaller than SE-Net and CBAM based on ResNet50. The model size/Flops/speed are also the same as the backbone and our proposed method does not increase any cost in the test process.

4 Experiment

We provided experiments with four datasets: COVID-19 dataset-A, COVID-19 dataset-B, COVID-19 dataset-C and COVID-19 dataset-D. For COVID-19 dataset-A and COVID-19 dataset-B, to verify the effectiveness of the attention we proposed, we used the state-of-the-art attention methods in image classification and backbone for comparison. For public COVID-19 dataset-C and COVID-19 dataset-D, we compared the excellent algorithms proposed by other researchers, both of which achieved good results on COVID-19 dataset-C or COVID-19 dataset-D. We show the results from quantitative analysis and qualitative analysis. For qualitative analysis, we used Grad-CAM [47] for visualization.

4.1 Experimental details

The proposed method was implemented in PyTorch [46]. All the COVID-19 CT images in our experiments were resized to 224 × 224. Due to the small amount of data on COVID-19 dataset-C, we performed data enhancement including horizontal flip, vertical flip and random rotation. Both inputs and mutex inputs during the training process were randomly selected. We used SGD [47] as the optimization function with an initial learning rate of 0.01. The learning rate decays with the epoch, and the change equation is shown as follows:

$$ l r_{d}=\text{lr} \times \left( 0.5^{\frac{e p o c h}{30}}\right) $$

(17)

where epoch is the number of iterations. Momentum was set to 0.9. The batch size was 16, and 120 epochs were run. An NVIDIA GeForce GTX 1070 GPU with 8 GB memory was used. In the test process, only the test images needed to be input. Since in the forward process, nothing has to do with mutex input, we did not need to enter mutex input in the testing process.

For the training process, COVID-19 dataset-A, COVID-19 dataset-B and COVID-19 dataset-C took 3.1 hours, 2.5 hours and 2.1 hours respectively. For the testing process, it took 0.032 seconds to an image.

4.2 Evaluation

The metrics employed to quantitatively evaluate classification were accuracy, sensitivity, F1-score, AUC and specificity. The equation of accuracy is as follows:

$$ \text { Accuracy }=\frac{\text {TN}+\text {TP}}{\text {N}+\text {P}} $$

(18)

The sensitivity and specificity measure the classifier’s ability to identify positive samples and negative samples, respectively, as shown in (19) and (20):

$$ \text {Sensitivity} =\frac{\text {TP}}{\text {FN}+\text {TP}} $$

(19)

$$ \text {Specificity} =\frac{\text {TN}}{\text {FP}+\text {TN}} $$

(20)

F1-score is the classification problems measurement. In machine learning competitions with multiple classification problems, the F1-score is often used as the final evaluation method. It is a harmonic average of the precision and recall, with a maximum of 1 and a minimum of 0.

$$ \text {Precision}=\frac{\text {TP}}{\text {TP}+\text {FP}} $$

(21)

$$ \text {Recall}=\frac{\text {TP}}{\text {TP}+\text {FN}} $$

(22)

$$ F 1=2 \times \frac{\text {Precision } \times \text { Recall}}{\text {Precision }+\text { Recall}} $$

(23)

5 Results

5.1 COVID-19 dataset-A

COVID-19 Dataset-A was provided by The Affiliated Hospital of Qingdao University Medical College. Detailed information was introduced in the previous section (Section 3.1). We used state-of-the-art attention methods in image classification and backbone for comparison, including ResNet [43], ResNet+CBAM [40] and ResNet+SE [39]. The attention module proposed by Ma et al. [29] for COVID-19 diagnosis is also compared. We conducted slice-level and patient-level experiments to verify the effectiveness of our method. Table 2 and 3 summarize the experimental results.

Table 2 Slice-level results of different methods on COVID-19 Dataset-A

Full size table

Table 3 Patient-level results of different methods on COVID-19 Dataset-A

Full size table

For the slice-level experiment, Table 2 shows that compared with the basic network ResNet, our method improved accuracy, sensitivity and specificity, increasing by 3.07%, 2.41% and 3.96%, respectively. Compared with SE-Net, our method improved accuracy, sensitivity and specificity by 2.16%, 0.6% and 4.20%, respectively. For ResNet + CBAM, 2.67%, 2.81% and 2.63% improvements were obtained. Compared with Ma et al. [29], the proposed attention blocks obtained more satisfactory results in terms of accuracy, sensitivity and specificity. The results illustrate that compared to the widely used classification networks and attention modules, our proposed method had better accuracy and the least false-positives. Fig. 6 shows the ROC curve (left) and confusion matrix (right) of the different methods.

As seen in Table 3, first, the proposed method obtained better results at the patient-level compared to state-of-the-art methods, which is similar to the slice-level experiment. Second, there was little difference between patient-level and slice-level experimental results, which verifies the effectiveness of the proposed method.

In particular, we discovered that SE-Net tended to obtain higher sensitivity, while ResNet + CBAM tended to have higher specificity. Therefore, the probability of missing detection was lower than that of ResNet + CBAM, but ResNet + CBAM had fewer false-positives. From statistical analysis, all p values were less than 0.001, which confirmed that our method was significantly different from other classification methods.

5.2 COVID-19 dataset-B

COVID-19 dataset-B [41] was provided by Ma et al. Similar to COVID-19 dataset-A, we also used the state-of-the-art attention modules in image classification and backbone for comparison, including ResNet [43], ResNet+CBAM [40] and ResNet+SE [39]. Table 4 summarizes the experimental results.

Table 4 Experimental results of different methods on COVID-19 dataset-B

Full size table

Similar to the results obtained on COVID-19 dataset-A, the proposed method achieved better accuracy, sensitivity and specificity. Comparing with ResNet, ResNet + CBAM and SE-Net, the proposed methods improved accuracy by 6.98%, 4.26% and 8.81%, and improved sensitivity by 2.59%, 7.04% and 1.90%, respectively. The specificity increased by 12.33%, 1.20% and 16.42%, respectively. Fig. 7 shows the ROC curve (left) and confusion matrix (right) of different methods.

5.3 COVID-19 dataset-C

COVID-19 dataset-C [20] was provided by Zhao et al. It is a public dataset used for COVID-19 diagnosis. The characteristics of COVID-19 Dataset-C were explained in detail in Section 3.1. Due to the small amount of data in it, we choose to compare with the papers that propose different solutions. Yang et al. [20], [32], [34] and [35] were selected for comparison due to their excellent performance. Table 5 summarizes the experimental results.

Table 5 Experimental results of different methods on COVID-19 dataset-C

Full size table

Table 5 reports that compared to the method proposed in [20], our method achieves better accuracy, F1-score and AUC without the assistance of a lung mask. Compared to the method proposed in [34], our method used random initialization, and achieved better results without pretraining on other datasets. Both the F1-score and AUC value were significantly improved. The results confirmed that our method can achieve satisfactory results without using other complex methods when the dataset was small.

5.4 COVID-19 dataset-D

COVID-19 dataset-D [42] was provided by Soares et al. It is a public dataset used for COVID-19 diagnosis. The characteristics of COVID-19 Dataset-D were explained in detail in Section 3.1. Some researchers have achieved excellent results on this dataset. we focused on comparing with papers that propose new methods. Wang et al. [35], [29] and [48] were choose for comparison. We also used the state-of-the-art attention modules in image classification for comparing. Table 6 summarizes the experimental results.

Table 6 Experimental results of different methods on COVID-19 dataset-D

Full size table

Experimental results illustrated that the proposed network achieved better result than other methods and attention models. Comparing with state-of-the-art methods, our method achieves better accuracy, F1-score and AUC. The experimental results illustrated the effectiveness of the proposed method for COVID-19 discrimination.

5.5 Ablation experiments

To verify the effectiveness of our proposed loss and attention blocks, we conducted two ablation experiments. For the loss function, we verified the influence of L_CS on the diagnosis results. The results are shown in Table 7.

Table 7 Diagnosis results using different loss functions (based on COVID-19 dataset-B)

Full size table

The experimental results show that the cosine similarity loss L_CS can improve the diagnosis result. Especially when combined with the adaptive weighting method, it can significantly improve the accuracy.

We also performed ablation experiments on the influence of different attention blocks. Table 8 shows that MAB significantly improved the diagnostic effect of the network on COVID-19, and FAB further improved the diagnostic effect of the network on this basis. This result was consistent with the visualized result in Fig. 9.

Table 8 Diagnosis results using different attention blocks (based on COVID-19 dataset-B)

Full size table

6 Discussion

In this work, we proposed a new Res-Layer structure for COVID-19 diagnosis on CT images named mutex attention Res-Layer. It is composed of MAB, FAB and a Res-Layer and can extract more distinguishable features to obtain better COVID-19 diagnosis results with the proposed adaptive weight loss. Experiments on four different COVID-19 datasets verified that our method can achieve better results than other state-of-the-art methods. This reflects the effectiveness and robustness of the proposed method.

For the more regular CT image datasets COVID-19 dataset-A and COVID-19 dataset-B [41], the proposed network obtains 98.18% and 95.88% accuracy, which achieves significant improvement compared to other state-of-the-art attention models. Furthermore, the proposed method achieves higher sensitivity and specificity, which means that our method can better avoid the existence of false negatives (FNs) and false-positives (FPs) than other methods. As shown in Table 2, the experimental results indicate that ResNet+CBAM [40] tends to obtain better specificity. In other words, it will produce many FNs, which is not conducive to COVID-19 diagnosis. Although SE-Net [39] can obtain very good sensitivity, only 0.6% lower than our method, it produces a large number of FPs. The specificity of SE-Net is 4.2% lower than our method. Both experiments demonstrate the superiority of the proposed method.

For the noisy dataset, COVID-19 dataset-C [20], the proposed method also obtains satisfactory results. Compared with the methods proposed by [34], [20], our method can obtain higher accuracy, F1-score and higher AUC with fewer labels than [34] and without transfer learning, as in [20]. This shows that our method can still obtain satisfactory results even when the quantity of data is small and the noise is complex, which fully demonstrates the robustness of our method. For another public COVID-19 dataset-D [42], more satisfactory results are obtained than other state-of-the-art algorithms.

This article proposes two new attention blocks, MAB and FAB. The experimental results indicate the effectiveness of the two attention blocks. In Table 8, compared with the basic ResNet, MAB can significantly improve the accuracy and specificity by 4.36% and 10.24% respectively. MAB can amplify the degree of feature differences in various categories, and tends to obtain fewer FPs. Based on MAB, FAB performs feature fusion to obtain more representative features.With the improvement in accuracy by 1.62% and sensitivity by 3.22%, specificity was nearly unchanged.

To confirm the results of the experiments, we visualized the feature maps of different categories extracted by the network and their COVID-19 images in Fig. 8. We use Grad-CAM [47] to visualize the feature maps outputted by the mutex attention Res-Layer4.

The visualization results show that our method focuses on the lesion area more accurately. For example, the lesion area in Column 3 was very small, while the lesion area in Column 4 was diffuse in both lungs. From the visualization results of the feature maps in the two columns, we can see that the proposed network is robust to lesions of different sizes. For different types of lesions, our network also achieves satisfactory localization. For example, the first column is mixed ground glass opacity, the third column is ground glass opacity and the second column is solid. The high response areas of these visualizations approach the lesion area. Visualization results confirm that our method can extract more representative features to achieve effective COVID-19 diagnosis, which proves the effectiveness of our method.

We also visualized the influence of different attention blocks on the feature maps. We extracted the feature maps outputted by the mutex attention Res-Layer4 for visualization. Since the output feature map size of mutex attention Res-Layer4 was [2048,7,7], we use the average superposition method to obtain a feature map 7 × 7. The final results were obtained by upsampling to 224 * 224 using bilinear interpolation, as shown in Fig. 9. It shows that MAB focuses the features on the most significant lesion region, but it cannot completely cover some subsidiary lesion areas. For example, in the second row, the original feature map has a diffuse appearance on both sides of the lung, but its coverage area far exceeds the lesion area. As shown in the third column, MAB focuses the feature most on right lung, which is the most distinctive area. The region with the highest response shrank to the lesion area. However, the lesion area in the left lung was ignored. For this problem, the FAB module may provide an explainable improvement. FAB performs feature fusion between the original feature maps and the MAB feature maps. It can also be described as the feature selection of the channel dimension. As the same example, the visualization in the fourth column of the second row shows that FAB focuses on the right lung as well as on the left lung, which can enhance the network’s diagnostic effectiveness. Furthermore, the proposed method will be helpful to precisely localizing and segmenting lesions.

7 Conclusion

This paper proposed the COVID-19 diagnosis network MA-Net, which takes a pair of multiple CT images as inputs. In particular, this paper proposed a mutex attention block. The mutex attention block aims to distinguish the features of mutex input pairs. The network can extract distinguishable features to improve the diagnostic effect of the network. Then, the fusion attention block is designed to perform feature fusion further improving the diagnosis accuracy. Regarding the loss function, the proposed network includes three losses, namely the cross-entropy classification loss of the two mutex inputs and the cosine similarity loss of mutex input pairs. Adaptive weight is used to adjust the weights of the three losses. Our method is very robust and achieved satisfactory results on multiple datasets. The accuracy, specificity, sensitivity, and AUC are reported as high as 98.17%, 97.25%, 98.79% and 99.84% on our own COVID- 19 dataset-A and 94.88%, 95.39%, 94.33% and 99.00% on the public COVID-19 dataset-B. The diagnosis result of our method is better than state-of-the-art the attention modules of classification method. For public COVID-19 dataset-C and COVID-19 dataset-D, we achieved better result than other excellent COVID-19 diagnosis methods. The experiments and analysis indicate that the proposed network can provide auxiliary quantitative analysis in COVID-19 diagnosis.

References

Baker DM, Bhatia S, Brown S, Cambridge W, Kamarajah SK, McLean KA, Xu W (2020) Medical student involvement in the COVID-19 response. The Lancet 395(10232):1254
Article Google Scholar
W. H. Organization (2020) “Novel Coronavirus(2019-nCoV) Situation Report – 22,” Accessed on: February. 11, 2020 [Online]. Available: https://www.who.int/publications/m/item/weekly-epidemiological-update-on-covid-19---20-april-2021
Galloway SE, Paul P, MacCannell DR, Johansson MA, Brooks JT, Macneil A, Dugan VG (2021) Emergence of SARS-cov-2 b. 1.1. 7 lineage—united states, december 29, 2020–january 12, 2021. Morb Mortal Wkly Rep 70(3):95
Article Google Scholar
Tahamtan A, Ardebili A (2020) Real-time RT-PCR in COVID-19 detection: issues affecting the results. Expert Rev Mol Diagn 20(5):453–454
Article Google Scholar
Xiao AT, Tong YX, Zhang S, False-negative of RT-PCR and prolonged nucleic acid conversion in COVID-19: rather than recurrence. J Med Virol (2020)
Ye Z, Zhang Y, Wang Y, Huang Z, Song B (2020) Chest CT manifestations of new coronavirus disease 2019 (COVID-19): a pictorial review. Eur Radiol 30(8):4381–4389
Article Google Scholar
Long C, Xu H, Shen Q, Zhang X, Fan B, Wang C, Li H (2020) Diagnosis of the Coronavirus disease (COVID-19): rRT-PCR or CT?. Eur J Radiol 126:108961
Article Google Scholar
Mei X, Lee HC, Diao KY, Huang M, Lin B, Liu C, Yang Y (2020) Artificial intelligence–enabled rapid diagnosis of patients with COVID-19. Nat Med 26(8):1224–1228
Article Google Scholar
Liu H, Nie H, Zhang Z, Li YF (2021) Anisotropic angle distribution learning for head pose estimation and attention understanding in human-computer interaction. Neurocomputing 433:310–322
Article Google Scholar
Li Z, Liu H, Zhang Z, Liu T, Xiong NN (2021) Learning knowledge graph embedding with heterogeneous relation attention networks. IEEE Transactions on Neural Networks and Learning Systems. https://doi.org/10.1109/TNNLS.2021.3055147
Zhang Z, Li Z, Liu H, Xiong NN (2020) Multi-scale dynamic convolutional network for knowledge graph embedding. In: IEEE Transactions on Knowledge and Data Engineering. https://doi.org/10.1109/TKDE.2020.3005952
Liang W, Yao J, Chen A, Lv Q, Zanin M, Liu J, He J (2020) Early triage of critically ill COVID-19 patients using deep learning. Nat Commun 11(1):1–7
Article Google Scholar
Ozsahin I, Sekeroglu B, Musa MS, Mustapha MT, Ozsahin DU (2020) Review on diagnosis of covid-19 from chest ct images using artificial intelligence. Computational and Mathematical Methods in Medicine 2020:1–10
Article Google Scholar
Mohamadou Y, Halidou A, Kapen PT (2020) A review of mathematical modeling, artificial intelligence and datasets used in the study, prediction and management of covid-19. Appl Intell 50(11):3913–3925
Article Google Scholar
Liu H, Fang S, Zhang Z, Li D, Lin K, Wang J (2021) MFDNet: Collaborative Poses Perception and Matrix Fisher Distribution for Head Pose Estimation. IEEE Transactions on Multimedia. https://doi.org/10.1109/TMM.2021.3081873
Li D, Liu H, Zhang Z, Lin K, Fang S, Li Z, Xiong NN (2021) CARM: Confidence-aware recommender model via review representation learning and historical rating behavior in the online platforms. Neurocomputing 455:283–296
Article Google Scholar
Shen X et al (2021) Deep variational matrix factorization with knowledge embedding for recommendation system. In: IEEE Transactions on Knowledge and Data Engineering. https://doi.org/10.1109/TKDE.2019.2952849, vol 33, pp 1906–1918
Liu T, Liu H, Li Y , Zhang Z, Liu S Fast Blind Reconstruction with Wavelet Transforms Regularization and Total Variation Minimization for FTIR Imaging Spectrometer. IEEE/ASME Transactions on Mechatronics. https://doi.org/10.1109/TMECH.2018.2870056
Liu T, Liu H, Li Y, Chen Z, Zhang Z, Liu S (Jan. 2020) Flexible FTIR spectral imaging enhancement for industrial robot infrared vision sensing. In: IEEE Transactions on Industrial Informatics. https://doi.org/10.1109/TII.2019.2934728, vol 16, pp 544–554
Yang X, He X, Zhao J, Zhang Y, Zhang S, Xie P (2020) COVID-CT-dataset: a CT scan dataset about COVID-19. https://github.com/UCSDAI4H/COVID-CT
Wang G, Liu X, Li C, Xu Z, Ruan J, Zhu H, Zhang S (2020) A noise-robust framework for automatic segmentation of COVID-19 pneumonia lesions from CT images. IEEE Transactions on Medical Imaging 39(8):2653–2663
Article Google Scholar
Minaee S, Kafieh R, Sonka M, Yazdani S, Soufi GJ (2020) Deep-COVID: Predicting COVID-19 from chest x-ray images using deep transfer learning. Med Image Anal 65:101794
Article Google Scholar
Ahuja S, Panigrahi BK, Dey N, Rajinikanth V, Gandhi TK (2021) Deep transfer learning-based automated detection of COVID-19 from lung CT scan slices. Appl Intell 51(1):571– 585
Article Google Scholar
Jamshidi M, Lalbakhsh A, Talla J, Peroutka Z, Hadjilooei F, Lalbakhsh P, Mohyuddin W (2020) Artificial intelligence and COVID-19: deep learning approaches for diagnosis and treatment. Ieee Access 8:109581–109595
Article Google Scholar
Abbas A, Abdelsamea MM, Gaber MM (2021) Classification of COVID-19 in chest X-ray images using DeTraC deep convolutional neural network. Appl Intell 51(2):854–864
Article Google Scholar
Qian X, Fu H, Shi W, Chen T, Fu Y, Shan F, Xue X (2020) M ³ Lung-Sys: a deep learning system for multi-class lung pneumonia screening from CT imaging. IEEE Journal of Biomedical and Health Informatics 24(12):3539–3550
Article Google Scholar
Liu B, Liu P, Dai L, Yang Y, Xie P, Tan Y , He K (2021) Assisting scalable diagnosis automatically via CT images in the combat against COVID-19. Scientific Reports 11(1):1–8
Google Scholar
Gao K, Su J, Jiang Z, Zeng LL, Feng Z, Shen H , Hu D (2021) Dual-branch combination network (DCN): Towards accurate diagnosis and lesion segmentation of COVID-19 using CT images. Med Image Anal 67:101836
Article Google Scholar
Ma X, Zheng B, Zhu Y, Yu F, Zhang R, Chen B (2021) COVID-19 lesion discrimination and localization network based on multi-receptive field attention module on CT images. Optik 241:167100
Article Google Scholar
Li L, Qin L, Xu Z, Yin Y, Wang X, Kong B, Xia J (2020) Using artificial intelligence to detect COVID-19 and community-acquired pneumonia based on pulmonary CT: evaluation of the diagnostic accuracy. Radiology 296(2):E65–E71
Article Google Scholar
Harmon SA, Sanford TH, Xu S, Turkbey EB, Roth H, Xu Z, Turkbey B (2020) Artificial intelligence for the detection of COVID-19 pneumonia on chest CT using multinational datasets. Nat Comput 11(1):1–7
Google Scholar
Mittal H, Pandey AC, Pal R, Tripathi A (2021) A new clustering method for the diagnosis of CoVID19 using medical images. Appl Intell 51(5):2988–3011
Article Google Scholar
He K, Fan H, Wu Y, Xie S, Girshick R (2020) Momentum contrast for unsupervised visual representation learning, pp 9729– 9738
He X, Yang X , Zhang S, Zhao J, Zhang Y, Xing E, Xie P (2020) Sample-efficient deep learning for COVID-19 diagnosis based on CT scans. medrxiv. https://doi.org/10.1101/2020.04.13.20063941
Wang Z, Liu Q, Dou Q (2020) Contrastive cross-site learning with redesigned net for covid-19 ct classification. IEEE Journal of Biomedical and Health Informatics 24(10):2806–2813
Article Google Scholar
Wang L, Lin ZQ, Wong A (2020) COVID-Net: a tailored deep convolutional neural network design for detection of COVID-19 cases from chest X-Ray images. Scientific Reports 10(1):1–12
Google Scholar
Zhou Wang, Lv Y, Lei J, Yu L (2021) Global and local-contrast guides content-aware fusion for RGB-d saliency prediction. In: IEEE Transactions on Systems, Man, and Cybernetics: Systems. https://doi.org/10.1109/TSMC.2019.2957386, vol 51, pp 3641–3649
Zhou W, Liu J, Lei J, Yu L, Hwang JN (2021) GMNet: graded-feature multilabel-learning network for RGB-thermal urban scene semantic segmentation. IEEE Trans Image Process 30:7790–7802
Article Google Scholar
Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 7132–7141
Woo S, Park J, Lee JY, Kweon IS (2018) Cbam: Convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV), pp 3–19
Ma J, Cheng G, Wang YX, An XL, Gao JT, Yu ZQ, Zhu QJ “COVID-19 CT Lung and Infection Segmentation Dataset. https://doi.org/10.5281/zenodo.3757476
Soares E, Angelov P, Biaso S, Froes MH, Abe DK (2020) “SARS-CoV-2 CT-scan dataset: A large dataset of real patients CT scans for SARS-CoV-2 identification,” medRxiv preprint. https://doi.org/10.1101/2020.04.24.20078584
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Ioffe S, Szegedy C (2015) Batch normalization: Accelerating deep network training by reducing internal covariate shift. In: International conference on machine learning. PMLR, pp 448–456
Glorot X, Bordes A, Bengio Y (2011) Deep sparse rectifier neural networks. In: Proceedings of the fourteenth international conference on artificial intelligence and statistics, pp 315-323. JMLR Workshop and Conference Proceedings
Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G, Chintala S (2019) Pytorch: An imperative style, high-performance deep learning library. Advances in Neural Information Processing Systems 32:8026–8037
Google Scholar
Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D (2017) Grad-cam: Visual explanations from deep networks via gradient-based localization. In: Proceedings of the IEEE international conference on computer vision, pp 618–626
Panwar H, Gupta PK, Siddiqui MK, Morales-Menendez R, Bhardwaj P, Singh V (2020) “A deep learning and grad-CAM based color visualization approach for fast detection of COVID-19 cases using chest X-ray and CT-scan images,”. Chaos, Solitons Fractals 140(110190):39
MathSciNet Google Scholar

Download references

Acknowledgements

This project is supported in part by the Qingdao City Science and Technology Special Fund (20-4-1-5-nsh), Qingdao West Coast New District Science and Technology Project(2019-59), KY-009, Science and Technology Commission of Shanghai Municipality (20DZ2254400), Zhongshan Hospital Clinical Research Foundation (2019ZSGG15) and Shanghai Pujiang Program (20PJ1402400).

Author information

Authors and Affiliations

School of Information Science and Engineering, East China University of Science and Technology, Shanghai, 200237, China
BingBing Zheng, Yu Zhu & Qin Shi
Shanghai Engineering Research Center of Internet of Things for Respiratory Medicine, Shanghai, 200237, China
Yu Zhu & Dawei Yang
Department of Pulmonary and Critical Care Medicine, Zhongshan Hospital, Fudan University, Shanghai, 200032, China
Dawei Yang
Department of Pulmonary and Critical Care Medicine, the Affiliated Hospital of Qingdao University, Qingdao, Shandong, 266000, China
Yanmei Shao & Tao Xu

Authors

BingBing Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Yu Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Qin Shi
View author publications
You can also search for this author in PubMed Google Scholar
Dawei Yang
View author publications
You can also search for this author in PubMed Google Scholar
Yanmei Shao
View author publications
You can also search for this author in PubMed Google Scholar
Tao Xu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Yu Zhu or Tao Xu.

Ethics declarations

Conflict of Interests

The author(s) declared no conflicts of interest with respect to the research, authorship, and publication of this paper.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zheng, B., Zhu, Y., Shi, Q. et al. MA-Net:Mutex attention network for COVID-19 diagnosis on CT images. Appl Intell 52, 18115–18130 (2022). https://doi.org/10.1007/s10489-022-03431-5

Download citation

Accepted: 22 February 2022
Published: 09 April 2022
Issue Date: December 2022
DOI: https://doi.org/10.1007/s10489-022-03431-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

MA-Net:Mutex attention network for COVID-19 diagnosis on CT images

Abstract

Similar content being viewed by others

A multi-level feature attention network for COVID-19 detection based on multi-source medical images

Computer Aided Diagnosis for COVID-19 in CT Images Utilizing Transfer Learning and Attention Mechanism

Attention-based VGG-16 model for COVID-19 chest X-ray image classification

Explore related subjects

1 Introduction

2 Related work

2.1 COVID-9 diagnosis on CT

2.2 Image attention mechanism

3 Datasets and methods

3.1 Datasets

3.2 Methodology

3.3 Mutex attention block (MAB)

3.4 Fusion attention block (FAB)

3.5 Loss functions

3.6 Inference process

4 Experiment

4.1 Experimental details

4.2 Evaluation

5 Results

5.1 COVID-19 dataset-A

5.2 COVID-19 dataset-B

5.3 COVID-19 dataset-C

5.4 COVID-19 dataset-D

5.5 Ablation experiments

6 Discussion

7 Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Conflict of Interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation