TY - JOUR
T1 - Machine-Learning-Enabled Diagnostics with Improved Visualization of Disease Lesions in Chest X-ray Images
AU - Rahman, Md Fashiar
AU - Tseng, Tzu Liang
AU - Pokojovy, Michael
AU - McCaffrey, Peter
AU - Walser, Eric
AU - Moen, Scott
AU - Vo, Alexander
AU - Ho, Johnny C.
N1 - Publisher Copyright:
© 2024 by the authors.
PY - 2024/8
Y1 - 2024/8
N2 - The class activation map (CAM) represents the neural-network-derived region of interest, which can help clarify the mechanism of the convolutional neural network’s determination of any class of interest. In medical imaging, it can help medical practitioners diagnose diseases like COVID-19 or pneumonia by highlighting the suspicious regions in Computational Tomography (CT) or chest X-ray (CXR) film. Many contemporary deep learning techniques only focus on COVID-19 classification tasks using CXRs, while few attempt to make it explainable with a saliency map. To fill this research gap, we first propose a VGG-16-architecture-based deep learning approach in combination with image enhancement, segmentation-based region of interest (ROI) cropping, and data augmentation steps to enhance classification accuracy. Later, a multi-layer Gradient CAM (ML-Grad-CAM) algorithm is integrated to generate a class-specific saliency map for improved visualization in CXR images. We also define and calculate a Severity Assessment Index (SAI) from the saliency map to quantitatively measure infection severity. The trained model achieved an accuracy score of 96.44% for the three-class CXR classification task, i.e., COVID-19, pneumonia, and normal (healthy patients), outperforming many existing techniques in the literature. The saliency maps generated from the proposed ML-GRAD-CAM algorithm are compared with the original Gran-CAM algorithm.
AB - The class activation map (CAM) represents the neural-network-derived region of interest, which can help clarify the mechanism of the convolutional neural network’s determination of any class of interest. In medical imaging, it can help medical practitioners diagnose diseases like COVID-19 or pneumonia by highlighting the suspicious regions in Computational Tomography (CT) or chest X-ray (CXR) film. Many contemporary deep learning techniques only focus on COVID-19 classification tasks using CXRs, while few attempt to make it explainable with a saliency map. To fill this research gap, we first propose a VGG-16-architecture-based deep learning approach in combination with image enhancement, segmentation-based region of interest (ROI) cropping, and data augmentation steps to enhance classification accuracy. Later, a multi-layer Gradient CAM (ML-Grad-CAM) algorithm is integrated to generate a class-specific saliency map for improved visualization in CXR images. We also define and calculate a Severity Assessment Index (SAI) from the saliency map to quantitatively measure infection severity. The trained model achieved an accuracy score of 96.44% for the three-class CXR classification task, i.e., COVID-19, pneumonia, and normal (healthy patients), outperforming many existing techniques in the literature. The saliency maps generated from the proposed ML-GRAD-CAM algorithm are compared with the original Gran-CAM algorithm.
KW - bacterial/viral infection
KW - class activation map (CAM)
KW - convolutional neural networks (CNNs)
KW - disease diagnosis
KW - medical imaging
UR - http://www.scopus.com/inward/record.url?scp=85202608194&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85202608194&partnerID=8YFLogxK
U2 - 10.3390/diagnostics14161699
DO - 10.3390/diagnostics14161699
M3 - Article
C2 - 39202188
AN - SCOPUS:85202608194
SN - 2075-4418
VL - 14
JO - Diagnostics
JF - Diagnostics
IS - 16
M1 - 1699
ER -