scholarly journals Deep Learning Classification of Active Tuberculosis Using Chest X-Rays: Efficacy of Transfer Learning and Generalization Performance of Cross-Population Datasets

Author(s):  
James Devasia ◽  
Hridyanand Goswami ◽  
Subitha Lakshminarayanan ◽  
Manju Rajaram ◽  
Subathra Adithan ◽  
...  

Abstract Chest X-ray based diagnosis of active Tuberculosis (TB) is one of the oldest ubiquitous tests in medical practice. Artificial Intelligence (AI) based automated detection of abnormality in chest radiography is crucial in radiology workflow. Most deep convolutional neural networks (DCNN) for diagnosing TB by transfer learning from natural images and using the same dataset to evaluate the model performance and diagnostic accuracy. However, dataset shift is a known issue in predictive models in AI, which is unexplored. In this work, we fine-tuned, validated, and tested two benchmark architectures and utilized the transfer learning methodology to measure the diagnostic accuracy on cross-population datasets. We achieved remarkable calcification accuracy of 100% and area under the receiver operating characteristic (AUC) 1.000 [1.000 – 1.000] (with a sensitivity 0.985 [0.971 – 1.000] and a specificity of 0.986 [0.971 – 1.000]) on intramural test set, but significant drop in extramural test set. Accuracy on various extramural test sets varies 50% - 70%, AUC ranges 0.527 – 0.865 (sensitivity and specificity fluctuate 0.394 – 0.995 and 0.443 – 0.864 respectively). Diagnostic performance on the intramural test set observed in this study shows that DCNN can accurately classify active TB and normal chest radiographs, however the external test set shows DCNN is less likely to generalize well on models trained on specific population dataset.

2021 ◽  
Author(s):  
Priyavrat Misra ◽  
Niranjan Panigrahi

Abstract With the ongoing outbreak of the COVID-19 global pandemic, the research community still struggles to develop early and reliable prediction and detection mechanisms for this infectious disease. The commonly used RT-PCR test is not readily available in areas with limited testing facilities, and it lags in performance and timeliness. This paper proposes a deep transfer learning-based approach to predict and detect COVID-19 from digital chest radiographs. In this study, three pre-trained convolutional neural network-based models (VGG16, ResNet18, and DenseNet121) have been fine tuned to detect COVID-19 infected patients from chest X-rays (CXRs). The most efficient model is further used to identify the affected regions using an unsupervised gradient-based localization technique. The proposed system uses a classification approach (normal vs. COVID-19 vs. pneumonia vs. lung opacity) using three supervised classification algorithms followed by gradient-based localization. The training, validation and testing of the system are performed using 21165 CXR images (10192 normal, 1345 pneumonia, 3616 COVID-19, and 6012 lung opacity). Simulation and evaluation results are presented using standard performance metrics, viz, accuracy, sensitivity, and specificity.


2021 ◽  
Vol 11 ◽  
Author(s):  
Huiquan Wang ◽  
Chunli Liu ◽  
Zhe Zhao ◽  
Chao Zhang ◽  
Xin Wang ◽  
...  

ObjectiveThis study aimed to evaluate the performance of the deep convolutional neural network (DCNN) to discriminate between benign, borderline, and malignant serous ovarian tumors (SOTs) on ultrasound(US) images.Material and MethodsThis retrospective study included 279 pathology-confirmed SOTs US images from 265 patients from March 2013 to December 2016. Two- and three-class classification task based on US images were proposed to classify benign, borderline, and malignant SOTs using a DCNN. The 2-class classification task was divided into two subtasks: benign vs. borderline & malignant (task A), borderline vs. malignant (task B). Five DCNN architectures, namely VGG16, GoogLeNet, ResNet34, MobileNet, and DenseNet, were trained and model performance before and after transfer learning was tested. Model performance was analyzed using accuracy, sensitivity, specificity, and the area under the receiver operating characteristic curve (AUC).ResultsThe best overall performance was for the ResNet34 model, which also achieved the better performance after transfer learning. When classifying benign and non-benign tumors, the AUC was 0.96, the sensitivity was 0.91, and the specificity was 0.91. When predicting malignancy and borderline tumors, the AUC was 0.91, the sensitivity was 0.98, and the specificity was 0.74. The model had an overall accuracy of 0.75 for in directly classifying the three categories of benign, malignant and borderline SOTs, and a sensitivity of 0.89 for malignant, which was better than the overall diagnostic accuracy of 0.67 and sensitivity of 0.75 for malignant of the senior ultrasonographer.ConclusionDCNN model analysis of US images can provide complementary clinical diagnostic information and is thus a promising technique for effective differentiation of benign, borderline, and malignant SOTs.


2020 ◽  
pp. 1-8
Author(s):  
Amira Rachah ◽  
Olav Reksen ◽  
Nils Kristian Afseth ◽  
Valeria Tafintseva ◽  
Sabine Ferneborg ◽  
...  

Abstract The objective of the study was to evaluate the potential of Fourier transform infrared spectroscopy (FTIR) analysis of milk samples to predict body energy status and related traits (energy balance (EB), dry matter intake (DMI) and efficient energy intake (EEI)) in lactating dairy cows. The data included 2371 milk samples from 63 Norwegian Red dairy cows collected during the first 105 days in milk (DIM). To predict the body energy status traits, calibration models were developed using Partial Least Squares Regression (PLSR). Calibration models were established using split-sample (leave-one cow-out) cross-validation approach and validated using an external test set. The PLSR method was implemented using just the FTIR spectra or using the FTIR together with milk yield (MY) or concentrate intake (CONCTR) as predictors of traits. Analyses were conducted for the entire first 105 DIM and separately for the two lactation periods: 5 ≤ DIM ≤ 55 and 55 < DIM ≤ 105. To test the models, an external validation using an independent test set was performed. Predictions depending on the parity (1st, 2nd and 3rd-to 6th parities) in early lactation were also investigated. Accuracy of prediction (r) for both cross-validation and external test set was defined as the correlation between the predicted and observed values for body energy status traits. Analyzing FTIR in combination with MY by PLSR, resulted in relatively high r-values to estimate EB (r = 0.63), DMI (r = 0.83), EEI (r = 0.84) using an external validation. Only moderate correlations between FTIR spectra and traits like EB, EEI and dry matter intake (DMI) have so far been published. Our hypothesis was that improvements in the FTIR predictions of EB, EEI and DMI can be obtained by (1) stratification into different stages of lactations and different parities, or (2) by adding additional information on milking and feeding traits. Stratification of the lactation stages improved predictions compared with the analyses including all data 5 ≤ DIM ≤105. The accuracy was improved if additional data (MY or CONCTR) were included in the prediction model. Furthermore, stratification into parity groups, improved the predictions of body energy status. Our results show that FTIR spectral data combined with MY or CONCTR can be used to obtain improved estimation of body energy status compared to only using the FTIR spectra in Norwegian Red dairy cattle. The best prediction results were achieved using FTIR spectra together with MY for early lactation. The results obtained in the study suggest that the modeling approach used in this paper can be considered as a viable method for predicting an individual cow's energy status.


2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Young-Gon Kim ◽  
Sungchul Kim ◽  
Cristina Eunbee Cho ◽  
In Hye Song ◽  
Hee Jin Lee ◽  
...  

AbstractFast and accurate confirmation of metastasis on the frozen tissue section of intraoperative sentinel lymph node biopsy is an essential tool for critical surgical decisions. However, accurate diagnosis by pathologists is difficult within the time limitations. Training a robust and accurate deep learning model is also difficult owing to the limited number of frozen datasets with high quality labels. To overcome these issues, we validated the effectiveness of transfer learning from CAMELYON16 to improve performance of the convolutional neural network (CNN)-based classification model on our frozen dataset (N = 297) from Asan Medical Center (AMC). Among the 297 whole slide images (WSIs), 157 and 40 WSIs were used to train deep learning models with different dataset ratios at 2, 4, 8, 20, 40, and 100%. The remaining, i.e., 100 WSIs, were used to validate model performance in terms of patch- and slide-level classification. An additional 228 WSIs from Seoul National University Bundang Hospital (SNUBH) were used as an external validation. Three initial weights, i.e., scratch-based (random initialization), ImageNet-based, and CAMELYON16-based models were used to validate their effectiveness in external validation. In the patch-level classification results on the AMC dataset, CAMELYON16-based models trained with a small dataset (up to 40%, i.e., 62 WSIs) showed a significantly higher area under the curve (AUC) of 0.929 than those of the scratch- and ImageNet-based models at 0.897 and 0.919, respectively, while CAMELYON16-based and ImageNet-based models trained with 100% of the training dataset showed comparable AUCs at 0.944 and 0.943, respectively. For the external validation, CAMELYON16-based models showed higher AUCs than those of the scratch- and ImageNet-based models. Model performance for slide feasibility of the transfer learning to enhance model performance was validated in the case of frozen section datasets with limited numbers.


2021 ◽  
Vol 14 (1) ◽  
Author(s):  
Ashwath Radhachandran ◽  
Anurag Garikipati ◽  
Nicole S. Zelin ◽  
Emily Pellegrini ◽  
Sina Ghandian ◽  
...  

Abstract Background Acute heart failure (AHF) is associated with significant morbidity and mortality. Effective patient risk stratification is essential to guiding hospitalization decisions and the clinical management of AHF. Clinical decision support systems can be used to improve predictions of mortality made in emergency care settings for the purpose of AHF risk stratification. In this study, several models for the prediction of seven-day mortality among AHF patients were developed by applying machine learning techniques to retrospective patient data from 236,275 total emergency department (ED) encounters, 1881 of which were considered positive for AHF and were used for model training and testing. The models used varying subsets of age, sex, vital signs, and laboratory values. Model performance was compared to the Emergency Heart Failure Mortality Risk Grade (EHMRG) model, a commonly used system for prediction of seven-day mortality in the ED with similar (or, in some cases, more extensive) inputs. Model performance was assessed in terms of area under the receiver operating characteristic curve (AUROC), sensitivity, and specificity. Results When trained and tested on a large academic dataset, the best-performing model and EHMRG demonstrated test set AUROCs of 0.84 and 0.78, respectively, for prediction of seven-day mortality. Given only measurements of respiratory rate, temperature, mean arterial pressure, and FiO2, one model produced a test set AUROC of 0.83. Neither a logistic regression comparator nor a simple decision tree outperformed EHMRG. Conclusions A model using only the measurements of four clinical variables outperforms EHMRG in the prediction of seven-day mortality in AHF. With these inputs, the model could not be replaced by logistic regression or reduced to a simple decision tree without significant performance loss. In ED settings, this minimal-input risk stratification tool may assist clinicians in making critical decisions about patient disposition by providing early and accurate insights into individual patient’s risk profiles.


Sensors ◽  
2021 ◽  
Vol 21 (13) ◽  
pp. 4520
Author(s):  
Luis Lopes Chambino ◽  
José Silvestre Silva ◽  
Alexandre Bernardino

Facial recognition is a method of identifying or authenticating the identity of people through their faces. Nowadays, facial recognition systems that use multispectral images achieve better results than those that use only visible spectral band images. In this work, a novel architecture for facial recognition that uses multiple deep convolutional neural networks and multispectral images is proposed. A domain-specific transfer-learning methodology applied to a deep neural network pre-trained in RGB images is shown to generalize well to the multispectral domain. We also propose a skin detector module for forgery detection. Several experiments were planned to assess the performance of our methods. First, we evaluate the performance of the forgery detection module using face masks and coverings of different materials. A second study was carried out with the objective of tuning the parameters of our domain-specific transfer-learning methodology, in particular which layers of the pre-trained network should be retrained to obtain good adaptation to multispectral images. A third study was conducted to evaluate the performance of support vector machines (SVM) and k-nearest neighbor classifiers using the embeddings obtained from the trained neural network. Finally, we compare the proposed method with other state-of-the-art approaches. The experimental results show performance improvements in the Tufts and CASIA NIR-VIS 2.0 multispectral databases, with a rank-1 score of 99.7% and 99.8%, respectively.


Sign in / Sign up

Export Citation Format

Share Document