Refine
Document Type
- Article (5)
Language
- English (5)
Has Fulltext
- yes (5)
Is part of the Bibliography
- no (5)
Keywords
- Artificial intelligence (5) (remove)
Institute
- Medizin (5) (remove)
Highlights
• Artificial intelligence systems for mechanically ventilated patients are increasing.
• The clinical and financial impact of these models are often unexamined.
• We developed a generic health-economic model for artificial intelligence systems.
• This model assesses the cost-effectiveness for many different scenarios.
• The developed framework is easily adjustable to other (clinical) situations.
Abstract
Purpose: The health and economic consequences of artificial intelligence (AI) systems for mechanically ventilated intensive care unit patients often remain unstudied. Early health technology assessments (HTA) can examine the potential impact of AI systems by using available data and simulations. Therefore, we developed a generic health-economic model suitable for early HTA of AI systems for mechanically ventilated patients.
Materials and methods: Our generic health-economic model simulates mechanically ventilated patients from their hospitalisation until their death. The model simulates two scenarios, care as usual and care with the AI system, and compares these scenarios to estimate their cost-effectiveness.
Results: The generic health-economic model we developed is suitable for estimating the cost-effectiveness of various AI systems. By varying input parameters and assumptions, the model can examine the cost-effectiveness of AI systems across a wide range of different clinical settings.
Conclusions: Using the proposed generic health-economic model, investors and innovators can easily assess whether implementing a certain AI system is likely to be cost-effective before an exact clinical impact is determined. The results of the early HTA can aid investors and innovators in deployment of AI systems by supporting development decisions, informing value-based pricing, clinical trial design, and selection of target patient groups.
Purpose: To determine whether machine learning assisted-texture analysis of multi-energy virtual monochromatic image (VMI) datasets from dual-energy CT (DECT) can be used to differentiate metastatic head and neck squamous cell carcinoma (HNSCC) lymph nodes from lymphoma, inflammatory, or normal lymph nodes.
Materials and methods: A retrospective evaluation of 412 cervical nodes from 5 different patient groups (50 patients in total) having undergone DECT of the neck between 2013 and 2015 was performed: (1) HNSCC with pathology proven metastatic adenopathy, (2) HNSCC with pathology proven benign nodes (controls for (1)), (3) lymphoma, (4) inflammatory, and (5) normal nodes (controls for (3) and (4)). Texture analysis was performed with TexRAD® software using two independent sets of contours to assess the impact of inter-rater variation. Two machine learning algorithms (Random Forests (RF) and Gradient Boosting Machine (GBM)) were used with independent training and testing sets and determination of accuracy, sensitivity, specificity, PPV, NPV, and AUC.
Results: In the independent testing (prediction) sets, the accuracy for distinguishing different groups of pathologic nodes or normal nodes ranged between 80 and 95%. The models generated using texture data extracted from the independent contour sets had substantial to almost perfect agreement. The accuracy, sensitivity, specificity, PPV, and NPV for correctly classifying a lymph node as malignant (i.e. metastatic HNSCC or lymphoma) versus benign were 92%, 91%, 93%, 95%, 87%, respectively.
Conclusion: Machine learning assisted-DECT texture analysis can help distinguish different nodal pathology and normal nodes with a high accuracy.
Background: To assess the potential of radiomic features to quantify components of blood in intraaortic vessels to non-invasively predict moderate-to-severe anemia in non-contrast enhanced CT scans. Methods: One hundred patients (median age, 69 years; range, 19–94 years) who received CT scans of the thoracolumbar spine and blood-testing for hemoglobin and hematocrit levels ± 24 h between 08/2018 and 11/2019 were retrospectively included. Intraaortic blood was segmented using a spherical volume of interest of 1 cm diameter with consecutive radiomic analysis applying PyRadiomics software. Feature selection was performed applying analysis of correlation and collinearity. The final feature set was obtained to differentiate moderate-to-severe anemia. Random forest machine learning was applied and predictive performance was assessed. A decision-tree was obtained to propose a cut-off value of CT Hounsfield units (HU). Results: High correlation with hemoglobin and hematocrit levels was shown for first-order radiomic features (p < 0.001 to p = 0.032). The top 3 features showed high correlation to hemoglobin values (p) and minimal collinearity (r) to the top ranked feature Median (p < 0.001), Energy (p = 0.002, r = 0.387), Minimum (p = 0.032, r = 0.437). Median (p < 0.001) and Minimum (p = 0.003) differed in moderate-to-severe anemia compared to non-anemic state. Median yielded superiority to the combination of Median and Minimum (p(AUC) = 0.015, p(precision) = 0.017, p(accuracy) = 0.612) in the predictive performance employing random forest analysis. A Median HU value ≤ 36.5 indicated moderate-to-severe anemia (accuracy = 0.90, precision = 0.80). Conclusions: First-order radiomic features correlate with hemoglobin levels and may be feasible for the prediction of moderate-to-severe anemia. High dimensional radiomic features did not aid augmenting the data in our exemplary use case of intraluminal blood component assessment.
Background: Bone age (BA) assessment performed by artificial intelligence (AI) is of growing interest due to improved accuracy, precision and time efficiency in daily routine. The aim of this study was to investigate the accuracy and efficiency of a novel AI software version for automated BA assessment in comparison to the Greulich-Pyle method.
Methods: Radiographs of 514 patients were analysed in this retrospective study. Total BA was assessed independently by three blinded radiologists applying the GP method and by the AI software. Overall and gender-specific BA assessment results, as well as reading times of both approaches, were compared, while the reference BA was defined by two blinded experienced paediatric radiologists in consensus by application of the Greulich-Pyle method.
Results: Mean absolute deviation (MAD) and root mean square deviation (RSMD) were significantly lower between AI-derived BA and reference BA (MAD 0.34 years, RSMD 0.38 years) than between reader-calculated BA and reference BA (MAD 0.79 years, RSMD 0.89 years; p < 0.001). The correlation between AI-derived BA and reference BA (r = 0.99) was significantly higher than between reader-calculated BA and reference BA (r = 0.90; p < 0.001). No statistical difference was found in reader agreement and correlation analyses regarding gender (p = 0.241). Mean reading times were reduced by 87% using the AI system.
Conclusions: A novel AI software enabled highly accurate automated BA assessment. It may improve efficiency in clinical routine by reducing reading times without compromising the accuracy compared with the Greulich-Pyle method.
Objectives: To analyze the performance of radiological assessment categories and quantitative computational analysis of apparent diffusion coefficient (ADC) maps using variant machine learning algorithms to differentiate clinically significant versus insignificant prostate cancer (PCa). Methods: Retrospectively, 73 patients were included in the study. The patients (mean age, 66.3 ± 7.6 years) were examined with multiparametric MRI (mpMRI) prior to radical prostatectomy (n = 33) or targeted biopsy (n = 40). The index lesion was annotated in MRI ADC and the equivalent histologic slides according to the highest Gleason Grade Group (GrG). Volumes of interest (VOIs) were determined for each lesion and normal-appearing peripheral zone. VOIs were processed by radiomic analysis. For the classification of lesions according to their clinical significance (GrG ≥ 3), principal component (PC) analysis, univariate analysis (UA) with consecutive support vector machines, neural networks, and random forest analysis were performed. Results: PC analysis discriminated between benign and malignant prostate tissue. PC evaluation yielded no stratification of PCa lesions according to their clinical significance, but UA revealed differences in clinical assessment categories and radiomic features. We trained three classification models with fifteen feature subsets. We identified a subset of shape features which improved the diagnostic accuracy of the clinical assessment categories (maximum increase in diagnostic accuracy ΔAUC = + 0.05, p < 0.001) while also identifying combinations of features and models which reduced overall accuracy. Conclusions: The impact of radiomic features to differentiate PCa lesions according to their clinical significance remains controversial. It depends on feature selection and the employed machine learning algorithms. It can result in improvement or reduction of diagnostic performance.