Refine
Document Type
- Article (8)
Language
- English (8)
Has Fulltext
- yes (8) (remove)
Is part of the Bibliography
- no (8)
Keywords
- Artificial intelligence (8) (remove)
Institute
Highlights
• Artificial intelligence systems for mechanically ventilated patients are increasing.
• The clinical and financial impact of these models are often unexamined.
• We developed a generic health-economic model for artificial intelligence systems.
• This model assesses the cost-effectiveness for many different scenarios.
• The developed framework is easily adjustable to other (clinical) situations.
Abstract
Purpose: The health and economic consequences of artificial intelligence (AI) systems for mechanically ventilated intensive care unit patients often remain unstudied. Early health technology assessments (HTA) can examine the potential impact of AI systems by using available data and simulations. Therefore, we developed a generic health-economic model suitable for early HTA of AI systems for mechanically ventilated patients.
Materials and methods: Our generic health-economic model simulates mechanically ventilated patients from their hospitalisation until their death. The model simulates two scenarios, care as usual and care with the AI system, and compares these scenarios to estimate their cost-effectiveness.
Results: The generic health-economic model we developed is suitable for estimating the cost-effectiveness of various AI systems. By varying input parameters and assumptions, the model can examine the cost-effectiveness of AI systems across a wide range of different clinical settings.
Conclusions: Using the proposed generic health-economic model, investors and innovators can easily assess whether implementing a certain AI system is likely to be cost-effective before an exact clinical impact is determined. The results of the early HTA can aid investors and innovators in deployment of AI systems by supporting development decisions, informing value-based pricing, clinical trial design, and selection of target patient groups.
Contemporary information systems make widespread use of artificial intelligence (AI). While AI offers various benefits, it can also be subject to systematic errors, whereby people from certain groups (defined by gender, age, or other sensitive attributes) experience disparate outcomes. In many AI applications, disparate outcomes confront businesses and organizations with legal and reputational risks. To address these, technologies for so-called “AI fairness” have been developed, by which AI is adapted such that mathematical constraints for fairness are fulfilled. However, the financial costs of AI fairness are unclear. Therefore, the authors develop AI fairness for a real-world use case from e-commerce, where coupons are allocated according to clickstream sessions. In their setting, the authors find that AI fairness successfully manages to adhere to fairness requirements, while reducing the overall prediction performance only slightly. However, they find that AI fairness also results in an increase in financial cost. Thus, in this way the paper’s findings contribute to designing information systems on the basis of AI fairness.
Purpose: To determine whether machine learning assisted-texture analysis of multi-energy virtual monochromatic image (VMI) datasets from dual-energy CT (DECT) can be used to differentiate metastatic head and neck squamous cell carcinoma (HNSCC) lymph nodes from lymphoma, inflammatory, or normal lymph nodes.
Materials and methods: A retrospective evaluation of 412 cervical nodes from 5 different patient groups (50 patients in total) having undergone DECT of the neck between 2013 and 2015 was performed: (1) HNSCC with pathology proven metastatic adenopathy, (2) HNSCC with pathology proven benign nodes (controls for (1)), (3) lymphoma, (4) inflammatory, and (5) normal nodes (controls for (3) and (4)). Texture analysis was performed with TexRAD® software using two independent sets of contours to assess the impact of inter-rater variation. Two machine learning algorithms (Random Forests (RF) and Gradient Boosting Machine (GBM)) were used with independent training and testing sets and determination of accuracy, sensitivity, specificity, PPV, NPV, and AUC.
Results: In the independent testing (prediction) sets, the accuracy for distinguishing different groups of pathologic nodes or normal nodes ranged between 80 and 95%. The models generated using texture data extracted from the independent contour sets had substantial to almost perfect agreement. The accuracy, sensitivity, specificity, PPV, and NPV for correctly classifying a lymph node as malignant (i.e. metastatic HNSCC or lymphoma) versus benign were 92%, 91%, 93%, 95%, 87%, respectively.
Conclusion: Machine learning assisted-DECT texture analysis can help distinguish different nodal pathology and normal nodes with a high accuracy.
Background: To assess the potential of radiomic features to quantify components of blood in intraaortic vessels to non-invasively predict moderate-to-severe anemia in non-contrast enhanced CT scans. Methods: One hundred patients (median age, 69 years; range, 19–94 years) who received CT scans of the thoracolumbar spine and blood-testing for hemoglobin and hematocrit levels ± 24 h between 08/2018 and 11/2019 were retrospectively included. Intraaortic blood was segmented using a spherical volume of interest of 1 cm diameter with consecutive radiomic analysis applying PyRadiomics software. Feature selection was performed applying analysis of correlation and collinearity. The final feature set was obtained to differentiate moderate-to-severe anemia. Random forest machine learning was applied and predictive performance was assessed. A decision-tree was obtained to propose a cut-off value of CT Hounsfield units (HU). Results: High correlation with hemoglobin and hematocrit levels was shown for first-order radiomic features (p < 0.001 to p = 0.032). The top 3 features showed high correlation to hemoglobin values (p) and minimal collinearity (r) to the top ranked feature Median (p < 0.001), Energy (p = 0.002, r = 0.387), Minimum (p = 0.032, r = 0.437). Median (p < 0.001) and Minimum (p = 0.003) differed in moderate-to-severe anemia compared to non-anemic state. Median yielded superiority to the combination of Median and Minimum (p(AUC) = 0.015, p(precision) = 0.017, p(accuracy) = 0.612) in the predictive performance employing random forest analysis. A Median HU value ≤ 36.5 indicated moderate-to-severe anemia (accuracy = 0.90, precision = 0.80). Conclusions: First-order radiomic features correlate with hemoglobin levels and may be feasible for the prediction of moderate-to-severe anemia. High dimensional radiomic features did not aid augmenting the data in our exemplary use case of intraluminal blood component assessment.
Nowadays, digitalization has an immense impact on the landscape of jobs. This technological revolution creates new industries and professions, promises greater efficiency and improves the quality of working life. However, emerging technologies such as robotics and artificial intelligence (AI) are reducing human intervention, thus advancing automation and eliminating thousands of jobs and whole occupational images. To prepare employees for the changing demands of work, adequate and timely training of the workforce and real-time support of workers in new positions is necessary. Therefore, it is investigated whether user-oriented technologies, such as augmented reality (AR) and virtual reality (VR) can be applied “on-the-job” for such training and support—also known as intelligence augmentation (IA). To address this problem, this work synthesizes results of a systematic literature review as well as a practically oriented search on augmented reality and virtual reality use cases within the IA context. A total of 150 papers and use cases are analyzed to identify suitable areas of application in which it is possible to enhance employees' capabilities. The results of both, theoretical and practical work, show that VR is primarily used to train employees without prior knowledge, whereas AR is used to expand the scope of competence of individuals in their field of expertise while on the job. Based on these results, a framework is derived which provides practitioners with guidelines as to how AR or VR can support workers at their job so that they can keep up with anticipated skill demands. Furthermore, it shows for which application areas AR or VR can provide workers with sufficient training to learn new job tasks. By that, this research provides practical recommendations in order to accompany the imminent distortions caused by AI and similar technologies and to alleviate associated negative effects on the German labor market.
Background: Bone age (BA) assessment performed by artificial intelligence (AI) is of growing interest due to improved accuracy, precision and time efficiency in daily routine. The aim of this study was to investigate the accuracy and efficiency of a novel AI software version for automated BA assessment in comparison to the Greulich-Pyle method.
Methods: Radiographs of 514 patients were analysed in this retrospective study. Total BA was assessed independently by three blinded radiologists applying the GP method and by the AI software. Overall and gender-specific BA assessment results, as well as reading times of both approaches, were compared, while the reference BA was defined by two blinded experienced paediatric radiologists in consensus by application of the Greulich-Pyle method.
Results: Mean absolute deviation (MAD) and root mean square deviation (RSMD) were significantly lower between AI-derived BA and reference BA (MAD 0.34 years, RSMD 0.38 years) than between reader-calculated BA and reference BA (MAD 0.79 years, RSMD 0.89 years; p < 0.001). The correlation between AI-derived BA and reference BA (r = 0.99) was significantly higher than between reader-calculated BA and reference BA (r = 0.90; p < 0.001). No statistical difference was found in reader agreement and correlation analyses regarding gender (p = 0.241). Mean reading times were reduced by 87% using the AI system.
Conclusions: A novel AI software enabled highly accurate automated BA assessment. It may improve efficiency in clinical routine by reducing reading times without compromising the accuracy compared with the Greulich-Pyle method.
Objectives: To analyze the performance of radiological assessment categories and quantitative computational analysis of apparent diffusion coefficient (ADC) maps using variant machine learning algorithms to differentiate clinically significant versus insignificant prostate cancer (PCa). Methods: Retrospectively, 73 patients were included in the study. The patients (mean age, 66.3 ± 7.6 years) were examined with multiparametric MRI (mpMRI) prior to radical prostatectomy (n = 33) or targeted biopsy (n = 40). The index lesion was annotated in MRI ADC and the equivalent histologic slides according to the highest Gleason Grade Group (GrG). Volumes of interest (VOIs) were determined for each lesion and normal-appearing peripheral zone. VOIs were processed by radiomic analysis. For the classification of lesions according to their clinical significance (GrG ≥ 3), principal component (PC) analysis, univariate analysis (UA) with consecutive support vector machines, neural networks, and random forest analysis were performed. Results: PC analysis discriminated between benign and malignant prostate tissue. PC evaluation yielded no stratification of PCa lesions according to their clinical significance, but UA revealed differences in clinical assessment categories and radiomic features. We trained three classification models with fifteen feature subsets. We identified a subset of shape features which improved the diagnostic accuracy of the clinical assessment categories (maximum increase in diagnostic accuracy ΔAUC = + 0.05, p < 0.001) while also identifying combinations of features and models which reduced overall accuracy. Conclusions: The impact of radiomic features to differentiate PCa lesions according to their clinical significance remains controversial. It depends on feature selection and the employed machine learning algorithms. It can result in improvement or reduction of diagnostic performance.
Artificial Intelligence (AI) and Machine Learning (ML) are currently hot topics in industry and business practice, while management-oriented research disciplines seem reluctant to adopt these sophisticated data analytics methods as research instruments. Even the Information Systems (IS) discipline with its close connections to Computer Science seems to be conservative when conducting empirical research endeavors. To assess the magnitude of the problem and to understand its causes, we conducted a bibliographic review on publications in high-level IS journals. We reviewed 1,838 articles that matched corresponding keyword-queries in journals from the AIS senior scholar basket, Electronic Markets and Decision Support Systems (Ranked B). In addition, we conducted a survey among IS researchers (N = 110). Based on the findings from our sample we evaluate different potential causes that could explain why ML methods are rather underrepresented in top-tier journals and discuss how the IS discipline could successfully incorporate ML methods in research undertakings.