Refine
Document Type
- Article (3)
Language
- English (3)
Has Fulltext
- yes (3)
Is part of the Bibliography
- no (3)
Keywords
- biomarker (1)
- data science (1)
- dementia (1)
- endocannabinoids (1)
- lipidomic analysis (1)
- lipids (1)
- machine learning (1)
- pain (1)
- quantitative sensory testing (1)
- sensory neuropathy (1)
Institute
- Medizin (3)
The evaluation of pharmacological data using machine learning requires high data quality. Therefore, data preprocessing, that is, cleaning analytical laboratory errors, replacing missing values or outliers, and transforming data adequately before actual data analysis, is crucial. Because current tools available for this purpose often require programming skills, preprocessing tools with graphical user interfaces that can be used interactively are needed. In collaboration between data scientists and experts in bioanalytical diagnostics, a graphical software package for data preprocessing called pguIMP is proposed, which contains a fixed sequence of preprocessing steps to enable reproducible interactive data preprocessing. As an R-based package, it also allows direct integration into this data science environment without requiring any programming knowledge. The implementation of contemporary data processing methods, including machine-learning-based imputation techniques, ensures the generation of corrected and cleaned bioanalytical data sets that preserve data structures such as clusters better than is possible with classical methods. This was evaluated on bioanalytical data sets from lipidomics and drug research using k-nearest-neighbors-based imputation followed by k-means clustering and density-based spatial clustering of applications with noise. The R package provides a Shiny-based web interface designed to be easy to use for non–data analysis experts. It is demonstrated that the spectrum of methods provided is suitable as a standard pipeline for preprocessing bioanalytical data in biomedical research domains. The R package pguIMP is freely available at the comprehensive R archive network (https://cran.r-project.org/web/packages/pguIMP/index.html).
Based on accumulating evidence of a role of lipid signaling in many physiological and pathophysiological processes including psychiatric diseases, the present data driven analysis was designed to gather information needed to develop a prospective biomarker, using a targeted lipidomics approach covering different lipid mediators. Using unsupervised methods of data structure detection, implemented as hierarchal clustering, emergent self-organizing maps of neuronal networks, and principal component analysis, a cluster structure was found in the input data space comprising plasma concentrations of d = 35 different lipid-markers of various classes acquired in n = 94 subjects with the clinical diagnoses depression, bipolar disorder, ADHD, dementia, or in healthy controls. The structure separated patients with dementia from the other clinical groups, indicating that dementia is associated with a distinct lipid mediator plasma concentrations pattern possibly providing a basis for a future biomarker. This hypothesis was subsequently assessed using supervised machine-learning methods, implemented as random forests or principal component analysis followed by computed ABC analysis used for feature selection, and as random forests, k-nearest neighbors, support vector machines, multilayer perceptron, and naïve Bayesian classifiers to estimate whether the selected lipid mediators provide sufficient information that the diagnosis of dementia can be established at a higher accuracy than by guessing. This succeeded using a set of d = 7 markers comprising GluCerC16:0, Cer24:0, Cer20:0, Cer16:0, Cer24:1, C16 sphinganine, and LacCerC16:0, at an accuracy of 77%. By contrast, using random lipid markers reduced the diagnostic accuracy to values of 65% or less, whereas training the algorithms with randomly permuted data was followed by complete failure to diagnose dementia, emphasizing that the selected lipid mediators were display a particular pattern in this disease possibly qualifying as biomarkers.
High glucosylceramides and low anandamide contribute to sensory loss and pain in Parkinson's disease
(2020)
Background: Parkinson's disease (PD) causes chronic pain in two‐thirds of patients, in part originating from sensory neuropathies. The aim of the present study was to describe the phenotype of PD‐associated sensory neuropathy and to evaluate its associations with lipid allostasis, the latter motivated by recent genetic studies associating mutations of glucocerebrosidase with PD onset and severity. Glucocerebrosidase catalyzes the metabolism of glucosylceramides.
Methods: We used quantitative sensory tests, pain ratings, and questionnaires and analyzed plasma levels of multiple bioactive lipid species using targeted lipidomic analyses. The study comprised 2 sets of patients and healthy controls: the first 128 Israeli PD patients and 224 young German healthy controls for exploration, the second 50/50 German PD patients and matched healthy controls for deeper analyses.
Results: The data showed a 70% prevalence of PD pain and sensory neuropathies with a predominant phenotype of thermal sensory loss plus mechanical hypersensitivity. Multivariate analyses of lipids revealed major differences between PD patients and healthy controls, mainly originating from glucosylceramides and endocannabinoids. Glucosylceramides were increased, whereas anandamide and lysophosphatidic acid 20:4 were reduced, stronger in patients with ongoing pain and with a linear relationship with pain intensity and sensory losses, particularly for glucosylceramide 18:1 and glucosylceramide 24:1.
Conclusions: Our data suggest that PD‐associated sensory neuropathies and PD pain are in part caused by accumulations of glucosylceramides, raising the intriguing possibility of reducing PD pain and sensory loss by glucocerebrosidase substituting or refolding approaches. © 2020 The Authors. Movement Disorders published by Wiley Periodicals LLC on behalf of International Parkinson and Movement Disorder Society.