Refine
Year of publication
Document Type
- Article (23)
- Doctoral Thesis (1)
- Preprint (1)
Has Fulltext
- yes (25)
Is part of the Bibliography
- no (25)
Keywords
- Hodgkin lymphoma (2)
- Machine learning (2)
- Prostate cancer (2)
- classical Hodgkin lymphoma (2)
- gene expression (2)
- Affymetrix (1)
- Artificial intelligence (1)
- B-cell immunology (1)
- B-cell lymphoma (1)
- BATF3 (1)
Institute
Background: Prostate cancer is a major health concern in aging men. Paralleling an aging society, prostate cancer prevalence increases emphasizing the need for efcient diagnostic algorithms.
Methods: Retrospectively, 106 prostate tissue samples from 48 patients (mean age,
66 ± 6.6 years) were included in the study. Patients sufered from prostate cancer (n = 38) or benign prostatic hyperplasia (n = 10) and were treated with radical prostatectomy or Holmium laser enucleation of the prostate, respectively. We constructed tissue microarrays (TMAs) comprising representative malignant (n = 38) and benign (n = 68) tissue cores. TMAs were processed to histological slides, stained, digitized and assessed for the applicability of machine learning strategies and open–source tools in diagnosis of prostate cancer. We applied the software QuPath to extract features for shape, stain intensity, and texture of TMA cores for three stainings, H&E, ERG, and PIN-4. Three machine learning algorithms, neural network (NN), support vector machines (SVM), and random forest (RF), were trained and cross-validated with 100 Monte Carlo random splits into 70% training set and 30% test set. We determined AUC values for single color channels, with and without optimization of hyperparameters by exhaustive grid search. We applied recursive feature elimination to feature sets of multiple color transforms.
Results: Mean AUC was above 0.80. PIN-4 stainings yielded higher AUC than H&E and
ERG. For PIN-4 with the color transform saturation, NN, RF, and SVM revealed AUC of 0.93 ± 0.04, 0.91 ± 0.06, and 0.92 ± 0.05, respectively. Optimization of hyperparameters improved the AUC only slightly by 0.01. For H&E, feature selection resulted in no increase of AUC but to an increase of 0.02–0.06 for ERG and PIN-4.
Conclusions: Automated pipelines may be able to discriminate with high accuracy between malignant and benign tissue. We found PIN-4 staining best suited for classifcation. Further bioinformatic analysis of larger data sets would be crucial to evaluate the reliability of automated classifcation methods for clinical practice and to evaluate potential discrimination of aggressiveness of cancer to pave the way to automatic precision medicine.
In gastric cancer (GC), there are four molecular subclasses that indicate whether patients respond to chemotherapy or immunotherapy, according to the TCGA. In clinical practice, however, not every patient undergoes molecular testing. Many laboratories have used well-implemented in situ techniques (IHC and EBER-ISH) to determine the subclasses in their cohorts. Although multiple stains are used, we show that a staining approach is unable to correctly discriminate all subclasses. As an alternative, we trained an ensemble convolutional neuronal network using bagging that can predict the molecular subclass directly from hematoxylin–eosin histology. We also identified patients with predicted intra-tumoral heterogeneity or with features from multiple subclasses, which challenges the postulated TCGA-based decision tree for GC subtyping. In the future, deep learning may enable targeted testing for molecular subtypes and targeted therapy for a broader group of GC patients. © 2022 The Authors. The Journal of Pathology published by John Wiley & Sons Ltd on behalf of The Pathological Society of Great Britain and Ireland.
Bioinformatics analysis quantifies neighborhood preferences of cancer cells in Hodgkin lymphoma
(2017)
Motivation Hodgkin lymphoma is a tumor of the lymphatic system and represents one of the most frequent lymphoma in the Western world. It is characterized by Hodgkin cells and Reed-Sternberg cells, which exhibit a broad morphological spectrum. The cells are visualized by immunohistochemical staining of tissue sections. In pathology, tissue images are mainly manually evaluated, relying on the expertise and experience of pathologists. Computational quantification methods become more and more essential to evaluate tissue images. In particular, the distribution of cancer cells is of great interest.
Results Here, we systematically quantified and investigated cancer cell properties and their spatial neighborhood relations by applying statistical analyses to whole slide images of Hodgkin lymphoma and lymphadenitis, which describes a non-cancerous inflammation of the lymph node. We differentiated cells by their morphology and studied the spatial neighborhood relation of more than 400,000 immunohistochemically stained cells. We found that, according to their morphological features, the cells exhibited significant preferences for and aversions to cells of specific profiles as nearest neighbor. We quantified differences between Hodgkin lymphoma and lymphadenitis concerning the neighborhood relations of cells and the sizes of cells. The approach can easily be applied to other cancer types.
Zur genomweiten Genexpressionsanalyse werden Microarray-Experimente verwendet. Ziel dieser Arbeit ist es, Methoden zur Präprozessierung von Microarrays der Firma Affymetrix zu evaluieren und die VSN-Methode für Experimente mit weniger als 1000 Zellen zu verbessern. Bei dieser Technologie wird die Expression jedes Gens durch mehrere Probessets gemessen. Jedes Probeset besteht aus einem Perfect-Match (PM) und einem dazugehörigen Mismatch (MM). Der Expressionswert pro Gen wird durch ein vierstufiges Verfahren aus den einzelnen Probe-Werten berechnet: Hintergrundkorrektur, Normalisierung, PM-Adjustierung und Aggregation. Für jeden dieser Schritte existieren mehrere Algorithmen. Dazu dienten die im affy-Paket des Bioconductor implementierten Methoden MAS5, RMA, VSN und die Methode sRMA von Cope et al. [Cope et al., 2006] in Kombination mit der Methode VSN von Huber et al. [Huber et al., 2002]. Den ersten Teil dieser Arbeit bildet die Reanalyse der Datensätze von Küppers et al. [Küppers et al., 2003] und Piccaluga et al. [Piccaluga et al., 2007] mit der VSN-Methode. Dabei konnte gezeigt werden, dass die VSN-Methode gegenüber Klein et al. [Klein et al., 2001] Vorteile zeigt. Bei beiden Datensätzen wurden zusätzliche Gene gefunden, die für die Pathogenese der jeweiligen Tumorarten wichtig sein können. Einige der zusätzlich gefunden Gene wurden durch andere wissenschaftliche Arbeiten bestätigt. Die Gene, die bisher in keinem Zusammenhang mit der untersuchten Tumorart stehen, sind eine Möglichkeit für die weitere Forschung. Vor allem der Zytokine/Zytokine Signalweg wurde bei beiden Reanalysen als überrepräsentiert erkannt. Da für einige Microarray-Experimente die Anzahl der Zellen und damit die Menge an mRNA nur begrenzt zur Verfügung stehen, müssen die Laborarbeit und die statistischen Analysen angepasst werden. Hierzu werden fünf Methoden für die Präprozessierung untersucht, um zu evaluieren, welche Methode geeignet ist, derartige Expressionsdaten zu verrechnen. Auf Basis eines Testdatensatzes der bereits zur Etablierung des Laborprozesses diente werden Expressionswerte durch empirische Verteilung, Gammaverteilung und ein linear gemischtes Modell simuliert. Die Simulation lässt sich in vier Schritte einteilen: Wahl der Verteilung, Simulation der Expressionsmatrix, Simulation der differentiellen Expression, Sortierung der Probes innerhalb des Probesets. Anschließend werden die fünf Präprozessierungsmethoden mit diesen simulierten Expressionsdaten auf ihre Sensitivität und Spezifität untersucht. Während sich bei den empirisch und gammaverteilt simulierten Expressionsdaten kein eindeutiges Ergebnis abzeichnet, hat sVSN bei den Daten aus dem linear gemischten Modell die größte Sensitivität und die größte Spezifität. Der in dieser Arbeit entwickelte sVSN-Algorithmus wurde zum ersten Mal angewendet und bewertet. Abschließend wird ein Teildatensatz von Brune et al. verwendet und hinsichtlich der fünf Präprozessierungsmethoden untersucht. Die Ergebnisse der sVSN-Methode wird im Detail weiter verfolgt. Die zusätzlich gefunden Gene können durch bereits veröffentlichte Arbeiten bestätigt werden. Letztendlich zeigt sich, dass neuere statistische Methoden (wie das im Rahmen dieser Arbeit entwickelte sVSN) bei der Analyse von Affymetrix Microarrays einen Vorteil bringen. Die sVSN und sRMA Methoden zeigen Vorteile, da die Probes nach der Normalisierung gewichtet werden, bevor diese aggregiert werden. Die MAS5-Methode schneidet am schlechtesten ab und sollte bei geringen Zellmengen nicht eingesetzt werden. Für die Analyse mit geringer Menge an mRNA müssen weitere Untersuchungen vorgenommen werden, um eine geeignete statistische Methode für die Analyse der Expressionsdaten zu finden.
Spinocerebellar ataxia type 2 (SCA2) is caused by polyglutamine expansion in Ataxin-2 (ATXN2). This factor binds RNA/proteins to modify metabolism after stress, and to control calcium (Ca2+) homeostasis after stimuli. Cerebellar ataxias and corticospinal motor neuron degeneration are determined by gain/loss in ATXN2 function, so we aimed to identify key molecules in this atrophic process, as potential disease progression markers. Our Atxn2-CAG100-Knock-In mouse faithfully models features observed in patients at pre-onset, early and terminal stages. Here, its cerebellar global RNA profiling revealed downregulation of signaling cascades to precede motor deficits. Validation work at mRNA/protein level defined alterations that were independent of constant physiological ATXN2 functions, but specific for RNA/aggregation toxicity, and progressive across the short lifespan. The earliest changes were detected at three months among Ca2+ channels/transporters (Itpr1, Ryr3, Atp2a2, Atp2a3, Trpc3), IP3 metabolism (Plcg1, Inpp5a, Itpka), and Ca2+-Calmodulin dependent kinases (Camk2a, Camk4). CaMKIV–Sam68 control over alternative splicing of Nrxn1, an adhesion component of glutamatergic synapses between granule and Purkinje neurons, was found to be affected. Systematic screening of pre/post-synapse components, with dendrite morphology assessment, suggested early impairment of CamKIIα abundance together with the weakening of parallel fiber connectivity. These data reveal molecular changes due to ATXN2 pathology, primarily impacting excitability and communication.
Background: Nodular lymphocyte predominant Hodgkin lymphoma (NLPHL) usually presents in middle aged men and shows an indolent clinical behavior. However, up to 30% of the patients present a secondary transformation into aggressive diffuse large B cell lymphoma (DLBCL). The aim of the present study was to characterize morphology and immunophenotype of this kind of DLBCL in detail and compare it with conventional DLBCL.
Methods: Morphology and immunophenotype of 33 cases of NLPHL with simultaneous or sequential transformation into DLBCL were investigated. These cases were compared with 41 de novo DLBCL in Finnish men.
Results: The majority of cases exhibited different immunophenotypes in the NLPHL and the DLBCL components. The immunophenotype of the DLBCL secondary to NLPHL was heterogeneous. However, BCL6, EMA, CD75 and J-chain were usually expressed in both components (≥73% positive). Overall, the NLPHL component was more frequently positive for EMA, CD75 and J-chain than the DLBCL component. In contrast, B cell markers, CD10 and BCL2, were more frequently expressed and were expressed at higher levels in the DLBCL component than in the NLPHL component. In the independent series of de novo DLBCL 4 cases could be identified with a growth pattern and immunophenotype that suggested that they had arisen secondarily from NLPHL.
Conclusions: The morphology and immunophenotype of DLBCL arisen from NLPHL is heterogeneous. Further characterization of the particular molecular features of this subgroup is warranted to be able to better identify these cases among conventional DLBCL.
Background: Increased glycolytic activity is a hallmark of cancer, allowing staging and restaging with 18F-fluorodeoxyglucose-positron-emission-tomography (PET). Since interim-PET is an important prognostic tool in Hodgkin lymphoma (HL), the aim of this study was to investigate the expression of proteins involved in the regulation of glucose metabolism in the different HL subtypes and their impact on clinical outcome.
Methods: Lymph node biopsies from 54 HL cases and reactive lymphoid tissue were stained for glucose transporter 1 (GLUT1), lactate dehydrogenase A (LDHA) and lactate exporter proteins MCT1 and MCT4. In a second series, samples from additional 153 HL cases with available clinical data were stained for GLUT1 and LDHA.
Results: Membrane bound GLUT1 expression was frequently observed in the tumor cells of HL (49% of all cases) but showed a broad variety between the different Hodgkin lymphoma subtypes: Nodular sclerosing HL subtype displayed a membrane bound GLUT1 expression in the Hodgkin-and Reed-Sternberg cells in 56% of the cases. However, membrane bound GLUT1 expression was more rarely observed in tumor cells of lymphocyte rich classical HL subtype (30%) or nodular lymphocyte predominant HL subtype (15%). Interestingly, in both of these lymphocyte rich HL subtypes as well as in progressively transformed germinal centers, reactive B cells displayed strong expression of GLUT1. LDHA, acting downstream of glycolysis, was also expressed in 44% of all cases. We evaluated the prognostic value of different GLUT1 and LDHA expression patterns; however, no significant differences in progression free or overall survival were found between patients exhibiting different GLUT1 or LDHA expression patterns. There was no correlation between GLUT1 expression in HRS cells and PET standard uptake values.
Conclusions: In a large number of cases, HRS cells in classical HL express high levels of GLUT1 and LDHA indicating glycolytic activity in the tumor cells. Although interim-PET is an important prognostic tool, a predictive value of GLUT1 or LDHA staining of the primary diagnostic biopsy could not be demonstrated. However, we observed GLUT1 expression in progressively transformed germinal centers and hyperplastic follicles, explaining false positive results in PET. Therefore, PET findings suggestive of HL relapse should always be confirmed by histology.
The pathogenesis of nodular lymphocyte–predominant Hodgkin lymphoma (NLPHL) and its relationship to other lymphomas are largely unknown. This is partly because of the technical challenge of analyzing its rare neoplastic lymphocytic and histiocytic (L&H) cells, which are dispersed in an abundant nonneoplastic cellular microenvironment. We performed a genome-wide expression study of microdissected L&H lymphoma cells in comparison to normal and other malignant B cells that indicated a relationship of L&H cells to and/or that they originate from germinal center B cells at the transition to memory B cells. L&H cells show a surprisingly high similarity to the tumor cells of T cell–rich B cell lymphoma and classical Hodgkin lymphoma, a partial loss of their B cell phenotype, and deregulation of many apoptosis regulators and putative oncogenes. Importantly, L&H cells are characterized by constitutive nuclear factor {kappa}B activity and aberrant extracellular signal-regulated kinase signaling. Thus, these findings shed new light on the nature of L&H cells, reveal several novel pathogenetic mechanisms in NLPHL, and may help in differential diagnosis and lead to novel therapeutic strategies.
In pathology, tissue images are evaluated using a light microscope, relying on the expertise and experience of pathologists. There is a great need for computational methods to quantify and standardize histological observations. Computational quantification methods become more and more essential to evaluate tissue images. In particular, the distribution of tumor cells and their microenvironment are of special interest. Here, we systematically investigated tumor cell properties and their spatial neighborhood relations by a new application of statistical analysis to whole slide images of Hodgkin lymphoma, a tumor arising in lymph nodes, and inflammation of lymph nodes called lymphadenitis. We considered properties of more than 400, 000 immunohistochemically stained, CD30-positive cells in 35 whole slide images of tissue sections from subtypes of the classical Hodgkin lymphoma, nodular sclerosis and mixed cellularity, as well as from lymphadenitis. We found that cells of specific morphology exhibited significant favored and unfavored spatial neighborhood relations of cells in dependence of their morphology. This information is important to evaluate differences between Hodgkin lymph nodes infiltrated by tumor cells (Hodgkin lymphoma) and inflamed lymph nodes, concerning the neighborhood relations of cells and the sizes of cells. The quantification of neighborhood relations revealed new insights of relations of CD30-positive cells in different diagnosis cases. The approach is general and can easily be applied to whole slide image analysis of other tumor types.
Objectives: To analyze the performance of radiological assessment categories and quantitative computational analysis of apparent diffusion coefficient (ADC) maps using variant machine learning algorithms to differentiate clinically significant versus insignificant prostate cancer (PCa). Methods: Retrospectively, 73 patients were included in the study. The patients (mean age, 66.3 ± 7.6 years) were examined with multiparametric MRI (mpMRI) prior to radical prostatectomy (n = 33) or targeted biopsy (n = 40). The index lesion was annotated in MRI ADC and the equivalent histologic slides according to the highest Gleason Grade Group (GrG). Volumes of interest (VOIs) were determined for each lesion and normal-appearing peripheral zone. VOIs were processed by radiomic analysis. For the classification of lesions according to their clinical significance (GrG ≥ 3), principal component (PC) analysis, univariate analysis (UA) with consecutive support vector machines, neural networks, and random forest analysis were performed. Results: PC analysis discriminated between benign and malignant prostate tissue. PC evaluation yielded no stratification of PCa lesions according to their clinical significance, but UA revealed differences in clinical assessment categories and radiomic features. We trained three classification models with fifteen feature subsets. We identified a subset of shape features which improved the diagnostic accuracy of the clinical assessment categories (maximum increase in diagnostic accuracy ΔAUC = + 0.05, p < 0.001) while also identifying combinations of features and models which reduced overall accuracy. Conclusions: The impact of radiomic features to differentiate PCa lesions according to their clinical significance remains controversial. It depends on feature selection and the employed machine learning algorithms. It can result in improvement or reduction of diagnostic performance.