The search result changed since you submitted your search request. Documents might be displayed in a different sort order.
  • search hit 10 of 19
Back to Result List

Comparative assessment of automated algorithms for the separation of one-dimensional Gaussian mixtures

  • Motivation: Gaussian mixture models (GMMs) are probabilistic models commonly used in biomedical research to detect subgroup structures in data sets with one-dimensional information. Reliable model parameterization requires that the number of modes, i.e., states of the generating process, is known. However, this is rarely the case for empirically measured biomedical data. Several implementations are available that estimate GMM parameters differently. This work aims to provide a comparative evaluation of automated GMM fitting methods. Results and conclusions: The performance of commonly used algorithms for automatic parameterization and mode number determination was compared with respect to reproducing the ground truth of generated data derived from multiple normal distributions. Four main variants of Gaussian mode number detection algorithms and five variants of GMM parameter estimation methods were tested in a combinatory scenario. The combination of best performing mode number determination algorithms and GMM parameter estimation methods was then tested on artificial and real-live data sets known to display a GMM structure. None of the tested methods correctly determined the underlying data structure consistently. The likelihood ratio test had the best performance in identifying the mode number associated with the best GMM fit of the data distribution while the Markov chain Monte Carlo (MCMC) algorithm was best for GMM parameter estimation while. The combination of the two methods of number determination algorithms and GMM parameter estimation was consistently among the best and overall outperformed the available implementations. Implementation: An automated tool for the detection of GMM based structures in (biomedical) datasets was created based on the present results and made freely available in the R library “opGMMassessment” at https://cran.r-project.org/package=opGMMassessment.

Download full text files

Export metadata

Additional Services

Share in Twitter Search Google Scholar
Metadaten
Author:Jörn LötschORCiDGND, Sebastian MalkuschORCiDGND, Alfred UltschGND
URN:urn:nbn:de:hebis:30:3-755721
DOI:https://doi.org/10.1016/j.imu.2022.101113
ISSN:2352-9148
Parent Title (English):Informatics in Medicine Unlocked
Publisher:Elsevier
Place of publication:Amsterdam [u.a.]
Document Type:Article
Language:English
Date of Publication (online):2022/10/22
Date of first Publication:2022/10/22
Publishing Institution:Universitätsbibliothek Johann Christian Senckenberg
Release Date:2023/09/11
Tag:Biomedical informatics; Data science; Data structure detection; Machine learning
Volume:34
Issue:101113
Page Number:13
HeBIS-PPN:513142290
Institutes:Medizin
Dewey Decimal Classification:0 Informatik, Informationswissenschaft, allgemeine Werke / 00 Informatik, Wissen, Systeme / 004 Datenverarbeitung; Informatik
6 Technik, Medizin, angewandte Wissenschaften / 61 Medizin und Gesundheit / 610 Medizin und Gesundheit
Sammlungen:Universitätspublikationen
Licence (German):License LogoCreative Commons - CC BY - Namensnennung 4.0 International