Control of dataset bias in combined affymetrix cohorts of triple negative breast cancer

  • Heterogenous subtypes of breast cancer need to be analyzed separately. Pooling of datasets can provide reasonable sample sizes but dataset bias is an important concern. We assembled a combined dataset of 579 Affymetrix microarrays from triple negative breast cancer (TNBC) in Gene Expression Omnibus (GEO) series GSE31519. We developed a method for selecting comparable datasets and to control for the amount of dataset bias of individual probesets.

Download full text files

Export metadata

Metadaten
Author:Thomas Karn, Achim Rody, Volkmar MüllerORCiDGND, Marcus Schmidt, Sven BeckerORCiDGND, Uwe Holtrich, Lajos Pusztai
URN:urn:nbn:de:hebis:30:3-442051
DOI:https://doi.org/10.1016/j.gdata.2014.09.014
ISSN:2213-5960
Pubmed Id:https://pubmed.ncbi.nlm.nih.gov/26484129
Parent Title (English):Genomics data
Publisher:Elsevier
Place of publication:Amsterdam [u. a.]
Document Type:Article
Language:English
Date of Publication (online):2017/06/12
Year of first Publication:2014
Publishing Institution:Universitätsbibliothek Johann Christian Senckenberg
Release Date:2017/06/12
Tag:Breast cancer; Dataset bias; Gene expression; Microarray; Pooling
Volume:2
Page Number:3
First Page:354
Last Page:356
Note:
© 2014 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/3.0/).
HeBIS-PPN:428607896
Institutes:Medizin / Medizin
Dewey Decimal Classification:6 Technik, Medizin, angewandte Wissenschaften / 61 Medizin und Gesundheit / 610 Medizin und Gesundheit
Sammlungen:Universitätspublikationen
Licence (German):License LogoCreative Commons - Namensnennung-Nicht kommerziell-Keine Bearbeitung 3.0