SubtypeExploreR is a user-friendly platform to compare the expression of genes of interest or gene sets of interest in the Pathway-Derived Subtypes (PDS), Consensus Molecular Subtypes (CMS) and intrinsic-Consensus Molecular Subtypes (iCMS) within two colorectal cancer cohorts, FOCUS and GSE39582.
The FOCUS cohort consists of n=360 formalin-fixed paraffin-embedded primary colorectal tumour samples transcriptionally profiled by microarray (Almac Xcel array). More information on this cohort can be found in the associated publication, by Malla et al., entitled “In-depth Clinical and Biological Exploration of DNA Damage Immune Response as a Biomarker for Oxaliplatin Use in Colorectal Cancer” or Gene Expression Omnibus (GEO) under accession number GSE156915.
The GSE39582 cohort consists of n=566 primary colon tumour samples transcriptionally profiled by microarray (Affymetrix U133Plus2 array). For more information on this cohort, see the associated publication, by Marisa et al., entitled “Gene expression classification of colon cancer into molecular subtypes: characterization, validation, and prognostic value”.
SubtypeExploreR has two analysis modules, Expression Boxplots and ssGSEA.
The Expression Boxplots tab enables the user to select a gene of interest and compare the expression of that gene in the PDS subtypes, CMS subtypes or iCMS subtypes.
The ssGSEA tab enables the user to perform single sample gene set enrichment analysis (ssGSEA) for a gene signature. Users can choose an existing gene signature from the Hallmark, KEGG, BioCarta, Reactome or Pathway Interaction Database (PID) gene set collections. Alternatively, the user can select the “Custom” option and then enter a list of gene symbols to define their own gene signature. ssGSEA scores will then be calculated for the chosen/user-defined signature in each sample in each cohort and the user can choose to compare the scores between PDS subtypes, CMS subtypes or iCMS subtypes.
The PDS subtype of each sample was determined using the
PDSpredict
function from the
PDSclassifier
R package with the default threshold
of 0.6.
Samples were assigned to a CMS subtype by the
CMSclassifier
R package (v1.0.0) with the random forest method via the
classifyCMS.RF
function. For the FOCUS cohort the
minPosterior
threshold was set to 0.4 while for the
GSE39582 cohort a minPosterior
threshold of 0.5 was
used.
For iCMS classification, the iCMS gene signatures were extracted to
create an iCMS template that was subsequently used with the nearest
template prediction (NTP) method embedded in the
CMScaller
R package (v2.0.1), as previously described by Joanito et al.
in the
original
iCMS publication. Samples with an FDR
greater than 0.05
were classed as unknown (UNK).