Colorectal Cancer Prediction via Applying Recursive Cluster Elimination with Intra-cluster Feature Elimination on Metagenomic Pathway Data


Temiz M., Kuzudisli C., Yousef M., Bakir-Gungor B.

International Conference on Intelligent and Fuzzy Systems, INFUS 2024, Çanakkale, Turkey, 16 - 18 July 2024, vol.1089 LNNS, pp.285-292 identifier

  • Publication Type: Conference Paper / Full Text
  • Volume: 1089 LNNS
  • Doi Number: 10.1007/978-3-031-67195-1_34
  • City: Çanakkale
  • Country: Turkey
  • Page Numbers: pp.285-292
  • Keywords: biomarkers, colorectal cancer, machine learning, metagenomics, pathway
  • Abdullah Gül University Affiliated: Yes

Abstract

Advances in next-generation sequencing and in “-omics” technologies enable the characterization of the human gut microbiome. Colorectal cancer (CRC), the third most common cancer worldwide, is caused by genetic mutations, environmental influences, and abnormalities in the gut microbiota. The aim of this study is to identify pathways that influence host metabolism in CRC patients. The CRC-related metagenomic dataset used in this study contains the relative abundance values of 551 pathways calculated for 1262 samples. Here, two different approaches based on the feature grouping reduce the number of features by considering relevant features as groups, eliminate irrelevant features, and perform classification. The recursive cluster elimination with intra-cluster feature elimination (RCE-IFE) approach achieves an AUC of 0.72 using an average of 66.2 features on CRC-associated metagenomics dataset. In these experiments, P163-PWY: L-lysine fermentation to acetate and butanoate and PWY-6151: S-adenosyl-L-methionine cycle I pathways are identified as potential biomarkers associated with CRC. These experiments also reduce the number of features reported by both approaches in P163-PWY: L-lysine fermentation to acetate and butanoate and PWY-6151: Sadenosyl-L-methionine cycle I pathways reported by both approaches are considered possible CRC-related biomarkers. This study contributes to the molecular diagnosis and treatment of colorectal cancer by revealing the pathways associated with CRC. Our results are promising for the study of the gut microbiota and its role in CRC.