Skip to content
2000
Volume 27, Issue 5
  • ISSN: 1386-2073
  • E-ISSN: 1875-5402

Abstract

Objective: Gene expression profile data is a good data source for people to study tumors, but gene expression data has the characteristics of high dimension and redundancy. Therefore, gene selection is a very important step in microarray data classification. Methods: In this paper, a feature selection method based on the maximum mutual information coefficient and graph theory is proposed. Each feature of gene expression data is treated as a vertex of the graph, and the maximum mutual information coefficient between genes is used to measure the relationship between the vertices to construct an undirected graph, and then the core and coritivity theory is used to determine the feature subset of gene data. Results: In this work, we used three different classification models and three different evaluation metrics such as accuracy, F1-Score, and AUC to evaluate the classification performance to avoid reliance on any one classifier or evaluation metric. The experimental results on six different types of genetic data show that our proposed algorithm has high accuracy and robustness compared to other advanced feature selection methods. Conclusion: In this method, the importance and correlation of features are considered at the same time, and the problem of gene selection in microarray data classification is solved.

Loading

Article metrics loading...

/content/journals/cchts/10.2174/1386207326666230413085646
2024-03-01
2025-10-21
Loading full text...

Full text loading...

/content/journals/cchts/10.2174/1386207326666230413085646
Loading

  • Article Type:
    Research Article
Keyword(s): cancer classification; feature selection; filters; Gene expression; graph theory; MIC
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error
Please enter a valid_number test