Performance of Feature Selection Methods

Edward R. Dougherty; Jianping Hua; Chao Sima

doi:10.2174/138920209789177629

ISSN: 1389-2029
E-ISSN: 1875-5488

Performance of Feature Selection Methods
By Edward R. Dougherty, Jianping Hua and Chao Sima
Source: Current Genomics, Volume 10, Issue 6, Sep 2009, p. 365 - 374
DOI: https://doi.org/10.2174/138920209789177629
- Available online: 01 Sep 2009

Abstract

High-throughput biological technologies offer the promise of finding feature sets to serve as biomarkers for medical applications; however, the sheer number of potential features (genes, proteins, etc.) means that there needs to be massive feature selection, far greater than that envisioned in the classical literature. This paper considers performance analysis for feature-selection algorithms from two fundamental perspectives: How does the classification accuracy achieved with a selected feature set compare to the accuracy when the best feature set is used and what is the optimal number of features that should be used? The criteria manifest themselves in several issues that need to be considered when examining the efficacy of a feature-selection algorithm: (1) the correlation between the classifier errors for the selected feature set and the theoretically best feature set; (2) the regressions of the aforementioned errors upon one another; (3) the peaking phenomenon, that is, the effect of sample size on feature selection; and (4) the analysis of feature selection in the framework of high-dimensional models corresponding to high-throughput data.

Article metrics loading...

/content/journals/cg/10.2174/138920209789177629

2009-09-01

2026-02-25

From This Site

/content/journals/cg/10.2174/138920209789177629

dcterms_title,dcterms_subject,pub_keyword

-contentType:Contributor -contentType:Concept -contentType:Institution

10

5

Full text loading...

/content/journals/cg/10.2174/138920209789177629

Article Type: Research Article

Performance of Feature Selection Methods

Abstract

From This Site

Most Read This Month

Most Cited Most Cited RSS feed

Polytene Chromosomes – A Portrait of Functional Organization of the Drosophila Genome

Early Stages of XY Sex Chromosomes Differentiation in the Fish Hoplias malabaricus (Characiformes, Erythrinidae) Revealed by DNA Repeats Accumulation

Mosaic Brain Aneuploidy in Mental Illnesses: An Association of Low-level post-zygotic Aneuploidy with Schizophrenia and Comorbid Psychiatric Disorders

A Postgenomic Perspective on Molecular Cytogenetics

Detecting Chromosome Condensation Defects in Gulf War Illness Patients

Integrative Bioinformatics Analysis for Targeting Hub Genes in Hepatocellular Carcinoma Treatment

Small Supernumerary Marker Chromosome May Provide Information on Dosage-insensitive Pericentric Regions in Human

Behavioral Variability and Somatic Mosaicism: A Cytogenomic Hypothesis

FAT4 Mutation is Related to Tumor Mutation Burden and Favorable Prognosis in Gastric Cancer

Gene-knockdown Methods for Silencing Nuclear-localized Insulin Receptors in Lung Adenocarcinoma Cells: A Bioinformatics Approach