Skip to content
2000

Prediction of Human Protein Subcellular Locations with Feature Selection and Analysis

image of Prediction of Human Protein Subcellular Locations with Feature Selection and Analysis
Preview this chapter:

In this paper, we propose a strategy to predict subcellular locations of human proteins using multi-step feature selection. Each protein is firstly coded by features derived from KEGG and GO enrichment scores. After an initial feature reduction, 9958 features remain and they are sorted by the Minimum Redundancy Maximum Relevance (mRMR) method. The sorted features are then filtered by an incremental feature selection (IFS) procedure and a compact set of features are obtained. Random forest (RF) is used as the prediction model and achieved an overall prediction accuracy of 67.72%, evaluated by ten-fold cross-validation. The corresponding KEGG pathways and GO terms of the resultant features are analyzed in-depth, and are deemed as the most important terms relating to human protein subcellular location.

/content/books/9781608058624.chapter-10
dcterms_subject,pub_keyword
-contentType:Journal -contentType:Figure -contentType:Table -contentType:SupplementaryData
10
5
Chapter
content/books/9781608058624
Book
false
en
Loading
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error
Please enter a valid_number test