FermatS: A Novel Numerical Representation for Protein Sequence Comparison and DNA-binding Protein Identification

Yanping Zhang; Ya Gao; Jianwei Ni; Pengcheng Chen; Xiaosheng Wang

doi:10.2174/1386207323999201117111738

ISSN: 1386-2073
E-ISSN: 1875-5402

FermatS: A Novel Numerical Representation for Protein Sequence Comparison and DNA-binding Protein Identification
By Yanping Zhang, Ya Gao, Jianwei Ni, Pengcheng Chen and Xiaosheng Wang
Source: Combinatorial Chemistry & High Throughput Screening, Volume 24, Issue 10, Nov 2021, p. 1746 - 1753
DOI: https://doi.org/10.2174/1386207323999201117111738
- Available online: 01 Nov 2021

Abstract

Aims: Based on protein sequence information, a simple and effective method was used to analyze protein sequence similarity and predict DNA-binding protein. Background: It is absolutely necessary that we generate computational methods of low complexity to accurate infer protein structure, function, and evolution in the rapidly growing number of molecular biology data available. Objective: It is important to generate novel computational algorithms for analyzing and comparing protein sequences with the rapidly growing number of molecular biology data available. Methods: Based on global and local position representation with the curves of Fermat spiral and normalized moments of inertia of the curve of Fermat spiral, respectively, moreover, composition of 20 amino acids to get the numerical characteristics of protein sequences. Results: It has been applied to analyze the similarity/dissimilarity of nine ND5 proteins, the analysis results are consistent with the biological evolution theory. Furthermore, we employ the Logistic regression with 5-fold cross-validation to establish the prediction of DNA-binding proteins model, which outperformed the DNAbinder, iDNA-prot, DNA-prot and gDNA-prot by 0.0069-0.609 in terms of F-measure, 0.293-0.898 in terms of MCC in unbalanced dataset. Conclusion: These results show that our method, namely FermatS, is effective to compare, recognition and prediction the protein sequences.

Article metrics loading...

/content/journals/cchts/10.2174/1386207323999201117111738

2021-11-01

2026-03-01

From This Site

/content/journals/cchts/10.2174/1386207323999201117111738

dcterms_title,dcterms_subject,pub_keyword

-contentType:Contributor -contentType:Concept -contentType:Institution

10

5

Full text loading...

/content/journals/cchts/10.2174/1386207323999201117111738

Article Type: Research Article

Keyword(s): Fermat spiral; identification of DNA-binding proteins; logistic regression; mass; moment of inertia; similarity/dissimilarity of species

FermatS: A Novel Numerical Representation for Protein Sequence Comparison and DNA-binding Protein Identification

Abstract

From This Site

Most Read This Month

Most Cited Most Cited RSS feed

Privileged Structures: Applications in Drug Discovery

Computational Methods in Developing Quantitative Structure-Activity Relationships (QSAR): A Review

Recent Advances on Potentiometric Membrane Sensors for Pharmaceutical Analysis

Label-Free Detection of Biomolecular Interactions Using BioLayer Interferometry for Kinetic Characterization

Metalloproteinase Inhibitors for the Disintegrin-Like Metalloproteinases ADAM10 and ADAM17 that Differentially Block Constitutive and Phorbol Ester-Inducible Shedding of Cell Surface Molecules

On Various Metrics Used for Validation of Predictive QSAR Models with Applications in Virtual Screening and Focused Library Design

Diversity Among Microbial Cyclic Lipopeptides: Iturins and Surfactins. Activity-Structure Relationships to Design New Bioactive Agents

Building a Tiered Approach to In Vitro Predictive Toxicity Screening: A Focus on Assays with In Vivo Relevance

Antioxidants and Inflammatory Disease: Synthetic and Natural Antioxidants with Anti-Inflammatory Activity

Machine Learning in Virtual Screening