-
oa Editorial [Hot Topic: Machine Learning Models in Protein Bioinformatics (Guest Editors: Lukasz Kurgan & Yaoqi Zhou)]
- Source: Current Protein and Peptide Science, Volume 12, Issue 6, Sep 2011, p. 455 - 455
-
- 01 Sep 2011
Abstract
Bioinformatics is a relatively new field concerned with the computational analysis and prediction of properties of biomolecules, DNA, RNA, and proteins, in particular, on a genomic/proteomic scale. Machine learning models play increasingly important roles in development of novel methodologies, summarization, and high-throughput analysis in the bioinformatics field. Advances in the related area, including protein structure and function prediction [1, 2], structural bioinformatics [3], and peptide analysis [4] were recently summarized, and several works that overview specific sub-areas of protein bioinformatics, such as prediction of secondary structure [5, 6], helical transmembrane proteins [7], localization and targeting [8], binding sites [9, 10], and RNA-binding [11], were published in the last couple of years. This issue provides a comprehensive overview of current efforts related to the analysis of protein data, from sequences to structures to functions. It consists of two parts, the first with five reviews and the second that includes seven original methodology papers. The first review by Xin and Radivojac summarizes approaches for the computational identification of functional residues in protein structures and discusses their applications in functional proteomics, including prediction of catalytic residues, post-translational modifications, and nucleic acid-binding sites. The second manuscript by Kurgan and Disfani provides a comprehensive review of ten onedimensional structural descriptors of proteins and comparatively summarizes over eighty computational models that are used to predict these descriptors from the protein sequences, primarily focusing on the prediction of secondary structure, relative solvent accessibility, and disorder. The review by Gromiha and Huang discusses machine learning-based and statistical methods for the computational prediction of protein folding rates and stability. The fourth paper by Zhou and coworkers overviews and compares current techniques for the prediction of small open reading frames and emphasizes the need for further research in this area. The last review introduces cellular automata and concentrates on its applications in the protein bioinformatics. The first original research paper by Kihara and coworkers describes the three-dimensional Zernike descriptor, which is used to describe molecular surfaces, and overviews several applications of this descriptor. In the next paper, Qin and Zhou introduce their DISPLAR method that aims at the accurate protein structure-based prediction of DNA binding sites. The manuscript by Xu and coworkers describes and evaluates a new sampling-based machine learning method to rank protein structural models by integrating multiple scores and features. The next two original contributions describe new methodologies for the protein model quality assessment. The work by Martin, Mirabello, and Pollastri concerns an efficient knowledge-based approach that utilizes neural network pairwise interaction fields. The paper by Meller and coworkers introduces a method based on the prediction of relative solvent accessibility using support vector regression, which is applied to soluble and alpha-helical membrane proteins. The next contribution by Hwang et al. investigates a relation between contact numbers and catalytic residues to build a simple and effective predictor of the catalytic residues. We close the issue with the paper by Yin, Fan, and Shen which proposes and evaluates an accurate nearest neighbor-based method for the prediction of the conotoxin superfamily. We are excited to deliver this comprehensive issue that tackles a diverse set of developments in the area of protein bioinformatics. We hope that it will constitute an indispensable resource for bioinformaticians, computer scientists, computational biologists, biophysicists, and biochemists. Last but not least, we would like to thank all the authors who make this issue possible. We are in great debt to the 26 anonymous reviewers from around the world who delivered timely and useful comments to the authors. The guest editors also express their gratitude to the Editor-in-Chief Prof. Ben M. Dunn for his invitation and support that resulted in the successful completion of this special issue.