Deep Convolutional Neural Networks for Predicting Hydroxyproline in Proteins

HaiXia Long; Mi Wang; HaiYan Fu

doi:10.2174/1574893612666170221152848

ISSN: 1574-8936
E-ISSN: 2212-392X

Deep Convolutional Neural Networks for Predicting Hydroxyproline in Proteins
By HaiXia Long, Mi Wang and HaiYan Fu
Source: Current Bioinformatics, Volume 12, Issue 3, Jun 2017, p. 233 - 238
DOI: https://doi.org/10.2174/1574893612666170221152848
- Available online: 01 Jun 2017

Abstract

Background: Protein hydroxyproline is one type of post translational modification (PTM). Because protein sequence contains many uncharacterized residues of P, the question that needs to be answered is: Which ones can be hydroxylated, and which ones cannot? The solution will not only give a deeper understanding of the hydroxylation mechanism but can also lead to drug development. The evergrowing demand for better handling of protein sequences in the post-genomic age presents new prediction challenges. Objective: To address these challenges, developing computational methods to identify these sites quickly and accurately is our objective. Method: We propose a new approach for predicting hydroxyproline using the deep learning model known as the convolutional neural network (CNN), and employed a pseudo amino acid composition (PseAAC) to identify these proteins and used the position-specific scoring matrix (PSSM) to represent samples as input to the CNN model. Results and Conclusion: In our experiment, K-fold cross-validation testing on benchmark datasets further demonstrated the potential for CNN identification of protein hydroxyproline as well as other PTM type proteins.

Article metrics loading...

/content/journals/cbio/10.2174/1574893612666170221152848

2017-06-01

2026-02-15

From This Site

/content/journals/cbio/10.2174/1574893612666170221152848

dcterms_title,dcterms_subject,pub_keyword

-contentType:Contributor -contentType:Concept -contentType:Institution

10

5

Full text loading...

/content/journals/cbio/10.2174/1574893612666170221152848

Article Type: Research Article

Keyword(s): convolutional neural network; deep learning; position-specific scoring matrix (PSSM); Protein hydroxyproline; pseudo amino acid composition (PseAAC)

Most Cited Most Cited RSS feed

- A Review of Ensemble Methods in Bioinformatics
  
  Authors: Pengyi Yang, Yee Hwa Yang, Bing B. Zhou and Albert Y. Zomaya
- Bioinformatics Tools for Mass Spectroscopy-Based Metabolomic Data Processing and Analysis
  
  Authors: Masahiro Sugimoto, Masato Kawakami, Martin Robert, Tomoyoshi Soga and Masaru Tomita
- Distance-based Support Vector Machine to Predict DNA N6- methyladenine Modification
  
  Authors: Haoyu Zhang, Quan Zou, Ying Ju, Chenggang Song and Dong Chen
- A Review on the Recent Developments of Sequence-based Protein Feature Extraction Methods
  
  Authors: Jun Zhang and Bin Liu
- Molecular Genetic Markers: Discovery, Applications, Data Storage and Visualisation
  
  Authors: Chris Duran, Nikki Appleby, David Edwards and Jacqueline Batley
- A Brief Survey of Machine Learning Methods in Protein Sub-Golgi Localization
  
  Authors: Wuritu Yang, Xiao-Juan Zhu, Jian Huang, Hui Ding and Hao Lin
- Cancer Diagnosis Through IsomiR Expression with Machine Learning Method
  
  Authors: Zhijun Liao, Dapeng Li, Xinrui Wang, Lisheng Li and Quan Zou
- Relevance of Molecular Docking Studies in Drug Designing
  
  Authors: Ritu Jakhar, Mehak Dangi, Alka Khichi and Anil K. Chhillar
- The Advances and Challenges of Deep Learning Application in Biological Big Data Processing
  
  Authors: Li Peng, Manman Peng, Bo Liao, Guohua Huang, Weibiao Li and Dingfeng Xie
- Gene Expression Profile Classification: A Review
  
  Authors: Musa H. Asyali, Dilek Colak, Omer Demirkaya and Mehmet S. Inan
More Less

Deep Convolutional Neural Networks for Predicting Hydroxyproline in Proteins

Abstract

Most Read This Month

Most Cited Most Cited RSS feed