Current Bioinformatics - Volume 20, Issue 2, 2025
Volume 20, Issue 2, 2025
-
-
Intersecting Peptidomics and Bioactive Peptides in Drug Therapeutics
Peptidomics is the study of total peptides that describe the functions, structures, and interactions of peptides within living organisms. It comprises bioactive peptides derived naturally or synthetically designed that exhibit various therapeutic properties against microbial infections, cancer progression, inflammation, etc. With the current state of the art, Bioinformatics tools and techniques help analyse large peptidomics data and predict peptide structure and functions. It also aids in designing peptides with enhanced stability and efficacy. Peptidomics studies are gaining importance in therapeutics as they offer increased target specificity with the least side effects. The molecular size and flexibility of peptides make them a potential drug candidate for designing protein-protein interaction inhibitors. These features increased their drug potency with the considerable increase in the number of peptide drugs available in the market for various health commodities. The present review extensively analyses the peptidomics field, focusing on different bioactive peptides and therapeutics, such as anticancer peptide drugs. Further, the review provides comprehensive information on in silico tools available for peptide research. The importance of personalised peptide medicines in disease therapy is discussed along with the case study. Further, the major limitations of peptide drugs and the different strategies to overcome those limitations are reviewed.
-
-
-
An Exploratory Review on Recent Computational Approaches Devised for MiRNA Disease Association Prediction
Authors: S. Sujamol, E.R. Vimina and U. KrishnakumarRecent evidence demonstrated the fundamental role of miRNAs as disease biomarkers and their role in disease progression and pathology. Identifying disease related miRNAs using computational approaches has become one of the trending topics in health informatics. Many biological databases and online tools were developed for uncovering novel disease-related miRNAs. Hence, a brief overview regarding the disease biomarkers, miRNAs as disease biomarkers and their role in complex disorders is given here. Various methods for calculating miRNA and disease similarities are included and the existing machine learning and network based computational approaches for detecting disease associated miRNAs are reviewed along with the benchmark dataset used. Finally, the performance matrices, validation measures and online tools used for miRNA Disease Association (MDA) predictions are also outlined.
-
-
-
GB5mCPred: Cross-species 5mc Site Predictor Based on Bootstrap-based Stochastic Gradient Boosting Method for Poaceae
Authors: Dipro Sinha, Tanwy Dasmandal, Md Yeasin, Dwijesh Chandra Mishra, Anil Rai and Sunil ArchakBackgroundOne of the most prevalent epigenetic alterations in all three kingdoms of life is 5mC, which plays a part in a wide range of biological functions. Although in-vitro techniques are more effective in detecting epigenetic alterations, they are time and cost-intensive. Artificial intelligence-based in silico approaches have been used to overcome these obstacles.
AimThis study aimed to develop a ML-based predictor for the detection of 5mC sites in Poaceae.
ObjectiveThe objective of this study was the evaluation of machine learning and deep learning models for the prediction of 5mC sites in rice.
MethodsIn this study, the vectorization of DNA sequences has been performed using three distinct feature sets- Oligo Nucleotide Frequencies (k = 2), Mono-nucleotide Binary Encoding, and Chemical Properties of Nucleotides. Two deep learning models, long short-term memory (LSTM) and Bidirectional LSTM (Bi-LSTM), as well as nine machine learning models, including random forest, gradient boosting, naïve bayes, regression tree, k-Nearest neighbour, support vector machine, adaboost, multiple logistic regression, and artificial neural network, were investigated. Also, bootstrap resampling was used to build more efficient models along with a hybrid feature selection module for dimensional reduction and removal of irrelevant features of the vector space.
ResultsRandom Forest gains the maximum accuracy, specificity and MCC, i.e., 92.6%, 86.41% and 0.84. Gradient Boosting obtained the maximum sensitivity, i.e., 96.85%. The Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS) technique showed that the best three models were Random Forest, Gradient Boosting, and Support Vector Machine in terms of accurate prediction of 5mC sites in rice. We developed an R-package, ‘GB5mCPred,’ and it is available in CRAN (https://cran.r-project.org/web/packages/GB5mcPred/index.html). Also, a user-friendly prediction server was made based on this algorithm (http://cabgrid.res.in:5474/).
ConclusionWith nearly equal TOPSIS scores, Random Forest, Gradient Boosting, and Support Vector Machine ended up being the best three models. The major rationale may be found in their architectural design since they are gradual learning models that can capture the 5mC sites more correctly than other learning models.
-
-
-
Hybrid Feature Extraction for Breast Cancer Classification Using the Ensemble Residual VGG16 Deep Learning Model
IntroductionBreast Cancer (BC) is a significant cause of high mortality amongst women globally and probably will remain a disease posing challenges about its detectability. Advancements in medical imaging technology have improved the accuracy and efficiency of breast cancer classification. However, tumor features' complexity and imaging data variability still pose challenges.
MethodsThis study proposes the Ensemble Residual-VGG-16 model as a novel combination of the Deep Residual Network (DRN) and VGG-16 architecture. This model is purposely engineered with maximal precision for the task of breast cancer diagnosis based on mammography images. We assessed its performance by accuracy, recall, precision, and the F1-Score. All these metrics indicated the high performance of this Residual-VGG-16 model. The diagnostic residual-VGG16 performed exceptionally well with an accuracy of 99.6%, precision of 99.4%, recall of 99.7%, F1 score of 98.6%, and Mean Intersection over Union (MIoU) of 99.8% with MIAS datasets.
ResultsSimilarly, the INBreast dataset achieved an accuracy of 93.8%, a precision of 94.2%, a recall of 94.5%, and an F1-score of 93.4%.
ConclusionThe proposed model is a significant advancement in breast cancer diagnosis, with high accuracy and potential as an automated grading.
-
-
-
Investigating Full-Length circRNA Transcripts to Reveal circRNA-Mediated Regulation of Competing Endogenous RNAs in Gastric Cancer
Authors: Jingjing Liu, Quan Yuan, Runqiu Cai, Jian Zhao, Juan Chen, Meng Zhang, Yulan Wang, Minhui Zhuang, Tianyi Xu, Xiaofeng Song and Jing WuBackgroundCircular RNAs (circRNAs) play important regulatory roles in the progression of gastric cancer (GC), but the exact mechanisms governing their regulation remain incompletely understood. Prior studies typically used back-spliced junctions (BSJs) to represent a range of circRNA isoforms, overlooking the prevalence of alternative splicing (AS) events within circRNAs, which could lead to unreliable or even incorrect conclusions in subsequent analyses, hindering our comprehension of the specific functions of circRNAs in GC.
ObjectiveThis study aimed to explore the potential functional roles of the dysregulated circRNA transcripts in GC and provide new biomarkers and effective novel therapeutic strategies for GC treatment.
MethodsRNA-seq data with rRNA depletion and RNase R treatment was employed to characterize the expression profiles of circRNAs in GC, and RNA-seq data only with rRNA depletion was employed to identify differentially expressed mRNAs in GC. Based on the full-sequence information and accurate isoform-level quantification of circRNA transcripts calculated by the CircAST tool, we performed a series of bioinformatic analyses. A circRNA-miRNA-hub gene regulatory network was constructed to reveal the circRNA-mediated regulation of competing endogenous RNAs in GC, and then the protein-protein interaction (PPI) network was built to identify hub genes.
ResultsA total of 18,398 circular transcripts were successfully reconstructed in the samples. Herein, 351 upregulated and 177 downregulated circRNA transcripts were identified. Functional enrichment analysis revealed that their parental genes were strongly associated with GC. After several screening steps, 19 dysregulated circRNA transcripts, 40 related miRNAs, and 65 target genes (mRNAs) were selected to construct the ceRNA network. Through PPI analysis, five hub genes (COL5A2, PDGFRB, SPARC, COL1A2, and COL4A1) were excavated. All these hub genes may play vital roles in gastric cancer cell proliferation and invasion.
ConclusionOur study revealed a comprehensive profile of full-length circRNA transcripts in GC, which could provide potential prognostic biomarkers and targets for GC treatment. The results would be helpful for further studies on the biological roles of circRNAs in GC and offer new mechanistic insights into the pathogenesis of GC.
-
-
-
CLPr_in_ML: Cleft Lip and Palate Reconstructed Features with Machine Learning
Authors: Baitong Chen, Ning Li and Wenzheng BaoBackgroundCleft lip and palate are two of the most common craniofacial congenital malformations in humans. It influences tens of millions of patients worldwide. The hazards of this disease are multifaceted, extending beyond the obvious facial malformation to encompass physiological functions, oral health, psychological well-being, and social aspects.
ObjectiveThe primary objective of our study is to demonstrate the importance of imaging in detecting cleft lip and palate. By observing the morphological and structural abnormalities involving the lip and palate through imaging methods, this study aims to establish imaging as the primary diagnostic approach for this disease.
MethodsIn this work, we proposed a novel model to analyze unilateral complete cleft lip and palate after velopharyngeal closure and non-left lip and palate patients from the Department of Stomatology of Xuzhou First People's Hospital, Conical Beam CT (CBCT) images in silicon. In order to demonstrate the generalization, the simulated dataset was constructed using the random disturbance factor, which is from the actual dataset. We extracted several raw features from CBCT images in detail. Then, we proposed a novel feature reconstruction method, including six types of reconstructed factors, to reconstruct the existing features. Then, the reconstructed features weretrained with machine learning algorithms. Finally, the testing and independent data model was utilized to analyze the performance of this work.
ResultsBy comparing different operator features, the min operator, max operator, average operator, and all operators can achieve good performances in both the testing set and the independent set.
ConclusionWith the different operator features, the majority of classification models, including Gradient Boosting, Hist Gradient Boosting, Multilayer Perceptron, lightGBM, and broadened learning, classification algorithms can get the well-performances in the selected reconstructed feature operators.
-
Volumes & issues
-
Volume 20 (2025)
-
Volume 19 (2024)
-
Volume 18 (2023)
-
Volume 17 (2022)
-
Volume 16 (2021)
-
Volume 15 (2020)
-
Volume 14 (2019)
-
Volume 13 (2018)
-
Volume 12 (2017)
-
Volume 11 (2016)
-
Volume 10 (2015)
-
Volume 9 (2014)
-
Volume 8 (2013)
-
Volume 7 (2012)
-
Volume 6 (2011)
-
Volume 5 (2010)
-
Volume 4 (2009)
-
Volume 3 (2008)
-
Volume 2 (2007)
-
Volume 1 (2006)
Most Read This Month
