Volume 25, Issue 3

Current Genomics - Volume 25, Issue 3, 2024

Volume 25, Issue 3, 2024

- Genomics in Diabetic Kidney Disease: A 2024 Update
  
  Authors: Stefanos Roumeliotis, Maria Divani, Eleni Stamellou and Vassilios Liakopoulos
  
  https://doi.org/10.2174/0113892029300247240325080421
  More Less
  
  Diabetic Kidney Disease (DKD) remains the leading cause of Chronic and End Stage Kidney Disease (ESKD) worldwide, with an increasing epidemiological burden. However, still, the disease awareness remains low, early diagnosis is difficult, and therapeutic management is ineffective. These might be attributed to the fact that DKD is a highly heterogeneous disease, with disparities and variability in clinical presentation and progression patterns. Besides environmental risk factors, genetic studies have emerged as a novel and promising tool in the field of DKD. Three decades ago, family studies first reported that inherited genetic factors might confer significant risk to DKD development and progression. During the past decade, genome-wide association studies (GWASs) screening the whole genome in large and multi-ethnic population-based cohorts identified genetic risk variants associated with traits defining DKD in both type 1 and 2 diabetes. Herein, we aim to summarize the existing data regarding the progress in the field of genomics in DKD, present how the revolution of GWAS expanded our understanding of pathophysiologic disease mechanisms and finally, suggest potential future directions.
  
  Add to my favourites
  
  Email this

- Ramifications of m6A Modification on ncRNAs in Cancer
  
  By Rashid Mehmood
  
  https://doi.org/10.2174/0113892029296712240405053201
  More Less
  
  N6-methyladenosine (m6A) is an RNA modification wherein the N6-position of adenosine is methylated. It is one of the most prevalent internal modifications of RNA and regulates various aspects of RNA metabolism. M6A is deposited by m6A methyltransferases, removed by m6A demethylases, and recognized by reader proteins, which modulate splicing, export, translation, and stability of the modified mRNA. Recent evidence suggests that various classes of noncoding RNAs (ncRNAs), including microRNAs (miRNAs), circular RNAs (circRNAs), and long con-coding RNAs (lncRNAs), are also targeted by this modification. Depending on the ncRNA species, m6A may affect the processing, stability, or localization of these molecules. The m6Amodified ncRNAs are implicated in a number of diseases, including cancer. In this review, the author summarizes the role of m⁶A modification in the regulation and functions of ncRNAs in tumor development. Moreover, the potential applications in cancer prognosis and therapeutics are discussed.
  
  Add to my favourites
  
  Email this

- Prediction of Deleterious Single Amino Acid Polymorphisms with a Consensus Holdout Sampler
  
  Authors: Óscar Álvarez-Machancoses, Eshel Faraggi, Enrique J. deAndrés-Galiana, Juan L. FernÁndez-Martínez and Andrzej Kloczkowski
  
  https://doi.org/10.2174/0113892029236347240308054538
  More Less
  
  Background: Single Amino Acid Polymorphisms (SAPs) or nonsynonymous Single Nucleotide Variants (nsSNVs) are the most common genetic variations. They result from missense mutations where a single base pair substitution changes the genetic code in such a way that the triplet of bases (codon) at a given position is coding a different amino acid. Since genetic mutations sometimes cause genetic diseases, it is important to comprehend and foresee which variations are harmful and which ones are neutral (not causing changes in the phenotype). This can be posed as a classification problem. Methods: Computational methods using machine intelligence are gradually replacing repetitive and exceedingly overpriced mutagenic tests. By and large, uneven quality, deficiencies, and irregularities of nsSNVs datasets debase the convenience of artificial intelligence-based methods. Subsequently, strong and more exact approaches are needed to address these problems. In the present work paper, we show a consensus classifier built on the holdout sampler, which appears strong and precise and outflanks all other popular methods. Results: We produced 100 holdouts to test the structures and diverse classification variables of diverse classifiers during the training phase. The finest performing holdouts were chosen to develop a consensus classifier and tested using a k-fold (1 ≤ k ≤5) cross-validation method. We also examined which protein properties have the biggest impact on the precise prediction of the effects of nsSNVs. Conclusion: Our Consensus Holdout Sampler outflanks other popular algorithms, and gives excellent results, highly accurate with low standard deviation. The advantage of our method emerges from using a tree of holdouts, where diverse LM/AI-based programs are sampled in diverse ways.
  
  Add to my favourites
  
  Email this

- DHFS-ECM: Design of a Dual Heuristic Feature Selection-based Ensemble Classification Model for the Identification of Bamboo Species from Genomic Sequences
  
  Authors: Aditi R. Durge and Deepti D. Shrimankar
  
  https://doi.org/10.2174/0113892029268176240125055419
  More Less
  
  Background: Analyzing genomic sequences plays a crucial role in understanding biological diversity and classifying Bamboo species. Existing methods for genomic sequence analysis suffer from limitations such as complexity, low accuracy, and the need for constant reconfiguration in response to evolving genomic datasets. Aim: This study addresses these limitations by introducing a novel Dual Heuristic Feature Selection- based Ensemble Classification Model (DHFS-ECM) for the precise identification of Bamboo species from genomic sequences. Methods: The proposed DHFS-ECM method employs a Genetic Algorithm to perform dual heuristic feature selection. This process maximizes inter-class variance, leading to the selection of informative N-gram feature sets. Subsequently, intra-class variance levels are used to create optimal training and validation sets, ensuring comprehensive coverage of class-specific features. The selected features are then processed through an ensemble classification layer, combining multiple stratification models for species-specific categorization. Results: Comparative analysis with state-of-the-art methods demonstrate that DHFS-ECM achieves remarkable improvements in accuracy (9.5%), precision (5.9%), recall (8.5%), and AUC performance (4.5%). Importantly, the model maintains its performance even with an increased number of species classes due to the continuous learning facilitated by the Dual Heuristic Genetic Algorithm Model. Conclusion: DHFS-ECM offers several key advantages, including efficient feature extraction, reduced model complexity, enhanced interpretability, and increased robustness and accuracy through the ensemble classification layer. These attributes make DHFS-ECM a promising tool for real-time clinical applications and a valuable contribution to the field of genomic sequence analysis.
  
  Add to my favourites
  
  Email this

- Testing the Significance of Ranked Gene Sets in Genome-wide Transcriptome Profiling Data Using Weighted Rank Correlation Statistics
  
  Authors: Min Yao, Hao He, Binyu Wang, Xinmiao Huang, Sunli Zheng, Jianwu Wang, Xuejun Gao and Tinghua Huang
  
  https://doi.org/10.2174/0113892029280470240306044159
  More Less
  
  Background: Popular gene set enrichment analysis approaches assumed that genes in the gene set contributed to the statistics equally. However, the genes in the transcription factors (TFs) derived gene sets, or gene sets constructed by TF targets identified by the ChIP-Seq experiment, have a rank attribute, as each of these genes have been assigned with a p-value which indicates the true or false possibilities of the ownerships of the genes belong to the gene sets. Objectives: Ignoring the rank information during the enrichment analysis will lead to improper statistical inference. We address this issue by developing of new method to test the significance of ranked gene sets in genome-wide transcriptome profiling data. Methods: A method was proposed by first creating ranked gene sets and gene lists and then applying weighted Kendall's tau rank correlation statistics to the test. After introducing top-down weights to the genes in the gene set, a new software called "Flaver" was developed. Results: Theoretical properties of the proposed method were established, and its differences over the GSEA approach were demonstrated when analyzing the transcriptome profiling data across 55 human tissues and 176 human cell-lines. The results indicated that the TFs identified by our method have higher tendency to be differentially expressed across the tissues analyzed than its competitors. It significantly outperforms the well-known gene set enrichment analyzing tools, GOStats (9%) and GSEA (17%), in analyzing well-documented human RNA transcriptome datasets. Conclusions: The method is outstanding in detecting gene sets of which the gene ranks were correlated with the expression levels of the genes in the transcriptome data.
  
  Add to my favourites
  
  Email this

- Detection and Quantification of 5moU RNA Modification from Direct RNA Sequencing Data
  
  Authors: Jiayi Li, Feiyang Sun, Kunyang He, Lin Zhang, Jia Meng, Daiyun Huang and Yuxin Zhang
  
  https://doi.org/10.2174/0113892029288843240402042529
  More Less
  
  Background: Chemically modified therapeutic mRNAs have gained momentum recently. In addition to commonly used modifications (e.g., pseudouridine), 5moU is considered a promising substitution for uridine in therapeutic mRNAs. Accurate identification of 5-methoxyuridine (5moU) would be crucial for the study and quality control of relevant in vitro-transcribed (IVT) mRNAs. However, current methods exhibit deficiencies in providing quantitative methodologies for detecting such modification. Utilizing the capabilities of Oxford nanopore direct RNA sequencing, in this study, we present NanoML-5moU, a machine-learning framework designed specifically for the read-level detection and quantification of 5moU modification for IVT data. Materials and Methods: Nanopore direct RNA sequencing data from both 5moU-modified and unmodified control samples were collected. Subsequently, a comprehensive analysis and modeling of signal event characteristics (mean, median current intensities, standard deviations, and dwell times) were performed. Furthermore, classical machine learning algorithms, notably the Support Vector Machine (SVM), Random Forest (RF), and XGBoost were employed to discern 5moU modifications within NNUNN (where N represents A, C, U, or G) 5-mers. Results: Notably, the signal event attributes pertaining to each constituent base of the NNUNN 5- mers, in conjunction with the utilization of the XGBoost algorithm, exhibited remarkable performance levels (with a maximum AUROC of 0.9567 in the "AGTTC" reference 5-mer dataset and a minimum AUROC of 0.8113 in the "TGTGC" reference 5-mer dataset). This accomplishment markedly exceeded the efficacy of the prevailing background error comparison model (ELIGOs AUC 0.751 for sitelevel prediction). The model's performance was further validated through a series of curated datasets, which featured customized modification ratios designed to emulate broader data patterns, demonstrating its general applicability in quality control of IVT mRNA vaccines. The NanoML-5moU framework is publicly available on GitHub (https://github.com/JiayiLi21/NanoML-5moU). Conclusion: NanoML-5moU enables accurate read-level profiling of 5moU modification with nanopore direct RNA-sequencing, which is a powerful tool specialized in unveiling signal patterns in in vitro-transcribed (IVT) mRNAs.
  
  Add to my favourites
  
  Email this

- Genomic and Metagenomic Insights into the Distribution of Nicotine-degrading Enzymes in Human Microbiota
  
  Authors: Ying Guan, Zhouhai Zhu, Qiyuan Peng, Meng Li, Xuan Li, Jia-Wei Yang, Yan-Hong Lu, Meng Wang and Bin-Bin Xie
  
  https://doi.org/10.2174/0113892029302230240319042208
  More Less
  
  Introduction: Nicotine degradation is a new strategy to block nicotine-induced pathology. The potential of human microbiota to degrade nicotine has not been explored. Aims: This study aimed to uncover the genomic potentials of human microbiota to degrade nicotine. Method: To address this issue, we performed a systematic annotation of Nicotine-Degrading Enzymes (NDEs) from genomes and metagenomes of human microbiota. A total of 26,295 genomes and 1,596 metagenomes for human microbiota were downloaded from public databases and five types of NDEs were annotated with a custom pipeline. We found 959 NdhB, 785 NdhL, 987 NicX, three NicA1, and three NicA2 homologs. Results: Genomic classification revealed that six phylum-level taxa, including Proteobacteria, Firmicutes, Firmicutes_A, Bacteroidota, Actinobacteriota, and Chloroflexota, can produce NDEs, with Proteobacteria encoding all five types of NDEs studied. Analysis of NicX prevalence revealed differences among body sites. NicX homologs were found in gut and oral samples with a high prevalence but not found in lung samples. NicX was found in samples from both smokers and non-smokers, though the prevalence might be different. Conclusion: This study represents the first systematic investigation of NDEs from the human microbiota, providing new insights into the physiology and ecological functions of human microbiota and shedding new light on the development of nicotine-degrading probiotics for the treatment of smoking-related diseases.
  
  Add to my favourites
  
  Email this

Current Genomics - Volume 25, Issue 3, 2024

Volume 25, Issue 3, 2024

Volumes & issues

Most Read This Month

Most Cited Most Cited RSS feed