Current Bioinformatics - Volume 21, Issue 1, 2026
Volume 21, Issue 1, 2026
-
-
Decoding the Genetic Code: Scientific Exploration of the Telomere-to-Telomere (T2T) Genome
More LessAuthors: Linjuan Wang, Yujia Li, Zhepei Zhang, Fengcheng Song, Yufei Zan, Ranxi Zheng and Zhengrong YuanWith the development of third-generation sequencing technology, genome assembly has entered a new era. The combination of multiple sequencing methods can combine the strengths of each, resulting in higher-quality assembly results. Telomere-to-Telomere (T2T) genome refers to a zero gap genome that is assembled at the T2T level of one or more chromosomes through the combination of multiple sequencing technologies, such as Pacific Biosciences High-Fidelity (PacBio HiFi), Oxford Nanopore Technologies (ONT) Ultra-long, High-throughput Chromosome Conformation Capture (Hi-C), and others. High-quality reference genomes are the basis for genomics research, and the T2T genome has enabled the exploration of unknown areas of the genome, such as telomeres and centromeres, which has provided a more in-depth direction for research. This review provides a comprehensive overview of the T2T genome, reviewing the development of sequencing technologies and then outlining sequencing strategies, assembly methods, quality assessment processes, and analysis software for the T2T genome with practical examples. Representative T2T genomic species of plants, animals, and microorganisms (e.g., human, Arabidopsis, brewer's yeast, and so on) are presented separately. A summary of the potential and challenges of current T2T genomic applications is provided, along with an outlook for future developments.
-
-
-
Identification and Analysis of Plant miRNAs: Evolution of In-silico Resources and Future Challenges
More LessAuthors: Abhishek Kushwaha, Hausila Prasad Singh and Noopur SinghEndogenous small RNAs (miRNA) are the key regulators of numerous eukaryotic lineages playing an important role in a broad range of plant development. Computational analysis of miRNAs facilitates the understanding of miRNA-based regulations in plants. The discovery of small non-coding RNAs has led to a greater understanding of gene regulation, and the development of bioinformatic tools has enabled the identification of microRNAs (miRNAs) and their targets. The need for comprehensive miRNA analysis is being accomplished by the development of advanced computational tools/algorithms and databases. Each resource has its own specificity and limitations for the analysis. This review provides a comprehensive overview of various algorithms used by computational tools, software, and databases for plant miRNA analysis. However, over a period of about two decades, a lot of knowledge has been added to our understanding of the biogenesis and functioning of miRNAs in other plants. Several parameters were already integrated and others need to be incorporated in order to give more accurate and efficient results. The reassessment of computational recourses (based on old algorithms) is required on the basis of new miRNA research and development. Generally, computational methods, including ab-initio and homology search-based methods, are used for miRNA identification and target prediction. This review presents the new challenges faced by the existing computational methods and the need to develop new tools and advanced algorithms and highlight the limitations of existing computational tools and methods, and emphasizing the need for a comprehensive platform for miRNA gene exploration.
-
-
-
Screening of Candidate Chemical Regulators for the m6A Writer MTA in Arabidopsis
More LessAuthors: Beilei Lei, Chengchao Jia, Cuixia Tan, Pengjun Ding, Zenglin Li, Jing Yang, Jiyuan Liu, Xiaomin Wei, Shiheng Tao and Chuang MaBackgroundThe MTA gene encodes a core component of m6A methyltransferase complex, which plays a crucial role in the post-transcriptional modification of RNA that influences many vital processes in plants. However, due to the constraint of embryonic lethality in MTA knockout mutation, the molecular function of MTA gene has yet to be comprehensively investigated.
ObjectiveThe aim of this study is to investigate the expression and regulation of MTA in Arabidopsis.
MethodsA large-scale transcriptome and genome analysis were carried out for the expression and nsSNP (non-synonymous Single Nucleotide Polymorphism) studies. Structured-based virtual screening, molecular dynamics simulation, binding free energy calculation and m6A modification level assay were employed to mine and validate MTA regulators from COCONUT natural product database.
ResultsTissue-specific expression and stress-responsive expression patterns of MTA were observed in Arabidopsis. nsSNPs from the 1,001 Arabidopsis project were not detected in the binding site of the methyl-donor substrate S-adenosylmethionine (SAM) in MTA. 10 small molecules were identified as potential regulators, among which CNP0251613 (adenosine diphosphate glucose, ADPG) was selected and validated to decrease m6A levels at 10 µM vs. the control in Arabidopsis.
ConclusionOur results provide a new insight and chemical entity into the in-depth study of RNA m6A writer MTA in plants.
-
-
-
DSPE: An End-to-End Drug Synergy Combination Prediction Algorithm for Echinococcosis
More LessAuthors: Haitao Li, Liyuan Jiang, Yuanyuan Chu, Yuansheng Liu, Chunhou Zheng and Yansen SuBackgroundEchinococcosis, a parasitic disease caused by the larvae of the Echinococcus parasite, poses a serious threat to human health. Medication is an indispensable means of treatment for Echinococcosis; however, due to the less satisfactory efficacy of single drugs, identifying effective drug combinations for the treatment of Echinococcosis is essential. Yet, current predictive models for drug synergy in Echinococcosis face accuracy challenges due to data scarcity, method limitations, and insufficient feature representation.
ObjectiveThis work aims to design an end-to-end method to predict drug synergistic combinations, which enables efficient and accurate identification of drug combinations against Echinococcosis.
MethodsIn this work, an end-to-end method, named DSPE, is proposed for predicting anti-Echinococcosis drug synergistic combinations. In DSPE, a dataset of Echinococcosis drug synergistic combinations is constructed by retrieving and extracting information from related scientific articles. Further, DSPE employs a residual graph attention network to deeply analyze drug characteristics and their interactions, thereby enhancing the performance of deep learning models. It also explores the protein-protein interaction network related to Echinococcosis, using node2vec combined with an attention mechanism to efficiently encode disease features. Finally, it predicts the synergy of drug combinations based on the Bliss score by integrating drug combinations and disease features.
ResultsExperimental evidence shows that DSPE outperforms five state-of-the-art algorithms in predicting drug combination effects by leveraging disease-target information and single-agents for the treatment.
ConclusionDSPE improves prediction accuracy and addresses the issue of data scarcity for new diseases, offering new insights and methods for the development of treatment plans for parasitic diseases in the future.
-
-
-
CNVviso: Visualisation Solution for Copy Number Variation Data from Ultra-Low Coverage Whole-Genome Sequencing in Preimplantation Genetic Testing
More LessAuthors: An Nguyen Huu, Nhung Trinh Thi Hong, Huong Bui Bich, Tien Vuong Quang, Dung Tien Pham and Hoi Le ThiBackgroundCNVviso is a web-based NGS data visualization tool designed to be user-friendly for medical researchers who desire visualization-based analysis of large amounts of data with no-code technology.
ObjectiveUnlike small tools that require extensive programming skills, CNVviso is designed to minimize hard coding and repetitive operations for ultra-low coverage whole-genome sequencing data from preimplantation genetic testing.
MethodsCNVviso was completely programmed and developed based on the R programming language and Shiny framework. CNVviso source code is available for download under the MIT Licence from https://github.com/Anegin24/CNVviso.
ResultsThe execution architecture of CNVviso allows users to input CNV data, which is then automatically visualized as graphs.
ConclusionThe charts display output of CNVviso are lively and interactive, enabling users to exploit the information effectively. The tool also provides tabular results containing aggregated information.
-
-
-
A Deep Learning Method for Identifying G-Protein Coupled Receptors Based on a Feature Pyramid Network and Attention Mechanism
More LessAuthors: Zhe Lv, Siqin Hu, Xin Wei and Wang Ren QiuBackgroundG-protein coupled receptors (GPCRs) represent a large family of membrane proteins, distinguished by their seven-transmembrane helical structures. These receptors play a pivotal role in numerous physiological processes. Nowadays, many researchers have proposed computational methods to identify GPCRs. In the past, we introduced a powerful method, EMCBOW-GPCR, which was designed for this purpose. However, the feature extraction technique employed is susceptible to out-of-vocabulary errors, indicating the potential for enhanced accuracy in GPCR identification.
MethodsTo solve the challenges, we propose a novel approach termed GPCR-AFPN. This method leverages the FastText algorithm to effectively extract features from protein sequences. Additionally, it employs a powerful deep neural network as the predictive model to improve prediction accuracy.
ResultsTo validate the efficacy of the proposed GPCR-AFPN method, we conducted five-fold cross-validation and independent tests, respectively. The experimental results indicate that GPCR-AFPN outperforms existing methods.
ConclusionOverall, our proposed method, GPCR-AFPN, can improve the accuracy of GPCR identification. For the convenience of researchers interested in applying our latest advancements, a user-friendly webserver for GPCR-AFPN is available at www.lzzzlab.top/gpcrafpn/, and the corresponding code can be downloaded at https://github.com/454170054/GPCR-AFPN.
-
-
-
Exploring Coding Sequence Length Distributions Across Taxonomic Kingdoms Based on Maximum Information Principle
More LessBackgroundGenetic information about organisms' traits is stored and encoded in deoxyribonucleic acid (DNA) sequences. The fundamental inquiry into the storage mechanisms of this genetic information within genomes has long been of interest to geneticists and biophysicists.
ObjectiveThe objective of this study was to investigate the distribution of coding sequence (CDS) lengths in species genomes across different kingdoms.
MethodsIn this study, we used the maximum entropy principle and the gamma distribution model based on a comprehensive dataset including viruses, archaea, bacteria, and eukaryote species.
ResultsOur study result revealed unique patterns in CDS length distributions among kingdoms and CDS lengths exhibit a right-skewed distribution, with varying preferences among kingdoms. Eukaryotes displayed bimodal distributions, with CDS sequences longer than those of prokaryotes. Fitting the gamma distribution model revealed differences in shape and scale parameters among kingdoms, with eukaryotes exhibiting larger scale parameters, indicating longer CDS sequences. Additionally, analysis of moments highlighted the complexity of eukaryotic genomes relative to prokaryotes.
ConclusionThis study result deepens our understanding of genome evolution and provides valuable insights for biological research.
-
Volumes & issues
-
Volume 21 (2026)
-
Volume 20 (2025)
-
Volume 19 (2024)
-
Volume 18 (2023)
-
Volume 17 (2022)
-
Volume 16 (2021)
-
Volume 15 (2020)
-
Volume 14 (2019)
-
Volume 13 (2018)
-
Volume 12 (2017)
-
Volume 11 (2016)
-
Volume 10 (2015)
-
Volume 9 (2014)
-
Volume 8 (2013)
-
Volume 7 (2012)
-
Volume 6 (2011)
-
Volume 5 (2010)
-
Volume 4 (2009)
-
Volume 3 (2008)
-
Volume 2 (2007)
-
Volume 1 (2006)
Most Read This Month