HaShRECA: Hadoop Based Short Read Error Correction Algorithm for Genome Assembly

Muhammad Tahir; Muhammad Sardaraz; Ataul Aziz Ikram; Hassan Bajwa

doi:10.2174/157489361004150922151409

ISSN: 1574-8936
E-ISSN: 2212-392X

HaShRECA: Hadoop Based Short Read Error Correction Algorithm for Genome Assembly
By Muhammad Tahir, Muhammad Sardaraz, Ataul Aziz Ikram and Hassan Bajwa
Source: Current Bioinformatics, Volume 10, Issue 4, Sep 2015, p. 469 - 475
DOI: https://doi.org/10.2174/157489361004150922151409
- Available online: 01 Sep 2015

Abstract

Next-generation high-throughput sequencing technologies have opened up new and challenging research opportunities. In particular, Next-generation sequencers produce a massive amount of short-reads data in a single run. However, the large amount of short-reads data produced is highly susceptible to errors, as compared to shotgun sequencing. Therefore, there is a peremptory demand to design fast and more accurate statistical and computational tools to analyze this data. We present HaShRECA, a new short-reads error correction algorithm based on probabilistic analysis of potential read errors that utilizes the Hadoop MapReduce framework. Experimental results show that HaShRECA is more accurate, as well as time and space efficient as compared to previous algorithms.

Article metrics loading...

/content/journals/cbio/10.2174/157489361004150922151409

2015-09-01

2026-02-16

From This Site

/content/journals/cbio/10.2174/157489361004150922151409

dcterms_title,dcterms_subject,pub_keyword

-contentType:Contributor -contentType:Concept -contentType:Institution

10

5

Full text loading...

/content/journals/cbio/10.2174/157489361004150922151409

Article Type: Research Article

Keyword(s): Algorithm; genome; mapreduce; next generation sequencing; short read errors

Most Cited Most Cited RSS feed

- A Review of Ensemble Methods in Bioinformatics
  
  Authors: Pengyi Yang, Yee Hwa Yang, Bing B. Zhou and Albert Y. Zomaya
- Bioinformatics Tools for Mass Spectroscopy-Based Metabolomic Data Processing and Analysis
  
  Authors: Masahiro Sugimoto, Masato Kawakami, Martin Robert, Tomoyoshi Soga and Masaru Tomita
- Distance-based Support Vector Machine to Predict DNA N6- methyladenine Modification
  
  Authors: Haoyu Zhang, Quan Zou, Ying Ju, Chenggang Song and Dong Chen
- A Review on the Recent Developments of Sequence-based Protein Feature Extraction Methods
  
  Authors: Jun Zhang and Bin Liu
- Molecular Genetic Markers: Discovery, Applications, Data Storage and Visualisation
  
  Authors: Chris Duran, Nikki Appleby, David Edwards and Jacqueline Batley
- A Brief Survey of Machine Learning Methods in Protein Sub-Golgi Localization
  
  Authors: Wuritu Yang, Xiao-Juan Zhu, Jian Huang, Hui Ding and Hao Lin
- Cancer Diagnosis Through IsomiR Expression with Machine Learning Method
  
  Authors: Zhijun Liao, Dapeng Li, Xinrui Wang, Lisheng Li and Quan Zou
- Relevance of Molecular Docking Studies in Drug Designing
  
  Authors: Ritu Jakhar, Mehak Dangi, Alka Khichi and Anil K. Chhillar
- The Advances and Challenges of Deep Learning Application in Biological Big Data Processing
  
  Authors: Li Peng, Manman Peng, Bo Liao, Guohua Huang, Weibiao Li and Dingfeng Xie
- Gene Expression Profile Classification: A Review
  
  Authors: Musa H. Asyali, Dilek Colak, Omer Demirkaya and Mehmet S. Inan
More Less

HaShRECA: Hadoop Based Short Read Error Correction Algorithm for Genome Assembly

Abstract

Most Read This Month

Most Cited Most Cited RSS feed