Skip to content
2000
Volume 10, Issue 4
  • ISSN: 1574-8936
  • E-ISSN: 2212-392X

Abstract

Next-generation high-throughput sequencing technologies have opened up new and challenging research opportunities. In particular, Next-generation sequencers produce a massive amount of short-reads data in a single run. However, the large amount of short-reads data produced is highly susceptible to errors, as compared to shotgun sequencing. Therefore, there is a peremptory demand to design fast and more accurate statistical and computational tools to analyze this data. We present HaShRECA, a new short-reads error correction algorithm based on probabilistic analysis of potential read errors that utilizes the Hadoop MapReduce framework. Experimental results show that HaShRECA is more accurate, as well as time and space efficient as compared to previous algorithms.

Loading

Article metrics loading...

/content/journals/cbio/10.2174/157489361004150922151409
2015-09-01
2025-09-02
Loading full text...

Full text loading...

/content/journals/cbio/10.2174/157489361004150922151409
Loading

  • Article Type:
    Research Article
Keyword(s): Algorithm; genome; mapreduce; next generation sequencing; short read errors
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error
Please enter a valid_number test