Skip to content
2000
Volume 14, Issue 3
  • ISSN: 2666-2558
  • E-ISSN: 2666-2566

Abstract

Background: Owing to increased growth in satellite imagery, the development of an architecture that rapidly and efficiently identifies similar images has become crucial. Hadoop has become a de-facto platform for storing large amounts of data. Apache Spark and MapReduce have also become key frameworks for distributed processing of big data. Objective: This paper proposes a novel Distributed Content-Based Image Retrieval (DCBIR) architecture that leverages the qualities of these engines, which were not utilized in previous studies. Methods: Features of 40 satellite images with sizes greater than 500 MB were indexed, on a 15-node Hadoop cluster with two different databases viz. Neo4J, a graph database, and HBase, a columnar database. Results: Performance and Scalability of both indexing and query phases, along with precision and recall were observed for both databases. Conclusion: Experimental results show that the proposed system can efficiently perform image retrieval on large remote sensing images.

Loading

Article metrics loading...

/content/journals/rascs/10.2174/2666255813666191126095114
2021-04-01
2025-10-19
Loading full text...

Full text loading...

/content/journals/rascs/10.2174/2666255813666191126095114
Loading

  • Article Type:
    Research Article
Keyword(s): Distributed computing; hadoop; HBase; image retrieval; MapReduce; Neo4J; satellite images; spark
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error
Please enter a valid_number test