Skip to content
2000
Volume 9, Issue 2
  • ISSN: 2352-0965
  • E-ISSN: 2352-0973

Abstract

The multimedia data under Spark cloud computing platform are composed of many different types of entities, and all kinds of entity attributes are not exactly same. The traditional multimedia data clustering process based on spectral clustering algorithm assumes that multimedia data are composed of independent entities of the same type, and entities are unrelated with the obtained clustering results giving significant error. A multimedia data clustering method based on semi supervised K-means is proposed, and the K-means clustering algorithm is introduced, on the basis of it, the optimal initial clustering center is determined by the iterative method of graph theory. According to ideology similar to tree clustering algorithm, clustering center is expanded based on the maximum distance tree clustering algorithm the similarity is computed between two data objects and between object and cluster, and the clustering of multimedia data is realized under Spark cloud platform. The simulation results show that the proposed method has better speedup ratio, expansion rate and clustering accuracy, and can be applied to the multimedia data clustering under Spark cloud computing platform.

Loading

Article metrics loading...

/content/journals/raeeng/10.2174/2352096509999160819164126
2016-08-01
2025-09-30
Loading full text...

Full text loading...

/content/journals/raeeng/10.2174/2352096509999160819164126
Loading

  • Article Type:
    Research Article
Keyword(s): Clustering; data; multimedia; spark cloud computing platform
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error
Please enter a valid_number test