Skip to content
2000

Machine Learning-based High-Dimensional Text Document Classification and Clustering

image of Machine Learning-based High-Dimensional Text Document Classification and Clustering
Preview this chapter:

Text classification is a difficult technique. Many techniques have been developed to decrease the dimension of feature vectors for use in text classification due to their enormous size. This work provides a detailed discussion of unique parameters utilising an optic clustering strategy, as well as a review of some of the most essential text categorization algorithms. In this case, the words are clustered according to their level of similarity. Each cluster's membership function is based on the mean along with the standard deviation of its data. Finally, characteristics are chosen from each grouping. Each cluster's extracted feature is the weighted sum of its words. There's also no need to guess or use trial-and-error approaches to determine the optimal number of clusters.

/content/books/9789815305395.chapter-13
dcterms_subject,pub_keyword
-contentType:Journal -contentType:Figure -contentType:Table -contentType:SupplementaryData
10
5
Chapter
content/books/9789815305395
Book
false
en
Loading
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error
Please enter a valid_number test