Comparative analysis of dimension reduction techniques for k-means clustering In highdimensional data

dc.contributor.authorMeerigama, K. M. W. G. T.
dc.date.accessioned2024-08-16T07:38:55Z
dc.date.available2024-08-16T07:38:55Z
dc.date.issued2015
dc.description.abstractA computation including high dimensional data is cumbersome due to high demand for computational power. Additionally, high dimensional data is hard to visualize. Converting the high dimensional data into lower dimensions greatly reduces the computational burden and will allow the data to be visualized easily. However, the efficiency of the processed data after reducing the dimensionality which is measured by Sum of Squared Error(SSE) method in k-means clustering algorithm should still be explored. This research study focuses on the use of dimensionality reduction methods as a preprocessing step to improve the accuracy and the efficiency of clustering high- dimensional data. This is demonstrated by considering principal component analysis (PCA), independent component analysis (lCA) and combination of PCA and ICA (in this method , first apply PCA then ICA) as dimensionality reduction methods for k-means clustering algorithm. The observed results from PCA and ICA methods yield lower error rates compared to the use of original dimensions when clustering text data with k-means clustering algorithm.
dc.identifier.urihttps://ir.lib.pdn.ac.lk/handle/20.500.14444/591
dc.language.isoen_US
dc.publisherUniversity of Peradeniya
dc.subjectHigh-dimensional data
dc.subjectK-means clustering
dc.titleComparative analysis of dimension reduction techniques for k-means clustering In highdimensional data
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
.Meerigama 2015.pdf
Size:
352.82 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description:

Collections