How is clustering affected by high-dimensional data, and how can the quality of clusters generated be improved in such cases?
One problem of performing clustering in high-dimensional data is that common distance metrics, such as Euclidean distance, do not perform as well.