What are some options for clustering on categorical data? What if the dataset contains a combination of numeric and categorical features?
K-Modes is a modification of K-Means suitable for datasets with all categorical features that clusters based on matches/mismatches across the features