I just ran PCA and then K-means Clustering algorithm on my data, after running the algorithm I get 3 clusters. I am trying to figure out which clusters my input belongs to , in
To group instances by their assigned cluster id
N_CLUSTERS = 3
clusters = [x_10d[X_clustered == i] for i in range(N_CLUSTERS)]
# replace x_10d with where you want to retrieve data
# to have a look
for i, c in enumerate(clusters):
print('Cluster {} has {} members: {}...'.format(i, len(c), c[0]))
# which prints
# Cluster 0 has 37 members: [0.95690664 0.07578273 0.0094432 ]...
# Cluster 1 has 30 members: [0.03124354 0.97932615 0.47270528]...
# Cluster 2 has 33 members: [0.26331688 0.5039502 0.72568873]...
If you want to add the cluster labels back in your dataframe, and assuming x_10d is your dataframe, you can do:
x_10d["cluster"] = X_clustered
This will add a new column in your dataframe called "cluster" which should contain the cluster label for each of your rows.