Feature agglomeration: How to retrieve the features that make up the clusters?

Question

I am using the scikit-learn's feature agglomeration to use a hierarchical clustering procedure on features rather than on the observations.

This is my code:

from sklearn import cluster
import pandas as pd

#load the data
df = pd.read_csv('C:/Documents/data.csv')
agglo = cluster.FeatureAgglomeration(n_clusters=5)
agglo.fit(df)
df_reduced = agglo.transform(df)

My original df had the shape (990, 15), after using feature agglomeration, df_reduced now has (990, 5).

How do now find out how the original 15 features have been clustered together? In other words, what original features from df make up each of the 5 new features in df_reduced?

σηγ · Accepted Answer

After fitting the clusterer, agglo.labels_ contains a list that tells in which cluster in the reduced dataset each feature in the original dataset belongs.

Feature agglomeration: How to retrieve the features that make up the clusters?

Answers (2)

Related Questions