iven a column find the highest correlated variable with the specified column

Question

As the title indicates, I have a dataframe named df.

Given a variable ( a specified column of df) I want to find the column with the highest correlation value with that variable.

Here's what I tried soo far :

def highest_correlated(df, column):
   sol = -1
   for col in df.columns:
       while col != column:
             corr = df[column].corr(df[col])
             if corr>sol:
                sol = corr
  return sol

The problem with this is that it takes too much time, and at the end I don't get any results, anyone can help me find a solution?

iven a column find the highest correlated variable with the specified column

Answers (1)

Related Questions