LiadHa
LiadHa

Reputation: 11

Pandas mistake while sorting values

Im trying to sort my dataframe based on 'date' and 'hour' columns. Its sorting 01/11/2020 before dates like 24/10/2020.

df = pd.read_csv("some_folder")   
df = df.sort_values(by = ['date','hour']).reset_index(drop=True)

Code Output

In the picture you can see the sorting error.

Upvotes: 0

Views: 113

Answers (1)

Andrej Kesely
Andrej Kesely

Reputation: 195528

Try to convert the column date to datetime before sorting (pd.to_datetime):

df = pd.read_csv("some_folder")   
df['date'] = pd.to_datetime(df['date'], dayfirst=True)     # <-- convert the column to `datetime`
df = df.sort_values(by = ['date','hour']).reset_index(drop=True)

Upvotes: 4

Related Questions