Is it expected behavior to manually reindex a Frame after sorting? Is this a bug?

Question

I have a time series of stellar flux that is periodic in nature. For this data I created a DataFrame with a time column and flux column.

file: Path = localdir / file_path.csv
est_period: float = #number_I_estimated
df = DataFrame(file, names=['t','f'])
df['stack_t'] = df['t'] % est_period

stacked = df[['stack','f']].sort_values(by='stack')

I create a new column by applying the modulus % operation to the time 't' series with the estimated period and stack the series on top of itself by calling df.sort_values(by='stack_t').

I noticed that DataFrame.sort_values(inplace=True) seems to not reindex the data set. If you sort the data set, then find the minimum of f over mask=stacked['stack_t']>somenumber then it turns out that argmin(stacked[mask]['f']) returns the index from df, not from stacked.

To get this to work it turns out that I have to manually reindex the array:

stacked = df[['stack','f']].sort_values(by='stack').reindex(range(0,len(df)))

Is this expected behavior? .sort_values already returns a copy of df. Why is the copy not reindexed?

Is it expected behavior to manually reindex a Frame after sorting? Is this a bug?

Answers (1)

Related Questions