numpy interpolation using pandas

Question

I am trying to do interpolation for pandas columns belonging to different dataframes with different sampling rates. I stripped the timestamp and used the count value as index. I looked at multiple ways to do interpolation on pandas and could not come with an elegant solution. Here's my hack using np.interp method. Is there a better method or alternative in pandas? Thanks in advance !

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd
plt.style.use('seaborn-deep')

df1 = pd.DataFrame({'S1':np.random.random(10)})
df2 = pd.DataFrame({'S2':np.random.random(5)})

# Interpolate df2 keeping first and last values and increase length of df2  to len(df1)
df1_index = np.arange(0,1,np.float(1/np.float(len(df1))))
df2_index = np.arange(0,1,np.float(1/np.float(len(df2))))
df2_on_df1 = np.interp(df1_index, df2_index, df2['S2'])
print len(df1), len(df2), len(df2_on_df1)
plt.plot(df2_on_df1)
plt.hold
# plt.plot(df1)
plt.plot(df2)
plt.legend(loc='upper right')
plt.show()

numpy interpolation using pandas

Answers (1)

Related Questions