Python Pandas: How to Insert one Missing Row?

Question

For the following dataframe: every group of c should have three values of b. The second value of a should be the average of the first and third value of a.

What is the easiest way to insert the "missing" row with a=48, b=42, c=4 between index = 0 and index = 1?

df_x = pd.DataFrame({"a": [47, 49, 55, 54, 53, 24, 27, 30], "b": [41, 43, 51, 52, 53, 41, 42, 43], "c": [4, 4, 5, 5, 5, 4, 4, 4]})
df_x
Out[14]: 
    a   b  c
0  47  41  4
1  49  43  4
2  55  51  5
3  54  52  5
4  53  53  5
5  24  41  4
6  27  42  4
7  30  43  4

If I use groupby('c').transform(my_func) or groupby('c').apply(my_func), I face the situation that the first call to my function my_func is done twice.

Tai · Accepted Answer

pandas's insert method only works for columns. We can use numpy.insert. Cons: this will create a new dataset. This should serve as an alternative to pd.concat or pd.append or pd.merge.

df_x = pd.DataFrame({"a": [47, 49, 55, 54, 53, 24, 27, 30], "b": [41, 43, 51, 52, 53, 41, 42, 43], "c": [4, 4, 5, 5, 5, 4, 4, 4]})

pd.DataFrame(np.insert(df_x.values, 1, values=[48, 42, 4], axis=0))


    0   1   2
0   47  41  4
1   48  42  4
2   49  43  4
3   55  51  5
4   54  52  5
5   53  53  5
6   24  41  4
7   27  42  4
8   30  43  4

In np.insert(df_x.values, 1, values=[48, 42, 4], axis=0), 1 tells the function the place/index you want to place the new values.

Python Pandas: How to Insert one Missing Row?

Answers (2)

Related Questions