pandas partial join on multiindex

Question

So, this is my problem:

dfa = pd.DataFrame({"a": [["a", "b", "c"][int(k/10)] for k in range(30)],
                    "b": ["a" + repr([10, 20, 30, 40, 50, 60][int(k/5)]) for k in range(30)],
                    "c": np.arange(30),
                    "d": np.random.normal(size=30)}).set_index(["a","b","c"])
dfb = pd.DataFrame({"a": [["a", "b", "c"][int(k/2)] for k in range(6)],
                    "b": ["a" + repr([10, 20, 30, 40, 50, 60][k]) for k in range(6)],
                    "m": np.random.normal(size=6)**2}).set_index(["a","b"])

Essentially I have two dataframes with multi-indices and I want to divide dfa.d by dfb.m, joining on ("a", "b"). I can't naively do dfa.d / dfb.m or join because it says that merging with more than one level overlap on a multi-index is not implemented.

The most straightforward (lol) way of doing this that I found is:

dfc = dfa.reset_index().set_index(["a", "b"]).join(dfb)
dfc["r"] = dfc.d / dfc.m
dfd = dfc.reset_index().set_index(["a", "b", "c"])[["r"]]

Any shortcuts?

pandas partial join on multiindex

Answers (1)

Related Questions