Remove a single value from each column randomly from pandas dataframe?

Question

I have a df something like this,

df

    a          b       c            d            e         f
0  Banana    Orange   Lychee     Custardapples Jackfruit  Pineapple
1   Apple    Pear   Strawberry   Muskmelon    Apricot    Peach
2  Raspberry Cherry  Plum           Kiwi        Mango   Blackberry

I want to remove a single value from each column randomly.

Eg:

        a          b       c            d            e         f
 0    Banana    Orange             Custardapples Jackfruit  
 1               Pear     Strawberry               Apricot    Peach
 2  Raspberry            Plum           Kiwi                Blackberry

Nicolas Gervais · Accepted Answer

You can use random x, y coordinates and set them to "":

for i in range(df.shape[1]):
    df.iloc[np.random.randint(df.shape[0]), i] = ""

Full code:

import pandas as pd
import numpy as np

df = pd.read_clipboard()
print(df)

           a       b           c              d          e           f
0     Banana  Orange      Lychee  Custardapples  Jackfruit   Pineapple
1      Apple    Pear  Strawberry      Muskmelon    Apricot       Peach
2  Raspberry  Cherry        Plum           Kiwi      Mango  Blackberry

for loop for all columns:

for i in range(df.shape[1]):
    df.iloc[np.random.randint(df.shape[0]), i] = ""

           a       b       c              d          e           f
0             Orange  Lychee  Custardapples  Jackfruit   Pineapple
1      Apple                      Muskmelon    Apricot            
2  Raspberry  Cherry    Plum                            Blackberry

Remove a single value from each column randomly from pandas dataframe?

Answers (2)

Related Questions