Replacing only some rows in the same column

Question

Given a column:

name 
Jules
Jules
Jules
Jules
Vince

I need to replace only the top-half of ocurrences of Jules for Quentin

Such as:

name 
Quentin
Quentin
Jules
Jules
Vince

How do I replace only some values in a given column?

To further ellaborate, the location of Jules will never be the same.

I've thought about iterating like this, but it did not work:

countOfJules = df['name'].value_counts()['Jules']
halfLenght = int(countoftbd/2)
listed = df['name'].to_list()
counter = 1

for eachname in listed:
    if eachname == 'Jules' and counter <= halfLenght:
        listed[:] == 'Quentin'
        counter += 1

Quang Hoang · Accepted Answer

It is rather straightforward:

# where name is Jules
is_jules = df['name'].eq('Jules')

# total `Jules` in `name`
num_jules = is_jules.sum()

# first half `Jules`
first_half = is_jules.cumsum().le(num_jules//2)

df.loc[is_jules & first_half, 'name'] = 'Quentin'

Output:

      name
0  Quentin
1  Quentin
2    Jules
3    Jules
4    Vince

Replacing only some rows in the same column

Answers (2)

Related Questions