Filter Pandas dataframe row and replace value in column

Question

I got a big list of phone numbers in all sorts of formats:

df = pd.DataFrame(
    {'phone': ['0123/12345', '0123-23456', '0123/4455-10', '0123-4455-22'],
     'name': ['A-1', 'B-1', 'C/3', 'D/7']})

  name phone
0 A-1  0123/12345
1 B-1  0123-23456
2 C/3  0123/4455-10
3 D/7  0123-4455-22

The formats I want are in rows #0 and #2.

When I concentrate on #1, I tried the following:

df.loc[(df.phone.str.count('-')==1) &
       (df.phone.str.count('/')==0)].apply(lambda x: x.str.replace('-', '/'))

And this does the trick on the number, but unfortunately also on the name column:

  name phone
1 B/1  0123/23456

But the name column must not be changed.

So I have two questions:

How can I filter the row and only change the phone column?
How can I go with #3, where I would want to replace the first occurrence of '-' to '/'?

Rayhane Mama · Accepted Answer

Or if you're no fan of regex (like me), you can simply do this:

df['phone'] = df.phone.apply(lambda x: x.replace('-','/',1) if '/' not in x else x)
print(df)

  name         phone
0  A-1    0123/12345
1  B-1    0123/23456
2  C/3  0123/4455-10
3  D/7  0123/4455-22

Probably not the best or fastest way, still I feel more comfortable with it since I don't know regex yet.

Hope that was helpful.

Filter Pandas dataframe row and replace value in column

Answers (2)

Related Questions