Pandas Characters between two spaces

Question

I have dataframe like one below

df = pd.DataFrame({'vals': [1, 2, 3, 4, 5], 'ids': [u'a iball is', u'aaa vcat ll', u'c cnut bb', u'fdfdf qbell l', 'bxyz zbat c']})

I am trying to replace the the first string of characters between the first and second space position with x in ids column

I want my data frame to look some thing like this

df = pd.DataFrame({'vals': [1, 2, 3, 4, 5], 'ids': [u'a xball is', u'aaa xcat ll', u'c xnut bb', u'fdfdf xbell l', 'bxyz xbat c']})

Umar.H · Accepted Answer

use str.replace with capturing groups.

\1 will apply to the first word after a space at the start of a string.

^ asserts a pattern at the start of a line.

\w matches any word [A-Za-z0-9_]

+ is a greedy match to match the previous token as many times as possible.

df['ids'].str.replace('(^\w+\s)(\w{1})', r'\1x')

0       a xball is
1      aaa xcat ll
2        c xnut bb
3    fdfdf xbell l
4      bxyz xbat c
Name: ids, dtype: object

Pandas Characters between two spaces

Answers (2)

Related Questions