Replacing column data with Pandas .replace not working

Question

I am attempting to turn data from a .csv into a Pandas df:

df = pd.read_csv('congress1.csv', delimiter = ';', names = ['Name', 'Years', 'Position', 'Party', 'State', 'Congress'], header = 0)

I want to replace the data in column "Congress" - "1(1789-1790)" - with a single date - "1789":

df['Congress'] = df['Congress'].replace('1(1789-1790)', '1789')

However, doing so does not change any of my data. If I, say, include inplace=True

df['Congress'] = df['Congress'].replace('1(1789-1790)', '1789', inplace=True)

... my data in that column of course becomes null. Yet I can't seem to replace this string with anything meaningful.

willeM_ Van Onsem · Accepted Answer

There are two problems here:

df['Congress'] = df['Congress'].str.replace(r'1\(1789-1790\)', '1789')

So we replace the string part with 1789.

Answers (1)