Remove '
' in text in pandas python

Question

The following code is current code that i use to remove in ['text'] column:

df = pd.read_csv('file1.csv')

df['text'].replace('\s+', ' ', regex=True, inplace=True) # remove extra whitespace
df['text'].replace('
',' ', regex=True) # remove 
 in text

header = ["text", "word_length", "author"]

df_out = df.to_csv('sn_file1.csv', columns = header, sep=',', encoding='utf-8')

I've tried too from the suggestions:

df['text'].replace('
', '')
df['text'] = df['text'].str.replace('
', '').str.replace('\s+', ' ').str.strip()

Output: ' What a smartass! Like he knows anything about real estate deals too...'

The code to remove whitespace is working. But not in removing the . Anyone can help me on this matter? Thanks.

I've tried to solve based on the suggestion from this link too removing newlines from messy strings in pandas dataframe cells? but it's still not working.

Solved:

df['text'].replace(r'\s+|\n', ' ', regex=True, inplace=True)

Remove '\n' in text in pandas python

Answers (1)

Related Questions

Remove &#39;\n&#39; in text in pandas python

Answers (1)

Related Questions

Remove '\n' in text in pandas python