Mystic Odin
Mystic Odin

Reputation: 289

Removing Trailing White Spaces with python

I have a script that loops over several search/replace regex in python, one of those operations is remove trailing spaces I've tried:

re.sub(r"""\s+$""", '', str)

re.sub(r""" +$""", r"""""", str)

and

re.sub(r""" +$""", r"""""", str, re.M)

I found several answers that simply recommended using strip my problem is that I want to integrate this in the regex replace mechanism.

Upvotes: 5

Views: 12502

Answers (2)

ideasman42
ideasman42

Reputation: 48028

This removes trailing space using regex:

import os
import re
PATH = '/path/to/source'

re_strip = re.compile(r'[ \t]+(\n|\Z)')

for path, dirs, files in os.walk(PATH):
    for f in files:
        file_name, file_extension = os.path.splitext(f)
        if file_extension == '.py':
            path_name = os.path.join(path, f)
            with open(path_name, 'r') as fh:
                data = fh.read()

            data = re_strip.sub(r'\1', data)

            with open(path_name, 'w') as fh:
                fh.write(data)

Upvotes: 1

Martin Ender
Martin Ender

Reputation: 44259

The function is sub and takes the target string as an argument (and returns a modified copy):

str = re.sub(r'\s+$', '', str)

or if you want to remove trailing spaces from multiple lines in a single string, use one of these:

str = re.sub(r'\s+$', '', str, 0, re.M)
str = re.sub(r'\s+$', '', str, flags=re.M)

The 0 is the count parameter (where 0 means no limit) and then re.M makes $ match at line endings. If you don't specify flags explicitly, you need that additional parameter, because flags is actually the fifth one.

Note that you only need triple quotes for multiline strings. What's important is the r for the pattern.

Alternatively, rstrip is used to remove trailing whitespace:

str = str.rstrip()

Upvotes: 10

Related Questions