extract strings between two strings in python using regular expression

Question

"<>THIS is the place to stay at when visiting the historical area of Seattle.

Your right on the water front near the ferry's and great sea food hotel.

The breakfast was great. <>"

Above is my sample text. I want to print the strings fall in between <> & <>. I want my output to be free of new line character , like this:

THIS is the place to stay at when visiting the historical area of Seattle. Your right on the water front near the ferry's and great sea food hotel.The breakfast was great.

I have tried the following piece of code:

import re
pattern = re.compile(r'\<>(.+?)\<>',re.DOTALL|re.MULTILINE)
text = """<>THIS is the place to stay at when visiting the historical area of Seattle.

Your right on the water front near the ferry's and great sea food hotel.

The breakfast was great.
<>"""
results = pattern.findall(text)
print results

But I am getting results like this :

["THIS is the place to stay at when visiting the historical area of Seattle.

Your right on the water front near the ferry's and great sea food hotel.

The breakfast was great.
"]

But I don't want any new line characters in my resulting string.

Wiktor Stribiżew · Accepted Answer

Use .replace(" ", "") on each found match (use comprehension) to replace any newline with an empty string.

See the demo:

results = [x.replace("
", "") for x in pattern.findall(text)]
# => ["THIS is the place to stay at when visiting the historical area of Seattle.Your right on the water front near the ferry's and great sea food hotel.The breakfast was great."]

extract strings between two strings in python using regular expression

Answers (2)

Related Questions