How to get multiple regex matches in python?

Question

I have this text:

???



I am trying to scrape what is inside data-compare="HERE" (I have multiple matches). 

I know how to do this in C#, using a MatchCollection, but in python I am pretty confused with re.search, re.match and also I've noticed that the regex that is working in C# is not really working in python.

Could somebody explain how to get this done ?

vivekagr · Accepted Answer

re.findall can be used to find all the matches in a list.

>>> import re
>>> s = '>> result = re.findall('data-compare="([\d/]+)"', s)
>>> print result
['/80174649/2550/', '/8131239/2550/']

Explanation

The desired output like '/80174649/2550/' has only numbers and forward slash, so we'll be only targeting that.

In ([\d/]+), [\d/] means match either a number (signified by \d) or forward slash /.

Then the + symbol means that the preceding pattern [\d/] can occur multiple times since we do have multiple numbers and /.

The enclosing parentheses means that the enclosed pattern [\d/]+ should only be captured and returned.

How to get multiple regex matches in python?

Answers (1)

Related Questions