How to fix this non working regex pattern matching?

Question

I have a pattern I am trying to match using re.compile. However, I cannot get the script to yield the desired result. Below is an example of some HTML code I am hoping to scrape, from the below HTML I hope to produce two list items.

Also below is my attempt at selecting the two list items:

import re

def getData():  

    trans_array = "" ##HTML data here
    pattern2 = re.compile('(.*)')

    print re.findall(pattern2, trans_array)

getData()

My feeling is that the code I used should work, but it has not. Any advice or comments would be appreciated.

Oleg Eterevsky · Accepted Answer

By default . in regular expression does not match new line characters. Add flags=re.S parameter to re.compile, and your regexp will work.

How to fix this non working regex pattern matching?

Answers (2)

Related Questions