How to fix my nonworking Python regex match?

Question

I want to grab the whole number out of this string some 344.3404.3 numbers.

Using the Pythex emulator website this works with [\d\.]* (a digit or point repeated zero or more times). In Python i get back the whole string:

Input:

import re
re.match(r'[\d\.]*', 'some 344.3404.3 numbers').string

Output:

'some 344.3404.3 numbers'

What am i missing?

Running python 3.3.5, win7, 64bit.

Tim Pietzcker · Accepted Answer

The string attribute of a regex match object contains the input string of the match, not the matched content.

If you want the (first) matching part, you need to change three things:

use re.search() because re.match() will only find a match at the start of the string,
access the group() method of the match object,
use + instead of * or you'll get an empty (zero-length) match unless the match happens to be at the start of the string.

Therefore, use

>>> re.search(r'[\d.]+', 'some 344.3404.3 numbers').group()
'344.3404.3'

or

>>> re.findall(r'[\d.]+', 'some 344.3404.3 numbers more 234.432')
['344.3404.3', '234.432']

if you expect more than one match.

Answers (2)