how to extract numbers from a string ignoring number-letter mixtures

Question

For example, the following regex extracts all "non-numbers" from a string

re.sub(r"\b[0-9]+\b", "", "5 1 inch c5 bolts 10")
'  inch c5 bolts '

How do I do the opposite? That is, how do I extract the numbers '5 1 10'? (Note: c5 is not a number, so it should not be included in the result)

mgilson · Accepted Answer

It looks like you already know about word boundaries... You're just looking for a word boundary, a string of numbers (and only numbers) and then another word boundary. The regex for that is \b\d+\b:

>>> re.findall(r'\b\d+\b', "5 1 inch c5 bolts 10")
['5', '1', '10']

how to extract numbers from a string ignoring number-letter mixtures

Answers (2)

Related Questions