(Python) Splitting string only on single instance of delimiter

Question

I'm trying to extract numeric values from text strings that use dashes as delimiters, but also to indicate negative values:

"1.3"          # [1.3]
"1.3-2-3.9"    # [1.3, 2, 3.9]
"1.3-2--3.9"   # [1.3, 2, -3.9]
"-1.3-2--3.9"  # [-1.3, 2, -3.9]

At the moment, I'm manually checking for the "--" sequence, but this seems really ugly and prone to breaking.

def get_values(text):
    return map(lambda s: s.replace('n', '-'), text.replace('--', '-n').split('-'))

I've tried a few different approaches, using both the str.split() function and re.findall(), but none of them have quite worked.

For example, the following pattern should match all the valid strings, but I'm not sure how to use it with findall:

r"^-?\d(\.\d*)?(--?\d(\.\d*)?)*$"

Is there a general way to do this that I'm not seeing? Thanks!

Casimir et Hippolyte · Accepted Answer

You can try to split with this pattern with a lookbehind:

(?<=[0-9])-

(An hyphen preceded by a digit)

>>> import re
>>> re.split('(?<=[0-9])-', text)

With this condition, you are sure to not be after the start of the string or after an other hyphen.

Answers (2)