split by regex and add matches to dictionary

Question

first time posting here.

I'd like to 1) parse the following text:"keyword: some keywords concept :some concepts"

and 2) store into the dictionary: ['keyword']=>'some keywords', ['concept']=>'some concepts'.

There may be 0 or 1 'space' before each 'colon'. The following is what I've tried so far.

sample_text = "keyword: some keywords concept :some concepts"

p_res = re.compile("(\S+\s?):").split(sample_text) # Task 1 

d_inc = dict([(k, v) for k,v in zip (p_res[::2], p_res[1::2])]) # Task 2

However, the list result p_res is wrong , with empty entry at the index 0, which consequently produce wrong dict. Is there something wrong with my regex?

Avinash Raj · Accepted Answer

Use re.findall to capture list of groups in a match. And then apply dict to convert list of tuples to dict.

>>> import re
>>> s = 'keyword: some keywords concept :some concepts'
>>> dict(re.findall(r'(\S+)\s*:\s*(.*?)\s*(?=\S+\s*:|$)', s))
{'concept': 'some concepts', 'keyword': 'some keywords'}
>>>

Above regex would capture key and it's corresponding value in two separate groups.

_{I assume that the input string contain only key value pair and the key won't contain any space character.}

DEMO

split by regex and add matches to dictionary

Answers (2)

Related Questions