Split markdown by each section using ruby regex

Question

I have this markdown as string:

# section 1


any type of valid markdown text. /notations here
 
Sample text for testing:
abcdefghijklmnopqrstuvwxyz ABCDEFGHIJKLMNOPQRSTUVWXYZ
0123456789 _+-.,!@$%^&*();/|<>"'
12345 -98.7 3.141 .6180 9,000 +42
555.123.4567    +1-(800)-555-2468
foo@demo.net    bar.ba@test.co.uk
www.demo.com    http://foo.co.uk/
http://regexr.com/foo.html?q=bar
https://mediatemple.net
- list 1
- list 2
[www.asdf.com](some description)

## sec 1.1
 blah

# header 2


## 2.1


### 2.2

# some_section

## 3.1

I would like to split the string by section, eg the output should be a list of 3 entries of string. The first entry should be '# section 1 ## 1.1 blah '.

The regex i'm using is /[^#]# [\s\S]+?(?=#)/ . How do I match a string without ' #' at the end? And my regex is matching the whole string instead of the output i need.

Sample at http://regexr.com/3ev83 . Thanks.

akuhn · Accepted Answer

Try this,

string.split(/(?=^# )/)

And if you want to split at any heading from # through ###

string.split(/(?=^#+ )/)

How does this work?

^ matches the begin of the line
(?=...) is a lookahead match
No need to match line endings

Split markdown by each section using ruby regex

Answers (2)

Related Questions