F# binding or composing together parsers acting on separate source streams

Question

How do I compose parser functions, in a way that they execute on different source streams, while the later ones depend on the earlier one's results? Say the following two:

let outerP = many (pchar 'a') |>> (fun l -> l.Length)
let innerP x = pstring "something" |>> (fun str -> (str,x))

with a single source, the binding is working nicely:

let combinedP = outerP >>= innerP
run combinedP "aasomething"

but as part of a more complex project, I need to parse together several separate files, whit the later parsers using the earlier one's output. eg.: i have

let outerSource = "aaaaa"
let innerSource = "something"

The obvious solution is to just concatenate the files together, but is not very scalable, especially because there is in fact a list of inner source files, etc...

Background: I am new to functional programming, not sure if this is taking the function composition too far, but seems like it should be the good solution here, just I can't figure out it in this case. Below is the working but non functional solution, which leads to a multi level nested code in the real project.

What works with the separate source files:

let combinedScript =
    let outerR = run outerP outerSource
    match outerR with
    | Success (outerParam,_,_) ->
        let innerR = run (innerP outerParam) innerSource
        innerR

In the real code, this is a 4 level deep pyramid of doom, and looking at it, it is basically what bind does, just with an extra change (the different source)

rmunn · Accepted Answer

Your last sentence contains a clue to a good functional way to do this: "... looking at it, it is basically what bind does, just with an extra change (the different source)"

Let's turn your 4-level pyramid of doom into a nice-looking expression by implementing our own bind-like function. I'm going to take your combinedScript expression and turn outerP and outerSource (and innerP and innerSource) into function parameters, and you'll hopefully be pleased with the results.

let combinedScript (outerP, outerSource) (innerP, innerSource) =
    let outerR = run outerP outerSource
    match outerR with
    | Success (outerParam,_,_) ->
        let innerR = run (innerP outerParam) innerSource
        innerR
    | Failure (msg, err, state) ->
        Failure (msg, err, state)

// And we'll define an operator for it
let (>==>) (outerP, outerSource) (innerP, innerSource) =
    combinedScript (outerP, outerSource) (innerP, innerSource)

// Now you can parse your four files like this
let parseResult =
    (parserA, fileA)
    >==> (parserB, fileB)
    >==> (parserC, fileC)
    >==> (parserD, fileD)

What's really great about functional programming is that I wrote the above without having to even think about it, because turning pyramids of doom into flat lists is a well-known recipe. As you said, that's basically what "bind" does. And all I did above is follow the standard recipe for writing a "bind" function. If you don't yet know the standard recipe for "bind" functions, https://fsharpforfunandprofit.com/series/map-and-bind-and-apply-oh-my.html is the best explanation I've found. If you're anything like me, you'll have to read it about four or five times before something finally goes "click" in your brain, but once you have that "Ah-HA!" moment, you'll gain a far deeper understanding of the power of functional programming, and how it lets you do really advanced things really simply.

P.S. If that article series is too advanced for where you are right now in your understanding of FP, then try https://fsharpforfunandprofit.com/posts/recipe-part2/ and https://fsharpforfunandprofit.com/rop/. One of those might be a better introduction to these concepts, depending on how much you already understand.

F# binding or composing together parsers acting on separate source streams

Answers (2)

Related Questions