Matching but ignoring nested parentheses with JISON

Question

I'm working on a grammar for a templating system. I've hit a snag in the build and I can't quite figure out how to solve this issue. I've simplified down the test case to best emphasize exactly what I'm doing.

Example Strings:

(foo) - works
(foo()) - fails Expecting 'parenEnd', got 'parenInterior'
foo (foo) bar
foo (foo(function() { console.log('stuff'); })) bar
foo (foo.bar.baz("stuff")) bar

The rules are that within a parenthetical, anything goes, any characters. I don't need to validate, and I don't need to ensure they match a proper format. On the other hand, from my understanding, in order for the parser to function I do need keep track of opening and closing ( and ) otherwise the lexer can't know where one parenthetical statement begins and another ends, such as (foo()) (bar). In order to keep track of that I'm using a paren start condition which increments a value whenever a paren is hit inside a paren statement, and removes it when a close paren is it.

The problem is it's just not working. The main culprit is it never appears to hit my ")" rule and yet I'm hitting the "(" rule just fine. They appear syntactically the same, why is one working and the other not?

Grammar

%lex

%x paren

%%

\s+                   /* skip whitespace */
"("         { this.begin("paren"); parenCount = 1; return "parenStart"; };
"("            { console.log("parenStart", parenCount); parenCount++; return "parenInterior"; };
")"            { console.log("parenEnd", parenCount); parenCount--; if (parenCount === 0) { this.popState(); return "parenEnd"; } else { return "parenInterior"; } };
[^\)\(]+       { console.log(this); return "parenInterior"; };
<>               return 'EOF';
.                     return 'INVALID';

/lex

%start expressions

%% /* language grammar */

expressions
    : parenStart parenInterior parenEnd { return $1 + $2 + $3; }
    ;

%%

parenCount = 0;

Matching but ignoring nested parentheses with JISON

Answers (1)

Related Questions