How to remove ambiguity from this syntax (antlr4)

Question

I am writing a tool to generation sequence diagram from some text. I need to support this two syntax:

anInstance:AClass.DoSomething() and
participant A -> participant B: Any character except for (<>{}?)etc..

Let's call the fist one strict syntax and the second one free syntax. In anInstance:AClass.DoSomething(), I need it to be matched by to ( ID ':' ID ) as in the strict syntax. However, :AClass.DoSomething() will be first matched by CONTENT. I am thinking some kind of lookahead, checking if -> is there but not able to figure it out.

`Strict` syntax

message
 : to '.' signature
 ;
signature
 : methodName '()'
 ;
to
 : ID ':' ID
 ;
methodName
 : ID
 ;

ID
 : [a-zA-Z_] [a-zA-Z_0-9]*
 ;

`Free` syntax

asyncMessage
 : source '->' target content
 ;
source
 : ID+
 ;
target
 : ID+
 ;
content
 : CONTENT
 ;

ID
 : [a-zA-Z_] [a-zA-Z_0-9]*
 ;
CONTENT
 : ':' ~[
]+
 ;
SPACE
 : [ 	
] -> channel(HIDDEN)
 ;

Jiri Tousek · Accepted Answer

You need to understand how ANTLR lexer works:

It uses whichever rule matches the longest part of the input (starting at current position)
In case multiple rules can match the same input (i.e. same length), the first one (in order they're defined in) is used

With your current lexer rules, CONTENT takes precedence whenever you encounter an : so ':' ID will never be matched.

With ANTLR 4, you should probably use modes in this case - when you encounter the : in the free form, switch to a "free" mode and define a lexer rule CONTENT to be only available in the "free" mode.

See this question for an idea about how ANTLR 4 lexer modes work.

How to remove ambiguity from this syntax (antlr4)

`Strict` syntax

`Free` syntax

Answers (1)

Related Questions

How to remove ambiguity from this syntax (antlr4)

Strict syntax

Free syntax

Answers (1)

Related Questions

`Strict` syntax

`Free` syntax