Solving ambiguous input: mismatched input

Question

I have this grammar:

grammar MkSh;

script
  : (statement
    | targetRule
    )*
  ;

statement
  :  assignment
  ;

assignment
  :  ID '=' STRING
  ;

targetRule
  : TARGET ':' TARGET*
  ;

ID
  :  ('a'..'z'|'A'..'Z'|'_') ('a'..'z'|'A'..'Z'|'0'..'9'|'_')*
  ;

WS
  : ( ' '
    | '	'
    | '
'
    | '
'
    ) -> channel(HIDDEN)
  ;

STRING
  : '"' CHR* '"'
  ;

fragment
CHR
  : ('a'..'z'|'A'..'Z'|' ')
  ;

TARGET
  :  ('a'..'z'|'A'..'Z'|'0'..'9'|'_'|'-'|'/'|'.')+
  ;

and this input file:

hello="world"

target: CLASSES

When running my parser I'm getting this error:

line 3:6 mismatched input ':' expecting '='
line 3:15 mismatched input ';' expecting '='

Which is because of the parser is taking "target" as an ID instead of a TARGET. I want the parser to choose the rule based on the separator character (':' vs '=').

How can I get that to happen?

(This is my first Antlr project so I'm open to anything.)

cantSleepNow · Accepted Answer

First, you need to know that the word target is matched as a ID token and not as a TARGET token, and since you have written the rule ID before TARGET, it will always be recognized as ID by the lexer. Notice that the word target completely complies to both ID and TARGET lexer rule, (I'm going to suppose that you are writing a laguage), meaning that the target which is a keyword can also be used as an id. In the book - "The definitive ANTLR reference" there is a subtitle "Treating Keywords As Identifiers" that deals with exactely these kinds of issues. I suggest you take a look at that. Or if you prefer the quick answer the solution is to use lexer modes. Also would be better to split grammar into parser and lexer grammar.

Solving ambiguous input: mismatched input

Answers (2)

Related Questions