Gocc to ignore things in lexical parser

Question

Is there ways to tell gocc to ignore things in lexical parser? E.g., for

2022-01-18 11:33:21.9885 [21] These are strings that I need to egnore, until - MYKW - Start Active One: 1/18/2022 11:33:21 AM

I want to tell gocc to ignore from [21] all the way to until. Here is what I've been trying:

/* Lexical part */

_letter : 'A'-'Z' | 'a'-'z' | '_' ;
_digit : '0'-'9' ;

_timestamp1 : _digit | ' ' | ':' | '-' | '.' ;
_timestamp2 : _digit | ' ' | ':' | '/' | 'A' | 'P' | 'M' ;

_ignore :  '['  { . } ' '  '-'  ' '  'M'  'Y'  'K'  'W'  ' '  '-'  ' ' ;

_lineend : [ '
' ] '
' ;


timestamp       : _timestamp1 { _timestamp1 } _ignore ;
taskLogStart    : 'S'  't'  'a'  'r'  't'  ' ' ;
jobName     : { . } _timestamp2 { _timestamp2 } _lineend ;


/* Syntax part */
Log
 : timestamp taskLogStart jobName ;

However, the parser failed at:

error: expected timestamp; got: unknown/invalid token "2022-01-18 11:33:21.9885 [21] T"

The reason I think it should be working is that, the following ignore rule works perfectly fine for white spaces:

!lineComment  : '/' '/' { . } '
' ;
!blockComment : '/' '*' { . | '*' } '*' '/' ;

and I'm just applying the above rule into my normal text parsing.

Gocc to ignore things in lexical parser

Answers (1)

Related Questions