Reputation: 387
I'm using Uima Ruta 2.5.0 Version. In this, Symbols like Γ,Δ were coming under CW .Why its Happening?
Input Γ Δ
Upvotes: 1
Views: 132
Reputation: 3113
The CW annotation like the other TokenSeed annotations is created by a JFlex lexer. The rule for CW is [:uppercase:][:lowercase:]*
whereas [:uppercase:]
is defined by the Unicode properties \p{Uppercase}
. Both of your example symbols are greek uppercase letters.
DISCLAIMER: I am a developer of UIMA Ruta
Upvotes: 1