How to preserve original term during transliteration in Elasticsearch with ICU plugin?

Question

I'm using the folowing ICU transform filter to peform transliteration

"transliterate": {
    "type": "icu_transform",
    "id": "Any-Latin; NFD; [:Nonspacing Mark:] Remove; NFC"
}

Current problem is that this filter replace the original term in index so search in native language is not possible with term query like this

{                
   "terms" : {    
     "field" : [   
       "term"    
     ],           
     "boost" : 1.0
   }              
 }

Is there any way to make icu_transform filter produce 2 terms original one and transliterated one?

If no i think the optimal solution will be maping with copy to another field and analyzer for this field without transliterate filter. Can you suggest smth more efficient?

I'm using Elasticsearch 5.6.4

How to preserve original term during transliteration in Elasticsearch with ICU plugin?

Answers (1)

Related Questions