Accessing the normalized tokens of an ElasticSearch document

Question

I'm using the standard English analyzer on text fields in my ElasticSearch docs.

I'm interested in accessing the list of normalized terms, so if the text is "Set the shape to semi-transparent by calling set_trans(5)" I want to access the normalized tokens set, shape, semi, transpar, call, set_tran, 5.

Is that possible?

Jettro Coenradie · Accepted Answer

I would use the termsvector endpoint for this: https://www.elastic.co/guide/en/elasticsearch/reference/current/docs-termvectors.html

Accessing the normalized tokens of an ElasticSearch document

Answers (2)

Related Questions