Reputation: 1801
How do I get a list of all individual tokens of a text field along with their document frequency. I want this to build a domain specific list of frequent (and therefore useless) stop words.
This question covers all the methods I found so far but
Upvotes: 0
Views: 569
Reputation: 2089
You will have to enable field_data on your field to do this. But be careful it can impact a lot the heap memory used.
https://www.elastic.co/guide/en/elasticsearch/reference/current/fielddata.html
Upvotes: 1