Obtain number of matches in a single field from elasticsearch

Question

I want to obtain the number the number of matches a term appears in one of my hits along with the search results. (e.g., I want to know that "hello" appeared in "hello hi hello" 2 times).

However, my problem is even trickier because I want to use the soundex as a filter. (e.g., If I search for "great" and it matched with "test test grate that great". Then I want to know that my match appeared 2 times because "great" is phonetically identical "grate"

Here is what my index looks like:

{
    "lecture" : {  
        "properties" : {  
            "transcript" : {  
                "type" : "string",
                "analyzer" : "lecture_analyzer"
             },
            "file_id" : {
                "type" : "string"
            }
        }
    }
}

The lecture_analyzer looks like this:

{
    "tokenizer":  "standard",
    "filter": [
        "dbl_metaphone",
    ]
}

dbl_metaphone is what I use for phonetic matching

Now when I issue the following query:

"query" : {
    "bool" : {
         "must" : [
              {"match": { "transcript" :"grate"}},
              {"term": { "file_id" : "21648371" }}
         ]
     }
}

I get the following result:

{
  ...
  "hits" : {
    "total" : 1,
    "max_score" : 3.519093,
    "hits" : [ {
      ...
      "_id" : "21648371",
      "_score" : 3.519093,
      "_source" : {
        "transcript" : "ok that's great, grate that carrot please",
        "file_id" : "21648371"
      }
    } ]
  }
}

However, I want to know that my term "grate" appeared twice in my hit: once for "grate", and once for "great" due to the dbl_metaphone filter I used.

Does anyone know how to do this?

Obtain number of matches in a single field from elasticsearch

Answers (1)

Related Questions