ash__999
ash__999

Reputation: 169

Elasticsearch Java API - How to get the number of documents without retrieving the documents

I need to get the number of documents in an index. not the documents themselves, but just this "how many" .

What's the best way to do that?

There is https://www.elastic.co/guide/en/elasticsearch/reference/current/search-count.html. but I'm looking to do this in Java.

There also is https://www.elastic.co/guide/en/elasticsearch/client/java-api/2.4/count.html, but it seems way old.

I can get all the documents in the given index and come up with "how many". But there must be a better way.

Upvotes: 6

Views: 18938

Answers (6)

wingman__7
wingman__7

Reputation: 749

2021 Solution

I went through the solutions posted and none of them are convincing. You may get the job done by setting size of the search request to 0 but that's not the correct way. For counting purposes we should use the count API because count consumes less resources/bandwidth and it doesn't require to fetch documents, scoring and other internal optimisations.

You must use the Count API for Java (link attached below) to get the count of the documents. Following piece of code should get the job done.

  • Build query using QueryBuilder

  • Pass the query and list of indexes to the CountRequest() constructor

  • Get CountResponse() object by doing client.count(countReq)

  • Extract/Return the value by doing countResp.getCount()

    CountRequest countReq = new CountRequest(indexes, query);

    CountResponse countResp = client.count(countReq, RequestOptions.DEFAULT);

    return countResp.getCount();

Read the second link for more information.

Important Links

Count API vs Search API : Counting number of documents using Elasticsearch

Count API for Java : https://www.elastic.co/guide/en/elasticsearch/client/java-rest/current/java-rest-high-count.html

Upvotes: 2

Z.ABC
Z.ABC

Reputation: 91

Elastic - Indices Stats

Indices level stats provide statistics on different operations happening on an index. The API provides statistics on the index level scope (though most stats can also be retrieved using node level scope).

prepareStats(indexName) client.admin().indices().prepareStats(indexName).get().getTotal().getDocs().getCount();

Upvotes: 4

Akash Babu
Akash Babu

Reputation: 41

Just an addition to @evanjd's answer

import org.elasticsearch.action.search.SearchResponse;
import org.elasticsearch.action.search.SearchType;
import org.elasticsearch.index.query.QueryBuilders.*;

 SearchResponse response = client.prepareSearch("your_index_goes_here")
   .setTypes("YourTypeGoesHere")
   .setQuery(QueryBuilders.termQuery("some_field", "some_value"))
   .setSize(0) // Don't return any documents, we don't need them.
   .get();

 SearchHits hits = response.getHits();
 long hitsCount = hits.getTotalHits().value;

we need to add .value to get long value of total hits otherwise it will be a string value like "6 hits"

long hitsCount = hits.getTotalHits().value;

long hitsCount = hits.getTotalHits().value;

Upvotes: 4

enesaltinok
enesaltinok

Reputation: 55

Breaking changes after 7.0; you need to set track_total_hits to true explicitly in the search request.

https://www.elastic.co/guide/en/elasticsearch/reference/current/breaking-changes-7.0.html#track-total-hits-10000-default

Upvotes: 3

Biplab
Biplab

Reputation: 139

We can also get lowLevelClient from highLevelClient and invoke the "_count" rest API like "GET /twitter/_doc/_count?q=user:kimchy".

Upvotes: 1

evanjd
evanjd

Reputation: 415

Use the search API, but set it to return no documents and retrieve the count of hits from the SearchResponse object it returns.

For example:

import org.elasticsearch.action.search.SearchResponse;
import org.elasticsearch.action.search.SearchType;
import org.elasticsearch.index.query.QueryBuilders.*;

SearchResponse response = client.prepareSearch("your_index_goes_here")
    .setTypes("YourTypeGoesHere")
    .setQuery(QueryBuilders.termQuery("some_field", "some_value"))
    .setSize(0) // Don't return any documents, we don't need them.
    .get();

SearchHits hits = response.getHits();
long hitsCount = hits.getTotalHits();

Upvotes: 15

Related Questions