Improve query performance for large number of records in rethinkdb

Question

I have a rethinkdb table with 150 mn records, containing schema-less JSON data. I'm querying a nested field in the JSON, for example the 'Gate No.' field in the below JSON.

{ 'Name' : 'XYZ', 'Age' : 22, 'Address' : { 'Gate No.' : 7, 'Society' : 'ABC' } }

When I ran the same query for a table containing 1mn records, the query returned in 680 ms, however with 150 mn records, the query doesn't return at all. From the web console, it runs for a while and later gives an error : Query terminated by an unknown cause. From my Java application, the query seems to run forever.

I've tried sharding with 4 servers, each holding ~37 mn documents but that doesn't seem to improve the situation. How can I get the query to run?

PS.: My JSON data is completely schema-less, so indexing the data is not a viable option.

Improve query performance for large number of records in rethinkdb

Answers (1)

Related Questions