Cassandra CQL3 order by clustered key efficiency (with limit clause?)

Question

I have the following table (using CQL3):

create table test (
  shard text,
  tuuid timeuuid,
  some_data text,
  status text,
  primary key (shard, tuuid, some_data, status)
);

I would like to get rows ordered by tuuid. But this is only possible when I restrict shard - I get this is due to performance.

I have shard purely for sharding, and I can potentially restrict its range of values to some small range [0-16) say. Then, I could run a query like this:

select * from test where shard in (0,...,15) order by tuuid limit L;

I may have millions of rows in the table, so I would like to understand the performance characteristics of such a order by query. It would seem like the performance could be pretty bad in general, BUT with a limit clause of some reasonable number (order of 10K), this may not be so bad - i.e. a 16 way merge but with a fairly low limit.

Any tips, advice or pointers into the code on where to look would be appreciated.

Cassandra CQL3 order by clustered key efficiency (with limit clause?)

Answers (1)

Related Questions