Query in Cassandra that will sort the whole table by a specific field

Question

I have a table like this

CREATE TABLE my_table(
category text,
name text,
PRIMARY KEY((category), name)
) WITH CLUSTERING ORDER BY (name ASC);

I want to write a query that will sort by name through the entire table, not just each partition.

Is that possible? What would be the "Cassandra way" of writing that query?

I've read other answers in the StackOverflow site and some examples created single partition with one id (bucket) which was the primary key but I don't want that because I want to have my data spread across the nodes by category

bric3 · Accepted Answer

Always model cassandra tables through the access patterns (relational db / cassandra fill different needs).

Up to Cassandra 2.X, one had to model new column families (tables) for each access pattern. So if your access pattern needs a specific column to be sorted then model a table with that column in the partition/clustering key. So the code will have to insert into both the master table and into the projection table. Note depending on your business logic this may be difficult to synchronise if there's concurrent update, especially if there's update to perform after a read on the projections.
With Cassandra 3.x, there is now materialized views, that will allow you to have a similar feature, but that will be handled internally by Cassandra. Not sure it may fit your problem as I didn't play too much with 3.X but that may be worth investigation.

More on materialized view on their blog.

Query in Cassandra that will sort the whole table by a specific field

Answers (2)

Related Questions