Datamodeling for Aerospike

Question

I am doing an investigation on Aerospike. We have a need to use it as a cache for data (no need for persistance) as those data just live for a very short period of time. (We create it, we read it and then the goal is to try to delete it as fast as possible based on some processing on a service)

Our data look something like this :

Record :
- RecordId
- ClientId
- Partition
- Region
- Size
- May have X number of custom attributes (I will probably limit the number of the attributes)

ClientId here represent the multitenancy we want to implement. We will always only query records that belong to one specific ClientId.

We need to query those data on different fields. I know that this is not easy for Aerospike as it only supports one filter on a secondary index per query. As we need to support an important number of records (in the range of several millions probably) we want to partitions our records based on their Partition field. That should allow the queries to run faster and make post processing easier.

Each record would have the same format by Partition but maybe be different from one partition to another.

To solve this problem I want to model my data in Aerospike like this :

Sets :

Partition_{ClientId} : (string equality filter)
   Key : RecordId
   Bin : Partition
   Index : Partition

Region_{ClientId} (string equality filter)
   Key : RecordId
   Bin : Region
   Index : Region

Size_{ClientId} (integer range search)
   Key : RecordId
   Bin : Size
   Index : Size

With as many sets necessary to filter my data. The point of having Then we would query the different sets and realize an intersection of the results of the queries to get the filtered queries.

First Question, I am doing this because from what I read there is no easy way to filter a set based on several filter. Is this a correct assumption

Second Question, based on that model we would reach the limit of set in one namespace much faster. Is there any other way to model the same sort of data while still being efficient ?

Datamodeling for Aerospike

Answers (1)

Related Questions