Reputation: 4555
After bulk load of data using sstableloader each Cassandra node ends up with ~3.000 sstables of size ~32MB each.
In an attempt to reduce the no. of sstables I run 'nodetool compact ' on each node.
This compaction puts tremendous pressure on the heap. I tried with 8GB heap (and also 16 GB though I know it is adviced against). In both cases the C* nodes end up doing garbage collection for ~90 secs per sweep. In all, the compaction is not able to complete.
Each machine has 32 GB physical memory. The bulk loaded table uses STCS and caching = 'keys_only'.
This leads to a number of questions:
UPDATE
Actually I have these amounts of sstables - most of similar size. Major compaction cannot complete because of memory shortage. And I cannot find a way to make minor compaction kick in:
Upvotes: 1
Views: 920
Reputation: 5249
Try lowering the number of concurrent compactors and the threshold for max. in memory row size. Are you using SSDs?
Upvotes: 2