Low CouchDB performance for randomized operations

Question

I am benchmarking the performance of Couchdb and Mongodb using the YCSB benchmarking tool. Unfortunately it seems I am doing something wrong because the difference in performance for single, random operations is huge:

Workload A (50/50 read/update), 16 query threads, 120 sec runtime (results are very similar with 20 minute runtimes):

CouchDB 1.6.1: Overall throughput: 1076 ops/sec, 99th percentile read latency of 13ms, 99th percentile update latency of 13ms

MongoDB 3.0.6: Overall throughput: 11203 ops/sec, 99th percentile read latency of 1ms, 99th percentile update latency of 1ms

As you can see, CouchDB is terribly slow for randomized reads and updates. The documentation recommends using bulk operations which might be fine for inserts, but I do not see how I would realize bulk reads considering YCSB is asking for reads one by one.

Testing environment:

Ubuntu 14.04 64bit on a virtual box on an i7 host with VT-D enabled, 2 cores, 2gb ram (although i get similar results with a dedicated machine)
Working set/db easily fits in ram
Localhost server, no hardware-network-latency (results are similar on a hardware cluster)
Everything that is supposed to improve performance mentioned in the CouchDB documentation, especially HTTPD options and C-extensions for simplejson are set correctly
The couchdb driver has been written by me using the Ektorp persistence API recommended on CouchDB's website. The code is straightforward and the drivers I have written for other database systems work fine.

What I have tried to improve throughput:

Using bulk inserts for the load phase. This makes CouchDB a lot faster, but still a factor of 7 slower than MongoDB, this is consistent with the benchmark published here, using the same CouchDB version.

Possible explanations for CouchDB's slowness:

Updates have to be done by requesting the document, modifying it and resubmitting it to the database, which is causing high latencies. However this does not explain the low read-throughput

Question: Do you see any other ways of improving CouchDB's performance?

Edit: Delayed_commit is set to true in couchdb, so I am starting to doubt the forced-fsync as the cause.

Low CouchDB performance for randomized operations

Answers (1)

Related Questions