How does ZeroMQ queue and send queued messages?

Question

I am the author of an R package (clustermq⁰) that distributes function calls on HPC schedulers using using the ZeroMQ bindings (rzmq). I have used a simple combination of REQ/REP sockets, with the workers requesting first the common data for all tasks (the function to call and constant arguments), and then data for each call they should evaluate from the master. This worked well so far, because running the computations is usually an order of magnitude slower than sending and receiving the data.

One issue however is that the common data can have several hundred MB in size, while the iterated data is generally small. It can thus happen that the master will be busy sending a huge chunk of common data and can not send iterated data at the same time. Because of this, there is a noticeable delay when starting up a distributed computation.

However, this may not be caused by the actual sending but rather preparing the message. The documentation states:

ZeroMQ does not send the message (single or multipart) right away, but at some indeterminate later time.

So I'm wondering:

Is ZeroMQ sending out data that we put in the queue with send() one after the other or in parallel?¹ Would this make a difference or is it likely negligible? Is there a way to influence this?
- As far as I understand, switching from REP to ROUTER here will not change anything.² Is this correct?
- If it's serial I might want to separate the data into slow and fast sockets
Is the main delay likely caused by what happens before, i.e. copying big chunks of memory for creating the message object?³ (I already serialize only once)
- In this case I would want to interface with ZeroMQ message objects without copying

Note that I'm looking for an answer from the design rationale of ZeroMQ and not a comment that I can benchmark.

Some clarifications below:

⁰ This is not meant to be implemented in the theoretically most efficient way, but rather using functions that rzmq provides. The goal is to improve upon packages that store everything on NAS and retrieve it from there (which is quite a low bar). This is a side project and I am not a systems engineer (and I'm not proficient in low-level ZeroMQ). I am benchmarking both overhead and real-world (a.k.a. my actual work) examples, but that hasn't made it into the docs yet.

¹ Assume the cases (TCP): one REP master and n REQ clients; one ROUTER master and n REQ clients; PUSH/PULL as alternative approach. Is there another way to interface with this apart from using different sockets (probably not from high-level bindings like rzmq but pointing me to relevant low-level documentation would also help; I haven't found this information in the user guide)

² What I mean by this is that if I connect REQ clients to a ROUTER master, I manage the envelopes myself (and have to send the id and empty frame manually), but this does not change the code that ZeroMQ uses under the hood to send messages. Or does it? Where is this documented? (I couldn't find it in the user guide)

³ A valid answer for this would be that bottlenecks are memory copy for initializing the message in the main thread and then sending the messages to one client after the other in a separate thread, not blocking the main (if that is the case, or whatever actually happens to the message)

How does ZeroMQ queue and send queued messages?

Answers (1)

1 ) showing Zero-code means any answer may be just at very high-level

Related Questions