Checking if ZMQ subscriber socket is still alive

Question

I have two applications that need to communicate over a ZMQ pub/sub socket using python. The publisher only runs for a couple hours and then shuts down. The subscriber should always be ready and waiting for available messages. The system has been working well while I've been developing it but now it's in production and the subscriber is sitting for long periods with no messages coming in. This has likely resulted in it "going to sleep" and not accepting new messages from the publisher with the publisher having no errors in sending these undelivered messages.

My subscriber is set up as follows:

context = zmq.Context()
socket = context.socket(zmq.SUB)
socket.connect("tcp://localhost:10001")
socket.setsockopt_string(zmq.SUBSCRIBE, "")

while True:
    message = socket.recv_string()
    do_something(message)

Messages can arrive at any time and interval (typically no less than 0.5 seconds) so I don't want to really use a non-blocking recv_string as missing a message can screw up the rest of the system. I'm using Windows 7 and I think this is something to do with the TCP aliveness timing out but I've got no insight to this. Ideally, I'd like to have the connection never timeout or at least be able to tell when it has so I can reestablish the socket.

Another possible thing I could do is to just send empty messages from another thread to the subscriber but there has to be a cleaner way to detect when the socket is suddenly not available. I don't want to use a broker as that just complicates my system and removes some of the portability that I'm aiming for.

Checking if ZMQ subscriber socket is still alive

Answers (1)

Related Questions