How to scale websocket connection load as one adds/removes servers?

Question

To explain the problem:

With HTTP:

Assume there are 100 requests/second arriving.

If, there are 4 servers, the load balancer (LB) can distribute the load across them evenly, 25/second per server
If i add a server (5 servers total), the LB balances it more evenly to now 20/second per server
If i remove a server (3 servers total), the LB decreases load per server to 33.3/second per server

So the load per server is automatically balanced as i add/remove servers, since each connection is so short lived.

With Websockets

Assume there are 100 clients, 2 servers (behind a LB)

The LB initially balances each incoming connection evenly, so each server has 50 connections.
However, if I add a server (3 servers total), the 3rd servers gets 0 connections, since the existing 100 clients are already connected to the 2 servers.
If i remove a server (1 server total), all those 100 connections will reconnect and are now served by 1 server.

Problem

Since websocket connections are persistent, adding/removing a server does not increase/decrease load per server until the clients decide to reconnect.

How does one then efficiently scale websockets and manage load per server?

How to scale websocket connection load as one adds/removes servers?

With HTTP:

With Websockets

Problem

Answers (1)

Related Questions