Rundeck in bad state, restarting spawns multiple broken instances

Question

I recently encountered a situation with Rundeck where service logs indicated that Rundeck was still functional, but the web gui was down, and lsof -i :4443 indicated that nothing was listening on Rundeck's web port. The rundeck commandline was also down, as all the rd commands (e.g. rd-queue) were not returning.

rundeckd restart (alternatively rundeckd stop; rundeckd start gave correct output, but only spawned more processes running the rundeck jar. The final solution was to force kill all of these processes and start rundeck via the init script.

Is there a more sophisticated way to check if Rundeck is still up aside from checking logs and rundeckd status? Status said it was up and running, which it most certainly was not.
What might cause Rundeck to enter this state? Is it possible for rundeck to still be functional / executing jobs and merely the web UI is down? Is it possible to restart or fix the web UI only without restarting rundeck and thus killing all running jobs entirely?

Rundeck in bad state, restarting spawns multiple broken instances

Answers (1)

Related Questions