This question is part of my continuing exploration of Docker and in some ways follows up on one of my earlier questions . I have now understood how one can get a full application stack (effectively a mini VPS) working by linking together a bunch of Docker containers. For example one could create a stack that provides Apache + PHP5 with a sheaf of extensions + Redis + MemCached + MySQL all running on top of Ubuntu with or without an additional data container to make it easy to serialize user data. All very nice and elegant. However, I cannot but help wonder... . 5 containers to run that little VPS (I count 5 not 6 since Apache + PHP5 go into one container). So suppose I have 100 such VPSs running? That means I have 500 containers running! I understand the arguments here - it is easy to compose new app stacks, update one component of the stack etc. But are there no unnecessary overheads to operating this way? Suppose I did this Put all my apps inside one container Write up a little shell script !/bin/bash service memcached start service redis-server start .... service apache2 start while: do : done In my Dockerfile I have ADD start.sh /usr/local/bin/start.sh RUN chmod +x /usr/local/bin/start.sh .... ENTRYPOINT ["/bin/bash"] CMD ["/usr/local/bin/start.sh"] I then get that container up & running docker run -d -p 8080:80 -v /var/droidos/site:/var/www/html -v /var/droidos/logs:/var/log/apache2 droidos/minivps and I am in business. Now when I want to shut down that container programmatically I can do so by executing one single docker command. There are many questions of a similar nature to be found when one Google's for them. Apart from the arguments I have reproduced above one of the commonest reasons given for the one-app-per-container approach is "that is the way Docker is designed to work". What I would like to know What are the downsides to running x100 instances of N linked containers - tradeoffs by way of speed, memory usage etc on the host? What is wrong with what I have done here?

Reputation: 8910

Running multiple applications in one docker container

This question is part of my continuing exploration of Docker and in some ways follows up on one of my earlier questions. I have now understood how one can get a full application stack (effectively a mini VPS) working by linking together a bunch of Docker containers. For example one could create a stack that provides Apache + PHP5 with a sheaf of extensions + Redis + MemCached + MySQL all running on top of Ubuntu with or without an additional data container to make it easy to serialize user data.

All very nice and elegant. However, I cannot but help wonder... . 5 containers to run that little VPS (I count 5 not 6 since Apache + PHP5 go into one container). So suppose I have 100 such VPSs running? That means I have 500 containers running! I understand the arguments here - it is easy to compose new app stacks, update one component of the stack etc. But are there no unnecessary overheads to operating this way?

Suppose I did this

Put all my apps inside one container
Write up a little shell script

!/bin/bash service memcached start service redis-server start .... service apache2 start while: do : done

In my Dockerfile I have

ADD start.sh /usr/local/bin/start.sh
RUN chmod +x /usr/local/bin/start.sh

....
ENTRYPOINT ["/bin/bash"]
CMD ["/usr/local/bin/start.sh"]

I then get that container up & running

docker run -d -p 8080:80 -v /var/droidos/site:/var/www/html -v /var/droidos/logs:/var/log/apache2 droidos/minivps

and I am in business. Now when I want to shut down that container programmatically I can do so by executing one single docker command.

There are many questions of a similar nature to be found when one Google's for them. Apart from the arguments I have reproduced above one of the commonest reasons given for the one-app-per-container approach is "that is the way Docker is designed to work". What I would like to know

What are the downsides to running x100 instances of N linked containers - tradeoffs by way of speed, memory usage etc on the host?
What is wrong with what I have done here?

Upvotes: 37

Answers (3)

Chef Gladiator

Reputation: 1018

Besides all the issues mentioned in https://phusion.github.io/baseimage-docker/ , the other key win is "local IPC", not localhost.

Remote calls are a source of many evils. That is why Unix Domain Sockets have been invented.

I am not particularly fond of .NET but this article explains the why's well: https://andrewlock.net/using-unix-domain-sockets-with-aspnetcore-and-httpclient/

I am far from Kubernetes SME, but AFAIK K POD's "system calls" are all over UDS.

My advice would be: Find your balance in the number of containers vs multiple processes in containers but be aware: You want UDS whenever possible.

Upvotes: 0

mc0e

Reputation: 2820

@Bryan's answer is solid, particularly in relation to the overheads of a container that just runs one process being low.

That said, you should at least read the arguments at https://phusion.github.io/baseimage-docker/, which makes a case for having containers with multiple processes. Without them, docker is light on provision for:

process supervision
cron jobs
syslog

baseimage-docker runs an init process which fires up a few processes besides the main one in the container.

For some purposes this is a good idea, but also be aware that for instance having a cron daemon and a syslog daemon per container adds up a bit more overhead. I expect that as the docker ecosystem matures we'll see better solutions that don't require this.

Upvotes: 7

Bryan

Reputation: 12200

A container is basically a process. There is no technical issue with running 500 processes on a decent-sized Linux system, although they will have to share the CPU(s) and memory.

The cost of a container over a process is some extra kernel resources to manage namespaces, file systems and control groups, and some management structures inside the Docker daemon, particularly to handle stdout and stderr.

The namespaces are introduced to provide isolation, so that one container does not interfere with any others. If your groups of 5 containers form a unit that does not need this isolation then you can share the network namespace using --net=container. There is no feature at present to share cgroups, AFAIK.

What is wrong with what you suggest:

it is not "the Docker way". Which may not be important to you.
you have to maintain the scripting to get it to work, worry about process restart, etc., as opposed to using an orchestrator designed for the task
you will have to manage conflicts in the filesystem, e.g. two processes need different versions of a library, or they both write to the same output file
stdout and stderr will be intermingled for the five processes

Upvotes: 26

Running multiple applications in one docker container

Answers (3)

Related Questions