Schitti
Schitti

Reputation: 27879

Python multiple threads accessing same file

I have two threads, one which writes to a file, and another which periodically moves the file to a different location. The writes always calls open before writing a message, and calls close after writing the message. The mover uses shutil.move to do the move.

I see that after the first move is done, the writer cannot write to the file anymore, i.e. the size of the file is always 0 after the first move. Am I doing something wrong?

Upvotes: 14

Views: 34086

Answers (3)

Alex Martelli
Alex Martelli

Reputation: 881555

Locking is a possible solution, but I prefer the general architecture of having each external resource (including a file) dealt with by a single, separate thread. Other threads send work requests to the dedicated thread on a Queue.Queue instance (and provide a separate queue of their own as part of the work request's parameters if they need result back), the dedicated thread spends most of its time waiting on a .get on that queue and whenever it gets a requests goes on and executes it (and returns results on the passed-in queue if needed).

I've provided detailed examples of this approach e.g. in "Python in a Nutshell". Python's Queue is intrinsically thread-safe and simplifies your life enormously.

Among the advantages of this architecture is that it translates smoothly to multiprocessing if and when you decide to switch some work to a separate process instead of a separate thread (e.g. to take advantage of multiple cores) -- multiprocessing provides its own workalike Queue type to make such a transition smooth as silk;-).

Upvotes: 30

Xorlev
Xorlev

Reputation: 8643

Check out http://www.evanfosmark.com/2009/01/cross-platform-file-locking-support-in-python/

You can use a simple lock with his code, as written by Evan Fosmark in an older StackOverflow question:

from filelock import FileLock

with FileLock("myfile.txt"):
    # work with the file as it is now locked
    print("Lock acquired.")

One of the more elegant libraries I've ever seen.

Upvotes: 4

Eli Bendersky
Eli Bendersky

Reputation: 273386

When two threads access the same resources, weird things happen. To avoid that, always lock the resource. Python has the convenient threading.Lock for that, as well as some other tools (see documentation of the threading module).

Upvotes: 8

Related Questions