Python mmap "memory leak"

Question

I have a reasonably large file (~4gb on disk) that I want to access with Python's mmap module to gain some familiarity with memory maps. I have a 64 bit system, and am running something similar to the example below. When I run that, I notice that this process's memory consumption continually increases. I've profiled it with pympler and nothing stands out. Can someone point me to some resources that might describe what's going on under the hood and how to correct this (so I can scan through the file without this "memory leak" consuming all my memory)? Thanks!

import mmap                                                                                                                                                                                                                                  

with open("/path/to/large.file", "r") as j:
    mm = mmap.mmap(j.fileno(), 0, access=mmap.ACCESS_READ)

pos = 0
for i in range(mm.size()):
    new_pos = mm.find(b"10", pos)
    print(new_pos)
    pos = new_pos + 1

EDIT The file looks something like this:

0000001, data
0000002, more data
...
...

And with this number of sequential values in the first position there will be a lot of hits for find(b"10")

Python mmap "memory leak"

Answers (1)

Related Questions

Python mmap &quot;memory leak&quot;

Answers (1)

Related Questions

Python mmap "memory leak"