why multi-thread cant improve a mmap task?

Question

I have a big task, which need to read 500 files (50G in total).

for every file, i should read it out, and do some calculation according to data from file. just calculate, nothing else. and i can ensure tasks are independent, just share some signleton object to read(i think that wont be the problem).

currently, i use mmap to get the file content's start pointer, and loop to calculate.

in single thread, i run the task, cost 30s,

i run it in a thread_pool, it cost me 35s（6 thread）.

my machine is a 16G memory, 2.2G hz cpu with 8 thread.

I try a lot of setting, and carefully ensure the independent of tasks.

I am not so good at hardware, is there a hard limit about IO, that limit my speed? can anyone remind me is there anything i can read?

sorry, the code is too complex, i cant make a valid demo here.

why multi-thread cant improve a mmap task?

Answers (1)

Related Questions