Reputation: 1395
I found that boost thread overhead has three order of magnitude timing overhead in the following simple program. Is there anyway to reduce this overhead and speedup the fooThread()
call ?
#include <iostream>
#include <time.h>
#include <boost/thread.hpp>
#include <boost/date_time.hpp>
typedef uint64_t tick_t;
#define rdtscll(val) do { \
unsigned int __a,__d; \
__asm__ __volatile__("rdtsc" : "=a" (__a), "=d" (__d)); \
(val) = ((unsigned long long)__a) | (((unsigned long long)__d)<<32); \
} while(0)
class baseClass {
public:
void foo(){
//Do nothing
}
void threadFoo(){
threadObjOne = boost::thread(&baseClass::foo, this);
threadObjOne.join();
}
private:
boost::thread threadObjOne;
};
int main(){
std::cout<< "main startup"<<std::endl;
baseClass baseObj;
tick_t startTime,endTime;
rdtscll(startTime);
baseObj.foo();
rdtscll(endTime);
std::cout<<"native foo() call takes "<< endTime-startTime <<" clock cycles"<<std::endl;
rdtscll(startTime);
baseObj.threadFoo();
rdtscll(endTime);
std::cout<<"Thread foo() call takes "<< endTime-startTime <<" clock cycles"<<std::endl;
}
You can compile it with g++ -lboost_thread-mt main.cpp
and here is the sample output in my machine:
main startup
native foo() call takes 2187 clock cycles
Thread foo() call takes 29630434 clock cycles
Upvotes: 3
Views: 2896
Reputation: 264331
What you really want is a thread pool:
#include "threadpool.hpp"
int main()
{
boost::threadpool::pool threadpool(8); // I have 4 cpu's
// Might be overkill need to time
// to get exact numbers but depends on
// blocking and other factors.
for(int loop = 0;loop < 100000; ++loop)
{
// schedule 100,000 small tasks to be run in the 8 threads in the pool
threadpool.schedule(task(loop));
}
// Destructor of threadpool
// will force the main thread to wait
// for all tasks to complete before exiting
}
Upvotes: 7