multiple threads using same value fetched by database DAO class method

Question

Status: solved

I had to make a pastebin as I had to point out line numbers.

note: not using executorsService or thread pools. just to understand that what is wrong in starting and using threads this way. If I use 1 thread. the app works Perfect!

related links:

http://www.postgresql.org/docs/9.1/static/transaction-iso.html http://www.postgresql.org/docs/current/static/explicit-locking.html

main app, http://pastebin.com/i9rVyari logs, http://pastebin.com/2c4pU1K8 , http://pastebin.com/2S3301gD

I am starting many threads (10) in a for loop with instantiating a runnable class but it seems I am getting same result from db (I am geting some string from db, then changing it) but with each thread, I get same string (despite each thread changed it.) . using jdbc for postgresql what might be the usual issues ?

line 252

and line 223

the link is marked as processed. (true) in db. other threads of crawler class also do it. so when line 252 should get a link. it should be processed = false. but I see all threads take same link.

when one of the threads crawled the link . it makes it processed = true. the others then should not crawl it. (get it) is its marked processed = true.

getNonProcessedLinkFromDB() returns a non processed link

public String getNonProcessedLink(){        line 645
public boolean markLinkAsProcesed(String link){   line 705

getNonProcessedLinkFromDB will see for processed = false links and give one out of them . limit 1 each thread has a starting interval gap of 20 secs.
within one thread. 1 or 2 seconds (estimate processing time for crawling)

line 98  keepS threads from grabbing the same url

if you see the result. one thread made it true. still others access it. waaaay after some time.

all thread are seperate. even one races. the db makes the link true at the moment the first thread processes it

Masood Ahmad · Accepted Answer

Despite the comments and response by helpers in this post were also correct.

at the start of crawl() method body.

    synchronized(Crawler.class){
        url = getNonProcessedLinkFromDB();
        new BasicDAO().markLinkAsProcesed(url);
    }

and at the bottom of crawl() method body (when it has done processing):

    crawl(nonProcessedLinkFromDB);

actually solved the issue.

It was the gap between marking a link processed true and fetching a new one and letting other threads get the same link while the current was working on it.

Synchonized block helped further.

Thanks to helper. "Fuber" on IRC channels. Quakenet servers #java and Freenode servers ##javaee

and ALL who supported me!

multiple threads using same value fetched by database DAO class method

Answers (2)

Related Questions