Prolog: optimizing global structures for performance

Question

I have written a prolog program and now am trying to optimize it for performance (yes, this use case actually needs it).

First some background on how the program was originally structured, so you know where I've already been.

The system keeps customer (user) orders in the logic base, which are dynamically asserted into the logic base as they come in (and dynamically removed once processed using retract). Originally the orders were structured like so:

order(RegionID, UserID, UserBalance, OrderID, ProductID, Price, ...) .
order(RegionID, UserID, UserBalance, OrderID, ProductID, Price, ...) .
...
order(RegionID, UserID, UserBalance, OrderID, ProductID, Price, ...) .

I liked this just fine, however during testing, I populated the system with 50,000 orders and found that it took inordinately long to PROCESS (on the order of a few minutes - it needed to be better). I profiled and found the most time was spent going through the logic base scooping up the orders for processing, so decided to try another scheme.

This makes sense because particular users are tied to particular regions:

order(RegionID, [ (UserID, UserBalance, OrderID, ProductID, Price, ...), (UserID, UserBalance, OrderID, ProductID, Price, ...), ...]) .
order(RegionID, [ (UserID, UserBalance, OrderID, ProductID, Price, ...), (UserID, UserBalance, OrderID, ProductID, Price, ...), ...]) .
...
order(RegionID, [ (UserID, UserBalance, OrderID, ProductID, Price, ...), (UserID, UserBalance, OrderID, ProductID, Price, ...), ...]) .

What I am doing here is storing a long list of user orders for each region. To test this, I made the lists within the order structures 50,000 in length (50,000 orders). This performed much better than the original scheme in PROCESSING the orders (25% - 30% of the original time); however in ADDING orders to the system, it performs much worse by at least an order of magnitude if not more.

The order adding procedure is quite simple. I simply retract the order structure with an instantiated the RegionID, then reassert with an additional order tacked on to the head (something like this):

retract( order(california, OldOrders ) ).
assert ( order(california, [ NewOrder | OldOrders ] ) ).

I would have presumed this to be reasonably fast as I'm just adding something to the head, but it isn't. My guess is there is a lot of copying of the long list going on behind the scenes.

My question is simply how to optimize this more for speed. You may suggest a different structuring of the data, a different algorithm, a different mechanism for storing this stuff (I only know assert/retract, but different prologs may have more exotic mechanisms?), or whatever you want. Keep in mind that with any suggestions, I don't want to go backwards on order PROCESSING (vs. ADDING).

I am currently using Eclipse (the prolog, not the IDE), however I could easily switch to XSB, yap, or any other free prolog if your suggestion requires it. Just note that we need to stick to faster prologs rather than slower ones like SWI.

Thanks for any suggestions.

Prolog: optimizing global structures for performance

Answers (1)

Related Questions