Refactor Oracle stored procedure to use BULK COLLECT

Question

I have a database view that tracks hourly data in an Oracle 12c database. The data is arranged like this, with columns for each hour of the day.

SALES_DATE  | LOCATION_CODE | HE1_SALES | HE2_SALES | ... | HE24_SALES
_________________________________________________________________________
12/27/2019  | ABCD          |    40     |    50     | ... |    60
12/26/2019  | ABCD          |    51     |    64     | ... |    68
12/27/2019  | ABCG          |    53     |    54     | ... |    50
12/26/2019  | ABCG          |    45     |    47     | ... |    52

I have a stored procedure that looks through the last 10 years and tries to find dates where the hourly patterns were the most similar to a user-defined day. This does this by getting the difference for each hour and adding it to a total. Any days that have a total differential that is within 50 will be returned in the list (so the result set is usually pretty small). It then puts the results in a table so that users can come back and review them later until they run the process again (when they view the data it is sorted showing most similar first, based on that "proximity"). Here is the procedure in its current form:

CREATE OR REPLACE PROCEDURE MYDB.COMPARE_SALES_SP (
    p_compare_date       IN DATE,
    p_location_code      IN VARCHAR2,
    p_userid             IN VARCHAR2,
    p_message            OUT VARCHAR2)
IS
    TYPE SalesArray IS TABLE OF NUMBER
        INDEX BY PLS_INTEGER;

    v_hours_count     INTEGER;
    v_difference      NUMBER;
    v_compare_sales   SalesArray;
    v_curr_sales      SalesArray;
BEGIN
    DELETE FROM MYDB.SALES_ANALYSIS
     WHERE userid = p_userid;
    IF TRUNC (SYSDATE) = TRUNC (p_compare_date)
    THEN
        v_hours_count := MYDB.F_HOUR_ENDING_NUMBER (SYSDATE);
    ELSE
        v_hours_count := 24;
    END IF;

    SELECT HE1_SALES, HE2_SALES, HE3_SALES, HE4_SALES, HE5_SALES, HE6_SALES,
           HE7_SALES, HE8_SALES, HE9_SALES, HE10_SALES, HE11_SALES, HE12_SALES,
           HE13_SALES, HE14_SALES, HE15_SALES, HE16_SALES, HE17_SALES, HE18_SALES,
           HE19_SALES, HE20_SALES, HE21_SALES, HE22_SALES, HE23_SALES, HE24_SALES
      INTO v_compare_sales (1), v_compare_sales (2), v_compare_sales (3), v_compare_sales (4),
           v_compare_sales (5), v_compare_sales (6), v_compare_sales (7), v_compare_sales (8),
           v_compare_sales (9), v_compare_sales (10), v_compare_sales (11), v_compare_sales (12),
           v_compare_sales (13), v_compare_sales (14), v_compare_sales (15), v_compare_sales (16),
           v_compare_sales (17), v_compare_sales (18), v_compare_sales (19), v_compare_sales (20),
           v_compare_sales (21), v_compare_sales (22), v_compare_sales (23), v_compare_sales (24)
      FROM MYDB.SALES_BY_DAY
     WHERE reading_date = TRUNC (p_compare_date) AND location_code = p_location_code;

    FOR i
        IN (SELECT *
              FROM MYDB.SALES_BY_DAY sd
             WHERE sd.READING_DATE > (SYSDATE - 3652)
               AND sd.READING_DATE != TRUNC(p_compare_date)
               AND location_code = p_location_code)
    LOOP
        v_difference := 0;

        SELECT i.HE1_SALES, i.HE2_SALES, i.HE3_SALES, i.HE4_SALES, i.HE5_SALES, i.HE6_SALES,
               i.HE7_SALES, i.HE8_SALES, i.HE9_SALES, i.HE10_SALES, i.HE11_SALES, i.HE12_SALES,
               i.HE13_SALES, i.HE14_SALES, i.HE15_SALES, i.HE16_SALES, i.HE17_SALES, i.HE18_SALES,
               i.HE19_SALES, i.HE20_SALES, i.HE21_SALES, i.HE22_SALES, i.HE23_SALES, i.HE24_SALES
          INTO v_curr_sales (1), v_curr_sales (2), v_curr_sales (3), v_curr_sales (4),
               v_curr_sales (5), v_curr_sales (6), v_curr_sales (7), v_curr_sales (8),
               v_curr_sales (9), v_curr_sales (10), v_curr_sales (11), v_curr_sales (12),
               v_curr_sales (13), v_curr_sales (14), v_curr_sales (15), v_curr_sales (16),
               v_curr_sales (17), v_curr_sales (18), v_curr_sales (19), v_curr_sales (20),
               v_curr_sales (21), v_curr_sales (22), v_curr_sales (23), v_curr_sales (24)
          FROM DUAL;

        FOR j IN 1 .. v_hours_count
        LOOP
            v_difference := v_difference + ABS (v_compare_sales (j) - v_curr_sales (j));
        END LOOP;
        IF (v_difference < 50)
        THEN
            INSERT INTO MYDB.SALES_ANALYSIS (READING_DATE, location_code, USERID, PROXIMITY)
                VALUES (i.READING_DATE, i.location_code, p_userid, v_difference);
        END IF;

    END LOOP;
    COMMIT;
    p_message := 'Sales analysis successful. Please review the results';
EXCEPTION
    WHEN OTHERS
    THEN
        ROLLBACK;
        p_message := 'Sales analysis was not successful. Error: ' || SQLERRM;
END;

The procedure itself is pretty quick (~1 second), however the IDE we're using is suggesting to use BULK COLLECT in the loop for code cleanliness and to ensure that it continues to perform well. I would like to do this, however I'm having trouble wrapping my head around how I should select the row for the compare date and then compare it to all the other rows when using that method. Would BULK COLLECT be the best way to go about this or is there a better way to do this many comparisons?

Edit

The data from this view comes from a table that is structured like this, if it would be easier to select data from the table itself.

SALES_DATE |  HOUR_ENDING  |  LOCATION_CODE |  VALUE
__________________________________________________________
12/27/2019        1              ABCD           40
12/27/2019        2              ABCD           50
12/27/2019        3              ABCD           51

The data must be compared hourly, not on a daily total (observe the image below). In this example, if compared hourly, there is a total difference of 35 (due to the ABS on every hour, since I don't care whether or not the difference is negative or positive...just how close). However if the totals were summed up, this would return a difference of 9.

Refactor Oracle stored procedure to use BULK COLLECT

Answers (1)

Related Questions