Performing Calculations on a Subset of OLAP data in MDX

Question

I'm trying to figure out a way I can filter out data in my cube so that I can perform time-series calculations such as a moving average using only that subset.

For example, let's say that I have a fact table with the following columns:

DayId (Key)
HourId (Key)
Value

I also have a Time dimension with a composite key of DayId and HourId. This dimension has a key for every hour over the span of 100 days, so the keys go from (1,1) to (100,24).

In the fact table there is a value for every point in time so it looks like

DayId HourId Value
1     1       50
1     2       60
1     3       75.2
...   ...     ...
100   23      87
100   24      89

Now, suppose I want to calculate a daily moving average from the beginning of time through some arbitrary point in the middle of the day. Basically, I would want to calculate the average using the last point of every day except the last one, which would use a different point in time in the middle of the day. If I was to do a moving average from day 1 to day 10, ending at noon of the 10th day (HourId 12), the data I would use for my calculation would look like:

DayId HourId Value
1     24     80
2     24     90
3     24     39
4     24     60
...   ...    ...
9     24     10 
10    12     30

In SQL, I could retrieve a set like this pretty easily:

SELECT 
    *
FROM
    [FactTable]
WHERE
    ((DayId BETWEEN 1 AND 9) AND (HourId = 24))
    OR ((DayId = 10) AND (HourId = 12))

I'm pretty new to OLAP and MDX, so I've really been struggling with the right way to do this. So far, the best I've been able to do is to perform a sub-select in my FROM clause, and essentially construct a tuple set of only the rows I want:

WITH
    MEMBER [SMA 10 Value] AS
    AVG (
        ([Time].[DayId].Lag(9):[Time].[DayId], [Time].[HourId])
        , [Value]
    )
SELECT
    {
      [Value]
      , [SMA 10 Value]
    } ON COLUMNS
    , ([Time].[DayId], [Time].[HourId]) ON ROWS
FROM
(
    SELECT
        [Measures] ON COLUMNS
        , {
            ([Time].[DayId].[1]:[Time].[DayId].[9], [Time].[HourId].[24])
            , ([Time].[DayId].[10], [Time].[HourId].[12])
    } ON ROWS
    FROM
        [Cube]
)

However, it doesn't seem to quite work right for my calculations. The moving average seems to be correct over the first 9 days, because their tuples all have the same hour ID, but when I get to the final day, instead of using the values from the previous 9 tuples, it performs the average over the previous 9 days with the 12 Hour ID.

What am I doing wrong here, is there a better way that I can filter my time dimension down to eliminate unwanted rows from my calculations?

Performing Calculations on a Subset of OLAP data in MDX

Answers (1)

Related Questions