Dimension Hierarchy Tables and Slowly Changing Dimensions in the Oracle Business Analytics Warehouse

The blog posts I wrote earlier in the week about ragged hierarchies and bridge tables got me thinking about how these things are represented in the Oracle Business Analytics Warehouse (OBAW), the pre-defined data warehouse that comes with the Oracle BI Applications. The OBAW comes with support for ragged hierarchies in the form of dimension hierarchy tables, I also thought I'd spotted some bridge tables in there, and I've been meaning to look at how it supports slowly changing dimensions, and so I fired up my virtual machine and took a look.

The OBAW comes with a number of tables with the _DH suffix, which complement the usual set of tables with _D (dimension), _F (fact), _A (aggregate), _DS (dimension staging) and so on. Listing out the table names in SQL*Plus, the full set in BI Apps 7.9.5 is:

SQL> select table_name
  2  from   user_tables
  3  where  table_name like '%_DH'
  4  /

TABLE_NAME
------------------------------
W_ALIGNVER_DH
W_AGREEMENT_DH
W_ALIGNMT_DH
W_INDUSTRY_DH
W_INT_ORG_DH
W_PRODUCT_DH
W_POSITION_DH
W_MED_PLAN_DH
W_OPTY_DH
W_FUND_DH
W_GEO_REGION_DH
W_PRODCAT_DH
W_PERIOD_DH
W_ORG_DH
W_REGN_DH
So what are these hierarchy tables for, given that there's already a bunch of dimensions defined in the business model and mapping layer of the repository, with their own hierarchies, levels and so on?

If we take one example, the W_INT_ORG_DH table, you'll find that it's listed in the physical model of the BI Apps repository and is aliased several times, as for example Dim_W_INT_ORG_DH_Employee_Org and Dim_W_INT_ORG_DH_Operating_Unit_Org. Taking a look a the Dim - Operating Unit Org logical table within the Core business model, you can see that this logical table maps to two logical table sources, one of which is called Dim_W_INT_ORG_D_Operating_Unit_Org and maps to a single corresponding physical alias table with the names of operating units in it, and another called Dim_W_INT_ORG_DH_Operating_Unit_Org that maps to the alias above the W_INT_ORG_DH table.

So why does a logical dimension table map to both a dimension table and a dimension hierarchy table? How does this relate to the regular, level-based hierarchies in the business model and mapping layer?

The clue is in the level-based bit. What these _DH dimension hierarchy tables are for is to flatten-out the ragged hierarchies that might be in your source system, in this case Peoplesoft, so that your internal parent-child organizational hierarchy relationships can be represented in the level-based way that OBIEE requires them. These are then used for reports that drill-down through the hierarchy (making use of regular dimension and hierarchy objects in the repository), and other dimension history tables such as W_POSITION_DH are used within the repository to secure fact table information to just staff members and their superiors.

Let's take a look at the structure of the W_INT_ORG_DH table, and the W_INT_ORG_D table that's the primary source for the other logical table source that this logical dimension table uses, to see how this works.

Looking at the W_INT_ORG_D table first, you can see that it contains the list of organizational units within the business

SQL> SELECT row_wid
  2  ,      org_num
  3  ,      org_name
  4  ,      effective_from_dt
  5  ,      effective_to_dt
  6  ,      current_flg
  7  from   w_int_org_d
  8  where  rownum < 10
  9  /

   ROW_WID ORG_NUM ORG_NAME                       EFFECTIVE EFFECTIVE C
---------- ------- ------------------------------ --------- --------- -
         0                                        01-JAN-01           Y
      6001 0       Setup Business Group           01-JAN-99 01-JAN-14 Y
      6002 202     Vision Corporation             01-JAN-99 01-JAN-14 Y
      6003 204     Vision Operations              01-JAN-99 01-JAN-14 Y
      6004 205     Manufacturing                  01-JAN-99 01-JAN-14 Y
      6005 206     Widgets Product Line           01-JAN-99 01-JAN-14 Y
      6006 207     Seattle Manufacturing          01-JAN-99 01-JAN-14 Y
      6007 208     Chicago Subassembly Plant      01-JAN-99 01-JAN-14 Y
      6008 209     Boston Manufacturing           01-JAN-99 01-JAN-14 Y

9 rows selected.
Looking at the names of the organization units, you can probably guess that some are higher in the organizational hierarchy than the others. For many organizations their organization hierarchy is stored in their source systems in a parent-child fashion and can often be unbalanced, and the ETL routine that prepares the internal organization hierarchy in the BI Apps anticipates this, initially storing this parent-child hierarchy in the W_INT_ORG_DH_TMP temporary table, with one row per organization unit per source system hierarchy, like this:
SQL> SELECT org_id
  2  ,      org_num
  3  ,      org_name
  4  ,      par_org_id
  5  ,      par_org_num
  6  ,      par_org_name
  7  ,      hierarchy_name
  8  FROM   w_int_org_dh_tmp
  9  WHERE  org_num between 0 and 209
 10  ORDER BY org_id
 11  /

ORG_ID  ORG_NUM ORG_NAME                       PAR_ORG PAR_ORG PAR_ORG_NAME           HIERARCHY_NAME
------- ------- ------------------------------ ------- ------- ---------------------- ------------------------------
0       0       Setup Business Group
202     202     Vision Corporation
204     204     Vision Operations              202     202     Vision Corporation     Vision Global Reporting
204     204     Vision Operations              202     202     Vision Corporation     US Commercial Sales (DBI)
204     204     Vision Operations              202     202     Vision Corporation     Global
204     204     Vision Operations              202     202     Vision Corporation     Primary Reporting Hierarchy
204     204     Vision Operations              205     205     Manufacturing          Vision Corp EBI Hierarchy
204     204     Vision Operations              202     202     Vision Corporation     Assets Hierarchy
204     204     Vision Operations              202     202     Vision Corporation     Single Establishment Reporting
204     204     Vision Operations              260     260     Vision Administration  PJI
204     204     Vision Operations              202     202     Vision Corporation     Vision Corp - Global
204     204     Vision Operations              458     458     Vision Services        Burden Schedule Hierarchy
204     204     Vision Operations              202     202     Vision Corporation     US DBI Hierarchy
205     205     Manufacturing                  202     202     Vision Corporation     Vision Global Reporting
205     205     Manufacturing                  204     204     Vision Operations      Single Establishment Reporting
205     205     Manufacturing                  204     204     Vision Operations      Primary Reporting Hierarchy
205     205     Manufacturing                  202     202     Vision Corporation     Vision Corp EBI Hierarchy
205     205     Manufacturing                  204     204     Vision Operations      US Commercial Sales (DBI)
205     205     Manufacturing                  204     204     Vision Operations      US DBI Hierarchy
205     205     Manufacturing                  202     202     Vision Corporation     Vision Corp - Global
206     206     Widgets Product Line           1615    1615    Widget Companies       Vision Global Reporting
206     206     Widgets Product Line           1615    1615    Widget Companies       Vision Corp - Global
206     206     Widgets Product Line           205     205     Manufacturing          Primary Reporting Hierarchy
206     206     Widgets Product Line           205     205     Manufacturing          Single Establishment Reporting
207     207     Seattle Manufacturing          204     204     Vision Operations      Global
207     207     Seattle Manufacturing          205     205     Manufacturing          Vision Corp - Global
207     207     Seattle Manufacturing          205     205     Manufacturing          Vision Global Reporting
207     207     Seattle Manufacturing          206     206     Widgets Product Line   Primary Reporting Hierarchy
207     207     Seattle Manufacturing          206     206     Widgets Product Line   Single Establishment Reporting
207     207     Seattle Manufacturing          204     204     Vision Operations      Vision Corp EBI Hierarchy
207     207     Seattle Manufacturing          204     204     Vision Operations      US DBI Hierarchy
207     207     Seattle Manufacturing          204     204     Vision Operations      US Commercial Sales (DBI)
207     207     Seattle Manufacturing          204     204     Vision Operations      DBI - Vision Operations
208     208     Chicago Subassembly Plant      205     205     Manufacturing          Vision Corp - Global
208     208     Chicago Subassembly Plant      206     206     Widgets Product Line   Single Establishment Reporting
208     208     Chicago Subassembly Plant      206     206     Widgets Product Line   Primary Reporting Hierarchy
208     208     Chicago Subassembly Plant      205     205     Manufacturing          Vision Global Reporting
209     209     Boston Manufacturing           205     205     Manufacturing          Vision Global Reporting
209     209     Boston Manufacturing           205     205     Manufacturing          Vision Corp - Global
209     209     Boston Manufacturing           207     207     Seattle Manufacturing  SEATTLE ECO HIERARCHY
209     209     Boston Manufacturing           206     206     Widgets Product Line   Single Establishment Reporting
209     209     Boston Manufacturing           204     204     Vision Operations      DBI - Vision Operations
209     209     Boston Manufacturing           206     206     Widgets Product Line   Primary Reporting Hierarchy

43 rows selected.
Now if you read my posting the other day about ragged hierarchies and OBIEE, you'll know that the semantic model used by OBIEE doesn't (currently, as of the 10gR3 release) support ragged hierarchies, requiring them instead to be "flattened out" into level-based hierarchies. This is exactly the route that the OBAW load routine takes when processing ragged hierarchies, flattening them out into tables such as W_INT_ORG_DH table, so that you now have a row per organization unit, per hierarchy, that shows their path up the organization chart. There's a number of SDE (Source-Dependent Mappings) that carry out this task, flattening the hierarchy through joining the temporary table about several times to itself using a Source Qualifier transformation, with the rest of the mapping now working with a set of flattened out level columns.

Once the data is in the W_ORG_INT_DH table, it looks like this:
SQL> SELECT row_wid
  2  ,      org_wid
  3  ,      org_hier1_num
  4  ,      org_hier2_num
  5  ,      org_hier3_num
  6  ,      org_hier8_num
  7  ,      org_top_num
  8  ,      effective_from_dt
  9  ,      effective_to_dt
 10  ,      current_flg
 11  FROM   w_int_org_dh
 12  WHERE  org_hier1_num between 0 and 209
 13  /

ROW_WID ORG_WID ORG_HIER1_NUM ORG_HIER2_NUM ORG_HIER3_NUM ORG_HIER8_NUM ORG_TOP_NUM EFFECTIVE EFFECTIVE C


    54       6004 205           205           205                         202           01-JAN-99 01-JAN-14 Y
   183       6002 202           202           202           202           202           01-JAN-99 01-JAN-14 Y
   193       6001 0             0             0             0             0             01-JAN-99 01-JAN-14 Y
    91       6008 209           209           209           204           202           01-JAN-99 01-JAN-14 Y
    92       6006 207           207           207           204           202           01-JAN-99 01-JAN-14 Y
    93       6003 204           204           204           204           202           01-JAN-99 01-JAN-14 Y
   102       6007 208           208           208           204           202           01-JAN-99 01-JAN-14 Y
  1230       6001 0             0             0             0             0             01-JAN-99 01-JAN-14 Y
  1411       6008 209           209           209           204           202           01-JAN-99 01-JAN-14 Y
  1412       6006 207           207           207           204           202           01-JAN-99 01-JAN-14 Y
  1413       6005 206           206           206           204           202           01-JAN-99 01-JAN-14 Y
  1414       6004 205           205           205           204           202           01-JAN-99 01-JAN-14 Y
  1415       6003 204           204           204           204           202           01-JAN-99 01-JAN-14 Y
  1416       6002 202           202           202           202           202           01-JAN-99 01-JAN-14 Y
  1527       6007 208           208           208           204           202           01-JAN-99 01-JAN-14 Y
  3045       6004 205           205           205           205           202           01-JAN-99 01-JAN-14 Y
  3047       6008 209           209           209           205           202           01-JAN-99 01-JAN-14 Y
  2090       6003 204           204           204           204           202           01-JAN-99 01-JAN-14 Y
  2618       6007 208           208           208           205           202           01-JAN-99 01-JAN-14 Y
  2093       6004 205           205           205           204           202           01-JAN-99 01-JAN-14 Y
  2095       6005 206           206           206           204           202           01-JAN-99 01-JAN-14 Y
  2096       6006 207           207           207           204           202           01-JAN-99 01-JAN-14 Y
  2097       6007 208           208           208           204           202           01-JAN-99 01-JAN-14 Y
  2098       6008 209           209           209           204           202           01-JAN-99 01-JAN-14 Y
  2902       6003 204           204           204           458           202           01-JAN-99 01-JAN-14 Y
  2428       6004 205           205           205           205           202           01-JAN-99 01-JAN-14 Y
  2679       6005 206           206           206           1615          202           01-JAN-99 01-JAN-14 Y
  2430       6008 209           209           209           205           202           01-JAN-99 01-JAN-14 Y
  2436       6006 207           207           207           205           202           01-JAN-99 01-JAN-14 Y
  2441       6003 204           204           204           204           202           01-JAN-99 01-JAN-14 Y
  2696       6004 205           205           205           205           202           01-JAN-99 01-JAN-14 Y
  2701       6003 204           204           204           205           202           01-JAN-99 01-JAN-14 Y
  2703       6006 207           207           207           205           202           01-JAN-99 01-JAN-14 Y
  2208       6003 204           204           204           204           202           01-JAN-99 01-JAN-14 Y
  3490       6003 204           204           204           204           202           01-JAN-99 01-JAN-14 Y
  4068       6008 209           209           209           207           202           01-JAN-99 01-JAN-14 Y
  3053       6006 207           207           207           205           202           01-JAN-99 01-JAN-14 Y
  3058       6003 204           204           204           204           202           01-JAN-99 01-JAN-14 Y
  3880       6006 207           207           207           204           202           01-JAN-99 01-JAN-14 Y
  3882       6008 209           209           209           204           202           01-JAN-99 01-JAN-14 Y
  3651       6003 204           204           204           204           202           01-JAN-99 01-JAN-14 Y
  3666       6006 207           207           207           204           202           01-JAN-99 01-JAN-14 Y
  3183       6007 208           208           208           205           202           01-JAN-99 01-JAN-14 Y
  3979       6003 204           204           204           260           202           01-JAN-99 01-JAN-14 Y
  3465       6003 204           204           204           204           202           01-JAN-99 01-JAN-14 Y
  3236       6005 206           206           206           1615          202           01-JAN-99 01-JAN-14 Y
  3478       6004 205           205           205           204           202           01-JAN-99 01-JAN-14 Y
  3479       6006 207           207           207           204           202           01-JAN-99 01-JAN-14 Y
  3480       6004 205           205           205           204           202           01-JAN-99 01-JAN-14 Y
  3481       6006 207           207           207           204           202           01-JAN-99 01-JAN-14 Y

50 rows selected.

This table is then made available in the OBAW, and added to the Internal Organization logical table as an additional logical table source (not sure why its separate logical table source, I would have thought it'd be more efficient to just add it to the existing logical table source that gets the main dimension member information), so that reports can drill up and down the dimension hierarchy. One slight drawback to this approach that's common to all hierarchy flattening techniques is that you'll need to amend the _DH warehouse table, plus all the dependent staging and temporary tables if your source parent-child hierarchy has more than the set number (nine I think) of flattened levels, this will be particularly interesting as you'll need to amend the tables themselves plus the SDE and SIL (source-independent load) Informatica mapping that load them, quite a big task when you think about it especially as there are SDE mappings for each source and for each hierarchy category.

The eagle-eyed amongst you would also have spotted effective from, and effective to, dates in the above tables, together with a current flag column that you usually see in dimensions configured to store history (known in Kimball terms as a slowly changing dimension). That's precisely what they are, organization dimension entries having start and end dates and a flag to show whether the row you're looking at is the current (i.e. active) row, with the same columns also being present in the dimension hierarchy tables and in fact every seeded dimension table within the OBAW.

The fact that every dimension table, and in fact every dimension SIL mapping, is set up in order to support slowly changing dimensions means that you can turn on this feature just by setting the $$TYPE2_FLG mapping parameter to "Y", though you'll need to start customizing the mapping if you want to go with anything other than the default set of Type 2 trigger columns. If you compare this to SCD handling in OWB it approaches things from a slightly different way, with OWB setting SCD types at the dimension object level and the BI Apps setting it at the mapping level, for the standard set of BI Apps seeded mappings their arrangement works well but it gets a bit messy when you want to alter the set of trigger columns and of course you have to reproduce all of this mapping logic in your own mappings if you want your own custom dimensions to be handled in the same way.

Anyway, that's it for dimension history tables and slowly changing dimensions. In terms of what I want to cover next in this series, there's the integration of the BI Apps with E-Business Suite security (probably the number 1 thing I'm asked) and a low-down on the universal adapter and how it can be used to bring in your own, unsupported data sources. But for now though that's it, let me know if you've got any feedback on the above.