DW Wisdom (3) - departmental business
The first two parts of this discussion covered the physical aspects of conventional DW wisdom. Today I will cover some of the business drivers for data warehouses.
When people found that their transactional systems were unsuited for BI reporting (perhaps because of the performance impact of running BI on a transactional system, or the transactional system did not hold all of the data required for reporting) they started to look towards dedicated data warehouses.
At first departmental DW systems were the norm, typically because it was a department that funded them; “Purchasing” paid for a system to report on product profitability and volumes purchased, marketing were perhaps more interested in promotion performance and customer profitability. DW performance was (or should have been!) good as all of the required data was in one place. But this lead to three distinct disadvantages:
- Departments often used their own pet data sources. Often these sources held similar, but not identical, information the sources used in other departments.
- There was duplication of data as people required similar fact data for their analysis, for example stock levels form key metrics for both purchasing and logistics. This had two effects: the duplication (or worse) of batch data load processes and wasted physical storage
- It was inherently difficult to access information outside of the department’s usual subject domain. Mainly early query tools struggled to give adequate performance when querying multiple databases.
- Identify true duplicates
- Identify records that have a high probability of being duplicates
- Identify records that contain significant errors – that is the records that fail business rules for loading
Restricting queries to a single data warehouse had two immediate paybacks
- boosted query performance as joins to disparate remote sources were avoided
- more security over data sources, authorised people only accessed approved sources