Many companies currently complement their existing relational data warehouses with big data components, such as Spark, HDFS, Kafka, S3, ... This leads to a new form of data warehouse (DW) that we call big data warehouse (BDW). This blog elaborates how BW/4HANA and the SAP Data Hub (DH) are a perfect match for building a BDW.
The idea of a BDW is prevailing in many companies and industries. This blog describes a BDW built at Netflix, this one a BDW at Sears. Many more can be found on the web. All those examples show how big data storage and processing environments complement traditional relational data warehouses by providing
Figure 1 shows a generic setup of a BDW. Usually, there are 2 to 3 storage layers involved; sometimes, the first two are collapsed into one:
Fig. 1: Many BDWs follow the pattern of these storage and processing layers.
Many SAP customers are on the same trajectory as described in the Netflix and Sears examples. All of them have run a relational DW for many years and are now evolving and complementing it with big data components. BW and BW-on-HANA are capable to play the role of the relational DW in such an environment through various connection options. However, BW/4HANA's ambition is to excel this and be well integrated with SAP's Data Hub. The latter manages the ingestion and processing layers to the left of figure 1. This is outlined in figure 2 which represents the pattern of figure 1 implemented with SAP software components.
Fig. 2: BW/4HANA and SAP's Data Hub combined.
Now, what does this tight integration between BW/4HANA and SAP's Data Hub mean? What are the specifics? This is shown in figure 3 and comprises the following features:
Fig. 3: Integration points between BW/4HANA and SAP's Data Hub.
What already exists today and what is planned to be shipped at what time is described in the roadmap shown in figure 4. Click the picture to enlarge.
Fig. 4: Roadmap of planned integration features for BW/4HANA and SAP's Data Hub.
In times of digitalization and the Internet-of-things, traditional and relational data warehouses are complemented with tooling, engines and infrastructure from the big data area. This leads to "big data warehouses" or, sometimes, also labeled "modern data warehouses". BW/4HANA and the SAP Data Hub are a perfect match in that respect.
This blog has also been published here and here. You can follow me on Twitter via @tfxz.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
User | Count |
---|---|
17 | |
11 | |
10 | |
10 | |
9 | |
8 | |
7 | |
5 | |
5 | |
5 |