A novel way to do selective deletion and not impact your change logs.
In many cases we have issues where we want to selectively delete records from DSO and want them to be deleted from the change log as well..
One interesting approach would be to consider using BW load processes to achieve the same.
Lets take an example :
Lets say we have ODS1 which gets loaded daily with Sales Data. Some of the records here are records for internal sales which have to be deleted.There is a cube which is being used for reporting which gets loaded daily from this DSO.
Approach 1 :
The usual way is to have a selective deletion on the DSO and then do a full load to the cube .
Process Steps :
- Load data to ODS1
- Activate data in ODS1
- Selectively delete data in ODS1
- Drop data in Cube
- Load data into cube
- Build the aggregates
Pros :- Simple method
- Easy to implement
Cons :- Repeated load of data to cube
- Aggregates have to be rebuilt every day
- High database activity
- Becomes self defeating with large data volumes
- With large data volumes – Selective deletion takes longer and longer times
Approach 2 :
Here you can look at selectively deleting the data from the DSO and then selectively deleting from the cube as well and reloading the new data alone using selective full loads from the DSO to cube
Process Steps
- Load data into ODS1
- Activate data in ODS1
- Selectively delete data in ODS1
- Selectively delete data in cube for affected time period / Delete Overlapping request
- Load selective full updates from ODS1 to cube
- Rebuild the aggregates
Pros :
- Slightly complex but still easy to achieve
- All can be done using Process chains and some simple ABAP in the infopackage
Cons :
- Aggregates have to be rebuilt every day
- High database activity
- Maintenance and validation heavy
- Gets tougher to manage load times with higher data volumes
Approach 3:
Use the BW load process and some ABAP programs to negate the values.
Process Steps
- Load data to ODS1
- Activate the data in ODS1
- Create a copy of ODS1 as a transaction ODS (ODS2)
- Have a program run in the process chain that identifies these records to be deleted and writes them to the active table of ODS2
- In the program set all the key figures to zero
- Load ODS1 from ODS2 using a full update
- Use the 0recordmode in the update to delete the records
- Continue deltas into the cube
- Drop data from ODS2
Pros :
- BW load processes are used and no deletion takes place
- Change log integrity is preserved
- Incremental building of aggregates can be done
- One time validation is required post which the whole process can be automated.
- This can be automated using a process chain
Cons :
- Requires an ABAP program
- Time to implement is slightly longer than the previous approaches
- Initial validation is required to make sure that only selected records are deleted and no additional records get deleted
A data flow for the same is available below.
We had done this for some really large DSOs and the process worked fine and the time taken was much lesser than Approach 1 or Appproach 2. Initiallly we did think that the ABAP program was going to take too much time and that it was not worth the effort but then some efficient ABAp programming and some big time failures in Approach 1 and 2 convinced us to go down Approach 3!!!