Deprecation for Data Engineer \ Analyst
Key words: divestiture, communication process, identifying stakeholders.
Challenge
As a Data Engineer, I initiate a deprecation process and have to deprecate my data objects mitigating the risks of other pipelines failing.
As a Data Analyst, I have found an outdated data object on books catalogue that my team has inherited from a legacy team. I know that it is of a poor quality and takes significant resources of my team to maintain it. I want to deprecate it and can't predict the consequences.
Solution
The ODD Platform provides a lineage diagram to examine data flows and possibility of marking data owners so that deprecation process goes smoothly.
Scenario
- My team has inherited a data object from a legacy team describing books (ISBN, code, author, publisher). 
- We examined the object and found out that it has not been updated for almost a year due to pipeline failovers and therefore does not contain recent book releases. I have strong doubts that expenses on storage, calculation and team support can be justified. 
- I need to ensure that divestiture won’t impose downstream system failure or block my users: - As a Data analyst, I log in to the ODD Platform to find the stakeholders, SMEs or primary PoCs of the object in doubt. - As a Data Engineer, I explore a lineage diagram to check whether there are any downstream systems sourcing from the object in doubt (pipelines, dashboards, views). 
- I find out that though there is a dashboard sourcing from my object, it was not used for 6 months. 
- We decide to hold a session with stakeholders listed in ODD to double-check my findings and notify them about divestiture. 
- We send out a notification letter that this data object is going to be decommissioned in 3 months. 
- We archive the object and stop object increments; 
- After 3 months we delete the object, DAGs and archive the code base. 
Result: Deprecation process is well-managed, all users are notified in advance and risks mitigated.
Last updated
