Explainability in the ETL Layer: Making Data Transformations Transparent and Traceable
Authors: Sougandhika Tera
DOI: https://doi.org/10.37082/IJIRMPS.v14.i1.232910
Short DOI: https://doi.org/hbnbz3
Country: United States
Full-text Research PDF File:
View |
Download
Abstract: Data transformation processes in Extract, Transform, Load (ETL) pipelines are crucial in creating the inputs to AI and analytics systems, despite the fact that they often operate as "black boxes" with little transparency. This paper presents Explainable ETL, a platform for transparent, traceable, and comprehensible data transformations. We explore the integration of lineage tracking, semantic annotations, and interpretability tools like as SHAP, LIME, and metadata graphs into ETL orchestration to enhance auditability, bias detection, and regulatory compliance. The suggested architecture's direct integration of explainability modules into ETL tools allows data engineers and business users to understand why data appears as it does at each stage of the pipeline. According to experimental results, explainable ETL reduces bias propagation by 65% and improves error traceability by 92%. This tactic encourages more accountability and trust in data-driven systems by bridging the gap between responsible AI and data engineering.
Keywords: Explainable ETL, Data Lineage, SHAP, LIME, Bias Detection, Data Transformation Transparency, Responsible AI, Data Governance, ETL Orchestration, Auditability.
Paper Id: 232910
Published On: 2026-01-28
Published In: Volume 14, Issue 1, January-February 2026
All research papers published in this journal/on this website are openly accessible and licensed under