Comprehensive Study of Data Imputation Techniques For Machine Learning Models
Authors: Vaibhav Tummalapalli
DOI: https://doi.org/10.37082/IJIRMPS.v13.i4.232674
Short DOI: https://doi.org/g9wp48
Country: United States
Full-text Research PDF File:
View |
Download
Abstract: Missing data is a significant challenge in machine learning, particularly in the development of propensity models where accurate predictions depend on complete and reliable data. This paper provides a comprehensive exploration of various imputation techniques tailored for machine learning workflows, specifically in the context of propensity modeling. Each technique is categorized by its applicability to different types of data and scenarios of missingness. The goal is to equip practitioners with the tools and knowledge to effectively handle missing data, ensuring robust and accurate propensity models
Keywords: Imputation, Machine Learning, K-Nearest Neighbors, Distance metrics, Iterative Imputation, Weight of Evidence, Propensity Models.
Paper Id: 232674
Published On: 2025-07-10
Published In: Volume 13, Issue 4, July-August 2025