RESI: A Region-Splitting Imputation method for different types of missing data
作者:
Highlights:
• Only imputing numerical or categorical missing data is unpractical.
• Region-splitting iterative framework (RESI) excels at completing mix-type missing data.
• Mean integrity rate is defined to measure the whole missing degree of a dataset.
• The k-fold validation can alleviate the deviation caused by under- and overlearning.
摘要
•Only imputing numerical or categorical missing data is unpractical.•Region-splitting iterative framework (RESI) excels at completing mix-type missing data.•Mean integrity rate is defined to measure the whole missing degree of a dataset.•The k-fold validation can alleviate the deviation caused by under- and overlearning.
论文关键词:Data mining,Missing data imputation,Region-splitting,k-fold cross validation
论文评审过程:Received 6 June 2019, Revised 29 November 2020, Accepted 29 November 2020, Available online 5 December 2020, Version of Record 9 December 2020.
论文官网地址:https://doi.org/10.1016/j.eswa.2020.114425