Study of some exponential-inverse sine-logarithmic imputation techniques under missing data using simulated data

Non-response in a survey refers to the absence of data from selected participants who fail to provide information for various reasons. Imputation is one of the most effective techniques to address non-response in sample surveys and ensure data completeness.  The most commonly used imputation methods are mean imputation, ratio imputation, and the compromised imputation method.  Mean imputation replaces all missing values with the mean of the responded values, thus reducing the variability in the data set.  To maintain a proportional link between variables, the ratio imputation technique is useful, although it assumes a strong linear relationship between the study and auxiliary variables, which may not always hold.  If violated, this can lead to biased results.  The compromised imputation method combines several techniques but still has limitations and may produce biased outcomes when underlying assumptions are not met.  To address these issues, we propose three Exponential-Inverse Sine-Logarithmic (ESL) imputation techniques along with their corresponding point estimators.  We derive the bias and mean square error (MSE) of the proposed estimators and evaluate their performance both theoretically and numerically in comparison with existing methods.  Additionally, simulated population data sets were generated using statistical software to conduct simulation studies.  Percentage relative efficiencies (PRE) were calculated to compare the performance of all estimators with respect to the mean and ratio methods.  Based on the results, we conclude that the proposed imputation techniques outperform the existing ones.

  1. Meeden G.  A decision theoretic approach to imputation in finite population sampling.  Journal of the American Statistical Association.  95 (450), 586–595 (2000).
  2. Singh S., Horn S.  Compromised imputation in survey sampling.  Metrika.  51 (3), 267–276 (2000).
  3. Singh A. K., Singh P., Singh V. K.  Exponential-Type Compromised Imputation in Survey Sampling. Journal of Statistics Applications & Probability.  3 (2), 211–217 (2014).
  4. Prasad S.  Some product exponential methods of imputation in sample surveys.  Statistics in Transition new series.  19 (1), 159–166 (2018).
  5. Singh G. N., Bhattacharyya D., Bandyopadhyay A.  Some logarithmic and sine-type imputation techniques for missing data in survey sampling in the presence of measurement errors.  Journal of Statistical Computation and Simulation.  91 (4), 713–731 (2020).
  6. Pandey R., Thakur N. S., Yadav K.  Estimation of population mean using exponential ratio type imputation method under survey non-response.  Journal of the Indian Statistical Association.  53 (1&2), 89–107 (2015).
  7. Singh B. K., Gogoi U.  Estimation of Population mean using Exponential Dual to Ratio Type Compromised Imputation for Missing data in Survey Sampling.  Journal of Statistics Applications & Probability.  6 (3), 515–522 (2017).
  8. Lee H., Rancourt E., Särndal C. E.  Experiments with Variance Estimation from Survey Data with Imputed Values.  Journal of Official Statistics.  10 (3), 231–243 (1994).
  9. R Core Team.  R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna (2022).  https://www.R-project.org.
  10. Goldfeld K., Wujciak-Jens J.  Simstudy: Simulation of Study Data.  https://cran.r-project.org/web/packages/simstudy/index.html.