Processing of missing values in survey data using Principal Component Analysis and probabilistic Principal Component Analysis methods

Authors

  • قتيبة نبيل نايف
  • بشرى رحيم جاسم

DOI:

https://doi.org/10.33095/jeas.v24i104.90

Abstract

The idea of ​​carrying out research on incomplete data came from the circumstances of our dear country and the horrors of war, which resulted in the missing of many important data and in all aspects of economic, natural, health, scientific life, etc.,. The reasons for the missing are different, including what is outside the will of the concerned or be the will of the concerned, which is planned for that because of the cost or risk or because of the lack of possibilities for inspection. The missing data in this study were processed using Principal Component  Analysis and self-organizing map methods using simulation. The variables of child health and variables affecting children's health were taken into account: breastfeeding and maternal health. The maternal health variable contained missing value and was processed in Matlab2015a using Methods Principal Component    Analysis and probabilistic Principal Component Analysis of where the missing values ​​were processed and then the methods were compared using the root of the mean error squares. The best method to processed the missing values Was the PCA method.                             

Downloads

Download data is not yet available.

Published

2018-10-23

Issue

Section

Statistical Researches

How to Cite

نايف ق.ن. and جاسم ب.ر. (2018) “Processing of missing values in survey data using Principal Component Analysis and probabilistic Principal Component Analysis methods”, Journal of Economics and Administrative Sciences, 24(104), p. 354. doi:10.33095/jeas.v24i104.90.