Comparison Between Hotdeck Method and Regression Method in Handling Health Science Missing Data

Onny Priskila, 101414153066 and Soenarnatalina M., NIDN. 0025126011 and Hari Basuki Notobroto, NIDN. 0025066504 (2016) Comparison Between Hotdeck Method and Regression Method in Handling Health Science Missing Data. International Journal of Preventive and Public Health Sciences, 2 (2). pp. 11-13. ISSN 2454-9223

[img] Text (Peer Review)

Download (3MB)
[img] Text (Turnitin)
Comparison Between Hotdeck Method and Regression Method in Handling Health Science Missing Data.pdf

Download (1MB)
Official URL:


Introduction: Missing data or missing value is information that is not available on a subject (case). Missing data occurs because some information on the object is not given, thus it is difficult to find or the actual information does not exist. The case of missing data is ignored as it will certainly make it difficult to obtain a high accuracy for result classification even though the most reliable classification algorithm is used. One method in handling the missing data problem is by imputation. Multiple imputation methods can be used to replace missing data with a constant value, hot deck, regression method, expectation maximization method, and multiple imputation. Purpose: To analyze, compare, and determine the best imputation method of missing data between hot deck and regression methods.Materials and Methods: Data used is the data of respondents who practice family planning in the town of Pasuruan, East Java, Indonesia, and age variable. Variable age is used as the simulation data is lost, then imputated by hot deck or regression. The original data results will be compared with the imputed data using t-test, Pearson correlation, and root mean square error (RMSE) test. Results: Results of imputation using simulated data age variable show that regression method is better than hot deck method in handling missing data on health science. Conclusion: The best method views from the results are not significant P value, r value close +1, and smallest RMSE value. Hot deck method resulted in P value not significant at 5% missing data, but the method has small r values even negative and RMSE were great. Regression method resulted in P value not significant data missing 5% and 10%. Besides looking at the results of the consistency analysis views also repeat values of P, r, and RMSE of value three methods.

Item Type: Article
Uncontrolled Keywords: Age, Hot deck, Imputation, Missing data, Regression
Subjects: R Medicine > RA Public aspects of medicine > RA1-1270 Public aspects of medicine > RA1-418.5 Medicine and the state > RA407-409.5 Health status indicators. Medical statistics and surveys
Divisions: 10. Fakultas Kesehatan Masyarakat
Onny Priskila, 101414153066UNSPECIFIED
Soenarnatalina M., NIDN. 0025126011UNSPECIFIED
Hari Basuki Notobroto, NIDN. 0025066504UNSPECIFIED
Depositing User: Tn Chusnul Chuluq
Date Deposited: 20 Sep 2019 02:28
Last Modified: 20 Sep 2019 02:28
Sosial Share:

Actions (login required)

View Item View Item