Handling Missing Value dengan Pendekatan Regresi pada Dataset Akuakultur Berukuran Kecil

Ricky Afiful Maula, Agus Indra Gunawan, Bima Sena Bayu Dewantara, M. Udin Harun Al Rasyid, Setiawardhana Setiawardhana, Ferry Astika Saputra, Junaedi Ispianto


Shrimp cultivation is strongly influenced by pond water quality conditions. Farmers must know the appropriate action in regulating water quality that is suitable for shrimp survival. The state of water quality can be understood by measuring pond parameters using various sensors. Installing sensors equipped with artificial intelligence modules to inform water quality conditions is the right action. However, the sensor cannot be separated from errors, so it results in not being able to get data or missing data. In this case, the approach of 5 parameters of pond water quality from 13 available parameters is carried out. This paper proposes a technique to obtain lost data caused by sensor error and looks for the best model. A simple approach can be taken, such as the Handling Missing Value (HMV), which is commonly used, namely the mean, with the K-Nearest Neighbors (KNN) classifier optimized using a grid search. However, the accuracy of this technique is still low, reaching 0.739 at 20-fold cross-validation. Calculations were carried out with other methods to further improve the prediction accuracy. It was found that Linear Regression (LR) can increase accuracy up to 0.757, which outperforms different approaches such as the statistical approach to mean 0.739, mode 0.716, median 0.734, and regression approach KNN 0.742, Lasso 0.751, Passive Aggressive Regressor (PAR) 0.737, Support Vector Regression (SVR) 0.739, Kernel Ridge (KR) 0.731, and Stochastic Gradient Descent (SGD) 0.734.


Handling Missing Value; Iterative Imputation; Algoritma Regresi; Akuakultur

Full Text:



Djumanto, Ustadi, Rustadi, and B. Triyatno, “Utilization of wastewater from vannamei shrimp pond for rearing milkfish in keburuhan coast purworejo sub-district,” Aquacultura Indonesiana, vol. 19 (1), pp. 38-46, 2018.

K. Li and L. Liu, “Preliminary research on modeling and simulation technology of artificial intelligence system,” IOP Conf. Series: Materials Science and Engineering, vol. 452, 2018.

L. Yang, Y. Liu, H. Yu, X. Fang, L. Song, D. Li, and Y. Chen, “Computer Vision models in intelligent aquaculture with emphasis on fish detection and behavior analysis: A Review,” Archives of Computational Methods in Engineering, vol. 28, no. 4, pp. 2785–2816, 2020.

S. Spänig, A. Emberger-Klein, J.-P. Sowa, A. Canbay, K. Menrad, and D. Heider, “The virtual doctor: An interactive clinical-decision-support system based on deep learning for non-invasive prediction of diabetes,” Artificial Intelligence in Medicine, vol. 100, p. 101706, 2019.

J. Huan, H. Li, M. Li, and B. Chen, “Prediction of dissolved oxygen in aquaculture based on gradient boosting decision tree and long short-term memory network: A study of chang zhou fishery demonstration base, China,” Computers and Electronics in Agriculture, vol. 175, p. 105530, 2020.

T. A. Sinshaw, C. Q. Surbeck, H. Yasarer, and Y. Najjar, “Artificial Neural Network for prediction of total nitrogen and phosphorus in US lakes,” Journal of Environmental Engineering, vol. 145, no. 6, p. 04019032, 2019.

M. S. Samsudin, A. Azid, S. I. Khalit, M. S. Sani, and F. Lananan, “Comparison of prediction model using spatial discriminant analysis for marine water quality index in mangrove estuarine zones,” Marine Pollution Bulletin, vol. 141, pp. 472–481, 2019.

A. Najah Ahmed, F. Binti Othman, H. Abdulmohsin Afan, R. Khaleel Ibrahim, C. Ming Fai, M. Shabbir Hossain, M. Ehteram, and A. Elshafie, “Machine learning methods for better water quality prediction,” Journal of Hydrology, vol. 578, p. 124084, 2019.

Q. Al-Tashi, S. J. Abdulkadir, H. M. Rais, S. Mirjalili, and H. Alhussian, “Approaches to multi-objective feature selection: A systematic literature review,” IEEE Access, vol. 8, pp. 125076–125096, 2020.

U. Moorthy and U. D. Gandhi, “A novel optimal feature selection technique for medical data classification using ANOVA based whale optimization,” Journal of Ambient Intelligence and Humanized Computing, vol. 12, no. 3, pp. 3527–3538, 2020.

M. J. Azur, E. A. Stuart, C. Frangakis, and P. J. Leaf, “Multiple imputations by chained equations: What is it and how does it work?,” International Journal of Methods in Psychiatric Research, vol. 20, no. 1, pp. 40–49, 2011.

I. R. White, P. Royston, and A. M. Wood, “Multiple imputation using chained equations: Issues and guidance for Practice,” Statistics in Medicine, vol. 30, no. 4, pp. 377–399, 2010.

DOI: https://doi.org/10.17529/jre.v18i3.25903


  • There are currently no refbacks.

View My Stats


Creative Commons License

Jurnal Rekayasa Elektrika (JRE) is published under license of Creative Commons Attribution-ShareAlike 4.0 International License.