Methodologies for Imputation of Missing Values in Rice Pest Data

V. Jinubala *

STPI-Software Technology Parks of India, Hyderabad, India.

P. Jeyakumar

ICAR-Indian Institute of Rice Research, Hyderabad, India.

*Author to whom correspondence should be addressed.


Abstract

Data Mining is an emerging research field in the analysis of agricultural data. In fact the most important problem in extracting knowledge from the agriculture data is the missing values of the attributes in the selected data set. If such deficiencies are there in the selected data set then it needs to be cleaned during preprocessing of the data in order to obtain a functional data. The main objective of this paper is to analyse the effectiveness of the various imputation methods in producing a complete data set that can be more useful for applying data mining techniques and presented a comparative analysis of the imputation methods for handling missing values. The pest data set of rice crop collected throughout Maharashtra state under Crop Pest Surveillance and Advisory Project (CROPSAP) during 2009-2013 was used for analysis. The different methodologies like Deleting of rows, Mean & Median, Linear regression and Predictive Mean Matching were analysed for Imputation of Missing values. The comparative analysis shows that Predictive Mean Matching Methodology was better than other methods and effective for imputation of missing values in large data set.

Keywords: Agriculture data, data mining, data preprocessing, missing data and imputation methods


How to Cite

Jinubala, V., and P. Jeyakumar. 2021. “Methodologies for Imputation of Missing Values in Rice Pest Data”. Current Journal of Applied Science and Technology 40 (5):64-73. https://doi.org/10.9734/cjast/2021/v40i531304.

Downloads

Download data is not yet available.