A Comparative Study of Detrending Methods on Crop Yield Time Series for Drought Studies
Aekesh Kumar *
Department of Soil and Water Conservation Engineering, College of Technology, G. B. Pant University of Agriculture and Technology, Pantnagar, Uttarakhand, India.
Anil Kumar
Department of Soil and Water Conservation Engineering, College of Technology, G. B. Pant University of Agriculture and Technology, Pantnagar, Uttarakhand, India.
Rajesh Pratap Singh
Department of Irrigation and Drainage Engineering, College of Technology, G. B. Pant University of Agriculture and Technology, Pantnagar, Uttarakhand, India.
Pravendra Kumar
Department of Soil and Water Conservation Engineering, College of Technology, G. B. Pant University of Agriculture and Technology, Pantnagar, Uttarakhand, India.
Praveen Vikram Singh
Department of Soil and Water Conservation Engineering, College of Technology, G. B. Pant University of Agriculture and Technology, Pantnagar, Uttarakhand, India.
*Author to whom correspondence should be addressed.
Abstract
Detrending is a statistical technique that removes systematic variations or trends from time series data, allowing analysts to focus on the underlying patterns or fluctuations. While multiple detrending approaches have been applied but rarely discussed their consistency of outcomes and effectiveness in accurately capturing better yield trends. The validation of drought occurrences has proven to be a challenging task due to the non-stationary characteristics of time series data related to crop yield. This research utilizes time series of cotton yield data from the Marathwada region covering the period from 1998 to 2021. Three traditional trend models, including simple linear regression, second-order polynomial regression and central moving average were applied. Additionally, two machine learning models (random forest and support vector regression) were tested with a novel approach. Moreover, two decomposition models (additive and multiplicative) were used to remove non-linear trends in crop yield time series data. The performance of the chosen models was evaluated based on metrics such as root mean square error, mean absolute error, Nash-Sutcliffe efficiency, and index of agreement. The results suggest that the most effective detrending approach involves combining a random forest machine learning model with an additive decomposition model.
Keywords: Drought, machine learning, detrending, decomposition model