Javascript must be enabled to continue!
Predicting Sorghum Yield in Data-Scarce and Conflict-Affected South Sudan Using Machine Learning Techniques.
View through CrossRef
Abstract
South Sudan continues to experience conflict. Agriculture production remains a crucial food and economic security feature for the overwhelming majority of the population and Sorghum is a key staple crop. In this country, sorghum yield estimation and prediction are of great interest to support interventions from the humanitarian sector, government policies, and for small-scale farmers’ food and economic security. Ongoing conflict, limited access, and poor infrastructure suggest remote sensing and modeling as credible alternative for sorghum yield prediction in this country.
This research compares five popular regression techniques, namely Random Forest(RF), Decision Tree, Extreme Gradient Boosting (XGBoost), Support Vector Machine Learning(SVM) and Artificial Neural Network(ANN) Regressor for predicting sorghum crop yield in conflict setting in Upper Nile and Werstern Barh Al Gazal. The study uses 4 years soghum yield data, remotely sensed weather patterns, and above ground biomass proxies including NDVI, EVI, LAI characteristics
as inputs for model training and evaluation.
Preprocessing method like Fillna is used to handle missing data. The performance of each model is evaluated using metrics like RMSE, MSE, MAE, and R squared. This study wanted to assess the influence of conflict on small-scale farmers’ sorghum yield prediction using remote sensing, climate data, and vegetation proxies in a data-scarce environment.
No research so far was done to investigate the impact of conflict on yield prediction in South Sudan. We constructed a modeling framework designed to incorporate data on the likelihood of conflict occurrences sourced from both Uppsala
University and Acled, farmers perception on conflict, soil data and remotely sensed vegetation proxies to predict sorghum yield. Machine learning models were used to predict end-of-season sorghum yield and results show that Random Forest
model yielded best combination of metrics with an RMSE of 176.27 Kg/ha and an R squared of 58 percent, confirming competitive performance. Outperforming other models, XGBoost records the best RMSE of 171.68 Kg/ha and an R squared of 59 percent. Renowned for its efficacy, XGBoost exceled in capturing intricate relationships within self-declared sorghum yield data, indicating its potential for accurate predictions. Both SVM and ANN achieved relatively lower performance metrics, with an RMSE of 188.95 Kg/ha and an R squared of 50.6 percent for SVM and for ANN a RMSE of 225kg/ha and an R squared of 52.5 percent. These findings for Decision trees, Random Forest, and XGBoost are significant in predicting sorghum yield in South Sudan. The variable importance reveals conflict probability was not a significant factor and did not influence sorghum yield prediction. Cultivated land size was the most significant predictor of sorghum yield in the context of South Sudan. These findings provide insights into the potential efficacy and limitations of machine learning for predicting sorghum yield in conflict settings in support to agricultural planning and humanitarian relief interventions
Title: Predicting Sorghum Yield in Data-Scarce and Conflict-Affected South Sudan Using Machine Learning Techniques.
Description:
Abstract
South Sudan continues to experience conflict.
Agriculture production remains a crucial food and economic security feature for the overwhelming majority of the population and Sorghum is a key staple crop.
In this country, sorghum yield estimation and prediction are of great interest to support interventions from the humanitarian sector, government policies, and for small-scale farmers’ food and economic security.
Ongoing conflict, limited access, and poor infrastructure suggest remote sensing and modeling as credible alternative for sorghum yield prediction in this country.
This research compares five popular regression techniques, namely Random Forest(RF), Decision Tree, Extreme Gradient Boosting (XGBoost), Support Vector Machine Learning(SVM) and Artificial Neural Network(ANN) Regressor for predicting sorghum crop yield in conflict setting in Upper Nile and Werstern Barh Al Gazal.
The study uses 4 years soghum yield data, remotely sensed weather patterns, and above ground biomass proxies including NDVI, EVI, LAI characteristics
as inputs for model training and evaluation.
Preprocessing method like Fillna is used to handle missing data.
The performance of each model is evaluated using metrics like RMSE, MSE, MAE, and R squared.
This study wanted to assess the influence of conflict on small-scale farmers’ sorghum yield prediction using remote sensing, climate data, and vegetation proxies in a data-scarce environment.
No research so far was done to investigate the impact of conflict on yield prediction in South Sudan.
We constructed a modeling framework designed to incorporate data on the likelihood of conflict occurrences sourced from both Uppsala
University and Acled, farmers perception on conflict, soil data and remotely sensed vegetation proxies to predict sorghum yield.
Machine learning models were used to predict end-of-season sorghum yield and results show that Random Forest
model yielded best combination of metrics with an RMSE of 176.
27 Kg/ha and an R squared of 58 percent, confirming competitive performance.
Outperforming other models, XGBoost records the best RMSE of 171.
68 Kg/ha and an R squared of 59 percent.
Renowned for its efficacy, XGBoost exceled in capturing intricate relationships within self-declared sorghum yield data, indicating its potential for accurate predictions.
Both SVM and ANN achieved relatively lower performance metrics, with an RMSE of 188.
95 Kg/ha and an R squared of 50.
6 percent for SVM and for ANN a RMSE of 225kg/ha and an R squared of 52.
5 percent.
These findings for Decision trees, Random Forest, and XGBoost are significant in predicting sorghum yield in South Sudan.
The variable importance reveals conflict probability was not a significant factor and did not influence sorghum yield prediction.
Cultivated land size was the most significant predictor of sorghum yield in the context of South Sudan.
These findings provide insights into the potential efficacy and limitations of machine learning for predicting sorghum yield in conflict settings in support to agricultural planning and humanitarian relief interventions.
Related Results
Effect of sorghum flour substitution on pasting behavior of wheat flour and application of composite flour in bread
Effect of sorghum flour substitution on pasting behavior of wheat flour and application of composite flour in bread
The objective of this study was to investigate the effect of sorghum flour substitution to wheat flour on pasting and thermal properties of the composite flours as well as firmness...
Changes in the root-associated bacteria of sorghum are driven by the combined effects of salt and sorghum development
Changes in the root-associated bacteria of sorghum are driven by the combined effects of salt and sorghum development
Abstract
Background
Sorghum is an important food staple in the developing world, with the capacity to grow under severe conditions such as salinity,...
Effect of Sorghum-Mung Bean Intercropping on Sorghum-Based Cropping System in the Lowlands of North Shewa, Ethiopia
Effect of Sorghum-Mung Bean Intercropping on Sorghum-Based Cropping System in the Lowlands of North Shewa, Ethiopia
Due to decreasing land units and a decline in soil fertility, integrating mung beans into the Sorghum production system is a viable option for increasing productivity and producing...
Yield Performance and Adoption of Released Sorghum Varieties in Ethiopia
Yield Performance and Adoption of Released Sorghum Varieties in Ethiopia
Sorghum national average productivity in Ethiopia is 2.1 tons/ha which is far below the global average of 3.2 tons/ha due to the problem of drought, striga, insect pest (stalk bore...
Flavonoid Biosynthesis Pathway Participating in Salt Resistance in a Landrace Sweet Sorghum Revealed by RNA-Sequencing Comparison With Grain Sorghum
Flavonoid Biosynthesis Pathway Participating in Salt Resistance in a Landrace Sweet Sorghum Revealed by RNA-Sequencing Comparison With Grain Sorghum
Abiotic stresses affect crop productivity worldwide. Plants have developed defense mechanisms against environmental stresses by altering the gene expression pattern which leads to ...
Sorghum Production in Northern Namibia: Farmers’ Perceived Constraints and Trait Preferences
Sorghum Production in Northern Namibia: Farmers’ Perceived Constraints and Trait Preferences
Sorghum (Sorghum bicolor [L.] Moench) is a valuable crop in the dry regions of the world, including Namibia. Due to the intensity and recurrence of drought and heat stress in the t...
LEADERSHIP AND MANAGEMENT IN SOUTH SUDAN: A THOUGHT-PROVOKING REVIEW
LEADERSHIP AND MANAGEMENT IN SOUTH SUDAN: A THOUGHT-PROVOKING REVIEW
The paper has argued the vitality of leadership and management in South Sudan. It does so by thought-provokingly reviewing the current situation of leadership and management in the...
Evaluation of Western Ethiopian Sorghum Landraces for Resistance to Striga hermonthica (Del.) Benth
Evaluation of Western Ethiopian Sorghum Landraces for Resistance to Striga hermonthica (Del.) Benth
Abstract
Striga hermonthica (Del.) Benth is an obligate root parasite that causes severe yield losses in sorghum production in semi-arid areas. It reduces yields in sorghum...


