Growing crops in modern conditions is a complex task and practically combines the practices of experience and the latest methods, including information technology, which has become part of the concept of "smart farming". An important factor in the stable predicted yield is the level of soil moisture, which is the result of changes in climatic factors such as air temperature, soil temperature, intensity of solar radiation, rainfall, wind speed, etc. A methodology for processing real historical indicators of climate change in a certain geographical area with subsequent training and application of machine learning models to predict soil moisture is proposed. To build a machine learning model, the following algorithms were selected and studied: the algorithm of regression trees, random forest, linear regression, M5P algorithms and the K* algorithm. The data source for training the models is the open information resource International Soil Moisture Network (ISMN) from ismn.earth/en. , which provides data on soil moisture and temperature, air temperature, and rainfall. Other data was used from the Open Meteo information service, which provides a free API and allows you to get historical data and weather forecast in specified coordinates during specified days. A data structure was developed to train the model for further prediction of soil moisture. An architecture has been developed and a software system for predicting soil moisture based on machine learning algorithms has been created using the Spring Framework, the WEKA library and Java FX with the ability to select and study the appropriate algorithms. Experiments have been carried out and the results of the duration of model training have been presented, while the algorithms of regression trees and linear regression require the least training time. A comparison of algorithms is made according to the following criteria: learning speed, cross-testing speed, prediction speed, testing performance indicators for real historical data. Based on the results of the study, conclusions are drawn about individual algorithms, the feasibility of using them to predict soil moisture based on climatic indicators. The obtained results will make it possible to evaluate and select the best models of machine learning in the design of the information and analytical system "smart agriculture" for forecasting soil moisture.
[1] Araújo, S. O., Peres, R. S., Barata, J., Lidon, F., & Ramalho, J. C. (2021). Characterising the Agriculture 4.0 Landscape – Emerging Trends, Challenges and Opportunities. Agronomy, 11(4), 2‑37. https://doi.org/10.3390/agronomy11040667
[2] Dawn, N., Ghosh, T., Ghosh, S., Saha, A., Mukherjee, P., Sarkar, S., Guha, S., & Sanyal, T. (2023). Implementation of Artificial Intelligence, Machine Learning, and Internet of Things (IoT) in revolutionizing Agriculture: A review on recent trends and challenges. International Journal of Experimental Research and Review, 30, 190–218. https://doi.org/10.52756/ijerr.2023.v30.018
[3] Singh, A., Gaurav, K. (2023). Deep learning and data fusion to estimate surface soil moisture from multi-sensor satellite images. Sci Repб 13, 2251. https://doi.org/10.1038/s41598-023-28939-9
[4] Li, Q., Li, Z., Shangguan, W., Wang, X., Li, L. & Yu, F. (April 2022). Improving soil moisture prediction using a novel encoder-decoder model with residual learning. Computers and Electronics in Agriculture, 195, April 2022. 106816. https://doi.org/10.1016/j.compag.2022.106816
[5] Ren, Y., Ling, F., & Wang, Y. (2023). Research on Provincial-Level Soil Moisture Prediction Based on Extreme Gradient Boosting Model. Agriculture, 13(5), 927. https://doi.org/10.3390/agriculture13050927
[6] Jiang, K., Pan, Z., Pan, F., Teuling, A. J., Han, G., An, P., Chen, X., ..., & Dong, Z. (16 June 2023). Combined influence of soil moisture and atmospheric humidity on land surface temperature under different climatic background. iScience, 26(6), 106837. https://doi.org/10.1016/j.isci.2023.106837
[7] Ariyanto, D. P., Qudsi, Z. A., Sumani, Dewi, W. S., Rahayu, & Komariah. (2021). The dynamic effect of air temperature and air humidity toward soil temperature in various lands coverat KHDTK Gunung Bromo, Karanganyar – Indonesia, IOP Conf. Ser. Earth Environ. Sci., 724, 1, 012003. https://DOI10.1088/1755-1315/724/1/012003
[8] Gikunda, P. K. & Jouandeau, N. (2019). Modern CNNs for IoT Based Farms. Communications in Computer and Information Science, 1026, 68‑79. https://doi.org/10.1007/978-3-030-26630-1_7
[9] International Soil Moisture Network – [Електронний ресурс]. Retrieved from: https://ismn.earth/en/
[10] Free Weather API – [Електронний ресурс]. Retrieved from: https://open-meteo.com/
[11] Frank, E., Hall, M. A., Witten, I. H. (2016). The WEKA Workbench. In Online Appendix for Data Mining: Practical Machine Learning Tools and Techniques, Morgan Kaufmann, 4th ed.; Elsevier: San Francisco, CA, USA, 1‑128.
[12] Torgo, L. (2011). Regression Trees. In: Sammut, C., Webb, G.I. (eds) Encyclopaedia of Machine Learning. Springer, Boston, MA. https://doi.org/10.1007/978-0387-30164-8711
[13] Ziqiu, Kang , Cagatay, Catal, Bedir, Tekinerdogan. (2020). Machine learning applications in production lines: A systematic literature review. Computers & Industrial Engineering, 149, 106773. https://doi.org/10.1016/j.cie.2020.106773
[14] KDAG IIT KGP Linear Regression h- [Електронний ресурс]. Retrieved from: https://kdagiit.medium.com/linear-regression-ba3fe4ba38c0
[15] Jaiswal, Jitendra, Samikannu, Rita (2017). Application of Random Forest Algorithm on Feature Subset Selection and Classification and Regression, Conference: 2017 World Congress on Computing and Communication Technologies (WCCCT), 65-68. https://doi.org/10.1109/WCCCT.2016.25
[16] Everingham, Y., Sexton, J., & Skocaj, D. (2016). Accurate prediction of sugarcane yield using a random forest algorithm Agron. Sustain. Dev. 36: 27. https://doi.org/10.1007/s13593-016-0364-z
[17] Hasup Song, Hasup Song, Injong Gi, Jihyuk Ryu, Yonghwan Kwon, Jongpil Jeong. (2023). Production Planning Forecasting System Based on M5P Algorithms and Master Data in Manufacturing Processes. Appl. Sci., 13(13), 7829. https://doi.org/10.3390/app13137829
[18] Goksu Tuysuzoglu, Kokten Ulas, Birant & Derya Birant Rainfall. (2023). Prediction Using an Ensemble Machine Learning Model Based on K-Stars, Sustainability, , 15(7), 5889. https://doi.org/10.3390/su15075889