Software implementation of probabilistic-deterministic design of a chemical experiment on R
Views: 208 / PDF downloads: 37
DOI:
https://doi.org/10.32523/2616-6771-2025-151-2-130-142Keywords:
probabilistic-deterministic design of experiment, R, approximation, modeling of chemical-technological processesAbstract
This paper presents a software implementation of an algorithm for processing experimental data obtained using the probabilistic-deterministic design of experiments (PDDoE) method in the R environment. The tool enables the construction of partial and generalized dependencies between results and varying factors, supports templates of different dimensionalities, automates the selection of approximation models, and evaluates their quality using R², the nonlinear multiple correlation coefficient (RM), and its significance (tRM). Implementation is carried out entirely in R using packages such as openxlsx, dplyr, and tcltk. The system supports arithmetic, geometric, and harmonic averaging methods, automatic orthogonality checks, and prioritized model selection based on tRM, with emphasis on physically meaningful functions. Generalized equations are constructed by enumerating combinations of significant partial dependencies until the maximum tRM is achieved. The tool provides result visualization, table export, and a modular structure allowing easy extension. Users can load custom plan templates, add new metrics and approximating functions, and manually define the form of both partial and generalized models. Designed for researchers, educators, and engineers working with multifactorial systems, the tool is applicable in scientific research, chemical technology, and education. The ChatGPT language model was used in R code generation. Potential applications of AI tools in experimental data processing are briefly discussed.
Downloads
References
Modifitsirovannyy metod veroyatnostno-determinirovannogo planirovaniya eksperimenta (MVE) [Modified method of probabilistic deterministic experiment planning (MPE)]. (2013). Internet Archive: https://web.archive.org/web/20130401063933/http://analiz3.com/
Ai, Q.X., Meng, F.W., Shi, J.L., et al. (2024). Extracting structured data from organic synthesis procedures using a fine-tuned large language model. Digital Discovery 3(9), 1822–1831. https://doi.org/10.1039/d4dd00091a
Akberdin, A.A., Kim, A.S., Sultangaziev, R.B. (2018). Experiment Planning in the Simulation of Industrial Processes. Steel Transl 48, 573–577. https://doi.org/10.3103/S0967091218090024
Akhmetkarimova, Z.S., Baikenov, M.I., Dyusekenov, A.M. (2017). Mathematical simulation of the hydrogenation of borodino coal. Solid Fuel Chemistry 51(2), 111–114. https://doi.org/10.3103/S0361521917020021
Barad, M. (2014). Design of experiments (DOE) - A valuable multi-purpose methodology. Applied Mathematics 5(14), 2120–2129. https://doi.org/10.4236/am.2014.514206
Belyaev, S.V., Malyshev, V.P. (2008). Puti razvitiya veroyatnostno-determinirovannogo planirovaniya eksperimenta [Development paths of probabilistic-deterministic experimental design]. In Kompleksnaya pererabotka mineral'nogo syr'ya Kazakhstana. Sostoyanie, problemy, resheniya [Complex processing of Kazakhstan's mineral raw materials. State, problems, solutions] 9(8), 599–633.
Bose, R.C., Shrikhande, S.S., Parker, E.T. (1960). Further results on the construction of mutually orthogonal Latin squares and the falsity of Euler's conjecture. Canadian Journal of Mathematics 12, 189–203
Box, G.E.P., Jenkins, G.M., Reinsel, G.C., Ljung, G.M. (2015). Time series analysis: Forecasting and control (5th ed.). John Wiley & Sons.
Chatterjee, S. (2021). A new coefficient of correlation. Journal of the American Statistical Association 116(536), 2009–2022. https://doi.org/10.1080/01621459.2020.1773984
Cooper, A.I. (2025). Concluding remarks: Faraday Discussion on data-driven discovery in the chemical sciences. Faraday Discussions 256, 664–690. https://doi.org/10.1039/d4fd00174e
Eriksson, L., Johansson, E., Wold, S. (2014). Design of experiments (DoE) and process optimization: A review of recent publications. Organic Process Research & Development 19(11), 1605–1633. https://doi.org/10.1021/op500169m
Fisher, R.A. (1926). The arrangement of field experiments. Journal of the Ministry of Agriculture of Great Britain 33, 503–513.
Fisher, R.A. (1935). The design of experiments. Oliver and Boyd.
Fomin, V.N., Dik, A.V. (2015). Using one-way analysis of variance in the stochastic determinated design of experiment. Bulletin of Karaganda University. Ser. Chemistry 1(58), 17–20.
Fomin, V.N., et al. (2021). Optimization of the parameters of a laser induced breakdown spectrometer (LIBS) using probabilistic-deterministic design of experiment. Industrial Laboratory Diagnostics of Materials 87(5), 14–19. https://doi.org/10.26896/1028-6861-2021-87-5-14-19
Fomin, V.N., et al. (2021). Optimization of coal tar gas chromatography conditions using probabilistic-deterministic design of experiment. Bulletin of the University of Karaganda-Chemistry 104, 39–46. https://doi.org/10.31489/2021Ch4/39-46
Fomin, V.N., et al. (2022). Method for qualitative and quantitative analysis of ancient lead enamel using laser inducted breakdown spectroscopy. Bulletin of the University of Karaganda-Chemistry 108, 107–117. https://doi.org/10.31489/2022Ch4/4-22-16
Fomin, V.N., et al. (2024). Method of classification and quantitative analysis of vein quartz using LIBS and chemometric techniques. Bulletin of the L. N. Gumilyov Eurasian National University. Chemistry Geography Ecology Series 147(2), 48–60. https://doi.org/10.32523/2616-6771-2024-147-2-48-60
Fomin, V.N., Kovaleva, A.A., Aldabergenova, S.K. (2017). Ispolzovanie mnogofaktornykh peremennykh v veroyatnostno-determinirovannom planirovanii eksperimenta [Use of multifactor variables in probabilistic-deterministic design of experiment]. Vestnik Karagandinskogo Universiteta. Seriya Khimiya [Bulletin of Karaganda University. Chemistry Series] 3(87), 91–100.
Gogol, D.B., Rozhkovoy, I.E., Sadyrbekov, D.T., Makasheva, A.M. (2023). Deposition of transition metal onto carbonate materials surface: Theoretical evaluation of optimal parameters. Eurasian Journal of Chemistry 28, 4(112). https://doi.org/10.31489/2959-0663/4-23-13
Ibishev, K.S., Malyshev, V.P., Kim, S.V., Sarsembaev, B.S., Egorov, N.B. (2017). Preparation of nanosized nickel powder by direct-current electrolysis combined with high-voltage spark discharge. High Energy Chemistry 51(3), 219–223. https://doi.org/10.1134/S0018143917030055
Jankovic, A., Chaudhary, G., Goia, F. (2021). Designing the design of experiments (DOE) – An investigation on the influence of different factorial designs on the characterization of complex systems. Energy and Buildings 250, 111298. https://doi.org/10.1016/j.enbuild.2021.111298
Kallergis, G., Asgari, E., Empting, M., et al. (2025). Domain adaptable language modeling of chemical compounds identifies potent pathoblockers for Pseudomonas aeruginosa. Communications Chemistry 8(1), 114. https://doi.org/10.1038/s42004-025-01484-4
Kumari, M., Chauhan, R., Garg, P. (2025). Can LLMs revolutionize text mining in chemistry? A comparative study. Computer Standards & Interfaces 94, 103997. https://doi.org/10.1016/j.csi.2025.103997
Lee, B.C.Y., Mahtab, M.S., Neo, T.H., Farooqi, I.H., Khursheed, A. (2022). A comprehensive review of Design of Experiment (DOE) for water and wastewater treatment application - Key concepts, methodology and contextualized application. Journal of Water Process Engineering 47, 102673. https://doi.org/10.1016/j.jwpe.2022.102673
Mahmood, S.W., Basheer, G.T., Algamal, Z.Y. (2025). Quantitative structure-activity relationship modeling based on improving kernel ridge regression. Journal of Chemometrics 39(5), e70027. https://doi.org/10.1002/cem.70027
Malyshev, V.P. (1981). Probabilistic-deterministic design of experiment [Veroyatnostno-determinirivannoe planirovanie eksperimenta in Russian]. Nauka, Almaty.
Malyshev, V.P. (1994). Probabilistic-deterministic mapping [Veroyatnostno-determinirovannoe otobrazhenie in Russian]. Gylym, Karaganda.
Protodyakonov, M.M., Tedder, R.I. (1970). Methodology of rational experimental design [Metodika ratsional'nogo planirovaniya eksperimentov in Russian]. Nauka, Moscow.
R Core Team. (2024). R: A language and environment for statistical computing (Version 4.4.1). R Foundation for Statistical Computing. https://www.R-project.org/
Rampal, N., Wang, K.Y., Burigana, M., et al. (2024). Single and multi-hop question-answering datasets for reticular chemistry with GPT-4-Turbo. Journal of Chemical Theory and Computation 20(20), 9128–9137. https://doi.org/10.1021/acs.jctc.4c00805
Schilling-Wilhelmi, M., Ríos-García, M., Shabih, S., et al. (2025). From text to insight: Large language models for chemical data extraction. Chemical Society Reviews 54(3), 1125–1150. https://doi.org/10.1039/d4cs00913d
Shi, S.W., Huang, Z.W., Gu, X.X., et al. (2025). From 2015 to 2023: How machine learning aids natural product analysis. Chemistry Africa 8(2), 505–522. https://doi.org/10.1007/s42250-024-01154-3
Shi, Y., Rampal, N., Zhao, C.B., et al. (2025). Comparison of LLMs in extracting synthesis conditions and generating Q&A datasets for metal-organic frameworks. Digital Discovery. https://doi.org/10.1039/d5dd00081e
Song, W.H., Sun, H.G. (2025). Local reaction condition optimization via machine learning. Journal of Molecular Modeling 31(5), 143. https://doi.org/10.1007/s00894-025-06365-0
Székely, G.J., Rizzo, M.L., Bakirov, N.K. (2007). Measuring and testing dependence by correlation of distances. Annals of Statistics 35(6), 2769–2794. https://doi.org/10.1214/009053607000000505
Taguchi, G. (1986) Introduction to Quality Engineering: Designing Quality into Products and Processes. Asian Productivity Organization, Tokyo.
Troeglasova, A.V. (2020). Veroyatnostno-determinirovannoe planirovanie eksperimenta po razlozheniyu kremniysoderzhashchikh obraztsov [Probabilistic-deterministic design of experiment for decomposition of silicon-containing samples]. Fiziko-khimicheskie issledovaniya [Physicochemical Research] 8(2), 49–55. https://doi.org/10.33764/2618-981X-2020-8-2-49-55
Turovets, M.A., et al. (2024). Chemometric approach for the determination of vanadium by the LIBS method. Eurasian Journal of Chemistry. https://doi.org/10.31489/2959-0663/4-24-10
Downloads
Published
Issue
Section
License
Copyright (c) 2025 V. Fomin (Author)

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.