ALGORITHMS FOR CONSTRUCTING A REGRESSION LINEAR WITH RESPECT TO UNKNOWN COEFFICIENTS ON A LIMITED AMOUNT OF EXPERIMENTAL DATA

Authors

DOI:

https://doi.org/10.20998/2079-0023.2025.02.02

Keywords:

multivariate regression, least squares method, least absolute deviations method, linear programming model, simplex method, optimal basis

Abstract

This publication continues the series of scientific works of the authors on the creation of algorithms for constructing multivariate regressions which are linear with respect to unknown coefficients by using linear programming models. To simplify the simulation modeling of their efficiency, we present the algorithms for the multivariate linear regression problem. The use of linear programming models requires minimizing the sum of the absolute differences used in the general procedure of the least squares method. The estimates of the unknown coefficients obtained as a result of solving the linear programming problem are linear with respect to the vector of the values ​​of the regression model in the statistical experiment. It is known that, by virtue of the Markov theorem, the estimates of the unknown coefficients obtained by the general procedure of the least squares method are efficient in the class of linear unbiased estimates. Thus, it would seem that the transition from the least squares method to the least absolute deviations used in the least squares method is a priori unproductive. But this is not so. From the proof of the Markov theorem, it follows that the linear estimation matrix must be constant and independent of the values of the regression model in the statistical experiment. The estimates obtained by the least absolute deviations method do not meet this condition. Indeed, the estimation matrix is the optimal basis for solving the linear programming problem by the simplex method and depends on the values of the regression model in the statistical experiment. Such a formulation of the problem allows introducing, into the optimization model, linear constraints that use the results of statistical tests and implement additional properties of the searched multivariate regression. The first studies of these algorithms have shown their efficiency, this allowed the authors to set the task of creating such algorithms that can not only compete with the general algorithmic procedure of the least squares method, but also be efficient for the case of a limited volume of experimental data, when the ratio of the average absolute value of the realizations of a random factor in the experiment to the average absolute value of the true regression on it is a sufficiently large value. In this case, it is incorrect to raise the problem of finding estimates of unknown coefficients that practically do not differ from the true ones, but, as experiments and, in particular, the examples given in this paper have shown, it is possible to find sufficiently good estimates of the average values of the true regression in the experiments conducted, which can be used, for example, in diagnosing the early stages of the onset of an epidemic of various diseases or in other recognition tasks.

Author Biographies

Alexander Pavlov, National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute"

Doctor of Technical Sciences, Full Professor, National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute", Kyiv, Ukraine, Professor of Computer Science and Software Engineering Department

Anton Kushch, National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute"

National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute", Postgraduate Student of Computer Science and Software Engineering Department, Kyiv, Ukraine

References

Roustaei N. Application and interpretation of linear-regression analysis. Medical hypothesis discovery and innovation in ophthalmology. 2024, vol. 13, no. 3, pp. 151–159. DOI: doi.org/10.51329/mehdiophthal1506.

Liu J., Bellows B., Hu X. J., Wu J., Zhou Z., Soteros C., Wang L. A new time-varying coefficient regression approach for analyzing infectious disease data. Scientific Reports. 2023, vol. 13, article 14687. DOI: doi.org/10.1038/s41598-023-41551-1.

Gao C. Robust regression via multivariate regression depth. Bernoulli. 2020, vol. 26, no. 2, pp. 1139–1170. DOI: doi.org/10.3150/19-BEJ1144.

Haghighat P., Gandara D., Kang L., Anahideh H. Fair multivariate adaptive regression splines for ensuring equity and transparency. Proceedings of the AAAI Conference on Artificial Intelligence. 2024, vol. 38, no. 20, pp. 22076–22086. DOI: doi.org/10.1609/aaai.v38i20.30211.

Zhou Z., Ji J., Wang Y. Zhu Z., Chen J. Hybrid regression model via multi-variate adaptive regression spline and online sequential extreme learning machine and its application in vision servo system. International Journal of Advanced Robotic Systems. 2022, vol. 19, no. 3, pp. 1–13. DOI: doi.org/10.1177/17298806221108603.

Omer A., Sedeeq B., Ali T. A proposed hybrid method for multivariate linear regression model and multivariate wavelets (Simulation study). Polytechnic Journal of Humanities and Social Sciences. 2024, vol. 5, no. 1, pp. 112–124. Available at: https://www.researchgate.net/publication/377558021 (accessed 24.11.2025).

Abdulrahman A. T., Alshammari N. S. Factor analysis and regression analysis to find out the influencing factors that led to the countries’ debt crisis. Advances and Applications in Statistics. 2022, vol. 78, pp. 1–16. DOI: doi.org/10.17654/0972361722047.

Hünermund P., Louw B. On the nuisance of control variables in causal regression analysis. Organizational Research Methods. 2023, vol. 28, no. 1, pp. 138–151. DOI: doi.org/10.1177/10944281231219274.

Szentpéteri Sz., Csáji B. C. Finite-sample identification of linear regression models with residual-permuted sums. IEEE Control Systems Letters. 2024, vol. 8, pp. 1523–1528. DOI: doi.org/10.1109/LCSYS.2024.3412856.

Nie B., Du Y., Du J., Rao Y., Zhang Y., Zheng X., Ye N., Jin H. A novel regression method: partial least distance square regression methodology. Chemometrics and Intelligent Laboratory Systems. 2023, vol. 237, article 104827. DOI: doi.org/10.1016/j.chemolab.2023.104827.

Pavlov A. A., Holovchenko M. N., Drozd V. V. An adaptive method for building a multivariate regression. Vestnik Nats. tekhn. un-ta "KhPI": sb. nauch. tr. Temat. vyp.: Sistemnyy analiz, upravlenie i informatsionnye tekhnologii [Bulletin of the National Technical University "KhPI": a collection of scientific papers. Thematic issue: System analysis, control and information technology]. Kharkov, NTU "KhPI" Publ., 2024. no. 1 (11). pp. 3–8. DOI: doi.org/10.20998/2079-0023.2024.01.01.

Pavlov A. A., Kushch A. V. Zadacha liniynoyi rehresiyi z liniyno zalezhnymy koefitsiyentamy [Formulation of a linear regression problem with linearly dependent coefficients]. Adaptyvni systemy avtomatychnoho upravlinnya [Adaptive Systems of Automatic Control]. Kyiv, Igor Sykorsky KPI, 2025. no. 2 (47). pp. 147–157. DOI: doi.org/10.20535/1560-8956.47.2025.340201. (In Ukr.).

Downloads

Published

2025-12-29

How to Cite

Pavlov, A., & Kushch, A. (2025). ALGORITHMS FOR CONSTRUCTING A REGRESSION LINEAR WITH RESPECT TO UNKNOWN COEFFICIENTS ON A LIMITED AMOUNT OF EXPERIMENTAL DATA. Bulletin of National Technical University "KhPI". Series: System Analysis, Control and Information Technologies, (2 (14), 8–15. https://doi.org/10.20998/2079-0023.2025.02.02

Issue

Section

SYSTEM ANALYSIS AND DECISION-MAKING THEORY