EFFICIENCY SUBSTANTIATION FOR A SYNTHETICAL METHOD OF CONSTRUCTING A MULTIVARIATE POLYNOMIAL REGRESSION GIVEN BY A REDUNDANT REPRESENTATION

Authors

DOI:

https://doi.org/10.20998/2079-0023.2023.01.01

Keywords:

univariate polynomial regression, multivariate polynomial regression, redundant representation, least squares method, test sequence, repeated experiment

Abstract

In recent years, the authors in their publications have developed two different approaches to the construction of a multivariate polynomial (in particular, linear) regressions given by a redundant representation. The first approach allowed us to reduce estimation of coefficients for nonlinear terms of a multivariate polynomial regression to construction of a sequence of univariate polynomial regressions and solution of corresponding nondegenerate systems of linear equations. The second approach was implemented using an example of a multivariate linear regression given by a redundant representation and led to the creation of a method the authors called a modified group method of data handling (GMDH), as it is a modification of the well-known heuristic self-organization method of GMDH (the author of GMDH is an Academician of the National Academy of Sciences of Ukraine O. G. Ivakhnenko). The modification takes into account that giving a multivariate linear regression by redundant representation allows for construction of a set of partial representations, one of which has the structure of the desired regression, to use not a multilevel selection algorithm, but an efficient algorithm for splitting the coefficients of the multivariate linear regression into two classes. As in the classic GMDH, the solution is found using a test sequence of data. This method is easily extended to the case of a multivariate polynomial regression since the unknown coefficients appear in the multivariate polynomial regression in a linear way. Each of the two approaches has its advantages and disadvantages. The obvious next step is to combine both approaches into one. This has led to the creation of a synthetic method that implements the advantages of both approaches, partially compensating for their disadvantages. This paper presents the aggregated algorithmic structure of the synthetic method, the theoretical properties of partial cases and, as a result, the justification of its overall efficiency.

Author Biographies

Alexander Pavlov, National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute"

Doctor of Technical Sciences, Full Professor, National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute", Kyiv, Ukraine, Professor of Informatics and Software Engineering Department

Maxim Holovchenko, National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute"

National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute", Kyiv, Ukraine, Senior Lecturer of Informatics and Software Engineering Department

Valeriia Drozd, National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute"

National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute", Bachelor of Informatics and Software Engineering Department

References

Yu L. Using negative binomial regression analysis to predict software faults: a study of Apache Ant. International Journal of Information Technology and Computer Science (IJITCS). 2012, vol. 4, no. 8, pp. 63–70. DOI: 10.5815/ijitcs.2012.08.08.

Shahrel M.Z., Mutalib S., Abdul-Rahman S. PriceCop – price monitor and prediction using linear regression and LSVM-ABC methods for e-commerce platform. International Journal of Information Engineering and Electronic Business (IJIEEB). 2021, vol. 13, no. 1, pp. 1–14. DOI: 10.5815/ijieeb.2021.01.01.

Satter A., Ibtehaz N. A regression based sensor data prediction technique to analyze data trustworthiness in cyber-physical system. International Journal of Information Engineering and Electronic Business (IJIEEB). 2018, vol. 10, no. 3, pp. 15–22. DOI: 10.5815/ ijieeb.2018.03.03.

Isabona J., Ojuh D. O. Machine learning based on kernel function controlled gaussian process regression method for in-depth extrapolative analysis of Covid-19 daily cases drift rates. International Journal of Mathematical Sciences and Computing (IJMSC). 2021, vol. 7, no. 2, pp. 14–23. DOI: 10.5815/ ijmsc.2021.02.02.

Sinha P. Multivariate polynomial regression in data mining: methodology, problems and solutions. International Journal of Scientific & Engineering Research. 2013, vol. 4, iss. 12, pp. 962–965.

Kalivas J. H. Interrelationships of multivariate regression methods using eigenvector basis sets. Journal of Chemometrics. 1999, vol. 13 (2), pp. 111–132. DOI: 10.1002/(SICI)1099-128X(199903/ 04)13:2<111::AID-CEM532>3.0.CO;2-N.

Ortiz-Herrero L., Maguregui M. I., Bartolomé L. Multivariate (O)PLS regression methods in forensic dating. TrAC Trends in Analytical Chemistry. 2021, vol. 141, 116278. DOI: 10.1016/ j.trac.2021.116278.

Guo G., Niu G., Shi Q. et al. Multi-element quantitative analysis of soils by laser induced breakdown spectroscopy (LIBS) coupled with univariate and multivariate regression methods. Analytical Methods. 2019, vol. 11, iss. 23, pp. 3006–3013. DOI: 10.1039/C9AY00890J.

Nastenko E., Pavlov V., Boyko G., Nosovets O. Mnogokriterial'nyj algoritm shagovoj regressii. Biomedychna inzheneriya i tekhnolohiya [Biomedical ingeneering and technology]. 2020, no. 3, pp. 48–53. DOI: 10.20535/2617-8974.2020.3.195661.

Babatunde G., Emmanuel A. A., Oluwaseun O. R., Bunmi O. B., Precious A. E. Impact of climatic change on agricultural product yield using k-means and multiple linear regressions. International Journal of Education and Management Engineering (IJEME). 2019, vol. 9, no. 3, pp. 16–26. DOI: 10.5815/ijeme.2019.03.02.

Hudson D. J. Statistics Lectures, Volume 2: Maximum Likelihood and Least Squares Theory. CERN Reports 64(18). Geneva, CERN, 1964. (Russ. ed.: Hudson D. Statistika dlja fizikov: Lekcii po teorii verojat¬nostej i jelementarnoj statistike. Moscow, Mir Publ., 1970. 296 p.). DOI: 10.5170/CERN-1964-018.

Pavlov A. A. Holovchenko M. N., Drozd V. V. Construction of a multivariate polynomial given by a redundant description in stochastic and deterministic formulations using an active experiment. Visnyk Nats. tekhn. un-tu "KhPI": zb. nauk. pr. Temat. vyp.: Systemnyy analiz, upravlinnya ta informatsiyni tekhnologiyi [Bulletin of the National Technical University "KhPI": a collection of scientific papers. Thematic issue: System analysis, management and information technology]. Kharkov, NTU "KhPI" Publ., 2022, no. 1 (7), pp. 3–8. DOI: 10.20998/2079-0023.2022.01.01.

Pavlov A., Holovchenko M., Mukha I. et al. A Modified Method and an Architecture of a Software for a Multivariate Polynomial Regression Building Based on the Results of a Conditional Active Experiment. Advances in Computer Science for Engineering and Education VI (ICCSEEA 2023). 2023. (to appear)

Pavlov A. A., Holovchenko M. N. Modified method of constructing a multivariate linear regression given by a redundant description. Visnyk Nats. tekhn. un-tu "KhPI": zb. nauk. pr. Temat. vyp.: Systemnyy analiz, upravlinnya ta informatsiyni tekhnologiyi [Bulletin of the National Technical University "KhPI": a collection of scientific papers. Thematic issue: System analysis, management and information technology]. Kharkov, NTU "KhPI" Publ., 2022, no. 2 (8), pp. 3–8. DOI: 10.20998/2079-0023.2022.02.01.

Downloads

Published

2023-07-15

How to Cite

Pavlov, A., Holovchenko, M., & Drozd, V. (2023). EFFICIENCY SUBSTANTIATION FOR A SYNTHETICAL METHOD OF CONSTRUCTING A MULTIVARIATE POLYNOMIAL REGRESSION GIVEN BY A REDUNDANT REPRESENTATION. Bulletin of National Technical University "KhPI". Series: System Analysis, Control and Information Technologies, (1 (9), 3–9. https://doi.org/10.20998/2079-0023.2023.01.01

Issue

Section

SYSTEM ANALYSIS AND DECISION-MAKING THEORY