EFFICIENCY SUBSTANTIATION FOR A SYNTHETICAL METHOD OF CONSTRUCTING A MULTIVARIATE POLYNOMIAL REGRESSION GIVEN BY A REDUNDANT REPRESENTATION
DOI:
https://doi.org/10.20998/2079-0023.2023.01.01Keywords:
univariate polynomial regression, multivariate polynomial regression, redundant representation, least squares method, test sequence, repeated experimentAbstract
In recent years, the authors in their publications have developed two different approaches to the construction of a multivariate polynomial (in particular, linear) regressions given by a redundant representation. The first approach allowed us to reduce estimation of coefficients for nonlinear terms of a multivariate polynomial regression to construction of a sequence of univariate polynomial regressions and solution of corresponding nondegenerate systems of linear equations. The second approach was implemented using an example of a multivariate linear regression given by a redundant representation and led to the creation of a method the authors called a modified group method of data handling (GMDH), as it is a modification of the well-known heuristic self-organization method of GMDH (the author of GMDH is an Academician of the National Academy of Sciences of Ukraine O. G. Ivakhnenko). The modification takes into account that giving a multivariate linear regression by redundant representation allows for construction of a set of partial representations, one of which has the structure of the desired regression, to use not a multilevel selection algorithm, but an efficient algorithm for splitting the coefficients of the multivariate linear regression into two classes. As in the classic GMDH, the solution is found using a test sequence of data. This method is easily extended to the case of a multivariate polynomial regression since the unknown coefficients appear in the multivariate polynomial regression in a linear way. Each of the two approaches has its advantages and disadvantages. The obvious next step is to combine both approaches into one. This has led to the creation of a synthetic method that implements the advantages of both approaches, partially compensating for their disadvantages. This paper presents the aggregated algorithmic structure of the synthetic method, the theoretical properties of partial cases and, as a result, the justification of its overall efficiency.
References
Yu L. Using negative binomial regression analysis to predict software faults: a study of Apache Ant. International Journal of Information Technology and Computer Science (IJITCS). 2012, vol. 4, no. 8, pp. 63–70. DOI: 10.5815/ijitcs.2012.08.08.
Shahrel M.Z., Mutalib S., Abdul-Rahman S. PriceCop – price monitor and prediction using linear regression and LSVM-ABC methods for e-commerce platform. International Journal of Information Engineering and Electronic Business (IJIEEB). 2021, vol. 13, no. 1, pp. 1–14. DOI: 10.5815/ijieeb.2021.01.01.
Satter A., Ibtehaz N. A regression based sensor data prediction technique to analyze data trustworthiness in cyber-physical system. International Journal of Information Engineering and Electronic Business (IJIEEB). 2018, vol. 10, no. 3, pp. 15–22. DOI: 10.5815/ ijieeb.2018.03.03.
Isabona J., Ojuh D. O. Machine learning based on kernel function controlled gaussian process regression method for in-depth extrapolative analysis of Covid-19 daily cases drift rates. International Journal of Mathematical Sciences and Computing (IJMSC). 2021, vol. 7, no. 2, pp. 14–23. DOI: 10.5815/ ijmsc.2021.02.02.
Sinha P. Multivariate polynomial regression in data mining: methodology, problems and solutions. International Journal of Scientific & Engineering Research. 2013, vol. 4, iss. 12, pp. 962–965.
Kalivas J. H. Interrelationships of multivariate regression methods using eigenvector basis sets. Journal of Chemometrics. 1999, vol. 13 (2), pp. 111–132. DOI: 10.1002/(SICI)1099-128X(199903/ 04)13:2<111::AID-CEM532>3.0.CO;2-N.
Ortiz-Herrero L., Maguregui M. I., Bartolomé L. Multivariate (O)PLS regression methods in forensic dating. TrAC Trends in Analytical Chemistry. 2021, vol. 141, 116278. DOI: 10.1016/ j.trac.2021.116278.
Guo G., Niu G., Shi Q. et al. Multi-element quantitative analysis of soils by laser induced breakdown spectroscopy (LIBS) coupled with univariate and multivariate regression methods. Analytical Methods. 2019, vol. 11, iss. 23, pp. 3006–3013. DOI: 10.1039/C9AY00890J.
Nastenko E., Pavlov V., Boyko G., Nosovets O. Mnogokriterial'nyj algoritm shagovoj regressii. Biomedychna inzheneriya i tekhnolohiya [Biomedical ingeneering and technology]. 2020, no. 3, pp. 48–53. DOI: 10.20535/2617-8974.2020.3.195661.
Babatunde G., Emmanuel A. A., Oluwaseun O. R., Bunmi O. B., Precious A. E. Impact of climatic change on agricultural product yield using k-means and multiple linear regressions. International Journal of Education and Management Engineering (IJEME). 2019, vol. 9, no. 3, pp. 16–26. DOI: 10.5815/ijeme.2019.03.02.
Hudson D. J. Statistics Lectures, Volume 2: Maximum Likelihood and Least Squares Theory. CERN Reports 64(18). Geneva, CERN, 1964. (Russ. ed.: Hudson D. Statistika dlja fizikov: Lekcii po teorii verojat¬nostej i jelementarnoj statistike. Moscow, Mir Publ., 1970. 296 p.). DOI: 10.5170/CERN-1964-018.
Pavlov A. A. Holovchenko M. N., Drozd V. V. Construction of a multivariate polynomial given by a redundant description in stochastic and deterministic formulations using an active experiment. Visnyk Nats. tekhn. un-tu "KhPI": zb. nauk. pr. Temat. vyp.: Systemnyy analiz, upravlinnya ta informatsiyni tekhnologiyi [Bulletin of the National Technical University "KhPI": a collection of scientific papers. Thematic issue: System analysis, management and information technology]. Kharkov, NTU "KhPI" Publ., 2022, no. 1 (7), pp. 3–8. DOI: 10.20998/2079-0023.2022.01.01.
Pavlov A., Holovchenko M., Mukha I. et al. A Modified Method and an Architecture of a Software for a Multivariate Polynomial Regression Building Based on the Results of a Conditional Active Experiment. Advances in Computer Science for Engineering and Education VI (ICCSEEA 2023). 2023. (to appear)
Pavlov A. A., Holovchenko M. N. Modified method of constructing a multivariate linear regression given by a redundant description. Visnyk Nats. tekhn. un-tu "KhPI": zb. nauk. pr. Temat. vyp.: Systemnyy analiz, upravlinnya ta informatsiyni tekhnologiyi [Bulletin of the National Technical University "KhPI": a collection of scientific papers. Thematic issue: System analysis, management and information technology]. Kharkov, NTU "KhPI" Publ., 2022, no. 2 (8), pp. 3–8. DOI: 10.20998/2079-0023.2022.02.01.
Downloads
Published
How to Cite
Issue
Section
License
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).