When studying a phenomenon or process, very muchit is often necessary to know whether there is a relationship between the factors (variables) and the response function (the dependent quantity), and how close is their interaction. To do this allows regression analysis, which is performed in several stages.
One of the main stages of regression analysisis the calculation of the mathematical relationship between the factors and the response function, which allows you to quantify the relationship between them. This dependence is called the equation of regression. Formally, the least-squares method is considered to be the basic analytical method for determining the specified equation, since this method is optimal and allows smoothing out the points of the correlation field. In practice, however, finding such a function is quite difficult, since we have to rely on theoretical knowledge about the phenomenon being studied, on the experience of our predecessors in this scientific field, or through the "trial and error" method, to perform a simple search and evaluation of various functions. If successful, a regression equation will be obtained, which allows to adequately assess the effect of various factors on the response function, that is, to find the expected value of the response function (dependent variable) for certain values of the factors (dependent variables).
В качестве исходных данных для регрессионного analysis uses the values of the factor x and the corresponding value of the response function Y, obtained during the experimental part of the work. For clarity and more comfortable perception, these values are presented in tabular form.
Линейное уравнение регрессии, как правило, имеет the following form Y = a + b ∙ X. It includes a constant coefficient (constant) a, and a regression coefficient (slope) b multiplied by the value of the variable factor X. The coefficient b shows the average change in the response function when the factor value is changed by one unit. When plotting the regression equation graph using the coefficient b, one can also determine the slope of the line to the abscissa line. It should be noted that this coefficient has certain properties:
· B can take different values;
· B is not symmetric, that is, it changes its value in the case of studying the influence of Y on X;
· Unit of measurement of the correlation coefficient is the ratio of the unit of measure of the response function Y to the unit of measurement of the variables X;
· If the units of measurement of the X and Y variables change, the value of the regression coefficient also changes.
In most cases, the observed values are rareare located exactly on a straight line. In practice, it is always possible to observe a certain scatter of experimental data on the regression line, which I form the predicted values. The deviation of an individual point from the regression line from its theoretical or predicted value is called the remainder.
Very often in practice, a sampleregression equation, the main method of calculating the values of the coefficients of which is the method of least squares. The coefficients are calculated from the initial data representing the sample of the values of the variable factor and the response function.
At first glance it may seem that the calculationThe value of the coefficients entering into the regression equation is quite complex and time-consuming. But this is not so. At the service of researchers are numerous application software packages (the simplest is Microsoft Excel), which according to your input data will not only calculate all the coefficients in the equation, will be able to establish the degree of interrelation between variables and dependent values, but will present the values in graphical form.