
A mathematically useful approach is therefore to find the line with the property that the sum of the following squares is minimum. The best fit line is the line for which the sum of the distances between each of the n data points and the line is as small as possible. the value of y where the line intersects with the y-axisįor our purposes, we write the equation of the best fit line asįor each i, we define ŷ i as the y-value of x i on this line, and so Recall that the equation for a straight line is y = bx + a, whereĪ = y-intercept, i.e. We now look at the line in the xy plane that best fits the data ( x 1, y 1), …, ( x n, y n).

In Correlation we study the linear correlation between two random variables x and y.
