Non-Negative Least Squares (NNLS) adds a constraint to the least squares equation that all coefficient estimates must be greater than or equal to zero.
If any of the assumptions of linear regression are violated, the model may not be reliable to use for either inference or prediction.
High influence points are observations that most influence, hence the name, the shape of the regression equation.
A high leverage point specifically refers to an observation in which the value of a predictor is considered to be extreme in the feature space.
ANOVA is a special case of regression when all of the independent variables are categorical.
In matrix form, the vector of coefficient estimates is derived using the formula: (X’X)-1X’Y, where X is the design matrix where the rows correspond to the observations and columns to the features, and Y is the vector of target values.
While the original variable will likely be correlated with a higher order term constructed from the same variable, multicollinearity is not an overt cause of concern
Many non-linear relationships can be transformed into linear relationships through logarithmic and power transformations