Modifications to LinearRegression documentation. #22561

thomasoliveira · Feb 20, 2022

Reference Issues/PRs

What does this implement/fix? Explain your changes.

From the documentation of https://scikit-learn.org/stable/modules/linear_model.html#ordinary-least-squares, one could understand that LinearRegression fits a model where the intercept $w_0$ is absent, that is, where $w_0=0$.

To prevent this misinterpretation, the present PR rephrases:

"Across the module, we designate the vector $w = (w_1, ..., w_p)$ as coef_ and $w_0$ as intercept_." as "Across the module, we designate the vector $(w_1, ..., w_p)$ as coef_ and $w_0$ as intercept_."
"LinearRegression fits a linear model with coefficients $w = (w_1, ..., w_p)" as "LinearRegression fits a linear model with coefficients $w = (w_0, w_1, ..., w_p)"
"and will store the coefficients of the linear model in its coef_ member" as "and will store the coefficients of the linear model in its coef_ and intercept_ members"

Any other comments?

thomasjpfan

Thanks for the PR @thomasoliveira !

doc/modules/linear_model.rst

lorentzenchr · Feb 21, 2022

doc/modules/linear_model.rst

@@ -13,7 +13,7 @@ value.

 .. math::    \hat{y}(w, x) = w_0 + w_1 x_1 + ... + w_p x_p

-Across the module, we designate the vector :math:`w = (w_1,
+Across the module, we designate the vector :math:`(w_1,


We have 2 options:

Still assign w = (w_1, ...) because this is used in the penalty terms. Then we need to add w_0 in many places.

Adjust the penalty terms, because we can't use w anymore as the intercept is not penalized.

The current documentation is formulated around X being a design matrix (including the constant 1 column), which implies that w_0 is in the w vector. In general, I think using X as the design matrix is consistent with literature around linear regression.

With that in mind, I prefer option 2 where we rewrite the penalty terms with summations that do not include the w_0 term.

In this case we should state that by :math:`Xw` we mean w_0 + w_1 x_1 + .... A priori, I'm fine with all ways as long as we are consistent throughout the user guide (within chapter linear models).

With option 2, how do we write an L2 penalty? Should we define a new intercept-less vector? With which name? (I would use \tilde{w}, but this only works in math mode not in Python code.

I don't have a strong opinion which option to choose as long as it is correct and consistent.

I think using \tilde{w} would be a nice solution from a LaTeX notation perspective. If we want to keep it simple, we could also name the vector u, however that would be a departure from other literature (not a bad one imo). I prefer the prior.

@lorentzenchr @thomasjpfan @Micky774 Regarding this, I feel \tilde{w} is better option for Penalty term. I can take this forward from here if everyone agrees and I can make a new PR with all the changes.

lorentzenchr · Jun 24, 2025

I close due to the long stall.
@raviteja-ganta you can open a new PR if you want to contribute.

Modifications to LinearRegression documentation.

b155a7e

thomasoliveira mentioned this pull request Feb 20, 2022

LinearRegression in User Guide seems to have $w_0=0$ #22551

Open

thomasjpfan reviewed Feb 20, 2022

View reviewed changes

doc/modules/linear_model.rst Outdated Show resolved Hide resolved

A more interesting linear regression example.

e1e2b18

Micky774 reviewed Feb 20, 2022

View reviewed changes

doc/modules/linear_model.rst Outdated Show resolved Hide resolved

A more interesting linear regression example, corrected.

5b27e32

lorentzenchr reviewed Feb 21, 2022

View reviewed changes

lorentzenchr added Stalled Needs Decision Requires decision Documentation module:linear_model labels Aug 12, 2022

lorentzenchr closed this Jun 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Modifications to LinearRegression documentation. #22561

Modifications to LinearRegression documentation. #22561

Uh oh!

thomasoliveira commented Feb 20, 2022

Uh oh!

thomasjpfan left a comment

Uh oh!

Uh oh!

Uh oh!

lorentzenchr Feb 21, 2022

Uh oh!

thomasjpfan Feb 21, 2022

Uh oh!

lorentzenchr Mar 10, 2022

Uh oh!

lorentzenchr Mar 21, 2022

Uh oh!

lorentzenchr Dec 8, 2022

Uh oh!

Micky774 Dec 8, 2022

Uh oh!

raviteja-ganta Feb 16, 2025

Uh oh!

lorentzenchr commented Jun 24, 2025

Uh oh!

Uh oh!

Search code, repositories, users, issues, pull requests...

Uh oh!

Modifications to LinearRegression documentation. #22561

Modifications to LinearRegression documentation. #22561

Uh oh!

Conversation

thomasoliveira commented Feb 20, 2022

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

thomasjpfan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

lorentzenchr Feb 21, 2022

Choose a reason for hiding this comment

Uh oh!

thomasjpfan Feb 21, 2022

Choose a reason for hiding this comment

Uh oh!

lorentzenchr Mar 10, 2022

Choose a reason for hiding this comment

Uh oh!

lorentzenchr Mar 21, 2022

Choose a reason for hiding this comment

Uh oh!

lorentzenchr Dec 8, 2022

Choose a reason for hiding this comment

Uh oh!

Micky774 Dec 8, 2022

Choose a reason for hiding this comment

Uh oh!

raviteja-ganta Feb 16, 2025

Choose a reason for hiding this comment

Uh oh!

lorentzenchr commented Jun 24, 2025

Uh oh!

Uh oh!