FIX use objective instead of loss for convergence in SGD #30031

glemaitre · Oct 8, 2024

When no early-stopping is used in SGD estimator, the stopping criterion should be based on the value of the objective function (loss function + regularization terms). However, currently only the loss function is used to decide whether or not the estimator converged.

So as a solution here, I proposed to modify the WeightVector Cython class such that it also compute the L1-norm on the fly and then in the case that early-stopping is not activated, then we compute the objective function instead of solely the loss function and use it as a stopping criterion.

Things that I'm not sure at this stage:

I did not check if I properly handle the intercept
I am not sure to know what is the interaction with the tolerance: is the parameter super sensitive and should always be tweak depending of the problem at hand (i.e. regression or classifiation)?

github-actions · Oct 8, 2024

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 8c38943. Link to the linter CI: here}

glemaitre · Oct 8, 2024

@ogrisel @jeremiedbb @antoinebaker

I'm not myself super familiar with the SGD and I would be happy to have some eyes on this PR if you thinking that this would go in the right direction or it does not make any sense.

jeremiedbb · Oct 10, 2024

To me it makes sense to set the stopping criterion on the objective function instead of the loss, at least because of the way convergence is checked here, i.e. small enough decrease.

Indeed, the objective function is guaranteed to decrease at each epoch which is not the case for the loss. So there are 2 ways to fix this: use the objective function or check on a small enough change instead of decrease. I think the first option chose in this PR is fine.

Looking at the diff here, it looks like the tol is compared with the absolute difference of the loss/objective. It's not user friendly because scaling the dataset by some factor will lead to a different result if you keep the same tol. I really think that we should compare the tol against a relative diff of the objective function.

glemaitre · Oct 11, 2024

I really think that we should compare the tol against a relative diff of the objective function.

This is a good point and we should fix it.

FIX use objective instead of loss for convergence in SGD

b280798

glemaitre marked this pull request as draft October 8, 2024 09:56

github-actions bot added cython module:linear_model module:utils labels Oct 8, 2024

remove debug

5164376

add more info regarding the objective or validation loss

8c38943

glemaitre mentioned this pull request Oct 8, 2024

SGDOneClassSVM model does not converge with default stopping criteria(stops prematurely) #30027

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

FIX use objective instead of loss for convergence in SGD #30031

FIX use objective instead of loss for convergence in SGD #30031

Uh oh!

glemaitre commented Oct 8, 2024 •

edited

Loading

Uh oh!

github-actions bot commented Oct 8, 2024 •

edited

Loading

Uh oh!

glemaitre commented Oct 8, 2024

Uh oh!

jeremiedbb commented Oct 10, 2024

Uh oh!

glemaitre commented Oct 11, 2024

Uh oh!

Uh oh!

Search code, repositories, users, issues, pull requests...

Uh oh!

FIX use objective instead of loss for convergence in SGD #30031

Are you sure you want to change the base?

FIX use objective instead of loss for convergence in SGD #30031

Uh oh!

Conversation

glemaitre commented Oct 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Oct 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

glemaitre commented Oct 8, 2024

Uh oh!

jeremiedbb commented Oct 10, 2024

Uh oh!

glemaitre commented Oct 11, 2024

Uh oh!

Uh oh!

glemaitre commented Oct 8, 2024 •

edited

Loading

github-actions bot commented Oct 8, 2024 •

edited

Loading