Change the stopping criterion by MaxenceGollier · Pull Request #281 · JuliaSmoothOptimizers/RegularizedOptimization.jl

MaxenceGollier · 2026-01-28T18:39:24Z

Switch from $$√\xi/\nu$$ to $$‖s_{cp}‖/\nu$$, following multiple discussions, I think that this will really improve the robustness of the solvers.

I am not sure what I should do for R2DH and TRDH;
We compute

spectral_test ? prox!(s, ψ, mν∇fk, ν₁) : iprox!(s, ψ, ∇fk, dkσk)

So if the spectral_test is false, what should be the measure ? $$‖s‖/‖dkσk‖_{\infty}$$ ?

I will also make the changements for LM and LMTR. We can compare in terms of number of iteration we the other CI runs.

codecov · 2026-01-28T18:45:39Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 84.46%. Comparing base (e0f214d) to head (252da43).
⚠️ Report is 265 commits behind head on master.

Additional details and impacted files

@@             Coverage Diff             @@
##           master     #281       +/-   ##
===========================================
+ Coverage   61.53%   84.46%   +22.93%     
===========================================
  Files          11       13        +2     
  Lines        1292     1590      +298     
===========================================
+ Hits          795     1343      +548     
+ Misses        497      247      -250

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…y point

MaxenceGollier · 2026-01-28T19:15:01Z

At least on the CIs, it looks like this does not really change the number of iterations, but we will definitely win in terms of robustness.

dpo · 2026-01-29T16:32:03Z

In TRDH, if we assume that $D_k \succ 0$ and that $h = 0$, the unconstrained step is $s_k = -D_k^{-1} \nabla f_k$. If we also assume that that step lies inside the trust region, we have $\nabla f_k = -D_k s_k$, and thus, $\Vert \nabla f_k \Vert = \Vert D_k s_k \Vert \leq \Vert d_k \Vert_{\infty} \Vert s_k \Vert \leq \Vert s_k \Vert / \nu_k$. The inequalities are equalities if $D_k$ is a multiple of the identity. So I would use $\Vert s_k \Vert / \nu_k$.

dpo · 2026-01-29T16:32:46Z

src/LM_alg.jl

 - `sub_kwargs::NamedTuple = NamedTuple()`: a named tuple containing the keyword arguments to be sent to the subsolver. The solver will fail if invalid keyword arguments are provided to the subsolver. For example, if the subsolver is `R2Solver`, you can pass `sub_kwargs = (max_iter = 100, σmin = 1e-6,)`.

-The algorithm stops either when `√(ξₖ/νₖ) < atol + rtol*√(ξ₀/ν₀) ` or `ξₖ < 0` and `√(-ξₖ/νₖ) < neg_tol` where ξₖ := f(xₖ) + h(xₖ) - φ(sₖ; xₖ) - ψ(sₖ; xₖ), and √(ξₖ/νₖ) is a stationarity measure.
+The algorithm stops when `‖sᶜᵖ‖/ν < atol + rtol*‖s₀ᶜᵖ‖/ν ` where sᶜᵖ ∈ argminₛ f(xₖ) + ∇f(xₖ)ᵀs + ψ(s; xₖ) ½ ν⁻¹ ‖s‖².


I would keep both stopping conditions. It's not always true that one measure is smaller than the other.

Having two stopping conditions in one solver is odd in my opinion.
I could add a keyword argument to switch between these if we really want to keep it.

Numerically, I strongly believe that taking the square root of something that can suffer from catastrophic cancellation and/or be very small is a bad idea that makes the solvers not robust. Please take a look at my comment below where I see a 25% increase in problems I can solve with this PR.
I think neg_tol is also a weird argument to have and that it should be removed from all solvers.

MohamedLaghdafHABIBOULLAH · 2026-01-30T00:46:41Z

In TRDH, if we assume that D k ≻ 0 and that h = 0 , the unconstrained step is s k = − D k − 1 ∇ f k . If we also assume that that step lies inside the trust region, we have ∇ f k = − D k s k , and thus, ‖ ∇ f k ‖ = ‖ D k s k ‖ ≤ ‖ d k ‖ ∞ ‖ s k ‖ ≤ ‖ s k ‖ / ν k . The inequalities are equalities if D k is a multiple of the identity. So I would use ‖ s k ‖ / ν k .

Same argument applies on the implementation of R2DH, we have $\nabla f_k = -(D_k + \sigma_k I) s_k$, and thus, $\Vert \nabla f_k \Vert = \Vert(D_k + \sigma_k I)s_k \Vert \leq (\Vert d_k \Vert_{\infty} + \sigma_k) \Vert s_k \Vert \leq \Vert d_k \sigma_k \Vert_{\infty}\Vert s_k \Vert$.

So I suggest to use rather $\Vert d_k \sigma_k \Vert_{\infty}\Vert s_k \Vert$

MaxenceGollier · 2026-01-30T15:29:28Z

Same argument applies on the implementation of R2DH, we have ∇ f k = − ( D k + σ k I ) s k , and thus, ‖ ∇ f k ‖ = ‖ ( D k + σ k I ) s k ‖ ≤ ( ‖ d k ‖ ∞ + σ k ) ‖ s k ‖ ≤ ‖ d k σ k ‖ ∞ ‖ s k ‖ .

So I suggest to use rather ‖ d k σ k ‖ ∞ ‖ s k ‖

I don't see how $$\lVert d_k \rVert_{\infty} + \sigma_k \leq \lVert d_k \sigma_k \rVert_{\infty}$$, am I missing something ?

see JuliaSmoothOptimizers/RegularizedOptimization.jl#281

MohamedLaghdafHABIBOULLAH · 2026-01-30T20:19:11Z

Ok let me correct myself, we have $\Vert \nabla f_k \Vert = \Vert(D_k + \sigma_k I)s_k \Vert \leq \Vert d_k \sigma_k \Vert_{\infty}\Vert s_k \Vert$.
By definition of $d_k \sigma_k$ as $D_k + \sigma_k I$.

MaxenceGollier · 2026-02-03T01:47:01Z

At least on the CIs, it looks like this does not really change the number of iterations, but we will definitely win in terms of robustness.

To prove my point, see the benchmark results in my penalty solver:
MaxenceGollier/ExactPenalty.jl#48

In particular the line Hessian model = σI; Tolerance = 1e-9. This is the line where I use R2 as a subsolver; I am seeing a 25% increase in numbers of problems I can solve.

The other lines do not see such improvements because for Tolerance = 1e-3, the problem of negative ξ does not occur too often and the lines with R2N as a subsolver still fail due to the negative ξ (I only removed the negative ξ1 issue in R2N).

Can you please take a look @dpo ?

MaxenceGollier · 2026-02-03T02:16:47Z

Ok let me correct myself, we have ‖ ∇ f k ‖ = ‖ ( D k + σ k I ) s k ‖ ≤ ‖ d k σ k ‖ ∞ ‖ s k ‖ . By definition of d k σ k as D k + σ k I .

To keep things similar, can't we use the last inequality from this comment:

In TRDH, if we assume that D k ≻ 0 and that h = 0 , the unconstrained step is s k = − D k − 1 ∇ f k . If we also assume that that step lies inside the trust region, we have ∇ f k = − D k s k , and thus, ‖ ∇ f k ‖ = ‖ D k s k ‖ ≤ ‖ d k ‖ ∞ ‖ s k ‖ ≤ ‖ s k ‖ / ν k . The inequalities are equalities if D k is a multiple of the identity. So I would use ‖ s k ‖ / ν k .

to have a measure based on ν instead of d_k σ_k in R2DH?

change stopping criterion from xi to the norm of the cauchy point

84b36bc

MaxenceGollier requested review from MohamedLaghdafHABIBOULLAH and dpo January 28, 2026 18:39

MaxenceGollier added 2 commits January 28, 2026 14:03

LM & LMTR: change stopping criterion from xi to the norm of the cauch…

1ce02a6

…y point

remove deprecated

252da43

dpo reviewed Jan 29, 2026

View reviewed changes

MaxenceGollier added a commit to MaxenceGollier/ExactPenalty.jl that referenced this pull request Jan 30, 2026

Benchmark the switch-stopping-criterion branch

679acab

see JuliaSmoothOptimizers/RegularizedOptimization.jl#281

MaxenceGollier mentioned this pull request Jan 30, 2026

Benchmark the switch-stopping-criterion branch MaxenceGollier/ExactPenalty.jl#48

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change the stopping criterion#281

Change the stopping criterion#281
MaxenceGollier wants to merge 3 commits intoJuliaSmoothOptimizers:masterfrom
MaxenceGollier:switch_stopping_criterion

MaxenceGollier commented Jan 28, 2026 •

edited

Loading

Uh oh!

codecov bot commented Jan 28, 2026 •

edited

Loading

Uh oh!

MaxenceGollier commented Jan 28, 2026

Uh oh!

dpo commented Jan 29, 2026

Uh oh!

dpo Jan 29, 2026

Uh oh!

MaxenceGollier Feb 3, 2026 •

edited

Loading

Uh oh!

MohamedLaghdafHABIBOULLAH commented Jan 30, 2026

Uh oh!

MaxenceGollier commented Jan 30, 2026

Uh oh!

MohamedLaghdafHABIBOULLAH commented Jan 30, 2026 •

edited

Loading

Uh oh!

MaxenceGollier commented Feb 3, 2026 •

edited

Loading

Uh oh!

MaxenceGollier commented Feb 3, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

MaxenceGollier commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

MaxenceGollier commented Jan 28, 2026

Uh oh!

dpo commented Jan 29, 2026

Uh oh!

dpo Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

MaxenceGollier Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

MohamedLaghdafHABIBOULLAH commented Jan 30, 2026

Uh oh!

MaxenceGollier commented Jan 30, 2026

Uh oh!

MohamedLaghdafHABIBOULLAH commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MaxenceGollier commented Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MaxenceGollier commented Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

MaxenceGollier commented Jan 28, 2026 •

edited

Loading

codecov bot commented Jan 28, 2026 •

edited

Loading

MaxenceGollier Feb 3, 2026 •

edited

Loading

MohamedLaghdafHABIBOULLAH commented Jan 30, 2026 •

edited

Loading

MaxenceGollier commented Feb 3, 2026 •

edited

Loading

MaxenceGollier commented Feb 3, 2026 •

edited

Loading