Use DifferentiationInterface for autodiff, allow ADTypes #153

gdalle · 2024-12-04T12:02:07Z

Warning

This PR bumps the Julia compat to 1.10 instead of 1.5.

This PR introduces the use of ADTypes for autodiff package selection + DifferentiationInterface for autodiff calls (fixes #132).

bump package version to 7.9.0
bump Julia compat to 1.10
add ADTypes and DifferentiationInterface to dependencies
modify oncedifferentiable.jl, twicedifferentiable.jl and constraints.jl
add docs for passing autodiff=ADTypes.AutoSomething()
adjust count of function/gradient calls?

Tests pass locally but I might be misinterpreting the interface somehow so please double-check. I think we can handle performance improvements in a follow-up PR.

pkofod · 2024-12-04T12:12:04Z

Looks like I never updated CI here for Github Actions :) I like the simplifications so far

devmotion · 2024-12-04T12:14:39Z

src/objective_types/constraints.jl

+    function j!(_j, _x)
+        DI.jacobian!(c!, ccache, _j, jac_prep, backend, _x)


Maybe these could be made callable structs to avoid closing over c!, ccache, jac_prep and backend?

These were already closures in the existing code, I thought I'd go for the "minimum diff" changes and then we could improve later on

Yeah, I saw that. It seemed you're closing over more variables in the DI version though, so it might be even more valuable to change the design.

I seem to recall that closures are not an issue if the variable we close over does not get assigned to more than once? So we might be good here?
In any case, to hunt down this kind of type inference barriers we would also need to improve annotation of functions in the various structs of the package.

I don't know. Even if it's currently the case, personally I wouldn't rely on it since in my experience the exact behaviour of the compiler is inherently unstable.

See for instance the explanations in https://discourse.julialang.org/t/can-someone-explain-closures-to-me/105605

gdalle · 2024-12-04T19:30:38Z

Do we need complex number support? Because it is not available in every AD backend so at the moment DI is only tested on real numbers.

gdalle · 2024-12-04T19:57:24Z

I ran the Optim.jl test suite with the version of NLSolversBase from this PR and they seem to pass (https://github.com/gdalle/Optim.jl/actions/runs/12166910179/job/33934234795), except on Julia 1.6 because the recent versions of DI no longer support it.

pkofod · 2024-12-06T08:08:12Z

CI PR is merged

gdalle · 2024-12-06T08:35:31Z

Should I do another PR which adds code coverage to the test action?

pkofod · 2024-12-06T10:22:34Z

@devmotion added codecov. Should be set up now

gdalle · 2024-12-06T11:05:43Z

Alright, closing and reopening to get coverage (hopefully)

gdalle · 2024-12-06T11:50:45Z

Any idea why we didn't get a Codecov comment? The report is available here but perhaps it doesn't have the right permissions to post on the PR?

devmotion · 2024-12-06T13:33:19Z

Might be codecov/codecov-action#1662? A bit unclear what the problem actually is, the same config works fine in other repos. I wonder if the codecov app has to be installed for the comments to appear.

gdalle · 2024-12-06T13:48:13Z

What's also weird is that the Codecov report (on the website) tells me "No file covered by tests were changed", which is obviously wrong

devmotion · 2024-12-06T13:56:19Z

https://app.codecov.io/github/JuliaNLSolvers/NLSolversBase.jl/pull/153 states that this PR changes coverage

gdalle · 2024-12-06T14:20:29Z

Fixed the missed line. Should we add some docs? Or keep it on the down low until downstream tests are running smoothly too?

pkofod · 2024-12-06T14:56:37Z

Fixed the missed line. Should we add some docs? Or keep it on the down low until downstream tests are running smoothly too?

To be honest, I think adding docs now is better. The risk (that often ends up happening) is that we forget later. It will only be on the dev docs anyway so there's no harm.

gdalle · 2024-12-06T15:11:29Z

Done

gdalle · 2024-12-06T15:12:49Z

src/objective_types/constraints.jl

+    lc::AbstractVector, uc::AbstractVector,
+    autodiff::Symbol = :central,
+    chunk::ForwardDiff.Chunk = checked_chunk(lx))
+    # TODO: is con_jac! still useful? we ignore it here


What should we do about this? The new version of the code directly computes the Hessian of the sum of constraints

gdalle · 2024-12-06T15:14:05Z

src/objective_types/oncedifferentiable.jl

-                DF = copy(DF)
-
-                x_f, x_df = x_of_nans(x_seed), x_of_nans(x_seed)
-                f_calls, j_calls = [0,], [0,]


Should we try to preserve this call count? I don't think it makes a lot of sense for autodiff in general, because some backends will not go through the actual code to compute the gradient (unlike ForwardDiff and FiniteDiff).

In addition, I don't think this call count was present for every operator or every autodiff method

I have to look. It's done this way because people complained that it didn't match with what they printed from their objective, but I agree that people will have to calculate the number themselves in the special case of finite diff

pkofod · 2024-12-06T16:01:49Z

Do we need complex number support? Because it is not available in every AD backend so at the moment DI is only tested on real numbers.

Well! We do support complex numbers in Optim. I'm unsure about the current testing situation and if it tests AD with complex

gdalle · 2024-12-06T16:14:04Z

Well! We do support complex numbers in Optim. I'm unsure about the current testing situation and if it tests AD with complex

The thing is that it should work out of the box because DI forwards arguments to the relevant backend without constraining their element types. However, DI does not have specific tests on complex inputs, and thus there is nothing to prevent accidental breakage at the moment

gdalle · 2024-12-06T16:16:18Z

Moreover, not all backends support complex numbers. For instance, ForwardDiff will fail

devmotion · 2024-12-06T16:17:37Z

Moreover, not all backends support complex numbers. For instance, ForwardDiff will fail

Complex outputs are supported: https://github.com/JuliaDiff/ForwardDiff.jl/blob/37c1d50d0f8a68fc410484057581262e1ec6d67d/test/DerivativeTest.jl#L113

gdalle · 2024-12-06T16:19:45Z

Sure but I'm assuming we want gradients of real-valued functions, for the purposes of optimization? If those functions have complex inputs, it will fail

julia> using ForwardDiff

julia> norm(x) = sum(abs2, x)
norm (generic function with 1 method)

julia> ForwardDiff.gradient(norm, [0.0 + im])
ERROR: ArgumentError: Cannot create a dual over scalar type ComplexF64. If the type behaves as a scalar, define ForwardDiff.can_dual(::Type{ComplexF64}) = true.

gdalle · 2024-12-06T16:32:44Z

And that's also why I'm reluctant to promise complex support in DI: every backend handles it differently, sometimes with different conventions, sometimes one operator works (ForwardDiff.derivative) and one doesn't (ForwardDiff.gradient), etc. All of this makes it really hard to obtain homogenous behavior

pkofod · 2024-12-06T16:55:57Z

I think we require user-written input in the complex case.. @antoine-levitt might remember but it's a long time ago :D I can also check later

gdalle · 2024-12-06T20:17:32Z

If autodiff is only used for real numbers that will make our life easier. Otherwise we can also add select complex tests to DI

antoine-levitt · 2024-12-06T21:09:11Z

I don't remember, but it's reasonable to treat complex numbers just like any user defined struct. If the AD backend has support for that (ie it can vjp a CustomType->Real function and give a CustomType) then great, otherwise just ask the user to convert into a vector of real -> Real function

gdalle · 2024-12-13T09:18:05Z

So what do we do about this PR? At the moment it passes all the tests here and all the ones in Optim.jl. Should we wait until complex numbers have more reliable handling in DI (for instance, erroring when a hessian is requested)?

pkofod · 2024-12-21T09:16:02Z

So what do we do about this PR? At the moment it passes all the tests here and all the ones in Optim.jl. Should we wait until complex numbers have more reliable handling in DI (for instance, erroring when a hessian is requested)?

Let me have a look over the holiday break. I have to go back and look, because I cannot remember what we actually support here, and if we do support something we cannot just remove it in a feature release.

Start DI integration

5d007f3

gdalle marked this pull request as draft December 4, 2024 12:02

devmotion reviewed Dec 4, 2024

View reviewed changes

gdalle added 3 commits December 4, 2024 17:38

Fix bug

8be2259

Handle constraints

7783026

Bump version to 7.9.0

dbf30ab

Merge remote-tracking branch 'upstream/master' into gd/di

250fd89

gdalle mentioned this pull request Dec 6, 2024

Add test action #154

Merged

gdalle closed this Dec 6, 2024

gdalle reopened this Dec 6, 2024

gdalle added 3 commits December 6, 2024 12:25

Merge branch 'master' into gd/di

ebaa4bd

Bump Julia compat to 1.10

5a7d1c4

Min dif

30ec326

Improve coverage

914c6c5

gdalle changed the title ~~Start DI integration~~ Use DifferentiationInterface for autodiff, allow ADTypes Dec 6, 2024

Add docs

1374805

gdalle commented Dec 6, 2024

View reviewed changes

		function j!(_j, _x)
		DI.jacobian!(c!, ccache, _j, jac_prep, backend, _x)

Use DifferentiationInterface for autodiff, allow ADTypes #153

Are you sure you want to change the base?

Use DifferentiationInterface for autodiff, allow ADTypes #153

Conversation

gdalle commented Dec 4, 2024 • edited Loading

pkofod commented Dec 4, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gdalle commented Dec 4, 2024

gdalle commented Dec 4, 2024

pkofod commented Dec 6, 2024

gdalle commented Dec 6, 2024

pkofod commented Dec 6, 2024

gdalle commented Dec 6, 2024

gdalle commented Dec 6, 2024

devmotion commented Dec 6, 2024

gdalle commented Dec 6, 2024

devmotion commented Dec 6, 2024

gdalle commented Dec 6, 2024

pkofod commented Dec 6, 2024

gdalle commented Dec 6, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pkofod commented Dec 6, 2024

gdalle commented Dec 6, 2024

gdalle commented Dec 6, 2024

devmotion commented Dec 6, 2024

gdalle commented Dec 6, 2024 • edited Loading

gdalle commented Dec 6, 2024

pkofod commented Dec 6, 2024

gdalle commented Dec 6, 2024

antoine-levitt commented Dec 6, 2024

gdalle commented Dec 13, 2024

pkofod commented Dec 21, 2024

gdalle commented Dec 4, 2024 •

edited

Loading

gdalle commented Dec 6, 2024 •

edited

Loading