[WIP] add SGHMC, SGLD trajectories #113

sivapvarma · 2019-10-13T20:01:27Z

Just placeholders for now, still getting a feeling for how things are organized in AHMC, at the same time wanted to get started.

Goal is to fix Issue #60 .

Comments are welcome.

src/trajectory.jl

xukai92 · 2019-10-16T11:55:11Z

src/trajectory.jl

+    h::Hamiltonian,
+    z::PhasePoint
+) where {T<:Real}
+    z′ = step(rng, τ.integrator, h, z, τ.n_steps)


We probably only need to change this line to implmenet SGHMC.

xukai92 · 2019-10-16T11:55:23Z

src/trajectory.jl

+    h::Hamiltonian,
+    z::PhasePoint
+) where {T<:Real}
+    z′ = step(rng, τ.integrator, h, z, τ.n_steps)


We probably only need to change this line to implmenet SGLD.

xukai92 · 2019-10-16T11:56:08Z

I left some comments on how to proceed.

xukai92 · 2020-04-09T16:12:55Z

The commit history is messed up. Please rebase and force-push.

ToDo: computing stochastic gradients.

sivapvarma

@xukai92 I have added the updates for SGHMC.

I am still stuck about how we compute the stochastic gradients. My main source of confusion is that there are many AD frameworks supported by Turing ( ForwardDiff, ReverseDiff, Zygote, Tracker). I know Zygote is the way forward. But I keep getting lost in how all of them are handled by Turing. Any pointers to AD documentation for Turing would help.

Furthermore, we need to compute gradients on minibatches, so it is still more details of how the interface should be designed.

src/trajectory.jl

xukai92 · 2020-04-13T14:14:04Z

I am still stuck about how we compute the stochastic gradients. My main source of confusion is that there are many AD frameworks supported by Turing ( ForwardDiff, ReverseDiff, Zygote, Tracker). I know Zygote is the way forward. But I keep getting lost in how all of them are handled by Turing. Any pointers to AD documentation for Turing would help.

I think the design here is Turing/AD-agonistic. All we need to assume is that Hamiltonian.∂ℓπ∂θ returns a tuple of value and gradient.

Furthermore, we need to compute gradients on minibatches, so it is still more details of how the interface should be designed.

I guess there are two options:

Assume the gradient is scaled correctly by users when implementing ∂ℓπ∂θ or
Include the batch size (M) and whole dataset size (N) in SGLD/SGHMC and we scale it inside AHMC

xukai92 requested changes Oct 16, 2019

View reviewed changes

sivapvarma added 2 commits April 9, 2020 03:04

add place holders for SGHMC, SGLD trajectories

d5fcd87

add SGLD updates

0b60062

sivapvarma force-pushed the stoch_grad_hmc branch from 50f8a55 to 0b60062 Compare April 10, 2020 22:53

add SGHMC updates

62692bf

ToDo: computing stochastic gradients.

sivapvarma commented Apr 13, 2020

View reviewed changes

src/trajectory.jl Show resolved Hide resolved

Merge branch 'master' into stoch_grad_hmc

df00217

yebai closed this Apr 21, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] add SGHMC, SGLD trajectories #113

[WIP] add SGHMC, SGLD trajectories #113

Uh oh!

sivapvarma commented Oct 13, 2019 •

edited

Loading

Uh oh!

Uh oh!

xukai92 Oct 16, 2019

Uh oh!

xukai92 Oct 16, 2019

Uh oh!

xukai92 commented Oct 16, 2019

Uh oh!

xukai92 commented Apr 9, 2020

Uh oh!

sivapvarma left a comment

Uh oh!

Uh oh!

xukai92 commented Apr 13, 2020

Uh oh!

Uh oh!

[WIP] add SGHMC, SGLD trajectories #113

[WIP] add SGHMC, SGLD trajectories #113

Uh oh!

Conversation

sivapvarma commented Oct 13, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

xukai92 Oct 16, 2019

Choose a reason for hiding this comment

Uh oh!

xukai92 Oct 16, 2019

Choose a reason for hiding this comment

Uh oh!

xukai92 commented Oct 16, 2019

Uh oh!

xukai92 commented Apr 9, 2020

Uh oh!

sivapvarma left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

xukai92 commented Apr 13, 2020

Uh oh!

Uh oh!

sivapvarma commented Oct 13, 2019 •

edited

Loading