Implement gradlogpartition for Exponential Family Distributions #149

Nimrais · 2023-12-14T14:04:59Z

This PR aims to close issue #130: it implements getgradlogpartition for exponential family distribution types from Distributions.jl

Derivations to do this feature we already prepared.

To test new functionality a new type of test for the exponential family interface was added: test_gradlogpartion_against_expectation. Based on the fact that expectation of the sufficient statistics is equal to the gradient of the logpartition:

$$\mathbf{E}[T(x)] = \nabla_{\eta} A(\eta)$$

However the straightforward check would be to long to run:

$$\mathbf{E}[T(x)]$$

can be generically approximated with a monte-carlo estimator

$$\frac{1}{N}\sum T(z_{i})$$

but this estimator is too slow to converge for the testing purposes.

So another with a faster convergence rate (the linear convergence rate)

$$\mathbf{E}[T(x)]^{T} F(\eta)^{-1} \mathbf{E}[T(x)]= \nabla_{\eta} A(\eta)^{T} F(\eta)^{-1} \nabla_{\eta} A(\eta)$$

is used and a sanity check that dimensionality of the gradient and the natural parameters are the same.

Tasks to do:

So please open a PR with getgradlogpartition for your type implemented into this branch.

…gument

…nential

add getgradlogpartition Exponential

…gument

add getgradlogpartition function for Poisson

add getgradlogpartition function for Bernoulli

…nted

Add gradient of binomial

Add gradient of beta

biaslab to reactivebayes

Add gradient of Erlang

Nimrais · 2024-01-22T19:58:27Z

@bvdmitri, I believe this PR is now ready for review.

Please note that the Contingency, Multinomial, and Continuous Bernoulli distributions will not be implemented in this PR. This is because we currently lack comprehensive functionality for them, and they haven’t been added to the package yet.

I’ve suppressed the test for MvNormalWishart. This is because, first, we need to implement its variate type, I believe. Subsequently, we should refactor this PR with its gradlogpartition: #173 accordingly.

The only issue I foresee is with the MatrixDirichlet. Sampling for it seems to be numerically problematic. Perhaps we could choose a non-random point for evaluation as a workaround?

Nimrais · 2024-01-25T13:48:21Z

@bvdmitri @ismailsenoz The problem is completely numerical as I see for MatrixDirichlet, it manifests it with even Beta sometimes samples are 0 exactly and log sufficient statistics function is producing NaNs. I tested scipy for this scenario as well and it also can produce samples that are exactly 0.

bvdmitri · 2024-01-31T09:46:42Z

@Nimrais can you provide a MWE? My very simple thus naive test does not produce 0 samples

julia> any(iszero, rand(Beta(2, 7), 1000000))
false

julia> any(sample -> sample ≈ 0, rand(Beta(2, 7), 1000000))
false

Nimrais · 2024-01-31T09:48:13Z

@Nimrais can you provide a MWE? My very simple thus naive test does not produce 0 samples
julia> any(iszero, rand(Beta(2, 7), 1000000))
false

julia> any(sample -> sample ≈ 0, rand(Beta(2, 7), 1000000))
false

julia> rand(Beta(0.001, 10), 1000)
1000-element Vector{Float64}:
 2.501697090499602e-61
 1.5363386828054128e-117
 0.0
 2.2934573763809164e-19
 0.0
 5.025249983628666e-188
 3.5214471095207316e-46
 ⋮
 1.2648082681768658e-96
 0.0
 8.892674549717065e-308
 4.820971913271643e-100
 0.0
 5.115976241095818e-165
 0.0

Nimrais · 2024-01-31T09:50:28Z

You can check our Direchlet test distributions, some marginal of them are pretty close to ill-behaved Betas, one parameter is super small and another is big.

bvdmitri · 2024-01-31T10:08:23Z

Got it! Lets address this in a separate issue

codecov · 2024-01-31T10:40:28Z

Codecov Report

Attention: 47 lines in your changes are missing coverage. Please review.

Comparison is base (65b1623) 79.77% compared to head (6f31e89) 80.21%.
Report is 23 commits behind head on main.

Files	Patch %	Lines
src/distributions/wishart.jl	46.15%	7 Missing ⚠️
src/distributions/lognormal.jl	50.00%	5 Missing ⚠️
src/distributions/matrix_dirichlet.jl	50.00%	5 Missing ⚠️
src/distributions/normal_family/normal_family.jl	76.47%	4 Missing ⚠️
src/distributions/weibull.jl	42.85%	4 Missing ⚠️
src/distributions/dirichlet.jl	50.00%	3 Missing ⚠️
src/distributions/gamma_family/gamma_family.jl	50.00%	3 Missing ⚠️
src/distributions/geometric.jl	50.00%	3 Missing ⚠️
src/distributions/negative_binomial.jl	50.00%	3 Missing ⚠️
src/distributions/pareto.jl	50.00%	3 Missing ⚠️
... and 5 more

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #149      +/-   ##
==========================================
+ Coverage   79.77%   80.21%   +0.44%     
==========================================
  Files          39       39              
  Lines        2887     3094     +207     
==========================================
+ Hits         2303     2482     +179     
- Misses        584      612      +28

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

bvdmitri

I trust the tests, didn't check the actual math. Well done guys!!!

feat: implement MvNormalMeanCovariance gradlogpartition

b1a130c

Nimrais marked this pull request as draft December 14, 2023 14:05

test(fix): make nsample to compute sufficient_statistics a keyword ar…

f831e49

…gument

Nimrais added good first issue Good for newcomers enhancement New feature or request labels Dec 14, 2023

Nimrais and others added 25 commits December 14, 2023 19:24

test(fix): make gradlogpartition test a bit more obvious

9921d34

feat: add getgradlogpartition for VonMises

65f5e6b

add beta grad log partition

ed317d2

add grad binomial

7086c08

add gradlogpartition function

3389b2a

add getgradlogpartition poisson

f274500

add getgradlogparition exponential

0306250

Merge branch 'implement-grad-logpartition' into gradlogpartition_expo…

f05cfd3

…nential

fix: grad is a vector, not a scalar

9ad7b0a

Merge pull request #152 from biaslab/gradlogpartition_exponential

decb514

add getgradlogpartition Exponential

fix: return gradlog as vector

ae07421

fix: return gradlog as vector

5f849dc

add getgradlogparition exponential

4c591f9

feat: implement MvNormalMeanCovariance gradlogpartition

f6a760b

test(fix): make nsample to compute sufficient_statistics a keyword ar…

076b656

…gument

test(fix): make gradlogpartition test a bit more obvious

0fc2d84

feat: add getgradlogpartition for VonMises

9a585bd

fix: grad is a vector, not a scalar

223719b

Merge pull request #151 from biaslab/gradlogpartition_poisson

e25b3f7

add getgradlogpartition function for Poisson

Merge pull request #150 from biaslab/gradlogpartition_bernoulli

bd4856a

add getgradlogpartition function for Bernoulli

Dirichlet, Gamma and Geometric distributions gradlogpartition impleme…

8138d8f

…nted

Add gradient calculation for log partition

85a651e

Merge pull request #153 from biaslab/grad_binomial

094558d

Add gradient of binomial

Merge pull request #154 from biaslab/grad_beta

3954fa3

Add gradient of beta

fix wishart gradient

4991bb6

İsmail Şenöz and others added 17 commits January 22, 2024 18:43

remove cov=var statement

575a16a

add tests

d1301e2

make format

05d7930

Revert piracy = false change

5093e67

Use Julia 1.10 for tests

4c8fa10

2prev

a7df4ef

Fix isapprox for Normal family of distributions

2d87607

Update README.md

f563f7f

biaslab to reactivebayes

Update examples.md

8413bf4

Update make.jl

ca8e43b

Update gamma_shape_rate_tests.jl

8b9df24

adjust docs deployment settings

9edc9ea

fix: use StableRNG

46d662b

test: do not test gradient for MvNormalWishart

8e5eeff

add gradient of erlang

6be1d71

Merge pull request #168 from ReactiveBayes/grad_erlang

ee9c9fa

Add gradient of Erlang

docs: remove TODO from error messages

64c3330

Nimrais marked this pull request as ready for review January 22, 2024 19:50

Nimrais requested a review from bvdmitri January 23, 2024 17:13

Add more tests

6f31e89

bvdmitri approved these changes Jan 31, 2024

View reviewed changes

bvdmitri merged commit 246b2e7 into main Jan 31, 2024
3 of 4 checks passed

bvdmitri deleted the implement-grad-logpartition branch January 31, 2024 10:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement gradlogpartition for Exponential Family Distributions #149

Implement gradlogpartition for Exponential Family Distributions #149

Nimrais commented Dec 14, 2023 •

edited

Loading

Nimrais commented Jan 22, 2024 •

edited

Loading

Nimrais commented Jan 25, 2024

bvdmitri commented Jan 31, 2024

Nimrais commented Jan 31, 2024 •

edited

Loading

Nimrais commented Jan 31, 2024 •

edited

Loading

bvdmitri commented Jan 31, 2024

codecov bot commented Jan 31, 2024

bvdmitri left a comment

Implement gradlogpartition for Exponential Family Distributions #149

Implement gradlogpartition for Exponential Family Distributions #149

Conversation

Nimrais commented Dec 14, 2023 • edited Loading

Nimrais commented Jan 22, 2024 • edited Loading

Nimrais commented Jan 25, 2024

bvdmitri commented Jan 31, 2024

Nimrais commented Jan 31, 2024 • edited Loading

Nimrais commented Jan 31, 2024 • edited Loading

bvdmitri commented Jan 31, 2024

codecov bot commented Jan 31, 2024

Codecov Report

bvdmitri left a comment

Choose a reason for hiding this comment

Nimrais commented Dec 14, 2023 •

edited

Loading

Nimrais commented Jan 22, 2024 •

edited

Loading

Nimrais commented Jan 31, 2024 •

edited

Loading

Nimrais commented Jan 31, 2024 •

edited

Loading