Avoid log(0) in KL divergence #12237

bz-e · 2024-10-22T10:06:07Z

…denominator and added a test case

Describe your change:

Fixes #12233
Added type NONE to make it pass type checking, and added a small constant to the kullback_leibler_divergence method to fix the bug of numerator and denominator being 0, and also added a test case.

Add an algorithm?
Fix a bug or typo in an existing algorithm?
Add or change doctests? -- Note: Please avoid changing both code and tests in a single pull request.
Documentation change?

Checklist:

imSanko

what do you actually want to change ?

bz-e · 2024-10-22T16:12:00Z

what do you actually want to change ?

To be precise, I want to change the kullback_leibler_divergence method to fix the bug that the final return value is INF when the numerator or denominator of np.log(y_true / y_pred) is zero. The loss of precision after the change will not affect the calculation of machine learning model parameters. However, strictly following the latest contribution guidelines, I need to fix a type error in the previous version before I can submit it.

kevin1kevin1k · 2024-10-23T14:34:23Z

@bz-e I think we do not intend to change the default behavior when all y_true are nonzero.
A better way might be to mask out all the zero entries and only sum them.

bz-e · 2024-10-23T16:00:07Z

@bz-e I think we do not intend to change the default behavior when all y_true are nonzero. A better way might be to mask out all the zero entries and only sum them.

I think this is the lowest time complexity solution to the issue #12233 .

kevin1kevin1k · 2024-10-23T16:10:41Z

I think this is the lowest time complexity solution to the issue #12233 .

I meant you could do something like the following (need not be identical, just for demo), and the complexity stays linear.
Also IMHO since this repo is more for educational purpose than for practical/production usage, correctness is more favorable compared to efficiency

    mask = y_true != 0
    y_true_filtered = y_true[mask]
    y_pred_filtered = y_pred[mask]
    kl_loss = y_true_filtered * np.log(y_true_filtered / y_pred_filtered)

bz-e · 2024-10-24T09:27:25Z

I meant you could do something like the following (need not be identical, just for demo), and the complexity stays linear. Also IMHO since this repo is more for educational purpose than for practical/production usage, correctness is more favorable compared to efficiency
    mask = y_true != 0
    y_true_filtered = y_true[mask]
    y_pred_filtered = y_pred[mask]
    kl_loss = y_true_filtered * np.log(y_true_filtered / y_pred_filtered)

For educational purposes I think you are right, this is the method with the least changes.

imSanko suggested changes Oct 22, 2024

View reviewed changes

kevin1kevin1k mentioned this pull request Oct 23, 2024

Fixed: #12233 #12249

Closed

15 tasks

cclauss changed the title ~~Fixes issue #12233~~ Avoid log(0) in KL divergence Oct 23, 2024

bz-e closed this Oct 24, 2024

bz-e force-pushed the master branch from 9367b3b to 6e24935 Compare October 24, 2024 09:40

algorithms-keeper bot added the awaiting reviews This PR is ready to be reviewed label Oct 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Avoid log(0) in KL divergence #12237

Avoid log(0) in KL divergence #12237

Uh oh!

bz-e commented Oct 22, 2024 •

edited

Loading

Uh oh!

imSanko left a comment

Uh oh!

bz-e commented Oct 22, 2024

Uh oh!

kevin1kevin1k commented Oct 23, 2024

Uh oh!

bz-e commented Oct 23, 2024

Uh oh!

kevin1kevin1k commented Oct 23, 2024

Uh oh!

bz-e commented Oct 24, 2024

Uh oh!

Uh oh!

Uh oh!

Avoid log(0) in KL divergence #12237

Avoid log(0) in KL divergence #12237

Uh oh!

Conversation

bz-e commented Oct 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe your change:

Checklist:

Uh oh!

imSanko left a comment

Choose a reason for hiding this comment

Uh oh!

bz-e commented Oct 22, 2024

Uh oh!

kevin1kevin1k commented Oct 23, 2024

Uh oh!

bz-e commented Oct 23, 2024

Uh oh!

kevin1kevin1k commented Oct 23, 2024

Uh oh!

bz-e commented Oct 24, 2024

Uh oh!

Uh oh!

bz-e commented Oct 22, 2024 •

edited

Loading