[MRG] PredictionEntropyScorer output negative scores #63

YanisLalou · 2024-01-29T10:14:19Z

Issue: PredictionEntropyScorer output negative scores
Solution: In PredictionEntropyScorer we compute equation (page 3 paper: https://arxiv.org/pdf/1711.10288.pdf):
$$[ E(X_T) = - \sum_{x_t \in T} \langle f(x_t; \theta), \log f(x_t; \theta) \rangle ]$$

In the code we forgot to use the greater_is_better flag + we use a double minus sign in the formula.
+ Add test case to test that all scores computed are > 0

codecov · 2024-01-29T10:27:06Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (c50867e) 88.83% compared to head (2ed1c2c) 88.95%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #63      +/-   ##
==========================================
+ Coverage   88.83%   88.95%   +0.12%     
==========================================
  Files          41       41              
  Lines        2678     2708      +30     
==========================================
+ Hits         2379     2409      +30     
  Misses        299      299

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

YanisLalou · 2024-01-31T13:10:02Z

Actually there was another mistake. Now we compute directly the mean of the entropy of each sample.
However there is still something bothering. Now the score outputted is bounded by -xlog(x) with x a probability. Thus $score \in [0, \frac{1}{e}*\log\frac{1}{e} ]$ and not $\in [0, 1]$ as we could expect for a score.

tgnassou · 2024-01-31T13:32:48Z

I reread the paper. For entropy scorer, we want to minimize the entropy, so greater_is_better equals False here. And the entropy is not a score, so that's why it is not between [0, 1]

tgnassou · 2024-01-31T13:34:06Z

Maybe we can change the name

YanisLalou · 2024-01-31T13:47:03Z

Yes my bad you're right greater_is_better equals False here.
So yes we should change the name
OR we could scale the entropy to be between [0, 1] and call it a score (we kind of already do that by computing a mean instead of a sum)
Tbh I do prefer the last choice, since it'll be easier to plot the results next to other scores bounded between [0, 1] like the accuracy, recall, precision....

Modification of the PR

tgnassou · 2024-01-31T13:54:32Z

I think the best thing is to put a sum instead of the mean and implement what is done in the paper. I think the mean was a mistake. We can plot it like a loss.

YanisLalou · 2024-01-31T14:35:19Z

Not a fan of removing the mean. If we do that the output will be proportionate to the number of samples. Thus it will be hard to compare methods effectiveness between different datasets.
(plus if we're considering this class as a loss, in pytroch by default they compute the mean(L) instead of the sum(L))

YanisLalou · 2024-01-31T14:35:56Z

Would love to have the thoughts of @kachayev @rflamary

tgnassou · 2024-01-31T14:58:00Z

Yeah, maybe a parameter reduction to be able to fit the paper if wanted with option None or mean.

rflamary · 2024-01-31T15:01:02Z

good idea to handle both (state in teh doc that one of them corespod to the paper)

kachayev · 2024-01-31T15:54:06Z

That's an interesting question...

Usually, I'm a 'theory absolutist' when it comes to the questions like this one. If we call it 'entropy', it should perform 'sum'. Otherwise it's no longer entropy.

In PyTorch, 'reduction' parameter for losses is typically (as it should be) a 'batch-related' setting: it only tells the system what to do with the fact the the loss is typically per-item thing, which means that for a batch we have an array of those, while the gradient descent requires a scalar. Thus it provides 'sum' or 'mean' options for how to collapse the array of values into a scalar. This doesn't change the nature of the loss itself, just a mini-batch GD. Also, from the gradient perspective it's just a matter for multiplicative scalar.

You are right that with larger number of samples max entropy of the system increases. As it should... The hypothesis that 'entropy divided by the number of samples' makes better comparison between different methods is interesting though by no means obvious or intuitive.

kachayev · 2024-02-01T10:32:15Z

Also,

$\in [0, 1]$ as we could expect for a score

Where is this stated for sklearn scores to be from a particular range? I'm not sure I've seen this requirement.

YanisLalou · 2024-02-01T16:23:58Z

Also,

∈[0,1] as we could expect for a score

Where is this stated for sklearn scores to be from a particular range? I'm not sure I've seen this requirement.

It's not a requirement, but to the best of my recollection, I believe that every class ending with the term 'score' - 'scorer' is upper-bounded by 1. Otherwise, they are referred to as 'loss,' 'error,' and so on.
I'm not 100% sure of that, but I quickly looked at: https://scikit-learn.org/stable/modules/model_evaluation.html and it seems to correlate with what I'm saying.

YanisLalou and others added 2 commits January 29, 2024 11:06

Fix PredictionEntropyScorer + test case score > 0

c5929eb

Merge branch 'main' into Fix_PredictionEntropyScorer

8512c63

Fix PredictionEntropyScorer

101da2d

YanisLalou changed the title ~~PredictionEntropyScorer output negative scores~~ [TO_REVIEW] PredictionEntropyScorer output negative scores Jan 29, 2024

Merge branch 'main' into Fix_PredictionEntropyScorer

09d5967

tgnassou previously approved these changes Jan 31, 2024

View reviewed changes

Fix PredictionEntropyScorer

5058dff

YanisLalou added 2 commits January 31, 2024 17:11

Add the reduction attribute to PredictionEntropyScorer

6e541c3

Modify assert test scorer

618bb16

YanisLalou and others added 6 commits February 1, 2024 20:09

Add test cases for codevoc

34256f4

flake8

cc1952b

Merge branch 'main' into Fix_PredictionEntropyScorer

9a18512

Merge branch 'main' into Fix_PredictionEntropyScorer

2d796f1

Merge branch 'main' into Fix_PredictionEntropyScorer

42846e8

Merge branch 'main' into Fix_PredictionEntropyScorer

95b6753

rflamary changed the title ~~[TO_REVIEW] PredictionEntropyScorer output negative scores~~ [MRG] PredictionEntropyScorer output negative scores Feb 19, 2024

YanisLalou and others added 2 commits February 20, 2024 11:22

Fix test case by adding ".set_fit_request()" to logreg

76252d3

Merge branch 'main' into Fix_PredictionEntropyScorer

2ed1c2c

YanisLalou requested a review from rflamary February 20, 2024 13:48

rflamary approved these changes Feb 20, 2024

View reviewed changes

rflamary merged commit bf4a427 into scikit-adaptation:main Feb 20, 2024
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MRG] PredictionEntropyScorer output negative scores #63

[MRG] PredictionEntropyScorer output negative scores #63

YanisLalou commented Jan 29, 2024 •

edited

Loading

codecov bot commented Jan 29, 2024 •

edited

Loading

YanisLalou commented Jan 31, 2024

tgnassou commented Jan 31, 2024

tgnassou commented Jan 31, 2024

YanisLalou commented Jan 31, 2024

tgnassou commented Jan 31, 2024

YanisLalou commented Jan 31, 2024

YanisLalou commented Jan 31, 2024

tgnassou commented Jan 31, 2024

rflamary commented Jan 31, 2024

kachayev commented Jan 31, 2024

kachayev commented Feb 1, 2024

YanisLalou commented Feb 1, 2024

[MRG] PredictionEntropyScorer output negative scores #63

[MRG] PredictionEntropyScorer output negative scores #63

Conversation

YanisLalou commented Jan 29, 2024 • edited Loading

codecov bot commented Jan 29, 2024 • edited Loading

Codecov Report

YanisLalou commented Jan 31, 2024

tgnassou commented Jan 31, 2024

tgnassou commented Jan 31, 2024

YanisLalou commented Jan 31, 2024

tgnassou commented Jan 31, 2024

YanisLalou commented Jan 31, 2024

YanisLalou commented Jan 31, 2024

tgnassou commented Jan 31, 2024

rflamary commented Jan 31, 2024

kachayev commented Jan 31, 2024

kachayev commented Feb 1, 2024

YanisLalou commented Feb 1, 2024

YanisLalou commented Jan 29, 2024 •

edited

Loading

codecov bot commented Jan 29, 2024 •

edited

Loading