feat(hpu): normality & expected value checks for HPU parameters #2441

pgardratzama · 2025-06-13T16:04:09Z

also adds noise measurement tests for 2M128 HPU parameter set

This change is

tfhe/src/core_crypto/algorithms/test/noise_distribution/lwe_hpu_noise.rs

Bapt-Roux · 2025-06-13T16:11:11Z

tfhe/src/core_crypto/algorithms/test/noise_distribution/lwe_hpu_noise.rs

+    max_msg_val: u64,
+    nb_tests: usize,
+    nb_pbs_per_test: usize,
+    check_variance: bool,


Instead of a boolean, I will have used an enum like:

enum HpuNoiseMode { Variance, Normality, }

And replace if stmt by match

It has not been easy 😓 but I think it is done

pgardratzama · 2025-06-17T12:27:39Z

To be clear, I had occurences of failure with expected value score check with ~7% probability. But most of the time it works well and is below 6.25%.

IceTDrinker · 2025-06-17T12:29:46Z

To be clear, I had occurences of failure with expected value score check with ~7% probability. But most of the time it works well and is below 6.25%.

when do those tests run ? if they fail with such high probability we can't have them like this in CI, I know they are statistical tests but we can't have very flaky tests

pgardratzama · 2025-06-17T20:03:03Z

At this point, these tests fails very rarely on my local machine but a bit more often in the CI runs.
I do not know how to define a test that is not always run.
I have been told that 7% of failure rate on expected value is kind of ok so there are 2 ways possible:

increasing acceptable failure rate from 6.25% to 7.5%
make these tests optional or comment them out

IceTDrinker · 2025-06-18T05:37:12Z

Which one is failing ? Normality check or noise measurement ?

pgardratzama · 2025-06-18T06:36:02Z

noise measurement are ok and stable
normality tests are checking 2 things: normality (stable under 6.25% failure rate), expected value near enough 0 with a score (failure rate between 3 and 7%)
I will just raise expected failure rate of expected value score and we will continue like that

IceTDrinker · 2025-06-18T07:03:28Z

Well there at least something to discuss with R&D and it is that : the normality is degraded after the 21 bits keyswitch and we know quantization on low bit width is a topic, if we expect 5% and can get +50% it is something we need to discuss theoretically

IceTDrinker · 2025-06-18T07:05:49Z

The normality being unachievable after mod switch because of normalization requires us to have normality before the mod switch apparently, so if we are seeing trouble there as well it’s something worth verifying.

if it’s after the PBS it’s even more concerning

pgardratzama · 2025-06-18T07:21:59Z

both check are done after KS (and *nu before):
normality_test_f64() is fine with failure probability < 5%
what is not always < 5% is the expected value score: mean(samples) / std_dev(samples) * sqrt(samples.len()) € [-1.96..1.96]
@mballandras did not seem surprised by this failure probability

IceTDrinker · 2025-06-18T08:17:06Z

both check are done after KS (and *nu before): normality_test_f64() is fine with failure probability < 5% what is not always < 5% is the expected value score: mean(samples) / std_dev(samples) * sqrt(samples.len()) € [-1.96..1.96] @mballandras did not seem surprised by this failure probability

Ok, still a bit surprising I thought the mean issue was after the MS there may be something I'm missing on the average value stuff

mballandras · 2025-06-19T07:41:49Z

These results are expected random fluctuations. Indeed we expect on average a 5% failure rate. If I remember correctly the test is performed 100 times and samples.len() = 100. In this setting, for a perfect Gaussian distribution with zero mean, the probability of observing a failure rate larger than 7% is roughly 1/4. We could set a bound at 12% failure rate of the test, then the probability of a perfect zero mean Gaussian to excess this 12% threshold would be 0.4% which seems acceptable for a CI failure rate. We could go up to 13% threshold then the probability of false positive would be 0.15%.

Sorry I should have been more precise about the expected results of the test in the first place

mballandras · 2025-06-19T08:07:57Z

Just to add that the test will still be powerful. In this same setting (100 tests with 100 samples each), if instead of zero the mean of the Gaussian is std_dev/10 (which would affect the p_fail by less than a bit), the failure rate of the test will be 17%, which will be catch almost always having set the threshold to 13%

IceTDrinker

Thanks a few nits, otherwise looks to be in line with what Mathieu proposed for the tests, to check if the 8% bound should be changed to 13% given his recommendations, I'm guessing @mballandras that the test for the average value is the one we will want to add for other primitives as well, is there a write-up about the specs of the test ? will be of interest when we update older core noise tests for the CPU part

Reviewed all commit messages.
Reviewable status: 0 of 1 files reviewed, 4 unresolved discussions (waiting on @Bapt-Roux)

tfhe/src/core_crypto/algorithms/test/noise_distribution/lwe_hpu_noise.rs line 128 at r4 (raw file):

    ct_width: 64,
    ksk_width: 21,
    //norm2: 8,

comment to remove ?

tfhe/src/core_crypto/algorithms/test/noise_distribution/lwe_hpu_noise.rs line 735 at r4 (raw file):

create_parameterized_test_hpu!(
    hpu_noise_distribution {
        //HPU_TEST_PARAMS_4_BITS_NATIVE_U64,

commented param sets could be removed ?

tfhe/src/core_crypto/algorithms/test/noise_distribution/lwe_hpu_noise.rs line 752 at r4 (raw file):

create_parameterized_test_hpu!(
    hpu_noise_distribution {
        //HPU_TEST_PARAMS_4_BITS_HPU_64_KS_21_132_GAUSSIAN,

same here

mballandras · 2025-06-19T09:36:46Z

Thanks a few nits, otherwise looks to be in line with what Mathieu proposed for the tests, to check if the 8% bound should be changed to 13% given his recommendations, I'm guessing @mballandras that the test for the average value is the one we will want to add for other primitives as well, is there a write-up about the specs of the test ? will be of interest when we update older core noise tests for the CPU part

So far the write-up about the specs is just a slack thread in research-fpga, I will transfer it to you. I can do a more precise spec detailing the 13% bound

…ormality & expected value checks

…e of both normality and expected value score and raise expected value score check acceptable failure rate to 8%

pgardratzama · 2025-06-19T20:13:36Z

removed useless commented lines, rebased & reduced to 2 commits

IceTDrinker

Normality check failed, this is too flaky to merge as is

mballandras · 2025-06-20T08:36:11Z

The normality test is similar to the zero-mean test, it is a null hypothesis test at 5% level of significance. If it is performed 100 times and we set the limit at 13% failure we should see only 0.15% of false positive

pgardratzama · 2025-06-20T08:40:30Z

Agreed, what is really strange is that I get failure rate on normality check below 5% on my laptop and at the same time 7% on the CI. I have the impression failure rate is always worse on CI...
Is this because of what does the test:

iter A (100) {
  - generate keys
  - encrypt msg
  - iter B (160) {
    - * nu
    - KS
    - sample noise in vec
    - PBS
  }
  check normality & expected value
}

Could it be because I do not re-encrypt the msg at each run of iter B?

pgardratzama · 2025-06-20T08:41:32Z

@mballandras you mean I should check that both failure probability are below 13%?

IceTDrinker · 2025-06-20T08:42:40Z

@pgardratzama do you run strictly the same tests between CI and laptop ?

you could try to seed the test to compare both CI and laptop runs, given it is the ntt PBS the polynomial mul algo should be the same between both

pgardratzama requested a review from Bapt-Roux June 13, 2025 16:04

pgardratzama requested a review from IceTDrinker as a code owner June 13, 2025 16:04

cla-bot bot added the cla-signed label Jun 13, 2025

IceTDrinker reviewed Jun 13, 2025

View reviewed changes

tfhe/src/core_crypto/algorithms/test/noise_distribution/lwe_hpu_noise.rs Show resolved Hide resolved

Bapt-Roux reviewed Jun 13, 2025

View reviewed changes

pgardratzama requested review from Bapt-Roux and IceTDrinker June 17, 2025 12:21

IceTDrinker reviewed Jun 19, 2025

View reviewed changes

pgardratzama added 2 commits June 19, 2025 21:58

feat(hpu): Add parameter set 2M128 for HPU noise measurements, plus n…

4ee0528

…ormality & expected value checks

fix(hpu): increase number of tests to cope with probability of failur…

198c626

…e of both normality and expected value score and raise expected value score check acceptable failure rate to 8%

pgardratzama force-pushed the hw-team/pg/normality_expvalue_tests branch from 88ab60b to 198c626 Compare June 19, 2025 20:11

IceTDrinker requested changes Jun 20, 2025

View reviewed changes

feat(hpu): normality & expected value checks for HPU parameters #2441

Are you sure you want to change the base?

feat(hpu): normality & expected value checks for HPU parameters #2441

Uh oh!

Conversation

pgardratzama commented Jun 13, 2025 • edited by IceTDrinker Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Bapt-Roux Jun 13, 2025

Choose a reason for hiding this comment

Uh oh!

pgardratzama Jun 14, 2025

Choose a reason for hiding this comment

Uh oh!

pgardratzama commented Jun 17, 2025

Uh oh!

IceTDrinker commented Jun 17, 2025

Uh oh!

pgardratzama commented Jun 17, 2025

Uh oh!

IceTDrinker commented Jun 18, 2025

Uh oh!

pgardratzama commented Jun 18, 2025

Uh oh!

IceTDrinker commented Jun 18, 2025

Uh oh!

IceTDrinker commented Jun 18, 2025

Uh oh!

pgardratzama commented Jun 18, 2025

Uh oh!

IceTDrinker commented Jun 18, 2025

Uh oh!

mballandras commented Jun 19, 2025

Uh oh!

mballandras commented Jun 19, 2025

Uh oh!

IceTDrinker left a comment

Choose a reason for hiding this comment

Uh oh!

mballandras commented Jun 19, 2025

Uh oh!

pgardratzama commented Jun 19, 2025

Uh oh!

IceTDrinker left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mballandras commented Jun 20, 2025

Uh oh!

pgardratzama commented Jun 20, 2025

Uh oh!

pgardratzama commented Jun 20, 2025

Uh oh!

IceTDrinker commented Jun 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pgardratzama commented Jun 13, 2025 •

edited by IceTDrinker

Loading

IceTDrinker left a comment •

edited

Loading