Feature: adaptive learning rate schedulers #1629

christopherlovell · 2025-07-31T19:59:17Z

Add Adaptive Learning Rate Schedulers to Training

Implements adaptive learning rate schedulers into the NPE, NLE and NRE methods. All changes maintain backward compatibility - existing code continues to work unchanged.

This feature was implemented with assistance from https://claude.ai/code
Co-Authored-By: Claude [email protected]

Changes Made

Core Implementation

Enhanced base trainer (sbi/inference/trainers/base.py):
- Added _create_lr_scheduler() method supporting 6 scheduler types: ReduceLROnPlateau, ExponentialLR,
  CosineAnnealingLR, StepLR, MultiStepLR, and CyclicLR
- Updated _converged() method with learning rate threshold-based early stopping
- Added learning rate tracking in training summaries
- Extended abstract train() signature with scheduler parameters
Updated NPE trainer (sbi/inference/trainers/npe/npe_base.py):
- Integrated scheduler creation and stepping logic
- Added learning rate logging to TensorBoard summaries
- Maintained full backward compatibility with existing APIs

Testing

Comprehensive test suite (tests/n*e_scheduler_test.py):
- Parametrized tests for all scheduler types and configurations
- Tests for convergence behavior and learning rate tracking
- Error handling validation for invalid configurations

Documentation

Practical guide (docs/how_to_guide/20_learning_rate_schedulers.md):
Advanced tutorial (docs/advanced_tutorials/22_adaptive_learning_rates.ipynb):

API Usage

  # ReduceLROnPlateau with custom parameters
  posterior = inference.train(
      lr_scheduler="reduce_on_plateau",
      lr_scheduler_kwargs={"patience": 10, "factor": 0.5}
  )

  # Advanced configuration
  posterior = inference.train(
      lr_scheduler={
          "type": "cosine_annealing",
          "T_max": 50,
          "eta_min": 1e-6
      }
  )

- Add scheduler infrastructure to base NeuralInference class - Implement _create_lr_scheduler() supporting 6 scheduler types: * plateau (ReduceLROnPlateau) * exponential (ExponentialLR) * cosine (CosineAnnealingLR) * step (StepLR) * multistep (MultiStepLR) * cyclic (CyclicLR) - Enhanced _converged() with optional min LR threshold - Update NPE_A and NPE_C train() method signatures - Integrate scheduler stepping in NPE training loop - Add learning rate tracking and TensorBoard logging - Maintain full backward compatibility 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

- 16 comprehensive tests covering all scheduler functionality - Tests all 6 scheduler types with parameter validation - Verifies actual learning rate reduction behavior - Tests dictionary-based configuration - Validates minimum LR threshold convergence - Ensures backward compatibility (no scheduler = no change) - Tests all NPE variants (NPE_A, NPE_B, NPE_C) - Error handling for invalid scheduler types - Parameter override functionality - Resume training with scheduler state preservation Utilizes existing SBI test fixtures and follows established patterns. All tests pass with the new scheduler implementation. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

- Add scheduler parameters (lr_scheduler, lr_scheduler_kwargs, min_lr_threshold) to NLE base class - Integrate scheduler creation and stepping in nle_base.py training loop - Add learning rate tracking and logging to summary - Update MNLE train method signature to support scheduler parameters - Support all 6 scheduler types: plateau, exponential, cosine, step, multistep, cyclic - Maintain backward compatibility - no scheduler defaults to constant learning rate - Add min_lr_threshold for early stopping when learning rate becomes too low 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

- Add scheduler parameters (lr_scheduler, lr_scheduler_kwargs, min_lr_threshold) to NRE base class - Integrate scheduler creation and stepping in nre_base.py training loop - Add learning rate tracking and logging to summary - Update all NRE variant train method signatures (NRE_A, NRE_B, NRE_C, BNRE) - Add missing 'Any' imports to typing statements in NRE files - Support all 6 scheduler types: plateau, exponential, cosine, step, multistep, cyclic - Maintain backward compatibility - no scheduler defaults to constant learning rate - Add min_lr_threshold for early stopping when learning rate becomes too low 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

- Test scheduler creation and integration for NLE_A and MNLE - Test all scheduler types: plateau, exponential, step, cosine - Test dictionary configuration for schedulers - Test learning rate reduction and tracking - Test min_lr_threshold early stopping - Test backward compatibility without schedulers - Test error handling for invalid scheduler types - Test scheduler kwargs override functionality - Test resume training with scheduler state preservation 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

- Test scheduler creation and integration for all NRE variants (NRE_A, NRE_B, NRE_C, BNRE) - Test all scheduler types: plateau, exponential, step, multistep, cyclic - Test dictionary configuration for schedulers - Test learning rate reduction and tracking - Test min_lr_threshold early stopping - Test backward compatibility without schedulers - Test error handling for invalid scheduler types - Test scheduler kwargs override functionality - Test resume training with scheduler state preservation - Test CyclicLR scheduler with learning rate variation 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

- Add how-to guide for learning rate schedulers with practical examples - Add advanced tutorial notebook with comparative analysis and visualizations - Update documentation index files to include new scheduler docs - Cover all supported schedulers: plateau, exponential, step, multistep, cosine, cyclic - Include configuration examples, best practices, and troubleshooting - Demonstrate usage across NPE, NLE, and NRE methods 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

janfb · 2025-08-06T14:25:30Z

Thanks a lot @christopherlovell for creating this PR, looks like a great addition. We will do a review of the proposed changes soon!

janfb

I did a first high-level pass over the changes - great additions overall! 👏

Before we go into the details, please have a look at my two main comments on the class structure for the LR scheduler options and their kwargs, and for the tests. thanks!

janfb · 2025-08-07T10:07:39Z

sbi/inference/trainers/base.py

+        lr_scheduler: Optional[Union[str, Dict[str, Any]]],
+        lr_scheduler_kwargs: Optional[Dict[str, Any]],


Instead of passing both, lr_scheduler and the corresponding kwargs, I suggest to use dataclasses like we recently introduced for the PosteriorParameters, see #1619. You would need to define a base class LrSchedulerParameters defining the interface and holder all shared parameters, and then a subclass for every LR scheduler type.

We should still offer the basic user to pass a string with the type, then we would just pick the corresponding parameter class and use the default. But as soon as the user wants to specify specific options, they should create their parameter class themselves and pass it here.

This will reduce some of the if-else below and strongly improve the type hints, IDE suggestions and debugging. Let me know if there are any questions and when I am missing something.

janfb · 2025-08-07T10:13:27Z

tests/nle_scheduler_test.py

The fixture on top is great, but the tests are quite verbose and repetitive at the moment. I suggest to use pytest.mark.parametrize to test the lr schedulers for the different trainers (npe, nle, nre) and scheduler types all in one test file. as mnle is basically nle, I think it's fine to not include it here.

another think to include would be the vector field estimators, e.g., fmpe and npse, because they handle convergence criteria a bit differently, see #1544 .

christopherlovell and others added 11 commits July 31, 2025 14:34

remove combined test file in favor of separate NLE and NRE test files

27ac0c3

run pre-commit

d35c1cc

rename npe scheduler test

875e862

fix type hint

12b2ab5

christopherlovell changed the title ~~Feature/adaptive learning rate schedulers~~ Feature: adaptive learning rate schedulers Aug 1, 2025

janfb reviewed Aug 7, 2025

View reviewed changes

Merge branch 'main' into feature/adaptive-learning-rate-schedulers

57b3802

abelaba mentioned this pull request Oct 10, 2025

A Pluggable Training Infrastructure for sbi numfocus/small-development-grant-proposals#60

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature: adaptive learning rate schedulers #1629

Feature: adaptive learning rate schedulers #1629

Uh oh!

christopherlovell commented Jul 31, 2025

Uh oh!

janfb commented Aug 6, 2025

Uh oh!

janfb left a comment

Uh oh!

janfb Aug 7, 2025

Uh oh!

janfb Aug 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		lr_scheduler: Optional[Union[str, Dict[str, Any]]],
		lr_scheduler_kwargs: Optional[Dict[str, Any]],

Feature: adaptive learning rate schedulers #1629

Are you sure you want to change the base?

Feature: adaptive learning rate schedulers #1629

Uh oh!

Conversation

christopherlovell commented Jul 31, 2025

Add Adaptive Learning Rate Schedulers to Training

Changes Made

Core Implementation

Testing

Documentation

API Usage

Uh oh!

janfb commented Aug 6, 2025

Uh oh!

janfb left a comment

Choose a reason for hiding this comment

Uh oh!

janfb Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

janfb Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants