How can I make non-trainable variable using nnx.Module? #4533

SangminLee0828 · 2025-02-07T22:54:07Z

SangminLee0828
Feb 7, 2025

Hi,

I am trying to create a normalization layer. This normalization layer has 'mean' and 'variance' inside, so when the values come in, the output values will be normalized value using the stored mean and variance.

https://www.tensorflow.org/api_docs/python/tf/keras/layers/Normalization

class Normalization(nnx.Module):
    """Normalization layer in JAX/Flax (nnx)."""
    def __init__(self, mean: jnp.ndarray, variance: jnp.ndarray, invert: bool =False):
        self.mean = mean
        self.variance = variance
        self.invert = invert  # Whether to denormalize instead of normalize.

    def __call__(self, x: jnp.ndarray) -> jnp.ndarray:
        """Applies normalization or denormalization."""
        if self.invert:
            # De-normalize: x * sqrt(variance) + mean
            return x * jnp.sqrt(self.variance) + self.mean
        else:
            # Normalize: (x - mean) / sqrt(variance)
            return (x - self.mean) / jnp.sqrt(self.variance)

    @staticmethod
    def adapt(data: jnp.ndarray, axis: int = 0):
        """
        Computes the mean and variance of the dataset for normalization.
        
        Args:
            data: Input data to compute mean and variance.
            axis: Axis along which to compute mean and variance.
                  Use None for global statistics across all dimensions.
        
        Returns:
            A tuple of (mean, variance).
        """
        mean = jnp.mean(data, axis=axis, keepdims=True)
        variance = jnp.var(data, axis=axis, keepdims=True)
        return mean, variance
    

adapt_data = jnp.array([[0.],
                        [2.],
                        [0.],
                        [2.]], dtype=jnp.float32)

input_mean, input_variance = Normalization.adapt(adapt_data, axis = 0)
print(f'input mean: {input_mean}, input variance: {input_variance}')
input_norm_layer = Normalization(mean=input_mean, variance=input_variance)

input_data = jnp.array([[0., 7., 4.]], dtype=jnp.float32)

input_normalized = input_norm_layer(input_data)

How can I make 'self.mean' and 'self.variance' not trainable?

DiagRisker · 2025-02-11T17:19:35Z

DiagRisker
Feb 11, 2025

My quick take is : by assigning those parameter to self, they are found by jax.grad as parameters to differentiate with (altough I am suprised because nnx.param exists for a good reason).

however, if you want something purely static, maybe the usage of nnx.Module or a class overall is not needed ? (if you want to make variance and mean on the fly)

0 replies

cgarciae · 2025-02-12T18:27:49Z

cgarciae
Feb 12, 2025
Maintainer

The solution is to create a filter for the train trainable Variable and pass it to both Optimizer and nnx.grad so you effectively only train a subset of the weights.

class Classifier(nnx.Module):
  def __init__(self, embed_dim, num_classes, backbone, rngs):
    self.backbone = backbone
    self.head = nnx.Linear(embed_dim, num_classes, rngs=rngs)

  def __call__(self, x):
    x = self.backbone(x)
    x = self.head(x)
    return x

def load_model():
  return nnx.Linear(784, 1024, rngs=nnx.Rngs(0))

backbone = load_model()
classifier = Classifier(1024, 10, backbone, rngs=nnx.Rngs(1))

# filter to select only Params on head path
head_params = nnx.All(nnx.Param, nnx.PathContains('head'))

optimizer = nnx.Optimizer(
  classifier,
  tx=optax.adamw(3e-4),
  wrt=head_params,  # filter head params
)

# simple train step
@nnx.jit
def train_step(model, optimizer, x, y):
  def loss_fn(model):
    logits = model(x)
    return optax.softmax_cross_entropy_with_integer_labels(logits, y).mean()

  diff_state = nnx.DiffState(0, head_params) # filter head params of the first argument
  grads = nnx.grad(loss_fn, argnums=diff_state)(model)
  optimizer.update(grads)

x = jnp.ones((1, 784))
y = jnp.ones((1,), jnp.int32)
train_step(classifier, optimizer, x, y)

3 replies

DiagRisker Feb 27, 2025

@cgarciae a documentation of the filtering method with the corresponding function, would be of great help !

head_params = nnx.All(nnx.Param, nnx.PathContains('head'))

This is of interest to my post in #4514

cgarciae Mar 1, 2025
Maintainer

@DiagRisker have you checked the Using Filters guide?

DiagRisker Mar 1, 2025

my apologies, I don't know how I missed this. Many thanks
I believe, I've checked it before, however the answer I was looking for, was not in documentation, but in your answer.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can I make non-trainable variable using nnx.Module? #4533

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 2 comments 3 replies

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

How can I make non-trainable variable using nnx.Module? #4533

SangminLee0828 Feb 7, 2025

Replies: 2 comments · 3 replies

DiagRisker Feb 11, 2025

cgarciae Feb 12, 2025 Maintainer

DiagRisker Feb 27, 2025

cgarciae Mar 1, 2025 Maintainer

DiagRisker Mar 1, 2025

SangminLee0828
Feb 7, 2025

Replies: 2 comments 3 replies

DiagRisker
Feb 11, 2025

cgarciae
Feb 12, 2025
Maintainer

cgarciae Mar 1, 2025
Maintainer