Question about decoder loss

Hi,

I have noticed that in the following loss functions, the `loss_decoder` only updates the decoder but does not update the quantizer, because a `.detach()` operation is applied before entering the decoder.

https://github.com/Neur-IO/ReVQ/blob/0d6634d823fd64ad7d0a3315382356948d08dfd0/scripts/train.py#L133-L139

May I ask why you design it this way, rather than using the decoder loss to update the decoder and quantizer altogether (i.e., without the `.detach()` operation before entering the decoder)?

	data_shuffle = viewer.shuffle(data)
	quant_shuffle = quantizer(data_shuffle)["x_quant"]
	quant = viewer.unshuffle(quant_shuffle)
	data_rec = decoder(quant.detach())
	loss_quant = F.mse_loss(data_shuffle, quant_shuffle)
	loss_decoder = F.mse_loss(data, data_rec)
	loss = loss_quant + loss_decoder

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about decoder loss #4

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Question about decoder loss #4

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions