Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR is about revisiting/improving the timeout handling of components.
Logic of the processing/timeout flow
It is well-known that every component has a processing timeout. Components can specify the timeout value by implementing the
component.TimeoutConfiguration
interface. Otherwise (or if a zero timeout is specified), it will be defaulted by the effective requeue interval, which defaults to 10 minutes.Then, note that a component can be in a 'processing' or 'non-processing' state (which is not directly related to
status.state
beingProcessing
). Here, 'processing' means thatstatus.processingSince
is non-initial. Now, if a component is reconciled, a certain component digest is calculated from the component's annotations, spec and references in the spec (see below for more details about references). Whenever this component digest differs from the currentstatus.processingDigest
, thenstatus.processingSince
is set to the current time, andstatus.processingDigest
is set to the new component digest.Roughly spoken, that means a new timeout countdown is started.
In addition to 'processing' a component can be in a 'timeout' state; this is the case if the
status.processingSince
timestamp lies more than the specified timeout duration in the past. If a component gets into the 'timeout' statestatus.state
) will be set toError
with condition reasonTimeout
Error
orPending
), and the condition reason is set toTimeout
.That means, a timeout can always be reliably detected by checking if the condition reason equals
Timeout
.A 'processing' component will be set to 'non-processing' (that is,
status.processingSince
is cleared) if the component becomes ready (in that case, in addition, one immediate requeue is triggered).Calculation of the component digest
At the beginning of the reconcilation of a component, a (component) digest is calculated that considers
metadata.annotations
of the componentmetadata.generation
resp. the spec of the componentConfigMapReference
,ConfigMapKeyReference
,SecretReference
,SecretKeyReference
,Reference
.Such references will be automatically loaded at the beginning of the reconcile iteration; for the builtin
ConfigMap
andSecret
reference types the logic is part of the framework, and for types implementing theinterface, the loading and digest logic is to be provided by the implementation. Besides being used in the timeout handling as
status.processingDigest
, the component digestOnObjectOrComponentChange
.Roughly speaking, the component digest should identify result of reconciling the component as exact as possible; that means: applying two components with identical digest should produce the same cluster state.
Incompatible changes
Besides the changes outlined above (which should have a bad impact) that PR contains the following incompatible changes:
status.state
was set toPending
with reasonPending
, respectively toDeletionPending
with reasonDeletionPending
; the reason values are changed toRetrying
andDeletionRetrying
, respectivelyRestarting
was added, that will be used withstatus.state
beingPending
, if the processing state of a component is reset due to a component digest change.