Automated Pod Hard Reset Policy for Persistent CrashLoopBackOff

<div _ngcontent-ng-c3109639921="" class="container"><message-content _ngcontent-ng-c3109639921="" _nghost-ng-c965300280="" id="message-content-id-r_8ef35dadbe11bb2a" class="ng-star-inserted"><div _ngcontent-ng-c965300280="" inline-copy-host="" class="markdown markdown-main-panel enable-updated-hr-color" id="model-response-message-contentr_8ef35dadbe11bb2a" aria-live="off" aria-busy="false" dir="ltr" style="--animation-duration: 400ms; --fade-animation-function: linear;"><h2 data-path-to-node="6">1. Summary</h2><p data-path-to-node="7">This KEP proposes a new, optional policy field within the <code>Deployment</code> workload API to enable <b>automatic Pod deletion and recreation (a "Hard Reset")</b> after the container's native <code>CrashLoopBackOff</code> restarts fail repeatedly on the same node.</p><p data-path-to-node="8">The primary goal is to improve the self-healing capacity of Kubernetes for long-running workloads by automatically resolving persistent, <b>node-local failures</b> (e.g., stuck volume mounts, non-transient resource contention) without requiring external monitoring or manual intervention (<code>kubectl delete pod</code>).</p><h2 data-path-to-node="9">2. Motivation</h2><h3 data-path-to-node="10"><b>2.1 The Problem with Current Behavior</b></h3><p data-path-to-node="11">The default <code>restartPolicy: Always</code> relies on the Kubelet performing in-place container restarts with <b>exponential backoff</b> (capped at 5 minutes). While excellent for handling transient application errors, this mechanism fails when the root cause is tied to the Pod's local environment on the current Node:</p><ul data-path-to-node="12"><li><p data-path-to-node="12,0,0"><b>Node-Local Persistent Errors:</b> The Kubelet is unable to resolve issues like a corrupted <code>emptyDir</code> or a host-specific lock.</p></li><li><p data-path-to-node="12,1,0"><b>Indefinite Resource Consumption:</b> The Pod remains in a non-functional state, consuming resources and generating restart events indefinitely, capped at the 5-minute backoff interval.</p></li><li><p data-path-to-node="12,2,0"><b>Requires Manual Intervention:</b> The only reliable fix is a human operator running <code>kubectl delete pod &lt;name&gt;</code>, which forces the Deployment Controller to schedule a replacement Pod on a potentially different, healthy node.</p></li></ul><h3 data-path-to-node="13"><b>2.2 User Story (Operator Persona)</b></h3><p data-path-to-node="14">As a <b>Cluster Operator</b>, I need Kubernetes to automatically perform a "Hard Reset" (Pod delete and reschedule) when a Pod has been stuck in a repeated <code>CrashLoopBackOff</code> cycle for a significant duration, so that I don't have to write custom monitoring and automation to handle obvious node-local resource failures.</p><h2 data-path-to-node="15">3. Goals</h2><ol start="1" data-path-to-node="16"><li><p data-path-to-node="16,0,0">Introduce a new policy field to the <code>Deployment</code> API to define the hard reset condition.</p></li><li><p data-path-to-node="16,1,0">Define <b>sensible default values</b> for this policy that trigger a Hard Reset for any Deployment that does not specify the field.</p></li><li><p data-path-to-node="16,2,0">Ensure the Hard Reset action is to <b>delete the Pod</b> (not the Deployment), forcing a fresh reschedule via the existing Deployment Controller logic.</p></li><li><p data-path-to-node="16,3,0">Prevent scheduler overload by implementing an internal <b>cooldown</b> between Hard Resets for the same deployment.</p></li></ol><h2 data-path-to-node="17">4. Proposal Detail</h2><h3 data-path-to-node="18"><b>4.1 New API Field (Alpha)</b></h3><p data-path-to-node="19">Introduce a new optional field, <code>PodFailurePolicy</code>, to the <code>Deployment.spec.template.spec</code> (mirroring the concept in Jobs, but with different actions):</p><response-element class="" ng-version="0.0.0-PLACEHOLDER"><code-block _nghost-ng-c3165720128="" class="ng-tns-c3165720128-97 ng-star-inserted" style=""><div _ngcontent-ng-c3165720128="" class="code-block ng-tns-c3165720128-97 ng-animate-disabled ng-trigger ng-trigger-codeBlockRevealAnimation" jslog="223238;track:impression,attention;BardVeMetadataKey:[[&quot;r_8ef35dadbe11bb2a&quot;,&quot;c_64bb7b9338b64aef&quot;,null,&quot;rc_4622761a325aef98&quot;,null,null,&quot;en&quot;,null,1,null,null,1,0]]" data-hveid="0" decode-data-ved="1" data-ved="0CAAQhtANahgKEwjWj9mLq7eRAxUAAAAAHQAAAAAQlwM" style="display: block;"><div _ngcontent-ng-c3165720128="" class="code-block-decoration header-formatted gds-title-s ng-tns-c3165720128-97 ng-star-inserted" style=""><span _ngcontent-ng-c3165720128="" class="ng-tns-c3165720128-97">YAML</span><div _ngcontent-ng-c3165720128="" class="buttons ng-tns-c3165720128-97 ng-star-inserted"><button _ngcontent-ng-c3165720128="" aria-label="Kodu kopyala" mat-icon-button="" mattooltip="Kodu kopyala" class="mdc-icon-button mat-mdc-icon-button mat-mdc-button-base mat-mdc-tooltip-trigger copy-button ng-tns-c3165720128-97 mat-unthemed ng-star-inserted" mat-ripple-loader-uninitialized="" mat-ripple-loader-class-name="mat-mdc-button-ripple" mat-ripple-loader-centered="" jslog="179062;track:generic_click,impression;BardVeMetadataKey:[[&quot;r_8ef35dadbe11bb2a&quot;,&quot;c_64bb7b9338b64aef&quot;,null,&quot;rc_4622761a325aef98&quot;,null,null,&quot;en&quot;,null,1,null,null,1,0]];mutable:true"><span class="mat-mdc-button-persistent-ripple mdc-icon-button__ripple"></span><mat-icon _ngcontent-ng-c3165720128="" role="img" fonticon="content_copy" class="mat-icon notranslate gds-icon-s google-symbols mat-ligature-font mat-icon-no-color" aria-hidden="true" data-mat-icon-type="font" data-mat-icon-name="content_copy"></mat-icon><span class="mat-focus-indicator"></span><span class="mat-mdc-button-touch-target"></span></button></div></div><div _ngcontent-ng-c3165720128="" class="formatted-code-block-internal-container ng-tns-c3165720128-97"><div _ngcontent-ng-c3165720128="" class="animated-opacity ng-tns-c3165720128-97"><pre _ngcontent-ng-c3165720128="" class="ng-tns-c3165720128-97"><code _ngcontent-ng-c3165720128="" role="text" data-test-id="code-content" class="code-container formatted ng-tns-c3165720128-97"><span class="hljs-attr">apiVersion:</span> <span class="hljs-string">apps/v1</span>
<span class="hljs-attr">kind:</span> <span class="hljs-string">Deployment</span>
<span class="hljs-attr">spec:</span>
  <span class="hljs-attr">template:</span>
    <span class="hljs-attr">spec:</span>
      <span class="hljs-attr">containers:</span> [<span class="hljs-string">...</span>]
      <span class="hljs-comment"># New optional policy field:</span>
      <span class="hljs-attr">failurePolicy:</span>
        <span class="hljs-comment"># Action to take when MaxRestartsOnNode is exceeded.</span>
        <span class="hljs-attr">action:</span> <span class="hljs-string">DeletePod</span>
        <span class="hljs-comment"># Hard limit on the number of container restarts on a single node before action is taken.</span>
        <span class="hljs-comment"># This counter should be reset on successful container uptime (e.g., 10 minutes).</span>
        <span class="hljs-attr">maxRestartsOnNode:</span> <span class="hljs-number">7</span> 
</code></pre></div></div></div></code-block></response-element><h3 data-path-to-node="21"><b>4.2 Proposed Automatic Default Behavior (The Core Enhancement)</b></h3><p data-path-to-node="22">To meet the goal of working <b>by default</b> without explicit configuration, the <b>Deployment Controller</b> will observe the Kubelet's Pod Status for all Pods it manages.</p><ul data-path-to-node="23"><li><p data-path-to-node="23,0,0"><b>Condition Check:</b> The Controller will check the <code>Pod.status.containerStatuses[*].restartCount</code>.</p></li><li><p data-path-to-node="23,1,0"><b>Default Threshold:</b> If the <code>restartCount</code> for any container in a Pod exceeds <b>7</b> (matching <code>Job.spec.backoffLimit</code> + 1), <b>AND</b> the Pod has not successfully achieved a "Ready" state within the preceding <b>10 minutes</b>, the Hard Reset is triggered.</p></li><li><p data-path-to-node="23,2,0"><b>Action:</b> The Deployment Controller performs a direct <code>DELETE</code> operation on the Pod object.</p><ul data-path-to-node="23,2,1"><li><p data-path-to-node="23,2,1,0,0"><b>Result:</b> The deletion event triggers the Deployment Controller's main reconciliation loop, which immediately notices one fewer ready replica than desired and creates a new, replacement Pod. This replacement will be scheduled by the Kubernetes Scheduler, likely on a different node.</p></li></ul></li></ul><h3 data-path-to-node="24"><b>4.3 Cooldown and Throttling (Resilience)</b></h3><p data-path-to-node="25">To prevent a cascade of Hard Resets from overwhelming the control plane (the "Thundering Herd" on the Scheduler):</p><ul data-path-to-node="26"><li><p data-path-to-node="26,0,0">The Deployment Controller will implement a <b>Deployment-wide cooldown</b> of <b>2 minutes</b> between Hard Resets for a single Deployment.</p></li><li><p data-path-to-node="26,1,0">If a Pod is flagged for Hard Reset during this cooldown period, the action is deferred until the cooldown expires.</p></li></ul><h2 data-path-to-node="27">5. Test Plan (Alpha)</h2><ol start="1" data-path-to-node="28"><li><p data-path-to-node="28,0,0"><b>Unit Tests:</b> Verify that the Deployment Controller correctly increments and resets the failure counter based on Kubelet events.</p></li><li><p data-path-to-node="28,1,0"><b>Integration Tests:</b></p><ul data-path-to-node="28,1,1"><li><p data-path-to-node="28,1,1,0,0">Deploy a simple application designed to crash instantly (e.g., <code>exit 1</code>).</p></li><li><p data-path-to-node="28,1,1,1,0">Verify the container's restart count reaches <b>7</b>.</p></li><li><p data-path-to-node="28,1,1,2,0">Verify the Deployment Controller automatically deletes the Pod and creates a new one.</p></li><li><p data-path-to-node="28,1,1,3,0">Verify a Hard Reset is triggered only when the new <code>failurePolicy</code> field is <i>omitted</i> (default behavior) or set to the proposed action.</p></li><li><p data-path-to-node="28,1,1,4,0">Verify the 2-minute throttling mechanism correctly limits Hard Resets.</p></li></ul></li></ol><h2 data-path-to-node="29">6. Graduation Criteria</h2><h3 data-path-to-node="30"><b>Alpha -&gt; Beta</b></h3><ul data-path-to-node="31"><li><p data-path-to-node="31,0,0">API field is merged into <code>Deployment</code> as <code>Alpha</code>.</p></li><li><p data-path-to-node="31,1,0">Successful implementation of the Deployment Controller logic.</p></li><li><p data-path-to-node="31,2,0">Positive feedback from at least two non-owner contributors/end-users.</p></li><li><p data-path-to-node="31,3,0">Controller-level throttling mechanism is proven stable in tests.</p></li></ul><h3 data-path-to-node="32"><b>Beta -&gt; GA</b></h3><ul data-path-to-node="33"><li><p data-path-to-node="33,0,0">API changes are proven stable for at least two releases.</p></li><li><p data-path-to-node="33,1,0">Extensive end-to-end testing in large-scale clusters, demonstrating the automatic healing of node-local failures without increasing scheduler burden.</p></li><li><p data-path-to-node="33,2,0">Agreement with SIG-Node on Kubelet interaction and event monitoring.</p></li></ul><h2 data-path-to-node="34">7. Risks and Mitigation</h2><div class="horizontal-scroll-wrapper"><div class="table-block-component"><response-element class="" ng-version="0.0.0-PLACEHOLDER">
Risk | Mitigation
-- | --
Control Plane Overload | Implementation of a 2-minute per-deployment cooldown for Hard Resets. This prioritizes stability over immediate recovery.
Data Loss | A hard reset deletes the Pod, terminating any data in emptyDir. This risk is mitigated by clear documentation that this policy is intended to resolve otherwise unrecoverable failures.
Masking Application Bugs | Hard Resets will be logged as a clear event. Operators will need to be educated that repeated Hard Resets (e.g., over multiple hours) still indicate a persistent application bug requiring code fix.
Conflict with Existing Liveness Probes | The policy must only trigger on container termination/crash events, not Liveness Probe failures (which already result in a restart).

</div><div _ngcontent-ng-c2904004286="" hide-from-message-actions="" class="table-footer hide-from-message-actions ng-star-inserted"><button _ngcontent-ng-c2904004286="" mat-button="" class="mdc-button mat-mdc-button-base export-sheets-button-container mat-mdc-button mat-unthemed ng-star-inserted" mat-ripple-loader-class-name="mat-mdc-button-ripple" jslog="184701;track:generic_click,impression;BardVeMetadataKey:[[&quot;r_8ef35dadbe11bb2a&quot;,&quot;c_64bb7b9338b64aef&quot;,null,null,null,null,null,null,1,null,null,null,0]]"><span class="mat-mdc-button-persistent-ripple mdc-button__ripple"></span><span class="mdc-button__label"><span _ngcontent-ng-c2904004286="" class="export-sheets-button"><span _ngcontent-ng-c2904004286="" class="export-sheets-icon"><mat-icon _ngcontent-ng-c2904004286="" role="img" fonticon="drive_spreadsheet" class="mat-icon notranslate google-symbols mat-ligature-font mat-icon-no-color" aria-hidden="true" data-mat-icon-type="font" data-mat-icon-name="drive_spreadsheet"></mat-icon></span><span _ngcontent-ng-c2904004286=""></span></span></span><span class="mat-focus-indicator"></span><span class="mat-mdc-button-touch-target"></span><span class="mat-ripple mat-mdc-button-ripple"></span></button><button _ngcontent-ng-c2904004286="" mat-icon-button="" mattooltip="" aria-label="Tabloyu kopyala" data-test-id="copy-table-button" class="mdc-icon-button mat-mdc-icon-button mat-mdc-button-base mat-mdc-tooltip-trigger copy-button mat-unthemed ng-star-inserted" mat-ripple-loader-uninitialized="" mat-ripple-loader-class-name="mat-mdc-button-ripple" mat-ripple-loader-centered="" jslog="276666;track:generic_click,impression;BardVeMetadataKey:[[&quot;r_8ef35dadbe11bb2a&quot;,&quot;c_64bb7b9338b64aef&quot;,null,null,null,null,null,null,1,null,null,null,0]]"><span class="mat-mdc-button-persistent-ripple mdc-icon-button__ripple"></span><mat-icon _ngcontent-ng-c2904004286="" role="img" fonticon="content_copy" class="mat-icon notranslate gds-icon-l google-symbols mat-ligature-font mat-icon-no-color" aria-hidden="true" data-mat-icon-type="font" data-mat-icon-name="content_copy"></mat-icon><span class="mat-focus-indicator"></span><span class="mat-mdc-button-touch-target"></span></button></div></div></table-block></response-element></div></div></div></message-content></div>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Automated Pod Hard Reset Policy for Persistent CrashLoopBackOff #5734

1. Summary

2. Motivation

2.1 The Problem with Current Behavior

2.2 User Story (Operator Persona)

3. Goals

4. Proposal Detail

4.1 New API Field (Alpha)

4.2 Proposed Automatic Default Behavior (The Core Enhancement)

4.3 Cooldown and Throttling (Resilience)

5. Test Plan (Alpha)

6. Graduation Criteria

Alpha -> Beta

Beta -> GA

7. Risks and Mitigation

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Automated Pod Hard Reset Policy for Persistent CrashLoopBackOff #5734

Description

1. Summary

2. Motivation

2.1 The Problem with Current Behavior

2.2 User Story (Operator Persona)

3. Goals

4. Proposal Detail

4.1 New API Field (Alpha)

4.2 Proposed Automatic Default Behavior (The Core Enhancement)

4.3 Cooldown and Throttling (Resilience)

5. Test Plan (Alpha)

6. Graduation Criteria

Alpha -> Beta

Beta -> GA

7. Risks and Mitigation

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions