Legal claims defining the scope of protection, as filed with the USPTO.
1. A computer-implemented method for intelligent container management, the computer-implemented method comprising: identifying, by a computer, using a group of solution agents, an issue that caused a failed state of a container based on an analysis of exit code information, container log information, host node agent log information, and resource utilization data corresponding to the container; determining, by the computer, whether the issue that caused the failed state of the container is a resolvable issue with a fix; applying, by the computer, using a runtime mutating webhook controller, the fix to the container prior to restart of the container in response to the computer determining that the issue that caused the failed state of the container is the resolvable issue with the fix; and performing, by the computer, the restart of the container on a host node after applying the fix.
2. The computer-implemented method of claim 1, further comprising: recording, by the computer, using the runtime mutating webhook controller, the fix applied to the container in a source code management tracker for application to one or more other containers in a container-based environment upon container restart after failure.
3. The computer-implemented method of claim 1, further comprising: determining, by the computer, whether the issue that caused the failed state of the container is an unresolvable issue; and terminating, by the computer, the container and marking the container as no restart in response to the computer determining that the issue that caused the failed state of the container is the unresolvable issue.
4. The computer-implemented method of claim 1, further comprising: determining, by the computer, whether the issue that caused the failed state of the container is a resource allocation issue; determining, by the computer, using the group of solution agents, an additional amount of resources needed by the container in response to the computer determining that the issue that caused the failed state of the container is the resource allocation issue; allocating, by the computer, the additional amount of resources to the container; and performing, by the computer, the restart of the container on the host node.
5. The computer-implemented method of claim 1, further comprising: receiving, by the computer, an input to generate the container on the host node in a container-based environment; generating, by the computer, the container on the host node in response to receiving the input; receiving, by the computer, information regarding a status of the container from a group of probes located on the host node; and determining, by the computer, whether the information regarding the status of the container received from the group of probes indicates that the container is in the failed state.
6. The computer-implemented method of claim 5, further comprising: retrieving, by the computer, the exit code information, the container log information, the host node agent log information, and the resource utilization data corresponding to the container in response to the computer determining that the information regarding the status of the container received from the group of probes indicates that the container is in the failed state.
7. The computer-implemented method of claim 1, further comprising: determining, by the computer, that the issue causing the failed state of the computer is an unknown issue in response to the computer determining that the issue that caused the failed state of the container is not the resolvable issue with the fix; and terminating, by the computer, the container and marking the container as no restart.
8. A computer system for intelligent container management, the computer system comprising: a communication fabric; a set of computer-readable storage media connected to the communication fabric, wherein the set of computer-readable storage media collectively stores program instructions; and a set of processors connected to the communication fabric, wherein the set of processors executes the program instructions to: identify, using a group of solution agents, an issue that caused a failed state of a container based on an analysis of exit code information, container log information, host node agent log information, and resource utilization data corresponding to the container; determine whether the issue that caused the failed state of the container is a resolvable issue with a fix; apply, using a runtime mutating webhook controller, the fix to the container prior to restart of the container in response to determining that the issue that caused the failed state of the container is the resolvable issue with the fix; and perform the restart of the container on a host node after applying the fix.
9. The computer system of claim 8, wherein the set of processors further executes the program instructions to: record, using the runtime mutating webhook controller, the fix applied to the container in a source code management tracker for application to one or more other containers in a container-based environment upon container restart after failure.
10. The computer system of claim 8, wherein the set of processors further executes the program instructions to: determine whether the issue that caused the failed state of the container is an unresolvable issue; and terminate the container and mark the container as no restart in response to determining that the issue that caused the failed state of the container is the unresolvable issue.
11. The computer system of claim 8, wherein the set of processors further executes the program instructions to: determine whether the issue that caused the failed state of the container is a resource allocation issue; determine, using the group of solution agents, an additional amount of resources needed by the container in response to determining that the issue that caused the failed state of the container is the resource allocation issue; allocate the additional amount of resources to the container; and perform the restart of the container on the host node.
12. The computer system of claim 8, wherein the set of processors further executes the program instructions to: receive an input to generate the container on the host node in a container-based environment; generate the container on the host node in response to receiving the input; receive information regarding a status of the container from a group of probes located on the host node; and determine whether the information regarding the status of the container received from the group of probes indicates that the container is in the failed state.
13. The computer system of claim 12, wherein the set of processors further executes the program instructions to: retrieve the exit code information, the container log information, the host node agent log information, and the resource utilization data corresponding to the container in response to determining that the information regarding the status of the container received from the group of probes indicates that the container is in the failed state.
14. A computer program product for intelligent container management, the computer program product comprising a set of computer-readable storage media having program instructions collectively stored therein, the program instructions executable by a computer to cause the computer to: identify, using a group of solution agents, an issue that caused a failed state of a container based on an analysis of exit code information, container log information, host node agent log information, and resource utilization data corresponding to the container; determine whether the issue that caused the failed state of the container is a resolvable issue with a fix; apply, using a runtime mutating webhook controller, the fix to the container prior to restart of the container in response to determining that the issue that caused the failed state of the container is the resolvable issue with the fix; and perform the restart of the container on a host node after applying the fix.
15. The computer program product of claim 14, wherein the program instructions further cause the computer to: record, using the runtime mutating webhook controller, the fix applied to the container in a source code management tracker for application to one or more other containers in a container-based environment upon container restart after failure.
16. The computer program product of claim 14, wherein the program instructions further cause the computer to: determine whether the issue that caused the failed state of the container is an unresolvable issue; and terminate the container and mark the container as no restart in response to determining that the issue that caused the failed state of the container is the unresolvable issue.
17. The computer program product of claim 14, wherein the program instructions further cause the computer to: determine whether the issue that caused the failed state of the container is a resource allocation issue; determine, using the group of solution agents, an additional amount of resources needed by the container in response to determining that the issue that caused the failed state of the container is the resource allocation issue; allocate the additional amount of resources to the container; and perform the restart of the container on the host node.
18. The computer program product of claim 14, wherein the program instructions further cause the computer to: receive an input to generate the container on the host node in a container-based environment; generate the container on the host node in response to receiving the input; receive information regarding a status of the container from a group of probes located on the host node; and determine whether the information regarding the status of the container received from the group of probes indicates that the container is in the failed state.
19. The computer program product of claim 18, wherein the program instructions further cause the computer to: retrieve the exit code information, the container log information, the host node agent log information, and the resource utilization data corresponding to the container in response to determining that the information regarding the status of the container received from the group of probes indicates that the container is in the failed state.
20. The computer program product of claim 14, wherein the program instructions further cause the computer to: determine that the issue causing the failed state of the computer is an unknown issue in response to determining that the issue that caused the failed state of the container is not the resolvable issue with the fix; and terminate the container and mark the container as no restart.
Unknown
September 9, 2025
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.