Parameters That Control Kernel Panics
The following parameters control the circumstances under which a kernel panic can occur.
Parameter | Description |
---|---|
kernel.hung_task_panic
|
If set to 1, the kernel panics if any kernel or user thread sleeps in the
The default value is 0, which disables the panic. Tip:
To diagnose a hung thread, you can examine
|
kernel.hung_task_timeout_secs
|
Specifies how long a user or kernel thread can remain in D
state before a warning message is generated or the kernel
panics, if the value of
|
kernel.nmi_watchdog
|
If set to 1 (default), enables the nonmaskable interrupt (NMI) watchdog thread in the
kernel. To use the NMI switch or the OProfile system profiler to generate an undefined
NMI, set the value of |
kernel.panic
|
Specifies the number of seconds after a panic before a system automatically resets itself. If the value is 0, which is the default value, the system becomes suspended, and you can collect detailed information about the panic for troubleshooting. To enable automatic reset, set a nonzero value. If you require a memory image
( |
kernel.panic_on_io_nmi
|
If set to 0 (default), the system tries to continue operations if the kernel detects an I/O channel check (IOCHK) NMI that typically indicates a uncorrectable hardware error. If set to 1, the system panics. |
kernel.panic_on_oops
|
If set to 0, the system tries to continue operations if the kernel detects an
In an OCFS2 cluster. set the value to 1 to specify that a system must panic if a kernel oops occurs. If a kernel thread required for cluster operation fails, the system must reset itself. Otherwise, another node might not detect whether a node is slow to respond or unable to respond, causing cluster operations to halt. |
kernel.panic_on_unrecovered_nmi
|
If set to 0 (default), the system tries to continue operations if the kernel detects an NMI that might indicate an uncorrectable parity or ECC memory error. If set to 1, the system panics. |
kernel.softlockup_panic
|
If set to 0 (default), the system tries to continue operations if the kernel detects a
soft-lockup error that causes the NMI watchdog thread to fail to update its
timestamp for more than twice the value of |
kernel.unknown_nmi_panic
|
If set to |
kernel.watchdog_thresh
|
Specifies the interval between generating an NMI
performance monitoring interrupt that the kernel uses to
check for hard-lockup and
soft-lockup errors. A hard-lockup
error is assumed if a CPU is unresponsive to the interrupt
for more than |
vm.panic_on_oom
|
If set to 0 (default), the kernel’s OOM-killer scans through the entire task list and stops a memory-hogging process to avoid a panic. If set to 1, the kernel panics but can survive under certain conditions. If a process limits allocations to certain nodes by using memory policies or cpusets, and those nodes reach memory exhaustion status, the OOM-killer can stop one process. No panic occurs in this case because other nodes’ memory might be free and the system as a whole might not yet be out of memory. If set to 2, the kernel always panics when an OOM condition occurs. Settings of 1 and 2 are for intended for use with clusters, depending on the defined failover policy. |