NMI watchdog: BUG: soft lockup - CPU#0 stuck for 22s!

时间:2023-03-10 07:44:20
NMI watchdog: BUG: soft lockup - CPU#0 stuck for 22s!

今天测试环境一虚拟机运行中突然报错,,, 没见过的内核报错,于是google一番。

  系统日志:

Nov  :: dev- kernel: NMI watchdog: BUG: soft lockup - CPU# stuck for 22s! [kworker/::]
Nov :: dev- kernel: Modules linked in: binfmt_misc ip6t_rpfilter ipt_REJECT nf_reject_ipv4 ip6t_REJECT nf_reject_ipv6 xt_conntrack ip_set nfnetlink ebtable_nat ebtable_broute bridge stp llc ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 ip6table_mangle ip6table_security ip6table_raw iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack iptable_mangle iptable_security iptable_raw ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter vmw_vsock_vmci_transport vsock sb_edac coretemp iosf_mbi crc32_pclmul ghash_clmulni_intel aesni_intel lrw gf128mul glue_helper ablk_helper ppdev cryptd sg pcspkr vmw_balloon joydev vmw_vmci parport_pc parport shpchp i2c_piix4 ip_tables xfs libcrc32c sr_mod cdrom ata_generic pata_acpi vmwgfx sd_mod drm_kms_helper crc_t10dif crct10dif_generic
Nov :: dev- kernel: syscopyarea sysfillrect sysimgblt fb_sys_fops ttm drm ata_piix crct10dif_pclmul crct10dif_common libata crc32c_intel serio_raw vmxnet3 i2c_core vmw_pvscsi floppy
Nov :: dev- kernel: CPU: PID: Comm: kworker/: Kdump: loaded Not tainted 3.10.-862.11..el7.x86_64 #
Nov :: dev- kernel: Hardware name: VMware, Inc. VMware Virtual Platform/440BX Desktop Reference Platform, BIOS 6.00 //
Nov :: dev- kernel: Workqueue: events_freezable vmballoon_work [vmw_balloon]
Nov :: dev- kernel: task: ffff9041caaf1fa0 ti: ffff90418f3ac000 task.ti: ffff90418f3ac000
Nov :: dev- kernel: RIP: :[<ffffffffc02f7297>] [<ffffffffc02f7297>] vmballoon_lock_page+0x57/0x150 [vmw_balloon]
Nov :: dev- kernel: RSP: :ffff90418f3afda0 EFLAGS:
Nov :: dev- kernel: RAX: RBX: RCX:
Nov :: dev- kernel: RDX: RSI: RDI: ffffffffc02fa438
Nov :: dev- kernel: RBP: ffff90418f3afdc0 R08: ffffffffac47600f R09:
Nov :: dev- kernel: R10: ffffe6bbca00e100 R11: fffffffffffffffa R12: ffff9041fffcf008
Nov :: dev- kernel: R13: ffff9041caaf1fa0 R14: ffff9041caaf1fa0 R15: ffff9041caaf1fa0
Nov :: dev- kernel: FS: () GS:ffff9041ffc00000() knlGS:
Nov :: dev- kernel: CS: DS: ES: CR0:
Nov :: dev- kernel: CR2: 00007fed8de1a8d0 CR3: 00000004177ec000 CR4: 00000000000407f0
Nov :: dev- kernel: Call Trace:
Nov :: dev- kernel: [<ffffffffc02f80d5>] vmballoon_work+0x5a5/0x6ff [vmw_balloon]
Nov :: dev- kernel: [<ffffffffabab613f>] process_one_work+0x17f/0x440
Nov :: dev- kernel: [<ffffffffabab71d6>] worker_thread+0x126/0x3c0
Nov :: dev- kernel: [<ffffffffabab70b0>] ? manage_workers.isra.+0x2a0/0x2a0
Nov :: dev- kernel: [<ffffffffababdf21>] kthread+0xd1/0xe0
Nov :: dev- kernel: [<ffffffffababde50>] ? insert_kthread_work+0x40/0x40
Nov :: dev- kernel: [<ffffffffac1255f7>] ret_from_fork_nospec_begin+0x21/0x21
Nov :: dev- kernel: [<ffffffffababde50>] ? insert_kthread_work+0x40/0x40
Nov :: dev- kernel: Code: c1 0f cc bb 6f 6d 6c cf c4 d8 f6 ba b9 4c cb ed <> c0 c6 1f 0f 9b f8 0f

知识点:  

watchdog: watchdog是为了保证系统正常运行,或者从死循环,死锁等一场状态退出的一种机制。

https://blog.****.net/whatday/article/details/73770736

http://oenhan.com/kernel-deadlock-check

soft lockup CPU死锁的问题

https://blog.****.net/sunny05296/article/details/82858071