Skip to content

Instantly share code, notes, and snippets.

@kwilczynski
Created January 12, 2025 11:10
Show Gist options
  • Save kwilczynski/e80c410998b4d851b10bf5affed83839 to your computer and use it in GitHub Desktop.
Save kwilczynski/e80c410998b4d851b10bf5affed83839 to your computer and use it in GitHub Desktop.
Jan 11 11:48:06 rocinante kernel: [17610.105025] amdgpu 0000:03:00.0: amdgpu: Dumping IP State
Jan 11 11:48:06 rocinante kernel: [17610.106837] amdgpu 0000:03:00.0: amdgpu: Dumping IP State Completed
Jan 11 11:48:06 rocinante kernel: [17610.106864] amdgpu 0000:03:00.0: amdgpu: ring sdma0 timeout, signaled seq=165765, emitted seq=165768
Jan 11 11:48:06 rocinante kernel: [17610.106867] amdgpu 0000:03:00.0: amdgpu: GPU reset begin!
Jan 11 11:48:06 rocinante kernel: [17610.106918] amdgpu 0000:03:00.0: amdgpu: Failed to disallow df cstate
Jan 11 11:48:06 rocinante kernel: [17610.137182] ------------[ cut here ]------------
Jan 11 11:48:06 rocinante kernel: [17610.137183] WARNING: CPU: 18 PID: 2156760 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x9f/0xb0 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.137349] Modules linked in: uhid rfcomm cmac algif_hash algif_skcipher af_alg xt_conntrack nft_chain_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 bridge stp llc xfrm_user xfrm_algo xt_addrtype nft_compat vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) snd_seq_dummy snd_hrtimer nf_tables nfnetlink bnep zram 842_decompress amd_atl intel_rapl_msr 842_compress overlay intel_rapl_common lz4hc_compress lz4_compress snd_usb_audio snd_usbmidi_lib snd_hwdep btusb snd_ump btrtl btintel snd_pcm btbcm btmtk snd_seq_midi snd_seq_midi_event snd_rawmidi binfmt_misc snd_seq uvcvideo videobuf2_vmalloc uvc snd_seq_device videobuf2_memops mt7921e snd_timer videobuf2_v4l2 mt7921_common videobuf2_common mt792x_lib snd mt76_connac_lib nls_iso8859_1 videodev mt76 edac_mce_amd soundcore bluetooth joydev hid_multitouch input_leds mc vfat fat mac80211 kvm_amd asus_nb_wmi cfg80211 asus_wmi platform_profile kvm ccp libarc4 sparse_keymap spd5118 wmi_bmof sch_fq_codel msr kyber_iosched efi_pstore ip_tables x_tables autofs4
Jan 11 11:48:06 rocinante kernel: [17610.137376] dm_crypt raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq raid1 raid0 tcp_bbr nzxt_kraken3 hid_generic crct10dif_pclmul usbhid hid crc32_pclmul polyval_clmulni polyval_generic ghash_clmulni_intel sha256_ssse3 sha1_ssse3 ahci libahci amdgpu amdxcp drm_exec gpu_sched drm_buddy i2c_algo_bit drm_suballoc_helper nvme drm_ttm_helper ttm nvme_core igc drm_display_helper nvme_auth video wmi aesni_intel crypto_simd cryptd zstd
Jan 11 11:48:06 rocinante kernel: [17610.137391] CPU: 18 UID: 0 PID: 2156760 Comm: kworker/u128:2 Tainted: G OE 6.12.9 #1
Jan 11 11:48:06 rocinante kernel: [17610.137393] Tainted: [O]=OOT_MODULE, [E]=UNSIGNED_MODULE
Jan 11 11:48:06 rocinante kernel: [17610.137393] Hardware name: ASUS System Product Name/ROG STRIX B650E-I GAMING WIFI, BIOS 3067 12/10/2024
Jan 11 11:48:06 rocinante kernel: [17610.137394] Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jan 11 11:48:06 rocinante kernel: [17610.137397] RIP: 0010:amdgpu_irq_put+0x9f/0xb0 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.137532] Code: 31 f6 31 ff c3 cc cc cc cc 44 89 e2 48 89 de 4c 89 f7 e8 a4 fc ff ff 5b 41 5c 41 5d 41 5e 5d 31 d2 31 f6 31 ff c3 cc cc cc cc <0f> 0b b8 ea ff ff ff eb c3 b8 fe ff ff ff eb bc 90 90 90 90 90 90
Jan 11 11:48:06 rocinante kernel: [17610.137534] RSP: 0018:ffffabcd4639fbd8 EFLAGS: 00010246
Jan 11 11:48:06 rocinante kernel: [17610.137535] RAX: 0000000000000000 RBX: ffff9632b3ca5558 RCX: 0000000000000000
Jan 11 11:48:06 rocinante kernel: [17610.137535] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
Jan 11 11:48:06 rocinante kernel: [17610.137536] RBP: ffffabcd4639fbf8 R08: 0000000000000000 R09: 0000000000000000
Jan 11 11:48:06 rocinante kernel: [17610.137536] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
Jan 11 11:48:06 rocinante kernel: [17610.137537] R13: 0000000000000001 R14: ffff9632b3c80000 R15: ffff9632b3c80000
Jan 11 11:48:06 rocinante kernel: [17610.137537] FS: 0000000000000000(0000) GS:ffff96495d900000(0000) knlGS:0000000000000000
Jan 11 11:48:06 rocinante kernel: [17610.137538] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 11 11:48:06 rocinante kernel: [17610.137538] CR2: 00007f7db2b10e24 CR3: 00000016e7d4a000 CR4: 0000000000f50ef0
Jan 11 11:48:06 rocinante kernel: [17610.137539] PKRU: 55555554
Jan 11 11:48:06 rocinante kernel: [17610.137539] Call Trace:
Jan 11 11:48:06 rocinante kernel: [17610.137540] <TASK>
Jan 11 11:48:06 rocinante kernel: [17610.137542] ? show_regs+0x6b/0x80
Jan 11 11:48:06 rocinante kernel: [17610.137545] ? __warn+0x8d/0x150
Jan 11 11:48:06 rocinante kernel: [17610.137548] ? amdgpu_irq_put+0x9f/0xb0 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.137671] ? report_bug+0x182/0x1b0
Jan 11 11:48:06 rocinante kernel: [17610.137673] ? handle_bug+0x63/0xa0
Jan 11 11:48:06 rocinante kernel: [17610.137675] ? exc_invalid_op+0x18/0x80
Jan 11 11:48:06 rocinante kernel: [17610.137677] ? asm_exc_invalid_op+0x1b/0x20
Jan 11 11:48:06 rocinante kernel: [17610.137680] ? amdgpu_irq_put+0x9f/0xb0 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.137800] ? amdgpu_irq_put+0x55/0xb0 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.137919] gfx_v10_0_hw_fini+0x1d/0x120 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.138054] gfx_v10_0_suspend+0xe/0x20 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.138176] amdgpu_device_ip_suspend_phase2+0x25c/0x490 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.138296] amdgpu_device_ip_suspend+0x49/0x80 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.138416] amdgpu_device_pre_asic_reset+0xf2/0x610 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.138535] amdgpu_device_gpu_recover+0x327/0xf10 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.138655] amdgpu_job_timedout+0x1ab/0x580 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.138804] ? raw_spin_rq_unlock+0x10/0x40
Jan 11 11:48:06 rocinante kernel: [17610.138807] drm_sched_job_timedout+0x6d/0x110 [gpu_sched]
Jan 11 11:48:06 rocinante kernel: [17610.138809] process_one_work+0x177/0x3c0
Jan 11 11:48:06 rocinante kernel: [17610.138811] worker_thread+0x2b6/0x3e0
Jan 11 11:48:06 rocinante kernel: [17610.138812] ? __pfx_worker_thread+0x10/0x10
Jan 11 11:48:06 rocinante kernel: [17610.138813] kthread+0xe5/0x120
Jan 11 11:48:06 rocinante kernel: [17610.138816] ? __pfx_kthread+0x10/0x10
Jan 11 11:48:06 rocinante kernel: [17610.138817] ret_from_fork+0x44/0x70
Jan 11 11:48:06 rocinante kernel: [17610.138820] ? __pfx_kthread+0x10/0x10
Jan 11 11:48:06 rocinante kernel: [17610.138821] ret_from_fork_asm+0x1a/0x30
Jan 11 11:48:06 rocinante kernel: [17610.138823] </TASK>
Jan 11 11:48:06 rocinante kernel: [17610.138824] ---[ end trace 0000000000000000 ]---
Jan 11 11:48:06 rocinante kernel: [17610.138843] ------------[ cut here ]------------
Jan 11 11:48:06 rocinante kernel: [17610.138844] WARNING: CPU: 18 PID: 2156760 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x9f/0xb0 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.138974] Modules linked in: uhid rfcomm cmac algif_hash algif_skcipher af_alg xt_conntrack nft_chain_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 bridge stp llc xfrm_user xfrm_algo xt_addrtype nft_compat vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) snd_seq_dummy snd_hrtimer nf_tables nfnetlink bnep zram 842_decompress amd_atl intel_rapl_msr 842_compress overlay intel_rapl_common lz4hc_compress lz4_compress snd_usb_audio snd_usbmidi_lib snd_hwdep btusb snd_ump btrtl btintel snd_pcm btbcm btmtk snd_seq_midi snd_seq_midi_event snd_rawmidi binfmt_misc snd_seq uvcvideo videobuf2_vmalloc uvc snd_seq_device videobuf2_memops mt7921e snd_timer videobuf2_v4l2 mt7921_common videobuf2_common mt792x_lib snd mt76_connac_lib nls_iso8859_1 videodev mt76 edac_mce_amd soundcore bluetooth joydev hid_multitouch input_leds mc vfat fat mac80211 kvm_amd asus_nb_wmi cfg80211 asus_wmi platform_profile kvm ccp libarc4 sparse_keymap spd5118 wmi_bmof sch_fq_codel msr kyber_iosched efi_pstore ip_tables x_tables autofs4
Jan 11 11:48:06 rocinante kernel: [17610.138997] dm_crypt raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq raid1 raid0 tcp_bbr nzxt_kraken3 hid_generic crct10dif_pclmul usbhid hid crc32_pclmul polyval_clmulni polyval_generic ghash_clmulni_intel sha256_ssse3 sha1_ssse3 ahci libahci amdgpu amdxcp drm_exec gpu_sched drm_buddy i2c_algo_bit drm_suballoc_helper nvme drm_ttm_helper ttm nvme_core igc drm_display_helper nvme_auth video wmi aesni_intel crypto_simd cryptd zstd
Jan 11 11:48:06 rocinante kernel: [17610.139008] CPU: 18 UID: 0 PID: 2156760 Comm: kworker/u128:2 Tainted: G W OE 6.12.9 #1
Jan 11 11:48:06 rocinante kernel: [17610.139009] Tainted: [W]=WARN, [O]=OOT_MODULE, [E]=UNSIGNED_MODULE
Jan 11 11:48:06 rocinante kernel: [17610.139010] Hardware name: ASUS System Product Name/ROG STRIX B650E-I GAMING WIFI, BIOS 3067 12/10/2024
Jan 11 11:48:06 rocinante kernel: [17610.139010] Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jan 11 11:48:06 rocinante kernel: [17610.139012] RIP: 0010:amdgpu_irq_put+0x9f/0xb0 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.139135] Code: 31 f6 31 ff c3 cc cc cc cc 44 89 e2 48 89 de 4c 89 f7 e8 a4 fc ff ff 5b 41 5c 41 5d 41 5e 5d 31 d2 31 f6 31 ff c3 cc cc cc cc <0f> 0b b8 ea ff ff ff eb c3 b8 fe ff ff ff eb bc 90 90 90 90 90 90
Jan 11 11:48:06 rocinante kernel: [17610.139136] RSP: 0018:ffffabcd4639fbd8 EFLAGS: 00010246
Jan 11 11:48:06 rocinante kernel: [17610.139137] RAX: 0000000000000000 RBX: ffff9632b3ca5570 RCX: 0000000000000000
Jan 11 11:48:06 rocinante kernel: [17610.139138] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
Jan 11 11:48:06 rocinante kernel: [17610.139139] RBP: ffffabcd4639fbf8 R08: 0000000000000000 R09: 0000000000000000
Jan 11 11:48:06 rocinante kernel: [17610.139139] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
Jan 11 11:48:06 rocinante kernel: [17610.139140] R13: 0000000000000001 R14: ffff9632b3c80000 R15: ffff9632b3c80000
Jan 11 11:48:06 rocinante kernel: [17610.139140] FS: 0000000000000000(0000) GS:ffff96495d900000(0000) knlGS:0000000000000000
Jan 11 11:48:06 rocinante kernel: [17610.139142] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 11 11:48:06 rocinante kernel: [17610.139142] CR2: 00007f7db2b10e24 CR3: 00000016e7d4a000 CR4: 0000000000f50ef0
Jan 11 11:48:06 rocinante kernel: [17610.139143] PKRU: 55555554
Jan 11 11:48:06 rocinante kernel: [17610.139143] Call Trace:
Jan 11 11:48:06 rocinante kernel: [17610.139144] <TASK>
Jan 11 11:48:06 rocinante kernel: [17610.139145] ? show_regs+0x6b/0x80
Jan 11 11:48:06 rocinante kernel: [17610.139146] ? __warn+0x8d/0x150
Jan 11 11:48:06 rocinante kernel: [17610.139148] ? amdgpu_irq_put+0x9f/0xb0 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.139266] ? report_bug+0x182/0x1b0
Jan 11 11:48:06 rocinante kernel: [17610.139268] ? handle_bug+0x63/0xa0
Jan 11 11:48:06 rocinante kernel: [17610.139269] ? exc_invalid_op+0x18/0x80
Jan 11 11:48:06 rocinante kernel: [17610.139270] ? asm_exc_invalid_op+0x1b/0x20
Jan 11 11:48:06 rocinante kernel: [17610.139273] ? amdgpu_irq_put+0x9f/0xb0 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.139390] ? amdgpu_irq_put+0x55/0xb0 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.139508] gfx_v10_0_hw_fini+0x2e/0x120 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.139635] gfx_v10_0_suspend+0xe/0x20 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.139754] amdgpu_device_ip_suspend_phase2+0x25c/0x490 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.139872] amdgpu_device_ip_suspend+0x49/0x80 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.139990] amdgpu_device_pre_asic_reset+0xf2/0x610 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.140110] amdgpu_device_gpu_recover+0x327/0xf10 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.140228] amdgpu_job_timedout+0x1ab/0x580 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.140365] ? raw_spin_rq_unlock+0x10/0x40
Jan 11 11:48:06 rocinante kernel: [17610.140366] drm_sched_job_timedout+0x6d/0x110 [gpu_sched]
Jan 11 11:48:06 rocinante kernel: [17610.140368] process_one_work+0x177/0x3c0
Jan 11 11:48:06 rocinante kernel: [17610.140370] worker_thread+0x2b6/0x3e0
Jan 11 11:48:06 rocinante kernel: [17610.140371] ? __pfx_worker_thread+0x10/0x10
Jan 11 11:48:06 rocinante kernel: [17610.140372] kthread+0xe5/0x120
Jan 11 11:48:06 rocinante kernel: [17610.140373] ? __pfx_kthread+0x10/0x10
Jan 11 11:48:06 rocinante kernel: [17610.140375] ret_from_fork+0x44/0x70
Jan 11 11:48:06 rocinante kernel: [17610.140376] ? __pfx_kthread+0x10/0x10
Jan 11 11:48:06 rocinante kernel: [17610.140377] ret_from_fork_asm+0x1a/0x30
Jan 11 11:48:06 rocinante kernel: [17610.140380] </TASK>
Jan 11 11:48:06 rocinante kernel: [17610.140380] ---[ end trace 0000000000000000 ]---
Jan 11 11:48:06 rocinante kernel: [17610.140393] ------------[ cut here ]------------
Jan 11 11:48:06 rocinante kernel: [17610.140394] WARNING: CPU: 18 PID: 2156760 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x9f/0xb0 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.140521] Modules linked in: uhid rfcomm cmac algif_hash algif_skcipher af_alg xt_conntrack nft_chain_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 bridge stp llc xfrm_user xfrm_algo xt_addrtype nft_compat vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) snd_seq_dummy snd_hrtimer nf_tables nfnetlink bnep zram 842_decompress amd_atl intel_rapl_msr 842_compress overlay intel_rapl_common lz4hc_compress lz4_compress snd_usb_audio snd_usbmidi_lib snd_hwdep btusb snd_ump btrtl btintel snd_pcm btbcm btmtk snd_seq_midi snd_seq_midi_event snd_rawmidi binfmt_misc snd_seq uvcvideo videobuf2_vmalloc uvc snd_seq_device videobuf2_memops mt7921e snd_timer videobuf2_v4l2 mt7921_common videobuf2_common mt792x_lib snd mt76_connac_lib nls_iso8859_1 videodev mt76 edac_mce_amd soundcore bluetooth joydev hid_multitouch input_leds mc vfat fat mac80211 kvm_amd asus_nb_wmi cfg80211 asus_wmi platform_profile kvm ccp libarc4 sparse_keymap spd5118 wmi_bmof sch_fq_codel msr kyber_iosched efi_pstore ip_tables x_tables autofs4
Jan 11 11:48:06 rocinante kernel: [17610.140542] dm_crypt raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq raid1 raid0 tcp_bbr nzxt_kraken3 hid_generic crct10dif_pclmul usbhid hid crc32_pclmul polyval_clmulni polyval_generic ghash_clmulni_intel sha256_ssse3 sha1_ssse3 ahci libahci amdgpu amdxcp drm_exec gpu_sched drm_buddy i2c_algo_bit drm_suballoc_helper nvme drm_ttm_helper ttm nvme_core igc drm_display_helper nvme_auth video wmi aesni_intel crypto_simd cryptd zstd
Jan 11 11:48:06 rocinante kernel: [17610.140552] CPU: 18 UID: 0 PID: 2156760 Comm: kworker/u128:2 Tainted: G W OE 6.12.9 #1
Jan 11 11:48:06 rocinante kernel: [17610.140553] Tainted: [W]=WARN, [O]=OOT_MODULE, [E]=UNSIGNED_MODULE
Jan 11 11:48:06 rocinante kernel: [17610.140553] Hardware name: ASUS System Product Name/ROG STRIX B650E-I GAMING WIFI, BIOS 3067 12/10/2024
Jan 11 11:48:06 rocinante kernel: [17610.140554] Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jan 11 11:48:06 rocinante kernel: [17610.140556] RIP: 0010:amdgpu_irq_put+0x9f/0xb0 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.140673] Code: 31 f6 31 ff c3 cc cc cc cc 44 89 e2 48 89 de 4c 89 f7 e8 a4 fc ff ff 5b 41 5c 41 5d 41 5e 5d 31 d2 31 f6 31 ff c3 cc cc cc cc <0f> 0b b8 ea ff ff ff eb c3 b8 fe ff ff ff eb bc 90 90 90 90 90 90
Jan 11 11:48:06 rocinante kernel: [17610.140674] RSP: 0018:ffffabcd4639fbd8 EFLAGS: 00010246
Jan 11 11:48:06 rocinante kernel: [17610.140674] RAX: 0000000000000000 RBX: ffff9632b3ca5588 RCX: 0000000000000000
Jan 11 11:48:06 rocinante kernel: [17610.140675] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
Jan 11 11:48:06 rocinante kernel: [17610.140675] RBP: ffffabcd4639fbf8 R08: 0000000000000000 R09: 0000000000000000
Jan 11 11:48:06 rocinante kernel: [17610.140676] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
Jan 11 11:48:06 rocinante kernel: [17610.140676] R13: 0000000000000001 R14: ffff9632b3c80000 R15: ffff9632b3c80000
Jan 11 11:48:06 rocinante kernel: [17610.140677] FS: 0000000000000000(0000) GS:ffff96495d900000(0000) knlGS:0000000000000000
Jan 11 11:48:06 rocinante kernel: [17610.140677] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 11 11:48:06 rocinante kernel: [17610.140678] CR2: 00007f7db2b10e24 CR3: 00000016e7d4a000 CR4: 0000000000f50ef0
Jan 11 11:48:06 rocinante kernel: [17610.140678] PKRU: 55555554
Jan 11 11:48:06 rocinante kernel: [17610.140678] Call Trace:
Jan 11 11:48:06 rocinante kernel: [17610.140679] <TASK>
Jan 11 11:48:06 rocinante kernel: [17610.140679] ? show_regs+0x6b/0x80
Jan 11 11:48:06 rocinante kernel: [17610.140680] ? __warn+0x8d/0x150
Jan 11 11:48:06 rocinante kernel: [17610.140681] ? amdgpu_irq_put+0x9f/0xb0 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.140797] ? report_bug+0x182/0x1b0
Jan 11 11:48:06 rocinante kernel: [17610.140799] ? handle_bug+0x63/0xa0
Jan 11 11:48:06 rocinante kernel: [17610.140800] ? exc_invalid_op+0x18/0x80
Jan 11 11:48:06 rocinante kernel: [17610.140802] ? asm_exc_invalid_op+0x1b/0x20
Jan 11 11:48:06 rocinante kernel: [17610.140804] ? amdgpu_irq_put+0x9f/0xb0 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.140919] ? amdgpu_irq_put+0x55/0xb0 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.141037] gfx_v10_0_hw_fini+0x3f/0x120 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.141161] gfx_v10_0_suspend+0xe/0x20 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.141279] amdgpu_device_ip_suspend_phase2+0x25c/0x490 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.141395] amdgpu_device_ip_suspend+0x49/0x80 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.141511] amdgpu_device_pre_asic_reset+0xf2/0x610 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.141627] amdgpu_device_gpu_recover+0x327/0xf10 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.141744] amdgpu_job_timedout+0x1ab/0x580 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.141879] ? raw_spin_rq_unlock+0x10/0x40
Jan 11 11:48:06 rocinante kernel: [17610.141880] drm_sched_job_timedout+0x6d/0x110 [gpu_sched]
Jan 11 11:48:06 rocinante kernel: [17610.141882] process_one_work+0x177/0x3c0
Jan 11 11:48:06 rocinante kernel: [17610.141883] worker_thread+0x2b6/0x3e0
Jan 11 11:48:06 rocinante kernel: [17610.141884] ? __pfx_worker_thread+0x10/0x10
Jan 11 11:48:06 rocinante kernel: [17610.141885] kthread+0xe5/0x120
Jan 11 11:48:06 rocinante kernel: [17610.141887] ? __pfx_kthread+0x10/0x10
Jan 11 11:48:06 rocinante kernel: [17610.141888] ret_from_fork+0x44/0x70
Jan 11 11:48:06 rocinante kernel: [17610.141890] ? __pfx_kthread+0x10/0x10
Jan 11 11:48:06 rocinante kernel: [17610.141891] ret_from_fork_asm+0x1a/0x30
Jan 11 11:48:06 rocinante kernel: [17610.141893] </TASK>
Jan 11 11:48:06 rocinante kernel: [17610.141893] ---[ end trace 0000000000000000 ]---
Jan 11 11:48:06 rocinante kernel: [17610.142214] ------------[ cut here ]------------
Jan 11 11:48:06 rocinante kernel: [17610.142215] WARNING: CPU: 18 PID: 2156760 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x9f/0xb0 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.142341] Modules linked in: uhid rfcomm cmac algif_hash algif_skcipher af_alg xt_conntrack nft_chain_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 bridge stp llc xfrm_user xfrm_algo xt_addrtype nft_compat vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) snd_seq_dummy snd_hrtimer nf_tables nfnetlink bnep zram 842_decompress amd_atl intel_rapl_msr 842_compress overlay intel_rapl_common lz4hc_compress lz4_compress snd_usb_audio snd_usbmidi_lib snd_hwdep btusb snd_ump btrtl btintel snd_pcm btbcm btmtk snd_seq_midi snd_seq_midi_event snd_rawmidi binfmt_misc snd_seq uvcvideo videobuf2_vmalloc uvc snd_seq_device videobuf2_memops mt7921e snd_timer videobuf2_v4l2 mt7921_common videobuf2_common mt792x_lib snd mt76_connac_lib nls_iso8859_1 videodev mt76 edac_mce_amd soundcore bluetooth joydev hid_multitouch input_leds mc vfat fat mac80211 kvm_amd asus_nb_wmi cfg80211 asus_wmi platform_profile kvm ccp libarc4 sparse_keymap spd5118 wmi_bmof sch_fq_codel msr kyber_iosched efi_pstore ip_tables x_tables autofs4
Jan 11 11:48:06 rocinante kernel: [17610.142361] dm_crypt raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq raid1 raid0 tcp_bbr nzxt_kraken3 hid_generic crct10dif_pclmul usbhid hid crc32_pclmul polyval_clmulni polyval_generic ghash_clmulni_intel sha256_ssse3 sha1_ssse3 ahci libahci amdgpu amdxcp drm_exec gpu_sched drm_buddy i2c_algo_bit drm_suballoc_helper nvme drm_ttm_helper ttm nvme_core igc drm_display_helper nvme_auth video wmi aesni_intel crypto_simd cryptd zstd
Jan 11 11:48:06 rocinante kernel: [17610.142371] CPU: 18 UID: 0 PID: 2156760 Comm: kworker/u128:2 Tainted: G W OE 6.12.9 #1
Jan 11 11:48:06 rocinante kernel: [17610.142372] Tainted: [W]=WARN, [O]=OOT_MODULE, [E]=UNSIGNED_MODULE
Jan 11 11:48:06 rocinante kernel: [17610.142372] Hardware name: ASUS System Product Name/ROG STRIX B650E-I GAMING WIFI, BIOS 3067 12/10/2024
Jan 11 11:48:06 rocinante kernel: [17610.142373] Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jan 11 11:48:06 rocinante kernel: [17610.142375] RIP: 0010:amdgpu_irq_put+0x9f/0xb0 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.142491] Code: 31 f6 31 ff c3 cc cc cc cc 44 89 e2 48 89 de 4c 89 f7 e8 a4 fc ff ff 5b 41 5c 41 5d 41 5e 5d 31 d2 31 f6 31 ff c3 cc cc cc cc <0f> 0b b8 ea ff ff ff eb c3 b8 fe ff ff ff eb bc 90 90 90 90 90 90
Jan 11 11:48:06 rocinante kernel: [17610.142492] RSP: 0018:ffffabcd4639fb88 EFLAGS: 00010246
Jan 11 11:48:06 rocinante kernel: [17610.142493] RAX: 0000000000000000 RBX: ffff9632b3c4b008 RCX: 0000000000000000
Jan 11 11:48:06 rocinante kernel: [17610.142493] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
Jan 11 11:48:06 rocinante kernel: [17610.142494] RBP: ffffabcd4639fba8 R08: 0000000000000000 R09: 0000000000000000
Jan 11 11:48:06 rocinante kernel: [17610.142494] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
Jan 11 11:48:06 rocinante kernel: [17610.142495] R13: 0000000000000001 R14: ffff9632b3c80000 R15: ffff9632b3c80000
Jan 11 11:48:06 rocinante kernel: [17610.142495] FS: 0000000000000000(0000) GS:ffff96495d900000(0000) knlGS:0000000000000000
Jan 11 11:48:06 rocinante kernel: [17610.142496] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 11 11:48:06 rocinante kernel: [17610.142496] CR2: 00007f7db2b10e24 CR3: 00000016e7d4a000 CR4: 0000000000f50ef0
Jan 11 11:48:06 rocinante kernel: [17610.142497] PKRU: 55555554
Jan 11 11:48:06 rocinante kernel: [17610.142497] Call Trace:
Jan 11 11:48:06 rocinante kernel: [17610.142497] <TASK>
Jan 11 11:48:06 rocinante kernel: [17610.142498] ? show_regs+0x6b/0x80
Jan 11 11:48:06 rocinante kernel: [17610.142499] ? __warn+0x8d/0x150
Jan 11 11:48:06 rocinante kernel: [17610.142500] ? amdgpu_irq_put+0x9f/0xb0 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.142615] ? report_bug+0x182/0x1b0
Jan 11 11:48:06 rocinante kernel: [17610.142617] ? handle_bug+0x63/0xa0
Jan 11 11:48:06 rocinante kernel: [17610.142618] ? exc_invalid_op+0x18/0x80
Jan 11 11:48:06 rocinante kernel: [17610.142619] ? asm_exc_invalid_op+0x1b/0x20
Jan 11 11:48:06 rocinante kernel: [17610.142621] ? amdgpu_irq_put+0x9f/0xb0 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.142736] ? amdgpu_irq_put+0x55/0xb0 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.142851] smu_v11_0_disable_thermal_alert+0x17/0x30 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.142988] smu_smc_hw_cleanup+0x7e/0x4d0 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.143133] smu_suspend+0x71/0x110 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.143257] amdgpu_device_ip_suspend_phase2+0x25c/0x490 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.143374] amdgpu_device_ip_suspend+0x49/0x80 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.143490] amdgpu_device_pre_asic_reset+0xf2/0x610 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.143607] amdgpu_device_gpu_recover+0x327/0xf10 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.143724] amdgpu_job_timedout+0x1ab/0x580 [amdgpu]
Jan 11 11:48:06 rocinante kernel: [17610.143856] ? raw_spin_rq_unlock+0x10/0x40
Jan 11 11:48:06 rocinante kernel: [17610.143857] drm_sched_job_timedout+0x6d/0x110 [gpu_sched]
Jan 11 11:48:06 rocinante kernel: [17610.143859] process_one_work+0x177/0x3c0
Jan 11 11:48:06 rocinante kernel: [17610.143861] worker_thread+0x2b6/0x3e0
Jan 11 11:48:06 rocinante kernel: [17610.143862] ? __pfx_worker_thread+0x10/0x10
Jan 11 11:48:06 rocinante kernel: [17610.143863] kthread+0xe5/0x120
Jan 11 11:48:06 rocinante kernel: [17610.143864] ? __pfx_kthread+0x10/0x10
Jan 11 11:48:06 rocinante kernel: [17610.143866] ret_from_fork+0x44/0x70
Jan 11 11:48:06 rocinante kernel: [17610.143867] ? __pfx_kthread+0x10/0x10
Jan 11 11:48:06 rocinante kernel: [17610.143869] ret_from_fork_asm+0x1a/0x30
Jan 11 11:48:06 rocinante kernel: [17610.143870] </TASK>
Jan 11 11:48:06 rocinante kernel: [17610.143871] ---[ end trace 0000000000000000 ]---
Jan 11 11:48:06 rocinante kernel: [17610.143872] amdgpu 0000:03:00.0: amdgpu: Fail to disable thermal alert!
Jan 11 11:48:06 rocinante kernel: [17610.143873] [drm:amdgpu_device_ip_suspend_phase2 [amdgpu]] *ERROR* suspend of IP block <smu> failed -22
Jan 11 11:48:07 rocinante kernel: [17611.193497] igc 0000:0a:00.0 eno1: NIC Link is Up 2500 Mbps Full Duplex, Flow Control: RX/TX
Jan 11 11:48:07 rocinante kernel: [17611.266555] amdgpu 0000:03:00.0: amdgpu: psp gfx command DESTROY_TMR(0x7) failed and response status is (0x80000306)
Jan 11 11:48:07 rocinante kernel: [17611.287868] ------------[ cut here ]------------
Jan 11 11:48:07 rocinante kernel: [17611.287869] WARNING: CPU: 6 PID: 2156760 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x9f/0xb0 [amdgpu]
Jan 11 11:48:07 rocinante kernel: [17611.288034] Modules linked in: uhid rfcomm cmac algif_hash algif_skcipher af_alg xt_conntrack nft_chain_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 bridge stp llc xfrm_user xfrm_algo xt_addrtype nft_compat vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) snd_seq_dummy snd_hrtimer nf_tables nfnetlink bnep zram 842_decompress amd_atl intel_rapl_msr 842_compress overlay intel_rapl_common lz4hc_compress lz4_compress snd_usb_audio snd_usbmidi_lib snd_hwdep btusb snd_ump btrtl btintel snd_pcm btbcm btmtk snd_seq_midi snd_seq_midi_event snd_rawmidi binfmt_misc snd_seq uvcvideo videobuf2_vmalloc uvc snd_seq_device videobuf2_memops mt7921e snd_timer videobuf2_v4l2 mt7921_common videobuf2_common mt792x_lib snd mt76_connac_lib nls_iso8859_1 videodev mt76 edac_mce_amd soundcore bluetooth joydev hid_multitouch input_leds mc vfat fat mac80211 kvm_amd asus_nb_wmi cfg80211 asus_wmi platform_profile kvm ccp libarc4 sparse_keymap spd5118 wmi_bmof sch_fq_codel msr kyber_iosched efi_pstore ip_tables x_tables autofs4
Jan 11 11:48:07 rocinante kernel: [17611.288066] dm_crypt raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq raid1 raid0 tcp_bbr nzxt_kraken3 hid_generic crct10dif_pclmul usbhid hid crc32_pclmul polyval_clmulni polyval_generic ghash_clmulni_intel sha256_ssse3 sha1_ssse3 ahci libahci amdgpu amdxcp drm_exec gpu_sched drm_buddy i2c_algo_bit drm_suballoc_helper nvme drm_ttm_helper ttm nvme_core igc drm_display_helper nvme_auth video wmi aesni_intel crypto_simd cryptd zstd
Jan 11 11:48:07 rocinante kernel: [17611.288081] CPU: 6 UID: 0 PID: 2156760 Comm: kworker/u128:2 Tainted: G W OE 6.12.9 #1
Jan 11 11:48:07 rocinante kernel: [17611.288083] Tainted: [W]=WARN, [O]=OOT_MODULE, [E]=UNSIGNED_MODULE
Jan 11 11:48:07 rocinante kernel: [17611.288083] Hardware name: ASUS System Product Name/ROG STRIX B650E-I GAMING WIFI, BIOS 3067 12/10/2024
Jan 11 11:48:07 rocinante kernel: [17611.288084] Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jan 11 11:48:07 rocinante kernel: [17611.288088] RIP: 0010:amdgpu_irq_put+0x9f/0xb0 [amdgpu]
Jan 11 11:48:07 rocinante kernel: [17611.288228] Code: 31 f6 31 ff c3 cc cc cc cc 44 89 e2 48 89 de 4c 89 f7 e8 a4 fc ff ff 5b 41 5c 41 5d 41 5e 5d 31 d2 31 f6 31 ff c3 cc cc cc cc <0f> 0b b8 ea ff ff ff eb c3 b8 fe ff ff ff eb bc 90 90 90 90 90 90
Jan 11 11:48:07 rocinante kernel: [17611.288229] RSP: 0018:ffffabcd4639fbd8 EFLAGS: 00010246
Jan 11 11:48:07 rocinante kernel: [17611.288230] RAX: 0000000000000000 RBX: ffff9632b3c80c78 RCX: 0000000000000000
Jan 11 11:48:07 rocinante kernel: [17611.288231] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
Jan 11 11:48:07 rocinante kernel: [17611.288231] RBP: ffffabcd4639fbf8 R08: 0000000000000000 R09: 0000000000000000
Jan 11 11:48:07 rocinante kernel: [17611.288232] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
Jan 11 11:48:07 rocinante kernel: [17611.288232] R13: 0000000000000001 R14: ffff9632b3c80000 R15: ffff9632b3c80000
Jan 11 11:48:07 rocinante kernel: [17611.288232] FS: 0000000000000000(0000) GS:ffff96495d300000(0000) knlGS:0000000000000000
Jan 11 11:48:07 rocinante kernel: [17611.288233] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 11 11:48:07 rocinante kernel: [17611.288234] CR2: 0000763a3621ca50 CR3: 00000012cd8bc000 CR4: 0000000000f50ef0
Jan 11 11:48:07 rocinante kernel: [17611.288234] PKRU: 55555554
Jan 11 11:48:07 rocinante kernel: [17611.288235] Call Trace:
Jan 11 11:48:07 rocinante kernel: [17611.288235] <TASK>
Jan 11 11:48:07 rocinante kernel: [17611.288237] ? show_regs+0x6b/0x80
Jan 11 11:48:07 rocinante kernel: [17611.288240] ? __warn+0x8d/0x150
Jan 11 11:48:07 rocinante kernel: [17611.288242] ? amdgpu_irq_put+0x9f/0xb0 [amdgpu]
Jan 11 11:48:07 rocinante kernel: [17611.288377] ? report_bug+0x182/0x1b0
Jan 11 11:48:07 rocinante kernel: [17611.288379] ? handle_bug+0x63/0xa0
Jan 11 11:48:07 rocinante kernel: [17611.288381] ? exc_invalid_op+0x18/0x80
Jan 11 11:48:07 rocinante kernel: [17611.288382] ? asm_exc_invalid_op+0x1b/0x20
Jan 11 11:48:07 rocinante kernel: [17611.288385] ? amdgpu_irq_put+0x9f/0xb0 [amdgpu]
Jan 11 11:48:07 rocinante kernel: [17611.288521] ? amdgpu_irq_put+0x55/0xb0 [amdgpu]
Jan 11 11:48:07 rocinante kernel: [17611.288655] gmc_v10_0_hw_fini+0x67/0xe0 [amdgpu]
Jan 11 11:48:07 rocinante kernel: [17611.288789] gmc_v10_0_suspend+0xe/0x20 [amdgpu]
Jan 11 11:48:07 rocinante kernel: [17611.288923] amdgpu_device_ip_suspend_phase2+0x25c/0x490 [amdgpu]
Jan 11 11:48:07 rocinante kernel: [17611.289048] amdgpu_device_ip_suspend+0x49/0x80 [amdgpu]
Jan 11 11:48:07 rocinante kernel: [17611.289169] amdgpu_device_pre_asic_reset+0xf2/0x610 [amdgpu]
Jan 11 11:48:07 rocinante kernel: [17611.289290] amdgpu_device_gpu_recover+0x327/0xf10 [amdgpu]
Jan 11 11:48:07 rocinante kernel: [17611.289413] amdgpu_job_timedout+0x1ab/0x580 [amdgpu]
Jan 11 11:48:07 rocinante kernel: [17611.289571] ? raw_spin_rq_unlock+0x10/0x40
Jan 11 11:48:07 rocinante kernel: [17611.289574] drm_sched_job_timedout+0x6d/0x110 [gpu_sched]
Jan 11 11:48:07 rocinante kernel: [17611.289576] process_one_work+0x177/0x3c0
Jan 11 11:48:07 rocinante kernel: [17611.289578] worker_thread+0x2b6/0x3e0
Jan 11 11:48:07 rocinante kernel: [17611.289579] ? __pfx_worker_thread+0x10/0x10
Jan 11 11:48:07 rocinante kernel: [17611.289580] kthread+0xe5/0x120
Jan 11 11:48:07 rocinante kernel: [17611.289583] ? __pfx_kthread+0x10/0x10
Jan 11 11:48:07 rocinante kernel: [17611.289584] ret_from_fork+0x44/0x70
Jan 11 11:48:07 rocinante kernel: [17611.289587] ? __pfx_kthread+0x10/0x10
Jan 11 11:48:07 rocinante kernel: [17611.289588] ret_from_fork_asm+0x1a/0x30
Jan 11 11:48:07 rocinante kernel: [17611.289590] </TASK>
Jan 11 11:48:07 rocinante kernel: [17611.289591] ---[ end trace 0000000000000000 ]---
Jan 11 11:48:07 rocinante kernel: [17611.289594] amdgpu 0000:03:00.0: amdgpu: MODE1 reset
Jan 11 11:48:07 rocinante kernel: [17611.289596] amdgpu 0000:03:00.0: amdgpu: GPU mode1 reset
Jan 11 11:48:07 rocinante kernel: [17611.289643] amdgpu 0000:03:00.0: amdgpu: GPU smu mode1 reset
Jan 11 11:48:10 rocinante kernel: [17614.607656] amdgpu 0000:03:00.0: amdgpu: SMU: I'm not done with your previous command: SMN_C2PMSG_66:0x00000036 SMN_C2PMSG_82:0x00000000
Jan 11 11:48:10 rocinante kernel: [17614.607659] amdgpu 0000:03:00.0: amdgpu: GPU mode1 reset failed
Jan 11 11:48:10 rocinante kernel: [17614.607660] amdgpu 0000:03:00.0: amdgpu: ASIC reset failed with error, -62 for drm dev, 0000:03:00.0
Jan 11 11:48:10 rocinante kernel: [17614.607681] amdgpu 0000:03:00.0: amdgpu: GPU reset(1) failed
Jan 11 11:48:10 rocinante kernel: [17614.607723] amdgpu 0000:03:00.0: amdgpu: GPU reset end with ret = -62
Jan 11 11:48:10 rocinante kernel: [17614.607725] amdgpu 0000:03:00.0: amdgpu: GPU Recovery Failed: -62
Jan 11 11:48:20 rocinante kernel: [17624.953388] amdgpu 0000:03:00.0: amdgpu: Dumping IP State
Jan 11 11:48:20 rocinante kernel: [17624.955194] amdgpu 0000:03:00.0: amdgpu: Dumping IP State Completed
Jan 11 11:48:20 rocinante kernel: [17624.955203] amdgpu 0000:03:00.0: amdgpu: ring sdma0 timeout, signaled seq=165768, emitted seq=165770
Jan 11 11:48:20 rocinante kernel: [17624.955205] amdgpu 0000:03:00.0: amdgpu: GPU reset begin!
Jan 11 11:48:20 rocinante kernel: [17624.955255] amdgpu 0000:03:00.0: amdgpu: Failed to disallow df cstate
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment