amdgpu driver crash: ring gfx_0.0.0 timeout

Bug #2031289 reported by Alan Pope 🍺🐧🐱 🦄
16
This bug affects 3 people
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Fix Released
Undecided
Unassigned

Bug Description

ThinkPad Z13 with 3 displays connected.

Using Ubuntu 23.04 with 6.2.0-26-generic, the graphical desktop locks or crashes. dmesg has the following error:

[Mon Aug 14 08:06:06 2023] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, signaled seq=5346515, emitted seq=5346517
[Mon Aug 14 08:06:06 2023] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 3456 thread Xorg:cs0 pid 3464
[Mon Aug 14 08:06:06 2023] amdgpu 0000:63:00.0: amdgpu: GPU reset begin!

This happens every few days, and I either have to restart the desktop or the entire computer. Today it happened at 08:06 when I wasn't at the machine doing anything. But many applications were still running, as I left it on Friday.

I have two external displays attached via a USB type C hub.

alan@ziggy:~$ xrandr | grep -B2 "*"
eDP connected 1920x1200+0+0 (normal left inverted right x axis y axis) 289mm x 186mm
   2880x1800 60.00 +
   1920x1200 60.00*
--
DisplayPort-6 disconnected (normal left inverted right x axis y axis)
DisplayPort-7 connected 1920x1080+3840+0 (normal left inverted right x axis y axis) 527mm x 296mm
   1920x1080 60.00*+ 50.00 59.94
--
   720x400 70.08
DisplayPort-8 connected primary 1920x1080+1920+0 (normal left inverted right x axis y axis) 527mm x 296mm
   1920x1080 60.00*+ 50.00 59.94

ProblemType: Bug
DistroRelease: Ubuntu 23.04
Package: linux-image-6.2.0-26-generic 6.2.0-26.26
ProcVersionSignature: Ubuntu 6.2.0-26.26-generic 6.2.13
Uname: Linux 6.2.0-26-generic x86_64
ApportVersion: 2.26.1-0ubuntu2
Architecture: amd64
CasperMD5CheckResult: unknown
CurrentDesktop: ubuntu:GNOME
Date: Mon Aug 14 08:57:10 2023
DistributionChannelDescriptor:
 # This is the distribution channel descriptor for the OEM CDs
 # For more information see http://wiki.ubuntu.com/DistributionChannelDescriptor
 canonical-oem-sutton-focal-amd64-20220803-89+sutton-focal-amd64+X02
InstallationDate: Installed on 2022-08-05 (373 days ago)
InstallationMedia: Ubuntu 20.04 "Focal" - Build amd64 LIVE Binary 20220803-13:42
MachineType: LENOVO 21D2CTO1WW
ProcFB: 0 amdgpudrmfb
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-6.2.0-26-generic root=UUID=96c77f27-d90f-4d14-a5d5-59ac57b3f6dc ro quiet splash vt.handoff=7
PulseList: Error: command ['pacmd', 'list'] failed with exit code 1: No PulseAudio daemon running, or not running as session daemon.
RebootRequiredPkgs: Error: path contained symlinks.
RelatedPackageVersions:
 linux-restricted-modules-6.2.0-26-generic N/A
 linux-backports-modules-6.2.0-26-generic N/A
 linux-firmware 20230323.gitbcdcfbcf-0ubuntu1.2
SourcePackage: linux
UpgradeStatus: Upgraded to lunar on 2023-01-02 (223 days ago)
dmi.bios.date: 12/08/2022
dmi.bios.release: 1.27
dmi.bios.vendor: LENOVO
dmi.bios.version: N3GET47W (1.27 )
dmi.board.asset.tag: Not Available
dmi.board.name: 21D2CTO1WW
dmi.board.vendor: LENOVO
dmi.board.version: Not Defined
dmi.chassis.asset.tag: No Asset Information
dmi.chassis.type: 10
dmi.chassis.vendor: LENOVO
dmi.chassis.version: None
dmi.ec.firmware.release: 1.54
dmi.modalias: dmi:bvnLENOVO:bvrN3GET47W(1.27):bd12/08/2022:br1.27:efr1.54:svnLENOVO:pn21D2CTO1WW:pvrThinkPadZ13Gen1:rvnLENOVO:rn21D2CTO1WW:rvrNotDefined:cvnLENOVO:ct10:cvrNone:skuLENOVO_MT_21D2_BU_Think_FM_ThinkPadZ13Gen1:
dmi.product.family: ThinkPad Z13 Gen 1
dmi.product.name: 21D2CTO1WW
dmi.product.sku: LENOVO_MT_21D2_BU_Think_FM_ThinkPad Z13 Gen 1
dmi.product.version: ThinkPad Z13 Gen 1
dmi.sys.vendor: LENOVO

Revision history for this message
Alan Pope 🍺🐧🐱 🦄 (popey) wrote :
description: updated
Revision history for this message
Juerg Haefliger (juergh) wrote :

There are some AMD FW updates in lunar-proposed linux-firmware 20230323.gitbcdcfbcf-0ubuntu1.6. Can you give that a try?

Revision history for this message
Ubuntu Kernel Bot (ubuntu-kernel-bot) wrote : Status changed to Confirmed

This change was made by a bot.

Changed in linux (Ubuntu):
status: New → Confirmed
Revision history for this message
Alan Pope 🍺🐧🐱 🦄 (popey) wrote :

Thanks! I have just had another crash with linux-firmware 20230323.gitbcdcfbcf-0ubuntu1.2, so will try 20230323.gitbcdcfbcf-0ubuntu1.6 on my 23.04 system.

Revision history for this message
Alan Pope 🍺🐧🐱 🦄 (popey) wrote :

Four days later, no crash yet. Left the machine on over the weekend, and I see no crashes in the amdgpu driver. So it looks like the newer firmware seems to be better for me.

Revision history for this message
Juerg Haefliger (juergh) wrote :

Can I close the bug then?

Revision history for this message
Alan Pope 🍺🐧🐱 🦄 (popey) wrote :

Another weekend has passed, and no crash. Yes, thanks, you can close the bug.

Juerg Haefliger (juergh)
Changed in linux (Ubuntu):
status: Confirmed → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.