Neon won't boot without nomodeset after upgrading CPU

I had a Ryzen 5 2600 CPU until today, when I replaced it with a Ryzen 9 5900X.

Windows doesn’t seem to have any problems at all (I dual boot) but KDE Neon runs into some problem while booting and reboots immediately.

I can’t see what the problem might be as it’s gone too fast, often before my monitor has time to change resolution (partway through booting).

If I add “nomodeset” to the kernel’s options, the system boots, but of course I’m not using the graphics drivers.

Since it seems to be relevant (unless it’s some other drivers that nomodeset removes), my graphics card is a Radeon RX 6800.

I’m on the latest kernel I can find in the repositories, 6.5.0-14-generic. As far as I can see, the CPU is supported since the 5.x series.

Incidentally, the Live version of KDE Neon (from a USB) doesn’t boot without nomodeset either.

What could the problem be? Is there a way to figure out what problem the kernel runs into on a normal boot?

#check boot logs and messages for issues
journalctl -b -p3
systemctl --failed
inxi -Fazu
sudo dmesg -tdH

Thanks. Here is the output of each of those. I didn’t notice anything very obvious, but I’m not sure how much of that is warnings that would appear anyway.

dmesg: [< 0.000000>] Linux version 6.5.0-14-generic (buildd@lcy02-amd64-110) (x86_64 - Pastebin.com

inxi:

System:
  Kernel: 6.5.0-14-generic x86_64 bits: 64 compiler: N/A
    parameters: BOOT_IMAGE=/boot/vmlinuz-6.5.0-14-generic
    root=UUID=19060cbe-6a72-41b4-9c80-395331524432 ro nomodeset
  Desktop: KDE Plasma 5.27.10 tk: Qt 5.15.11 wm: kwin_x11 vt: 2 dm: SDDM
    Distro: KDE neon 22.04 5.27 base: Ubuntu 22.04 LTS Jammy
Machine:
  Type: Desktop Mobo: ASRock model: B450 Pro4 serial: <superuser required>
    UEFI: American Megatrends v: P8.02 date: 02/06/2023
CPU:
  Info: model: AMD Ryzen 9 5900X bits: 64 type: MT MCP arch: Zen 3
    family: 0x19 (25) model-id: 0x21 (33) stepping: 2 microcode: 0xA20120A
  Topology: cpus: 1x cores: 12 tpc: 2 threads: 24 smt: enabled cache:
    L1: 768 KiB desc: d-12x32 KiB; i-12x32 KiB L2: 6 MiB desc: 12x512 KiB
    L3: 64 MiB desc: 2x32 MiB
  Speed (MHz): avg: 2345 high: 3700 min/max: 2200/4950 boost: enabled
    scaling: driver: acpi-cpufreq governor: schedutil cores: 1: 2193 2: 2200
    3: 2200 4: 2200 5: 2200 6: 2192 7: 3700 8: 2200 9: 2200 10: 2870 11: 2200
    12: 2200 13: 2200 14: 2200 15: 2193 16: 2200 17: 2200 18: 2200 19: 2200
    20: 2200 21: 2200 22: 2200 23: 2873 24: 2869 bogomips: 177254
  Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm
  Vulnerabilities:
  Type: gather_data_sampling status: Not affected
  Type: itlb_multihit status: Not affected
  Type: l1tf status: Not affected
  Type: mds status: Not affected
  Type: meltdown status: Not affected
  Type: mmio_stale_data status: Not affected
  Type: retbleed status: Not affected
  Type: spec_rstack_overflow mitigation: safe RET, no microcode
  Type: spec_store_bypass
    mitigation: Speculative Store Bypass disabled via prctl
  Type: spectre_v1
    mitigation: usercopy/swapgs barriers and __user pointer sanitization
  Type: spectre_v2 mitigation: Retpolines, IBPB: conditional, IBRS_FW,
    STIBP: always-on, RSB filling, PBRSB-eIBRS: Not affected
  Type: srbds status: Not affected
  Type: tsx_async_abort status: Not affected
Graphics:
  Device-1: AMD Navi 21 [Radeon RX 6800/6800 XT / 6900 XT] vendor: Gigabyte
    driver: N/A alternate: amdgpu pcie: gen: 4 speed: 16 GT/s lanes: 16
    bus-ID: 0c:00.0 chip-ID: 1002:73bf class-ID: 0300
  Device-2: WaveRider USB 2.0 Camera type: USB
    driver: snd-usb-audio,uvcvideo bus-ID: 3-1:2 chip-ID: 0c46:64ab
    class-ID: 0102 serial: <filter>
  Display: x11 server: X.Org v: 1.21.1.4 compositor: kwin_x11 driver: X:
    loaded: ati,vesa unloaded: fbdev,modesetting,radeon gpu: N/A display-ID: :0
    screens: 1
  Screen-1: 0 s-res: 800x600 s-dpi: 96 s-size: 212x159mm (8.3x6.3")
    s-diag: 265mm (10.4")
  Monitor-1: default res: 800x600 hz: 75 size: N/A
  OpenGL: renderer: llvmpipe (LLVM 15.0.7 256 bits)
    v: 4.5 Mesa 23.0.4-0ubuntu1~22.04.1 direct render: Yes
Audio:
  Device-1: AMD Navi 21 HDMI Audio [Radeon RX 6800/6800 XT / 6900 XT]
    driver: snd_hda_intel v: kernel pcie: gen: 4 speed: 16 GT/s lanes: 16
    bus-ID: 0c:00.1 chip-ID: 1002:ab28 class-ID: 0403
  Device-2: AMD Starship/Matisse HD Audio vendor: ASRock
    driver: snd_hda_intel v: kernel pcie: gen: 4 speed: 16 GT/s lanes: 16
    bus-ID: 0e:00.4 chip-ID: 1022:1487 class-ID: 0403
  Device-3: WaveRider USB 2.0 Camera type: USB
    driver: snd-usb-audio,uvcvideo bus-ID: 3-1:2 chip-ID: 0c46:64ab
    class-ID: 0102 serial: <filter>
  Sound Server-1: ALSA v: k6.5.0-14-generic running: yes
  Sound Server-2: PulseAudio v: 15.99.1 running: yes
  Sound Server-3: PipeWire v: 0.3.48 running: yes
Network:
  Device-1: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet
    vendor: ASRock driver: r8169 v: kernel pcie: gen: 1 speed: 2.5 GT/s
    lanes: 1 port: d000 bus-ID: 09:00.0 chip-ID: 10ec:8168 class-ID: 0200
  IF: enp9s0 state: up speed: 1000 Mbps duplex: full mac: <filter>
  IF-ID-1: wg-mullvad state: unknown speed: N/A duplex: N/A mac: N/A
Bluetooth:
  Device-1: Realtek Bluetooth Radio type: USB driver: btusb v: 0.8
    bus-ID: 1-1:2 chip-ID: 0bda:8771 class-ID: e001 serial: <filter>
  Report: hciconfig ID: hci0 rfk-id: 0 state: up address: <filter>
    bt-v: 3.0 lmp-v: 5.1 sub-v: d922 hci-v: 5.1 rev: dfc6
  Info: acl-mtu: 1021:6 sco-mtu: 255:12
    link-policy: rswitch hold sniff park link-mode: peripheral accept
    service-classes: rendering, capturing, object transfer, audio, telephony
Drives:
  Local Storage: total: 5 TiB used: 2.7 TiB (53.9%)
  SMART Message: Unable to run smartctl. Root privileges required.
  ID-1: /dev/nvme0n1 maj-min: 259:0 vendor: Western Digital
    model: WDS500G3X0C-00SJG0 size: 465.76 GiB block-size: physical: 512 B
    logical: 512 B speed: 31.6 Gb/s lanes: 4 type: SSD serial: <filter>
    rev: 111110WD temp: 38.9 C scheme: GPT
  ID-2: /dev/sda maj-min: 8:0 vendor: Seagate model: ST1000DM010-2EP102
    size: 931.51 GiB block-size: physical: 4096 B logical: 512 B
    speed: 6.0 Gb/s type: HDD rpm: 7200 serial: <filter> rev: CC43
    scheme: GPT
  ID-3: /dev/sdb maj-min: 8:16 vendor: Seagate model: ST4000NE001-2MA101
    size: 3.64 TiB block-size: physical: 4096 B logical: 512 B speed: 6.0 Gb/s
    type: HDD rpm: 7200 serial: <filter> rev: EN01 scheme: GPT
Partition:
  ID-1: / raw-size: 42.97 GiB size: 41.99 GiB (97.73%) used: 26.3 GiB (62.6%)
    fs: ext4 dev: /dev/nvme0n1p6 maj-min: 259:6
    uuid: 19060cbe-6a72-41b4-9c80-395331524432
  ID-2: /boot/efi raw-size: 99 MiB size: 95 MiB (95.96%)
    used: 33.8 MiB (35.6%) fs: vfat dev: /dev/nvme0n1p2 maj-min: 259:2
    uuid: DA3C-9F94
  ID-3: /home raw-size: 931.51 GiB size: 916.7 GiB (98.41%)
    used: 539.83 GiB (58.9%) fs: ext4 dev: /dev/sda1 maj-min: 8:1
    uuid: 70b57ee2-abf0-4d2a-bd64-984ee423991a
Swap:
  Kernel: swappiness: 60 (default) cache-pressure: 100 (default)
  ID-1: swap-1 type: partition size: 7.81 GiB used: 0 KiB (0.0%)
    priority: -2 dev: /dev/sdb2 maj-min: 8:18
    uuid: 8828b589-dbf7-4047-952f-94799a518ef6
Sensors:
  Message: No sensor data found. Is lm-sensors configured?
Info:
  Processes: 384 Uptime: 7m wakeups: 0 Memory: 15.53 GiB
  used: 2.27 GiB (14.6%) Init: systemd v: 249 runlevel: 5 tool: systemctl
  Compilers: gcc: 11.4.0 alt: 11/12 Packages: 2509 apt: 2501 lib: 1481
  flatpak: 8 Shell: fish v: 3.3.1 running-in: konsole inxi: 3.3.13

journalctl:

Dec 14 21:43:16 zoig-neon kernel: mce: [Hardware Error]: CPU 19: Machine Check: 0 Bank 5: bea0000001000108
Dec 14 21:43:16 zoig-neon kernel: mce: [Hardware Error]: TSC 0 ADDR ffffffc0680b4e MISC d0130fff00000000 SYND 4d000000 IPID 500b000000000 
Dec 14 21:43:16 zoig-neon kernel: mce: [Hardware Error]: PROCESSOR 2:a20f12 TIME 1702590189 SOCKET 0 APIC 13 microcode a20120a
Dec 14 21:43:16 zoig-neon systemd-udevd[541]: /etc/udev/rules.d/40-libsane.rules:26: GOTO="libsane_rules_end" has no matching label, ignoring
Dec 14 21:43:16 zoig-neon systemd-udevd[541]: /etc/udev/rules.d/S99-2000S1.rules:26: GOTO="libsane_rules_end" has no matching label, ignoring
Dec 14 21:43:18 zoig-neon kernel: usb 3-1: Failed to query (GET_INFO) UVC control 5 on unit 1: 0 (exp. 1).
Dec 14 21:43:18 zoig-neon kernel: usb 3-1: Failed to query (GET_INFO) UVC control 7 on unit 1: 0 (exp. 1).
Dec 14 21:43:18 zoig-neon kernel: usb 3-1: 3:1: cannot get freq at ep 0x84
Dec 14 21:43:20 zoig-neon systemd[1189]: pulseaudio.socket: Socket service pulseaudio.service not loaded, refusing.
Dec 14 21:43:20 zoig-neon systemd[1189]: Failed to listen on Sound System.
Dec 14 21:43:21 zoig-neon systemd[1189]: Failed to start Daemon to manage razer devices in userspace.
Dec 14 21:43:30 zoig-neon sddm-helper[1354]: gkr-pam: unable to locate daemon control file
Dec 14 21:43:31 zoig-neon kernel: usb 3-1: 3:1: cannot get freq at ep 0x84
Dec 14 21:43:31 zoig-neon kernel: usb 3-1: 3:1: cannot get freq at ep 0x84
Dec 14 21:43:34 zoig-neon hp-systray[2505]: hp-systray[2505]: error: option -s not recognized
Dec 14 21:43:40 zoig-neon kup-daemon[2584]: kup.daemon: "Kup is not enabled, enable it from the system settings module. You can do that by running kcmshell5 kup"

systemctl:

  UNIT LOAD ACTIVE SUB DESCRIPTION
0 loaded units listed.

i’m just a bystander reading the tea leaves but maybe there’s something wrong with CPU and or it doesn’t work with your onboard audio chip?

it’s possible that windows can limp along without core 19 but linux is being more picky?

those hardware errors are a problem.

I’m not entirely sure those are related. I found this thread: System auto-rebooted... mce: [Hardware Error] in dmesg related to CPU - #9 by Daniel-I - Support - Manjaro Linux Forum which is about a different model, but I tried that setting in the BIOS and those “hardware error” messages have disappeared from journalctl, every time I’ve looked at it since.

However the problem continues. One time it did boot, somehow, but I restarted it immediately to see if it would happen again, and it went back to not booting.

well you cleaned up 3 errors… just keep going i guess

this is what the output of journalctl -b -p3 looks like on my machine

Dec 14 12:32:52 FXPC kernel: hid-generic 0003:1532:026C.0004: No inputs registered, leaving

that’s it, just one line… which has to do with my razer keyboard and there doesn’t seem to be anything i can do about it… it’s certainly not stopping me from booting.

perhaps you need a firmware update… either your bios or the microcode (or both)

in discover, you can check under settings to see if the sources are showing linux firmware updates are turned on, just in case that matters, and then check your mothboard maker to see if there is a bios update for newer CPUs

Those errors are back, despite that change, and I’ve no idea what to look at really.

BIOS is updated, done that days ago, and no newer microcode is available. I re-installed the package anyway, no change.

I wonder if I should try to just put back the old CPU, assuming I haven’t broken it. It’s just too much of a hassle, compared to any other component.

maybe.

or maybe just try re-seating the CPU in the socket once before you give up.

ur sure this CPU is fully supported by your old motherboard?

I usually trust pcpartpicker for that sort of thing, and it says yes.

see post #15

https://bbs.archlinux.org/viewtopic.php?id=266210

or

try sudo less -f /dev/mcelog and see what it shows (if anything).

Absolutely nothing in mcelog.
I think I’ll try to replace the old CPU later today.

I put the old one back and everything seems OK thus far…
If it was an overheating issue, hopefully I did better with the paste this time around. But I’m not experimenting more any time soon.