MCE
-
https://wiki.gentoo.org/wiki/Ryzen#Random_reboots_with_mce_events
-
https://www.reddit.com/r/AMDHelp/comments/nh7dlo/i_just_rmaed_my_cpu_to_get_the_same_mce_hardware/
-
github.com/qrwteyrutiyoup/ryzen-stabilizator
-
https://www.reddit.com/r/linuxhardware/comments/nsqz2l/amd_ryzen_9_5900x_mce_hardware_error/
-
https://www.reddit.com/r/AMDHelp/comments/pcp4fs/what_is_power_supply_idle_control_and_other_bios/
-
https://www.chiphell.com/forum.php?mod=viewthread&tid=2322849&extra=&mobile=1
-
https://www.overclock.net/threads/replaced-3950x-with-5950x-whea-and-reboots.1774627/
-
https://rog.asus.com/forum/showthread.php?121917-Rog-crosshair-viii-dark-hero-5950x-constant-reboots
-
https://www.reddit.com/r/ryzen/comments/kc3m6f/rog_crosshair_viii_dark_hero_5950x_constant/
1、OC→高级CPU配置→AMD CBS→Global C-State Control→禁止 2、OC→高级CPU配置→PSS Support→禁止 3、OC→高级CPU配置→Power Supply Idle Control→Typical Current Idle
Peter 2021-01-24 10:53:24 UTC Rich and Joel,
can’t say enough how much you made my day! The
amdgpu.ppfeaturemask=0xffffbffd
solved it after months of debugging.
For those coming here without having gone through the entire thread:
If ‘journalctl | grep -i “hardware err”’ returns errors like bea0000000000108 and microcode 8701021 or 8701013, and the BIOS is updated to the last version, the kernel is up to date and several passes of memtest86+ have run without errors, then if you have an AMD GPU the problem might be related to that
GPU
Many report success by booting their kernels with amdgpu.ppfeaturemask=0xffffbffd. If that does not help, try amdgpu.dpm=0. If that works, either keep it as is or remove it again and experiment with other less invasive ppfeaturemask settings discussed above. If none of this helps, the problem might be related to the
CPU
The first recommendation generally is to set in the BIOS “Cool’n Quiet” to Disabled. If that does not help also set “Power Idle Current” to Typical. This should already disable the problematic C states (checking and even disabling c6 can be done also with https://github.com/r4m0n/ZenStates-Linux). If none of this helps, then the recommendation is to also set “Global C State Control” to Disabled. The next step would be to also set “SMT” of the Overclocking settings to Disabled.
Peter
A
- Power Supply Idle Control→Typical Current Idle
- XMP 3000 MHz
- PBO Disabled