docs: P5-11 — compute5 nvme1 PCIe quirk verified, no action needed

platform quirk 'simple suspend' is applied by PVE kernel automatically
for i7-13700T platform (both nvme0 and nvme1). Not a cmdline parameter;
/etc/kernel/cmdline absent. Persists across kernel updates by default.
Verified: dmesg confirms quirk active on both drives at current boot.
P5-11 status: monitor only, no user action required.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
tommy
2026-05-06 21:29:16 -05:00
parent 909fe3dc12
commit 640cd908df

View File

@@ -220,15 +220,12 @@ After P5-01 completes and PBS is confirmed stable for 48h.
### P5-10 — Pi4 node-exporter ARM64 Deploy ### P5-10 — Pi4 node-exporter ARM64 Deploy
See P5-08. See P5-08.
### P5-11 — Compute5 SK Hynix PC711 PCIe Power Management ### P5-11 — Compute5 SK Hynix PC711 PCIe Power Management (monitor only)
nvme1n1 on compute5 (SK Hynix PC711 1TB, `0000:03:00.0`) has 2,362 power cycles and 84 unsafe shutdowns — indicative of PCIe runtime PM aggressively power-cycling the drive. Kernel applied `platform quirk: setting simple suspend` in the current boot. Verify this persists: nvme1n1 on compute5 (SK Hynix PC711 1TB, `0000:03:00.0`) has 2,362 power cycles and 84 unsafe shutdowns — indicative of PCIe runtime PM aggressively power-cycling the drive.
```bash
# Check if quirk is active post-reboot: **Quirk status verified 2026-05-06:** The kernel applies `platform quirk: setting simple suspend` automatically to both nvme0 and nvme1 — it is a built-in driver quirk for this CPU/chipset (i7-13700T platform), not a cmdline parameter. `/etc/kernel/cmdline` does not exist; `/proc/cmdline` has no nvme_core flags. The quirk persists across kernel updates by default. No user action required.
dmesg | grep -i 'nvme.*simple suspend\|03:00.*quirk'
# If not applied, add to kernel cmdline or create modprobe conf: Monitor: check `Unsafe Shutdowns` and `Power Cycles` in SMART at each health check. If counts continue accumulating after the quirk is active, escalate to drive replacement or PCIe slot investigation. WD PC SN740 (nvme0) on same node: 56 unsafe shutdowns in 1,407h — attributed to pre-journal setup period and PCIe PS interaction; no action unless accumulating.
# nvme_core.default_ps_max_latency_us=0
```
WD PC SN740 (nvme0) on same node has 56 unsafe shutdowns in 1,407h — likely from pre-journal setup period and PCIe PS behavior. No action unless counts accumulate.
--- ---