NVMe Identify Controller failed after 9 minutes reading directory '.': Input/output error

Hi All, I have been chasing my tail on and off for months.

I have a Raspberry Pi 5 Model B 16GB with PCIe Expansion Board For Raspberry Pi 5, Raspberry Pi 5 M.2 HAT, Raspberry Pi AI HAT+ 26 TOPS for Raspberry Pi 5, Raspberry Pi 5 Active Cooler, Raspberry Pi 5 White Power Supply (Official) USB-C PD 5.1V 27W with a SD card boot and a 1 tb Corsair MP600 MICRO NVMe SSD mounted as storage for video. It is running frigate NVR without issues for around 9 minutes then the NVMe becomes no longer accessible. The only solution is reboot the pi. Disk testing for I/O reveals no errors and speeds are as expected for a PCIE gen 2. Running Frigate NVR on the SD card without mounting the SSD for storage works as expected.

NVMe formatted as:

/dev/nvme0n1: UUID=“da24ca08-80c5-4286-84eb-81dffd3fb015” BLOCK_SIZE=“4096” TYPE=“ext4”

Errors seen after failure from testing:

Read NVMe Identify Controller failed: NVME_IOCTL_ADMIN_CMD: Input/output error

Smartctl open device: /dev/nvme0 failed: Resource temporarily unavailable

I have reinserted the PCI cable and NVMe SSD drive in the HAT.

I am lost to what to try next.

Any ideas please?

Cheers Matt