Hey so I set up few nodes on a new pc and I see this thing happening everyday on all 3 nodes 2 disks are used and 1 brand new and all of them crash at the same time
all of them with same fatal error
2024-09-04T19:57:56+03:00
ERROR
failure during run
{error: piecestore monitor: timed out after 1m0s while verifying readability of storage directory, errorVerbose: piecestore monitor: timed out after 1m0s while verifying readability of storage directory\n\tstorj.io/storj/storagenode/monitor.(*Service).Run.func1.1:153\n\tstorj.io/common/sync2.(*Cycle).Run:160\n\tstorj.io/storj/storagenode/monitor.(*Service).Run.func1:140\n\tgolang.org/x/sync/errgroup.(*Group).Go.func1:78}
2024-09-04T19:57:56+03:00
FATAL
Unrecoverable error
{error: piecestore monitor: timed out after 1m0s while verifying readability of storage directory, errorVerbose: piecestore monitor: timed out after 1m0s while verifying readability of storage directory\n\tstorj.io/storj/storagenode/monitor.(*Service).Run.func1.1:153\n\tstorj.io/common/sync2.(*Cycle).Run:160\n\tstorj.io/storj/storagenode/monitor.(*Service).Run.func1:140\n\tgolang.org/x/sync/errgroup.(*Group).Go.func1:78}
I know Alexey will say disks are too slow but there is nothing else running on them they running on almost 0% all day after they finish filewalkers
Well, it says it timed out waiting for a response back from the drive. So, the drive didn’t return the data in one minute of time. Perhaps your drive is going to sleep at that time due to some power setting? Or your drive is indeed slow.
Something in your pc (assuming the disks are internal) is a bottleneck. You could check Process Monitor (from MS) to check the I/O and/or CPU/RAM usage. That might give you some clues.
There are only two devices that can freeze/seize up Windows, GPU driver & HDD. In case of the latter, you may not necessarily notice your entire disk subsytem to be frozen, as long as there is ram the kernel will continue to service, system will appear fine - but upon any disk access, will freeze until the bus /flush/clears/reconnects.
#1. Check your OS drive for errors; #2. Check your three remaining drives, any one drive can freeze a system. #3. If any of those are dynamic .vhdx, and you have previously expanded any volumes on them.
If you don’t do DISM repair first, even though SFC finds errors it may not solve your problems. If you don’t go back and do DISM online repair, then SFC. The DISM repairs the repository that SFC uses to do it’s repairs.
You see its a diffrent problem here all nodes crash at the same time with same error. This had to be something else and in my case sfc /scannow helped and nodes didnt crash in 2 days.
sfc /scannow helped while chkdsk didnt find anything wrong and I do have my pc on power saver. My cpu goes down to 1.5 ghz sometime that might be too slow.